[Archivesspace_Users_Group] PUI Ordering and Sorting Issue

Andrew Morrison andrew.morrison at bodleian.ox.ac.uk
Wed May 11 05:14:54 EDT 2022


Sorry, make that:

[^\p{L}\d\s]

Otherwise it'll strip out digits.

Andrew.


On 11/05/2022 09:53, Andrew Morrison wrote:
>
> I'd forgotten about clean_for_sort. I've overridden it myself, in a 
> plug-in. The simplest way is:
>
>  1. In plugins/local, create a subfolder called "indexer"
>  2. Create a file in that subfolder called "indexer_common_override.rb"
>  3. In the file, declare "class IndexerCommon"
>  4. Copy the def for clean_for_sort into that, and adjust it to meet
>     your requirements.
>  5. Restart and re-index.
>
> Something like:
>
> [^\p{L}\s]
>
> Would be a less anglocentric substitute for:
>
> [^\w\s]
>
> If that works, you probably won't have any empty title_sort values, 
> but in general you can control the sorting of records without values 
> in schema.xml with the sortMissingLast attribute, see:
>
> https://solr.apache.org/guide/8_8/field-type-definitions-and-properties.html
>
> Andrew.
>
>
> On 11/05/2022 08:37, 松山 ひとみ wrote:
>>
>> Hello, again.
>>
>> We’ve tried as Andrew kindly suggested.
>>
>> However, it didn’t work as well as expected…
>>
>> We think using “clean_for_sort; IndexerCommon” may interfere when 
>> “title” is set to “title_sort”.
>>
>> “clean_for_sort” looks eliminating anything except alphabets and 
>> numbers when sorting.
>>
>> It would affect not only Japanese character but also any non-alphabet 
>> characters, such as Hangul or Cyrillic, we suppose.
>>
>> We also wonder what order is applied when “title_sort” is empty.
>>
>>
>>
>> Thanks,
>>
>> Hitomi
>>
>> Hitomi Matsuyama, Audiovisual Archivist
>>
>> Nakanoshima Museum of Art, Osaka
>>
>> 4-3-1 Nakanoshima, Kita-ku
>>
>> Osaka 530-0005 JAPAN
>>
>> tel. +81 (0)6 64 79 05 58
>>
>> email. matsuyama-h at nakka-art.jp <mailto:matsuyama-h at nakka-art.jp>
>>
>> *From:*archivesspace_users_group-bounces at lyralists.lyrasis.org 
>> <archivesspace_users_group-bounces at lyralists.lyrasis.org> *On Behalf 
>> Of *Andrew Morrison
>> *Sent:* Thursday, April 28, 2022 8:46 PM
>> *To:* archivesspace_users_group at lyralists.lyrasis.org
>> *Subject:* Re: [Archivesspace_Users_Group] PUI Ordering and Sorting Issue
>>
>> I forgot to mention that you probably have to re-index after changing 
>> schema.xml and reloading the core.
>>
>> Andrew.
>>
>> On 28/04/2022 10:51, 松山 ひとみwrote:
>>
>>     Thanks again Andrew!
>>
>>     We’ll try applying what you gave to our current AS.
>>
>>     Hitomi
>>
>>     *From:*archivesspace_users_group-bounces at lyralists.lyrasis.org
>>     <archivesspace_users_group-bounces at lyralists.lyrasis.org>
>>     <mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org>
>>     *On Behalf Of *Andrew Morrison
>>     *Sent:* Thursday, April 28, 2022 6:09 PM
>>     *To:* archivesspace_users_group at lyralists.lyrasis.org
>>     *Subject:* Re: [Archivesspace_Users_Group] PUI Ordering and
>>     Sorting Issue
>>
>>     If you're using the schema.xml that came with ArchivesSpace 3.0.1
>>     in your external Solr 8.11, then it will still define the
>>     "sort_icu" fieldType as an instance of the solr.TextField class.
>>     If you look below that, there is a commented-out alternative
>>     fieldType definition which is an instance of
>>     solr.ICUCollationField. ArchivesSpace 3.2.0 has changed to that
>>     (because it no longer has to support the previously-built-in Solr
>>     4.10) but you don't need to upgrade to it, you can just edit your
>>     schema.xml, then reload the Solr core. See the link in my
>>     previous email for help on how to set that up to be optimized for
>>     Japanese characters.
>>
>>     Andrew.
>>
>>     On 28/04/2022 09:44, 松山 ひとみ wrote:
>>
>>         Thank you Andrew!
>>
>>         Our IT says we’ve already been using an external Solr 8.11
>>         with ArchivesSpace 3.0.1, not the one built-in.
>>
>>         We’re thinking of upgrading our AS to 3.2.0. Do you think we
>>         will get a better result?
>>
>>         Hitomi
>>
>>         *From:*archivesspace_users_group-bounces at lyralists.lyrasis.org
>>         <archivesspace_users_group-bounces at lyralists.lyrasis.org>
>>         <mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org>
>>         *On Behalf Of *Andrew Morrison
>>         *Sent:* Thursday, April 28, 2022 4:47 PM
>>         *To:* archivesspace_users_group at lyralists.lyrasis.org
>>         *Subject:* Re: [Archivesspace_Users_Group] PUI Ordering and
>>         Sorting Issue
>>
>>         Are you using the built-in Solr search engine that comes with
>>         ArchivesSpace 3.0.1? If so, your sorting problems could be
>>         because it uses a very old version, because newer ones aren't
>>         compatible with the method of embedding it in a bigger
>>         application. But there is the option to configure
>>         ArchivesSpace to use an external Solr service:
>>
>>         https://archivesspace.github.io/tech-docs/provisioning/solr.html
>>
>>         That allows you to run a more up-to-date version, which would
>>         enable use of the solr.ICUCollationField class for sort
>>         fields. That can be adjusted to sort different languages
>>         according to their own sorting rules, as described here:
>>
>>         https://solr.apache.org/guide/8_11/language-analysis.html#unicode-collation
>>
>>         ArchivesSpace 3.2.0 removes the built-in Solr, so running an
>>         external Solr service will be necessary if you upgrade in the
>>         future.
>>
>>         As for adding the option to sort on identifiers, I don't
>>         think there is a configuration option or simple interface for
>>         adding them. But it would probably be possible to develop a
>>         plug-in to override certain Ruby methods in the core code to
>>         do it.
>>
>>         Andrew.
>>
>>         On 27/04/2022 11:04, 松山 ひとみwrote:
>>
>>             Hello all,
>>
>>             We’ve been stuck in the “ordering and sorting” issue in
>>             [~/repositories/resources]. Our AS is version 3.0.1.
>>
>>             Presumably, because we use Japanese Character, our
>>             resource list cannot be displayed in a right,
>>             alphabetical order when sorted by Title.
>>
>>             Could we add Identifier to the category of sorting;
>>             Relevance/Title(Asc/Desc)/Year(Asc/Desc), as alternative?
>>
>>             We’d very much appreciate you helping solve our issue!
>>
>>             All the best,
>>
>>             Hitomi Matsuyama, Audiovisual Archivist
>>
>>             Nakanoshima Museum of Art, Osaka
>>
>>             4-3-1 Nakanoshima, Kita-ku
>>
>>             Osaka 530-0005 JAPAN
>>
>>             tel. +81 (0)6 64 79 05 58
>>
>>             email. matsuyama-h at nakka-art.jp
>>             <mailto:matsuyama-h at nakka-art.jp>
>>
>>
>>
>>
>>
>>             _______________________________________________
>>
>>             Archivesspace_Users_Group mailing list
>>
>>             Archivesspace_Users_Group at lyralists.lyrasis.org
>>
>>             http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group
>>
>>
>>
>>
>>         _______________________________________________
>>
>>         Archivesspace_Users_Group mailing list
>>
>>         Archivesspace_Users_Group at lyralists.lyrasis.org
>>
>>         http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group
>>
>>
>>
>>     _______________________________________________
>>
>>     Archivesspace_Users_Group mailing list
>>
>>     Archivesspace_Users_Group at lyralists.lyrasis.org
>>
>>     http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group
>>
>>
>> _______________________________________________
>> Archivesspace_Users_Group mailing list
>> Archivesspace_Users_Group at lyralists.lyrasis.org
>> http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group
>
> _______________________________________________
> Archivesspace_Users_Group mailing list
> Archivesspace_Users_Group at lyralists.lyrasis.org
> http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lyralists.lyrasis.org/pipermail/archivesspace_users_group/attachments/20220511/659bb381/attachment.html>


More information about the Archivesspace_Users_Group mailing list