[Archivesspace_Users_Group] FW: normalization in ArchivesSpace

Trevor Thornton trthorn2 at ncsu.edu
Mon Apr 20 18:44:14 EDT 2020


>From what I can tell, the Solr Standard Tokenizer
<https://lucene.apache.org/solr/guide/6_6/tokenizers.html#Tokenizers-StandardTokenizer>
(which I think is the one used for most text fields) doesn't exclude the
apostrophe or use it as a delimiter to split the word (as it does with
other punctuation marks), so a query for "Governor’s" won't match
"Governors" and vice versa. I don't know of a convenient workaround
(without modifying the Solr schema).

On Mon, Apr 20, 2020 at 4:38 PM Hoffner, Bailey E. <baileys at ou.edu> wrote:

> Hello All,
>
>
>
> One of our catalogers noticed an issue with search functionality and
> normalization (see below). Has anyone dealt with this issue before, or know
> of a workaround?
>
>
>
> Thanks!
>
>
>
> -Bailey
>
>
>
> Bailey Hoffner, MLIS
>
> Metadata and Collections Management Archivist
>
> University of Oklahoma Libraries
>
> 405-325-1566
>
>
>
>
>
> *From: *"Steele, Thomas D." <Thomas.D.Steele-1 at ou.edu>
> *Date: *Monday, April 20, 2020 at 3:26 PM
> *To: *"Hoffner, Bailey E." <baileys at ou.edu>
> *Subject: *normalization in ArchiveSpace
>
>
>
> Searching for a term such as “Governors’” yields no hits if you spell it
> as “Governor’s”.  both terms should normalize to “Governors”, but it’s
> possible the latter is normalizing to “Governor s”
>
>
>
> Tom Steele
>
> Science and Technology Cataloger
>
> University of Oklahoma Libraries
>
> Norman, OK   73019
>
> (405) 325-4082
>
> Thomas.D.Steele-1 at ou.edu
>
>
>
> *"Books constitute capital. A library book lasts as long as a house, for
> hundreds of years. It is not, then, an article of mere consumption but
> fairly of capital, and often in the case of professional men, setting out
> in life, it is their only capital*.*" -- Thomas Jefferson*
>
>
> _______________________________________________
> Archivesspace_Users_Group mailing list
> Archivesspace_Users_Group at lyralists.lyrasis.org
> http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group
>


-- 
Trevor Thornton
Applications Developer, Digital Library Initiatives
North Carolina State University Libraries
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lyralists.lyrasis.org/pipermail/archivesspace_users_group/attachments/20200420/542bb0fc/attachment.html>


More information about the Archivesspace_Users_Group mailing list