[Archivesspace_Users_Group] FW: normalization in ArchivesSpace
Hoffner, Bailey E.
baileys at ou.edu
Tue Apr 21 16:04:21 EDT 2020
Thanks, Trevor!
Bailey Hoffner, MLIS
Metadata and Collections Management Archivist
University of Oklahoma Libraries
405-325-1566
From: <archivesspace_users_group-bounces at lyralists.lyrasis.org> on behalf of Trevor Thornton <trthorn2 at ncsu.edu>
Reply-To: Archivesspace Users Group <Archivesspace_Users_Group at lyralists.lyrasis.org>
Date: Monday, April 20, 2020 at 5:44 PM
To: Archivesspace Users Group <Archivesspace_Users_Group at lyralists.lyrasis.org>
Subject: Re: [Archivesspace_Users_Group] FW: normalization in ArchivesSpace
From what I can tell, the Solr Standard Tokenizer<https://urldefense.proofpoint.com/v2/url?u=https-3A__lucene.apache.org_solr_guide_6-5F6_tokenizers.html-23Tokenizers-2DStandardTokenizer&d=DwMFaQ&c=qKdtBuuu6dQK9MsRUVJ2DPXW6oayO8fu4TfEHS8sGNk&r=xB0vxoykphpON8cdfrwytw&m=I0Ne1UMdLv_KUjPtc0RyySa9VLfYxS80yP6KzhuqCfw&s=8mSL-nxEiOABYzS5eUSmPA9xMKsUpaMKUnNstFRmkk4&e=> (which I think is the one used for most text fields) doesn't exclude the apostrophe or use it as a delimiter to split the word (as it does with other punctuation marks), so a query for "Governor’s" won't match "Governors" and vice versa. I don't know of a convenient workaround (without modifying the Solr schema).
On Mon, Apr 20, 2020 at 4:38 PM Hoffner, Bailey E. <baileys at ou.edu<mailto:baileys at ou.edu>> wrote:
Hello All,
One of our catalogers noticed an issue with search functionality and normalization (see below). Has anyone dealt with this issue before, or know of a workaround?
Thanks!
-Bailey
Bailey Hoffner, MLIS
Metadata and Collections Management Archivist
University of Oklahoma Libraries
405-325-1566
From: "Steele, Thomas D." <Thomas.D.Steele-1 at ou.edu<mailto:Thomas.D.Steele-1 at ou.edu>>
Date: Monday, April 20, 2020 at 3:26 PM
To: "Hoffner, Bailey E." <baileys at ou.edu<mailto:baileys at ou.edu>>
Subject: normalization in ArchiveSpace
Searching for a term such as “Governors’” yields no hits if you spell it as “Governor’s”. both terms should normalize to “Governors”, but it’s possible the latter is normalizing to “Governor s”
Tom Steele
Science and Technology Cataloger
University of Oklahoma Libraries
Norman, OK 73019
(405) 325-4082
Thomas.D.Steele-1 at ou.edu<mailto:Thomas.D.Steele-1 at ou.edu>
"Books constitute capital. A library book lasts as long as a house, for hundreds of years. It is not, then, an article of mere consumption but fairly of capital, and often in the case of professional men, setting out in life, it is their only capital." -- Thomas Jefferson
_______________________________________________
Archivesspace_Users_Group mailing list
Archivesspace_Users_Group at lyralists.lyrasis.org<mailto:Archivesspace_Users_Group at lyralists.lyrasis.org>
http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group<https://urldefense.proofpoint.com/v2/url?u=http-3A__lyralists.lyrasis.org_mailman_listinfo_archivesspace-5Fusers-5Fgroup&d=DwMFaQ&c=qKdtBuuu6dQK9MsRUVJ2DPXW6oayO8fu4TfEHS8sGNk&r=xB0vxoykphpON8cdfrwytw&m=I0Ne1UMdLv_KUjPtc0RyySa9VLfYxS80yP6KzhuqCfw&s=0JWCdmVdS-WAU5D29N1gFpZTrO1n20HBSZYj1Op9TJE&e=>
--
Trevor Thornton
Applications Developer, Digital Library Initiatives
North Carolina State University Libraries
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lyralists.lyrasis.org/pipermail/archivesspace_users_group/attachments/20200421/27df1162/attachment.html>
More information about the Archivesspace_Users_Group
mailing list