[Archivesspace_Users_Group] [EXTERNAL] Re: Hard re-indexing question

Rees, John (NIH/NLM) [E] reesj at mail.nlm.nih.gov
Tue Oct 18 08:53:55 EDT 2022


Thanks. They weren't deleted records, but published-then-suppressed records.

Being once published, their subject terms were published and then persisted as ghost terms in the PUI with no linked records. Until either the subject term was touched (when we ran the full re-index) or the published state of the DO record was toggled (suppression doesn't change system_mtime), subject terms weren't re-indexed for the PUI.

We learned something new.

John

From: archivesspace_users_group-bounces at lyralists.lyrasis.org <archivesspace_users_group-bounces at lyralists.lyrasis.org> On Behalf Of Andrew Morrison
Sent: Tuesday, October 18, 2022 4:09 AM
To: archivesspace_users_group at lyralists.lyrasis.org
Subject: [EXTERNAL] Re: [Archivesspace_Users_Group] Hard re-indexing question


There is a bug involving the "deleted_records" table that can cause records to disappear after a re-index:



https://archivesspace.atlassian.net/browse/ANW-1607<https://gcc02.safelinks.protection.outlook.com/?url=https%3A%2F%2Farchivesspace.atlassian.net%2Fbrowse%2FANW-1607&data=05%7C01%7Creesj%40mail.nlm.nih.gov%7Cbe41282b2ff24c0845c008dab0e00fbc%7C14b77578977342d58507251ca2dc2b06%7C0%7C0%7C638016773627495964%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=DKwpcgbd6kTMSy8vVkTloEjuEp%2FkaD0oDDhiAsXvuyU%3D&reserved=0>



But it only affects things that have been transferred from one repository to another, then back again. And that doesn't apply to subjects, as they don't belong to any one repository. There could be another mechanism to trigger than same bug for subjects, if so the SQL file attached to that issue could be adapted to find and fix them.



If subjects are missing from the public interface but not from the staff interface, then it is a problem with the indexer, or the code it calls to check that each subject is linked to at least one published record.



Andrew.




On 17/10/2022 17:48, Blake Carver wrote:




>> We recently tried a hard re-index and discovered that for the PUI, subjects linked only to digital objects were not re-indexed and dropped from the PUI. Is this expected behavior?



Maybe related to what's in your deleted_records table? How many rows in that table?



>> Related, is there a table/tables in the dbase that tell Solr when to index that we could investigate,



Many things have a system_mtime that you can set to NOW().







_______________________________________________

Archivesspace_Users_Group mailing list

Archivesspace_Users_Group at lyralists.lyrasis.org<mailto:Archivesspace_Users_Group at lyralists.lyrasis.org>

http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group<https://gcc02.safelinks.protection.outlook.com/?url=http%3A%2F%2Flyralists.lyrasis.org%2Fmailman%2Flistinfo%2Farchivesspace_users_group&data=05%7C01%7Creesj%40mail.nlm.nih.gov%7Cbe41282b2ff24c0845c008dab0e00fbc%7C14b77578977342d58507251ca2dc2b06%7C0%7C0%7C638016773627495964%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=Tw8KXwQFpP17YMUbXed7Y5UYCePj24vMt6j7b3NrzKU%3D&reserved=0>
CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you recognize the sender and are confident the content is safe.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lyralists.lyrasis.org/pipermail/archivesspace_users_group/attachments/20221018/ff80b445/attachment.html>


More information about the Archivesspace_Users_Group mailing list