[Archivesspace_Users_Group] Phantom records post-migration to version 3.2.0
Andrew Morrison
andrew.morrison at bodleian.ox.ac.uk
Fri Aug 12 03:39:04 EDT 2022
That plug-in does the equivalent of the "Full reindex" process described
here:
https://archivesspace.github.io/tech-docs/administration/indexes.html
So, it is not something you want running upon startup of a production
system, as records will be unavailable until they've been re-indexed,
which might take a long time, depending on the size of your collections.
It was probably written to help automate the management of Lyrasis's own
sandbox.archivesspace.org and test.archivesspace.org servers, which only
contain a few test records.
A one-off "full reindex" is probably required to fix your phantom
records. But you need to watch the application log to monitor that the
indexing process starts, and runs all the way to the end, without
errors. Personally, I'd trigger it manually, but if your prefer to use
the plug-in, then you need to change 4567 in the curl command to the
port on which your backend is listening. Usually that is 8089, but
double-check the AppConfig[:backend_url] setting in config.rb. If you
don't know how to obtain the session token in $SESSION, see the
instructions here:
https://archivesspace.github.io/tech-docs/api/
If you want to figure out the cause of the phantom records, you should
identify some by comparing what's in Solr against what's in MySQL,
before running the re-index. Then you could check when they were deleted
in the "deleted_records" MySQL table.
Andrew.
On 11/08/2022 15:45, Jerry Boggio wrote:
>
> Hi Don and other ArchivesSpace Users;
>
> Referring to the page Don suggested:
>
> *ArchivesSpace Reindexer plugin*
>
> This plugin can be used in two ways:
>
> 1.On system startup to initiate a reindex
>
> 2.Via the api to trigger a reindex
>
> TODO: consider running as a job and making it available that way too.
>
> *On startup*
>
> ·Set AppConfig[:reindex_on_startup] = true in config.rb
>
> ·Restart ArchivesSpace
>
> *Via the api*
>
> curl -H "X-ArchivesSpace-Session: $SESSION" -X POST
> http://localhost:4567/plugins/reindex
>
> We have set “AppConfig[:reindex_on_startup] = true”inconfig.rb before
> the last restart and still have “phantom” records.
>
> Looking at the API call, which port should be used here? The port for
> the External Solr Index, the one specified, or something else?
>
> We would also like to know the cause of the “phantom” records. Does it
> have something to do with the setup-database.sh script?
>
> By the way, when running the check_index.sh script in Linux it
> generates an error. Should there be a new script for External Solr?
>
> /apps/archivesspace/scripts>ll
>
> total 52
>
> -rw-r--r-- 1 archspc users 317 Apr 16 2021 backup.bat
>
> -rwxr-xr-x 1 archspc users 365 Apr 16 2021 backup.sh
>
> -rwxr-xr-x 1 archspc users 271 Apr 16 2021 checkindex.bat
>
> *-rwxr-xr-x 1 archspc users 360 Apr 16 2021 checkindex.sh***
>
> -rw-r--r-- 1 archspc users 290 Apr 16 2021 ead_export.bat
>
> -rwxr-xr-x 1 archspc users 350 Apr 16 2021 ead_export.sh
>
> -rwxr-xr-x 1 archspc users 217 Apr 16 2021 find-base.sh
>
> -rw-r--r-- 1 archspc users 496 Apr 16 2021 initialize-plugin.bat
>
> -rwxr-xr-x 1 archspc users 804 Dec 22 2021 initialize-plugin.sh
>
> -rw-r--r-- 1 archspc users 295 Apr 16 2021 password-reset.bat
>
> -rwxr-xr-x 1 archspc users 353 Apr 16 2021 password-reset.sh
>
> drwxr-xr-x 2 archspc users 46 Feb 4 2022 rb
>
> -rwxr-xr-x 1 archspc users 304 Apr 16 2021 setup-database.bat
>
> -rwxr-xr-x 1 archspc users 322 Apr 16 2021 setup-database.sh
>
> /apps/archivesspace/scripts>./checkindex.sh
>
> *RuntimeError: Solr war file not found***
>
> find_solr_war at ../scripts/rb/checkindex.rb:29
>
> check at ../scripts/rb/checkindex.rb:10
>
> <main> at ../scripts/rb/checkindex.rb:86
>
> /apps/archivesspace/scripts>
>
> I know there are a lot of questions here and we would appreciate your
> help in getting answers to all of them.
>
> Thank you again!
>
> *Gerard (Jerry) Boggio* | *MITRE Corporation* | R124 - Collaboration &
> Info Management| 781-271-2719
>
> Greg,
> One easy way to rebuild the index is to install the Lyrasis reindexer
> plugin: https://github.com/lyrasis/aspace-reindexer. Once installed, you
> can either have it rebuild the index via the startup config or by
> the endpoint that the plugin adds to the API. We recently migrated from
> 3.0.2 to 3.2.0 and everything went smoothly for us. Once the new
> version of
> Aspace was running we hit the endpoint and the database rebuilt.
> Don
> Donald R. Mennerich, digital archivist
> New York University Libraries
> don.mennerich at nyu.edu
> <http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group>
> (212) 992-6264
>
> *From:* Jerry Boggio <gboggio at mitre.org>
> *Sent:* Tuesday, August 9, 2022 2:22 PM
> *To:* archivesspace_users_group at lyralists.lyrasis.org
> *Cc:* Erin Faulder <EFAULDER at mitre.org>
> *Subject:* Phantom records post-migration to version 3.2.0
>
> Hello ASpace Users;
>
> We just upgraded from ArchivesSpace version 2.7.1 to version 3.2.0 and
> now have “phantom” records in Resources and Subjects. By phantom I
> mean records show up in ASpace, but are not in the database and when
> trying to view the record returns:
>
> As part of this upgrade on our Test machine we:
>
> * Started using External Solr as opposed to Internal
> * Purged the contents of the following directories, but did not
> delete the directories:
> o /apps/archivesspace/data/indexer_pui_state
> o /apps/archivesspace/data/indexer_state
> o /apps/archivesspace/data/tmp
> * Kept, but moved /apps/archivesspace/data/solr_backups to
> /apps/archivesspace/data/old_solr_backups; and created a new
> /apps/archivesspace/data/solr_backups directory
> * Refreshed our Test MySQL database from our v2.7.1 Prod version and
> ran the setup-database.sh script to convert to v3.2.0. We are
> running MySQL version 8.
>
> This left the following directory structure under the data directory:
>
> /apps/archivesspace/data>ll
>
> total 36
>
> drwxr-xr-x 5 archspc users 147 Oct 16 2019 archivesspace_demo_db
>
> drwxr-xr-x 9 archspc users 4096 Oct 18 2019 demo_db_backups
>
> -rw-r--r-- 1 archspc users 32 Oct 8 2019
> frontend_cookie_secret_cookie_secret.dat
>
> -rw-r--r-- 1 archspc users 32 May 14 2020 frontend_cookie_secret.dat
>
> drwxr-xr-x 2 archspc users 4096 Aug 9 13:59 indexer_pui_state
>
> drwxr-xr-x 2 archspc users 4096 Aug 9 13:59 indexer_state
>
> drwxr-xr-x 13 archspc users 4096 Aug 9 00:00 old_solr_backups
>
> -rw-r--r-- 1 archspc users 32 Oct 8 2019
> public_cookie_secret_cookie_secret.dat
>
> -rw-r--r-- 1 archspc users 32 May 14 2020 public_cookie_secret.dat
>
> drwxr-xr-x 3 archspc users 22 Oct 28 2019 shared
>
> drwxr-xr-x 2 archspc users 6 Aug 9 10:33 solr_backups
>
> drwxr-xr-x 13 archspc users 4096 Aug 9 13:59 tmp
>
> /apps/archivesspace/data>
>
> External Solr was installed on the same Linux server as ArchivesSpace.
> We installed solr-8.10.0, but see that a newer version solr-8.11.2 is
> available. Should we be using the newer version? How can we clear the
> External Solr index in order to rebuild it?
>
> What needs to be done to eliminate these phantom records? We are
> assuming it is something left over from the prior version.
>
> Please advise or let us know if you need more information.
>
> Thank you.
>
> *Gerard (Jerry) Boggio* | *MITRE Corporation* | R124 - Collaboration &
> Info Management| 781-271-2719
>
>
> _______________________________________________
> Archivesspace_Users_Group mailing list
> Archivesspace_Users_Group at lyralists.lyrasis.org
> http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lyralists.lyrasis.org/pipermail/archivesspace_users_group/attachments/20220812/2b1357ac/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.png
Type: image/png
Size: 39627 bytes
Desc: not available
URL: <http://lyralists.lyrasis.org/pipermail/archivesspace_users_group/attachments/20220812/2b1357ac/attachment.png>
More information about the Archivesspace_Users_Group
mailing list