<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<style type="text/css" style="display:none;"> P {margin-top:0;margin-bottom:0;} </style>
</head>
<body dir="ltr">
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Thanks, Andrew! So far we haven't had any issues with indexer commit timeouts, but I'll keep that in mind.<br>
</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
There's an extra wrinkle that I forgot to mention. Its a 2.5.0 -> 2.7.1 upgrade and the data the indexer gathers has changed (both from core and from some local indexer changes) so I think I'm going to follow something similar to a suggestion Mark Custer made.</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Basically, I'm thinking of spinning up a clone of the 2.5.0 app and db and let that index, then point our PUI to that, then upgrade the app and run the db and indexer updates on the now hidden from the public production version, and then point the PUI back
to the old production system once the indexer has finished up. And, finally, remove the clone.</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Something like that seems the most foolproof in concept. And we'll be doing it during our intercession to minimize any risk.</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Joshua<br>
</div>
<div>
<div id="appendonsend"></div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<br>
</div>
<hr tabindex="-1" style="display:inline-block; width:98%">
<div id="divRplyFwdMsg" dir="ltr"><font style="font-size:11pt" face="Calibri, sans-serif" color="#000000"><b>From:</b> archivesspace_users_group-bounces@lyralists.lyrasis.org <archivesspace_users_group-bounces@lyralists.lyrasis.org> on behalf of Andrew Morrison
<andrew.morrison@bodleian.ox.ac.uk><br>
<b>Sent:</b> Tuesday, April 28, 2020 9:20 AM<br>
<b>To:</b> Archivesspace Users Group <archivesspace_users_group@lyralists.lyrasis.org><br>
<b>Subject:</b> Re: [Archivesspace_Users_Group] Best Way to Reindex with PUI Live?</font>
<div> </div>
</div>
<div>
<p style="margin-top: 0px; margin-bottom: 0px;">Not knowing which version you are using I cannot be absolutely sure, but for versions released in recent years deleting the
<i>indexer_state</i> and <i>indexer_pui_state</i> subfolders inside the <i>data</i> directory will not cause downtime or missing records for PUI users (nor staff.)<br>
</p>
<p style="margin-top: 0px; margin-bottom: 0px;"><br>
</p>
<p style="margin-top: 0px; margin-bottom: 0px;">If you are re-indexing because you've made changes to config.rb it will require an application restart to put the change into effect. Delete those state folders immediately after running the restart command, and
the indexer will begin refreshing records in batches once it is back up and running. If the changes you've made affect how certain records are indexed (e.g. inherited_fields for archival_objects) then there will be some inconsistency until every record has
been overwritten in Solr's memory by the ArchivesSpace indexer. But it is unlikely any end user will notice.<br>
</p>
<p style="margin-top: 0px; margin-bottom: 0px;"><br>
</p>
<p style="margin-top: 0px; margin-bottom: 0px;">If you do decide to block user access during the re-index, you should note it is possible for the indexer to go into a loop when doing a full re-index, and never finish. But only if you've got lots of complex
records in a single repository. That is because the last step in re-indexing each repository is to send an instruction to Solr to commit all changes in memory to disk. Depending on the speed of whatever storage layer your system uses that can take longer than
5 minutes, in which case the indexer will start again from scratch. We've set AppConfig[:indexer_solr_timeout_seconds] to 1800 to give it half an hour, to avoid this.<br>
</p>
<p style="margin-top: 0px; margin-bottom: 0px;"><br>
</p>
<p style="margin-top: 0px; margin-bottom: 0px;">Andrew.</p>
<p style="margin-top: 0px; margin-bottom: 0px;"><br>
</p>
<p style="margin-top: 0px; margin-bottom: 0px;"><br>
</p>
<div class="x_moz-cite-prefix">On 27/04/2020 21:00, Joshua D. Shaw wrote:<br>
</div>
<blockquote type="cite">
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
Hey Blake-</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
I usually empty the indexer states directories and the data/solr_index/index directory when I do a fresh index run, but this is the first time I've had to do a re-index while the PUI is live. Staff I can give a heads up and they typically don't work weekends
anyway. But students & faculty are a different ballgame!<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
Do you inform users of the PUI that its down? Or do your stats indicate that the use on weekends is low enough not to warrant that step? I'm loathe to completely take down an online resource - especially now when Dartmouth is in the middle of its spring quarter.<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
I guess I'll try a couple of different approaches on our dev site and see which turns out to be best. If none of those work, postponing the update till early June is probably the best option for us (when classes and finals end).<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
Thanks!</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
Joshua<br>
</div>
<div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<br>
</div>
<hr tabindex="-1" style="display:inline-block; width:98%">
<div id="x_divRplyFwdMsg" dir="ltr"><font style="font-size:11pt" face="Calibri, sans-serif" color="#000000"><b>From:</b>
<a class="x_moz-txt-link-abbreviated" href="mailto:archivesspace_users_group-bounces@lyralists.lyrasis.org">
archivesspace_users_group-bounces@lyralists.lyrasis.org</a> <a class="x_moz-txt-link-rfc2396E" href="mailto:archivesspace_users_group-bounces@lyralists.lyrasis.org">
<archivesspace_users_group-bounces@lyralists.lyrasis.org></a> on behalf of Blake Carver
<a class="x_moz-txt-link-rfc2396E" href="mailto:blake.carver@lyrasis.org"><blake.carver@lyrasis.org></a><br>
<b>Sent:</b> Monday, April 27, 2020 2:47 PM<br>
<b>To:</b> Archivesspace Users Group <a class="x_moz-txt-link-rfc2396E" href="mailto:archivesspace_users_group@lyralists.lyrasis.org">
<archivesspace_users_group@lyralists.lyrasis.org></a><br>
<b>Subject:</b> Re: [Archivesspace_Users_Group] Best Way to Reindex with PUI Live?</font>
<div> </div>
</div>
<div dir="ltr">
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<span style="font-family:Calibri,Arial,Helvetica,sans-serif; background-color:rgb(255,255,255); display:inline!important">Theoretically another way to do it is to update system_mtime on everything as well.</span><br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<a href="https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgist.github.com%2FBlake-%2F538c8d7cc7ade39efc372a3e3e190873&data=02%7C01%7Cjoshua.d.shaw%40dartmouth.edu%7C3c5b615bb3ed4ab561ae08d7eb76ee55%7C995b093648d640e5a31ebf689ec9446f%7C0%7C0%7C637236768364882877&sdata=PMVjhoP6JNV%2BBSqc%2FK3mOm2Ml2PuBoNGKo1nnqcs5u0%3D&reserved=0" originalsrc="https://gist.github.com/Blake-/538c8d7cc7ade39efc372a3e3e190873" shash="hclANNUI6O9oaDEDH7eRN03cK2s+Jm87GFlnvSB7SjQxYH7LYVblkUHNCCfJq5WcyYoVGLv5VhaQiCEWLzum1/oClyPE6z+DNl+zxEVBGprWLY2O2NkqKQYQtlI9MtIzh4aYxccxQKLVVN82SdexAsafF3AVHuWYcGXTTKtkXLk=">https://gist.github.com/Blake-/538c8d7cc7ade39efc372a3e3e190873</a><br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
Someplace in the official solr docs they say the best way to do it is to wipe everything. I've found it best to empty /data/.</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
We'll usually do the full reindexes on a Friday night, most sites will have finished up by Monday.</div>
<div>
<div id="x_x_Signature">
<div id="x_x_divtagdefaultwrapper" dir="ltr" style="color:rgb(0,0,0); background-color:rgb(255,255,255)">
</div>
</div>
</div>
<hr tabindex="-1" style="display:inline-block; width:98%">
<div id="x_x_divRplyFwdMsg" dir="ltr"><font style="font-size:11pt" face="Calibri, sans-serif" color="#000000"><b>From:</b>
<a class="x_moz-txt-link-abbreviated" href="mailto:archivesspace_users_group-bounces@lyralists.lyrasis.org">
archivesspace_users_group-bounces@lyralists.lyrasis.org</a> <a class="x_moz-txt-link-rfc2396E" href="mailto:archivesspace_users_group-bounces@lyralists.lyrasis.org">
<archivesspace_users_group-bounces@lyralists.lyrasis.org></a> on behalf of Joshua D. Shaw
<a class="x_moz-txt-link-rfc2396E" href="mailto:Joshua.D.Shaw@dartmouth.edu"><Joshua.D.Shaw@dartmouth.edu></a><br>
<b>Sent:</b> Monday, April 27, 2020 1:01 PM<br>
<b>To:</b> Archivesspace Users Group <a class="x_moz-txt-link-rfc2396E" href="mailto:archivesspace_users_group@lyralists.lyrasis.org">
<archivesspace_users_group@lyralists.lyrasis.org></a><br>
<b>Subject:</b> [Archivesspace_Users_Group] Best Way to Reindex with PUI Live?</font>
<div> </div>
</div>
<div dir="ltr">
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
Hi all-</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
Just wondering what people have been doing when they need to do a total reindex and they have a live PUI? Our reindex takes about 4-6 hours typically and I'm looking to avoid 4-6 hours of PUI downtime if at all possible.<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
I'm planning to just wipe the indexer_state files and leave the index itself in place while the re-index occurs, but I'm wondering if there are better/alternate methods? Theoretically the PUI should still be functional while the reindex takes place if only
the indexer_state files are wiped.<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
Thanks!</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
Joshua<br>
</div>
<div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<br>
</div>
<div id="x_x_x_Signature">
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
___________________</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
Joshua Shaw (he, him)<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
Technology Coordinator</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
Rauner Special Collections Library & Digital Library Technologies Group<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
Dartmouth College<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
603.646.0405<br>
</div>
</div>
</div>
</div>
</div>
</div>
<br>
<fieldset class="x_mimeAttachmentHeader"></fieldset>
<pre class="x_moz-quote-pre">_______________________________________________
Archivesspace_Users_Group mailing list
<a class="x_moz-txt-link-abbreviated" href="mailto:Archivesspace_Users_Group@lyralists.lyrasis.org">Archivesspace_Users_Group@lyralists.lyrasis.org</a>
<a class="x_moz-txt-link-freetext" href="https://nam12.safelinks.protection.outlook.com/?url=http%3A%2F%2Flyralists.lyrasis.org%2Fmailman%2Flistinfo%2Farchivesspace_users_group&data=02%7C01%7Cjoshua.d.shaw%40dartmouth.edu%7C3c5b615bb3ed4ab561ae08d7eb76ee55%7C995b093648d640e5a31ebf689ec9446f%7C0%7C0%7C637236768364882877&sdata=rEdTqReJX3IyRQvVKTaOTC8IB44GIgVWZ%2FLPTBM0BK8%3D&reserved=0" originalsrc="http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group" shash="elVnrNSqvYDPcVSFTfUIsEJsUcH03F8dBsLSYkiNpYfXhAXlU+zJwFbu/PR10jTBVUejdrueYI/3xA7rv5ph6cVDr8d8EJjOKTNNA8XFD5cjo3Rv2gAkLI9oJNHnr9IeCKExAMzshhioJzJeF77tpcGBMjriDFA5mK9e5QxAQc8=">http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group</a>
</pre>
</blockquote>
</div>
</div>
</body>
</html>