[Archivesspace_Users_Group] Check for Broken URLs Report Plugin
VivianLea.Solek at Kofc.Org
Tue Jul 27 15:02:25 EDT 2021
Congrats Corey - cool plugin!
All the best,
Knights of Columbus Supreme Council Archives
1 State Street
New Haven, CT 06511-6702
Phone 203 752-4578
Fax 203 865-0351
From: archivesspace_users_group-bounces at lyralists.lyrasis.org <archivesspace_users_group-bounces at lyralists.lyrasis.org> On Behalf Of Corey Schmidt
Sent: Tuesday, July 27, 2021 2:59 PM
To: Archivesspace Users Group <archivesspace_users_group at lyralists.lyrasis.org>
Subject: [Archivesspace_Users_Group] Check for Broken URLs Report Plugin
Hello, this is Corey, ArchivesSpace PM at the University of Georgia. I hope everyone is well, healthy, and staying cool!
I'm excited to say we at UGA created our first custom plugin report for ArchivesSpace and wanted to share it with the community. The report looks for and returns broken URLs that may exist in note fields across all repositories in an ArchivesSpace instance. Those notes come from resources, archival objects, digital objects, digital object components, digital object file versions (URLs), subject scope and contents, agent person, corporate entity, family, and software. We've used it to find your standard 404 errors, but also other fun ones like 403s and malformed links.
You can find the code for the plugin here, just download the check_urls folder: https://github.com/uga-libraries/uga-archivesspace-reports<https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fuga-libraries%2Fuga-archivesspace-reports&data=04%7C01%7Cvivianlea.solek%40kofc.org%7C075fdc93344a441ed57c08d951309f34%7C8a4b69f88bb74be59eda6c40a157248c%7C0%7C1%7C637630091542868422%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&sdata=yOslkwdFMwz2n7Qvmv3OrPzXbFHtY%2FkhZurZi9UVZJk%3D&reserved=0>. Info on how to install an ArchivesSpace plugin can be found here: https://archivesspace.github.io/tech-docs/customization/plugins.html<https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Farchivesspace.github.io%2Ftech-docs%2Fcustomization%2Fplugins.html&data=04%7C01%7Cvivianlea.solek%40kofc.org%7C075fdc93344a441ed57c08d951309f34%7C8a4b69f88bb74be59eda6c40a157248c%7C0%7C1%7C637630091542878418%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&sdata=7pzXg14HNhicujf35EgSiEg2O9QvL%2BujMRzl1RraTZ0%3D&reserved=0>.
The plugin isn't perfect, as it requires you to export it in CSV format, so if you install it and test it, please set the report as a CSV. Additionally, because it's doing many lookups, expect the report to run for a long time. We have over 5000 resources between five repositories and it takes us just under an hour to complete. Lastly, there is no way currently to limit the repository or notes being checked. Filtering results is best done in Excel by clicking on the third header row and using the Data > Filter feature. If anyone has any advice on how to do that in ASpace, I would greatly appreciate the feedback.
A special thanks to Dallas Pillen, who helped us solve the last puzzle of exporting the data in a usable fashion, and Alicia Detelich for her awesome tutorial on how to make a custom reports plugin (https://www.youtube.com/watch?v=ruRWpOGaj1A<https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.youtube.com%2Fwatch%3Fv%3DruRWpOGaj1A&data=04%7C01%7Cvivianlea.solek%40kofc.org%7C075fdc93344a441ed57c08d951309f34%7C8a4b69f88bb74be59eda6c40a157248c%7C0%7C1%7C637630091542888414%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&sdata=dBdnn87fNqPcyeif357Ouv2neuif18ox2ZmTBvHEoGM%3D&reserved=0>) and general advice. For anyone else I missed, thank you for your advice and patience.
Please reach out if you have any questions or feedback on the plugin and if you find it useful.
University of Georgia Special Collections Libraries | ArchivesSpace Project Manager
706-542-8151<tel:7065428151> | Corey.Schmidt at uga.edu<mailto:Corey.Schmidt at uga.edu>
CONFIDENTIALITY NOTICE: This message and any attachments may contain confidential, proprietary or legally privileged information and is intended only for the use of the addressee or addressees named above for its intended purpose. If you are not the intended recipient of this message, this message constitutes notice that any review, retransmission, distribution, copying or other use or taking any action in reliance on the information in this message and its attachments, is prohibited. If you receive this communication in error, please immediately advise the sender by reply e-mail and delete this message and its attachments from your system without keeping a copy. Unless expressly stated in this e-mail, nothing in this message may be construed as a digital or electronic signature. Thank you.
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Archivesspace_Users_Group