[Archivesspace_Users_Group] MARC XML -->resource record batch import

Rees, John (NIH/NLM) [E] reesj at mail.nlm.nih.gov
Tue Oct 4 16:33:37 EDT 2022


Hi Sarah,

My path (pre-covid) used MarcEdit

  1.  grab MARCXML via z39.50
  2.  use marcedit's MARC-EAD transform xsl (w/ local edits)
  3.  import EAD into aspace
  4.  massage/publish agents, subjects, etc.

We do this annually for adding accession records as resources (we don't use the aspace accession module). I had done this same routine for our pre-aspace EAD-centric discovery software.

John

John P. Rees
Archivist and Digital Resources Manager
History of Medicine Division
National Library of Medicine
301-827-4510




From: archivesspace_users_group-bounces at lyralists.lyrasis.org <archivesspace_users_group-bounces at lyralists.lyrasis.org> On Behalf Of Newhouse, Sarah
Sent: Tuesday, October 4, 2022 12:28 PM
To: Archivesspace Users Group <archivesspace_users_group at lyralists.lyrasis.org>
Subject: [EXTERNAL] [Archivesspace_Users_Group] MARC XML -->resource record batch import

Hi all,

We're trying to figure out the fastest way to generate stub resource records in ASpace from MARC XML for about 300 collections. The MARC XML we get from our Sierra OPAC creates really messy resource records using the MARC import function baked into ASpace, so while we could do that and spend our time on clean-up after ingest, I feel like there must be a way to convert the MARC XML into clean JSON in bulk (maybe with a CSV intermediary step?) and then import via the API. Before I go too far down the "I'll just do it all myself" route, has anyone done this before and could share workflows or scripts? Or gently tell me why this won't work?

I'm aware of this from Smith: https://github.com/smith-special-collections/a2c-tools/tree/master/mrbc_resources<https://gcc02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fsmith-special-collections%2Fa2c-tools%2Ftree%2Fmaster%2Fmrbc_resources&data=05%7C01%7Creesj%40mail.nlm.nih.gov%7C03a5c3f8d86f4a5fafcc08daa62576b8%7C14b77578977342d58507251ca2dc2b06%7C0%7C0%7C638004977086907720%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=DY8BH%2Feq2Ut1ifqfmhpsdsAxRdR089tLdtRq%2BtKWdq0%3D&reserved=0>
and the API Playbook<https://gcc02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fsupport.atlas-sys.com%2Fhc%2Fen-us%2Farticles%2F360052217114-The-ArchivesSpace-API-Playbook&data=05%7C01%7Creesj%40mail.nlm.nih.gov%7C03a5c3f8d86f4a5fafcc08daa62576b8%7C14b77578977342d58507251ca2dc2b06%7C0%7C0%7C638004977086907720%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=G08hfevG8eGYnJvlLthLqf1ehOewF2G2jttjHfdcgo8%3D&reserved=0> from Atlas, the official API wiki<https://gcc02.safelinks.protection.outlook.com/?url=https%3A%2F%2Farchivesspace.github.io%2Farchivesspace%2Fapi%2F&data=05%7C01%7Creesj%40mail.nlm.nih.gov%7C03a5c3f8d86f4a5fafcc08daa62576b8%7C14b77578977342d58507251ca2dc2b06%7C0%7C0%7C638004977087064359%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=R9mNz3BzBw43guZF%2BrQ5axvGopfwgtrja19wKm6WX5s%3D&reserved=0> and its sample scripts, and various blog posts/guides for ArchivesSnake and PyMARC.

Other suggestions?

Thank you!

__________________________________

Sarah Newhouse   (she, her, hers)
Digital Preservation Archivist
Othmer Library of Chemical History
t. +1.215.873.8249

Science History Institute
Chemistry * Engineering * Life Sciences
315 Chestnut Street * Philadelphia, PA 19106 * U.S.A.
Learn about the scientific discoveries that changed our world at sciencehistory.org/learn<https://gcc02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.sciencehistory.org%2Flearn&data=05%7C01%7Creesj%40mail.nlm.nih.gov%7C03a5c3f8d86f4a5fafcc08daa62576b8%7C14b77578977342d58507251ca2dc2b06%7C0%7C0%7C638004977087064359%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=H1IitZ6WUsya6EROwnGzRCNveMdNxrciEBfPpzh2KmQ%3D&reserved=0>
CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you recognize the sender and are confident the content is safe.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lyralists.lyrasis.org/pipermail/archivesspace_users_group/attachments/20221004/542cf18a/attachment.html>


More information about the Archivesspace_Users_Group mailing list