[Archivesspace_Users_Group] [EXTERNAL] Re: MARC XML -->resource record batch import

Rees, John (NIH/NLM) [E] reesj at mail.nlm.nih.gov
Wed Oct 5 13:48:25 EDT 2022


Nope, all at once, though we're only doing a few dozen at most (acquisitions are slow these days). You could probably send bigger batches over as these would be small file sizes.

John


From: archivesspace_users_group-bounces at lyralists.lyrasis.org <archivesspace_users_group-bounces at lyralists.lyrasis.org> On Behalf Of Newhouse, Sarah
Sent: Tuesday, October 4, 2022 5:20 PM
To: Archivesspace Users Group <archivesspace_users_group at lyralists.lyrasis.org>
Subject: [EXTERNAL] Re: [Archivesspace_Users_Group] MARC XML -->resource record batch import

Thanks, John! That's really useful. (And honestly, shame on me for forgetting that MarcEdit exists.) Are you importing those EAD XML files one at a time?

Science History Institute
Chemistry * Engineering * Life Sciences
315 Chestnut Street * Philadelphia, PA 19106 * U.S.A.
Learn about the scientific discoveries that changed our world at sciencehistory.org/learn<https://gcc02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.sciencehistory.org%2Flearn&data=05%7C01%7Creesj%40mail.nlm.nih.gov%7C91446f8f94154e834b9f08daa64e3f5e%7C14b77578977342d58507251ca2dc2b06%7C0%7C0%7C638005152821288955%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=dUrhCfgHMYKPtCUBRfgI3KIc50ymq%2F3m2yzpZMIsuHA%3D&reserved=0>
__________________________________

Sarah Newhouse   (she, her, hers)
Digital Preservation Archivist
Othmer Library of Chemical History
t. +1.215.873.8249
________________________________
From: archivesspace_users_group-bounces at lyralists.lyrasis.org<mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org> <archivesspace_users_group-bounces at lyralists.lyrasis.org<mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org>> on behalf of Rees, John (NIH/NLM) [E] <reesj at mail.nlm.nih.gov<mailto:reesj at mail.nlm.nih.gov>>
Sent: Tuesday, October 4, 2022 4:33 PM
To: Archivesspace Users Group <archivesspace_users_group at lyralists.lyrasis.org<mailto:archivesspace_users_group at lyralists.lyrasis.org>>
Subject: Re: [Archivesspace_Users_Group] MARC XML -->resource record batch import


Hi Sarah,



My path (pre-covid) used MarcEdit

  1.  grab MARCXML via z39.50
  2.  use marcedit's MARC-EAD transform xsl (w/ local edits)
  3.  import EAD into aspace
  4.  massage/publish agents, subjects, etc.



We do this annually for adding accession records as resources (we don't use the aspace accession module). I had done this same routine for our pre-aspace EAD-centric discovery software.



John



John P. Rees

Archivist and Digital Resources Manager

History of Medicine Division

National Library of Medicine

301-827-4510









From: archivesspace_users_group-bounces at lyralists.lyrasis.org<mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org> <archivesspace_users_group-bounces at lyralists.lyrasis.org<mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org>> On Behalf Of Newhouse, Sarah
Sent: Tuesday, October 4, 2022 12:28 PM
To: Archivesspace Users Group <archivesspace_users_group at lyralists.lyrasis.org<mailto:archivesspace_users_group at lyralists.lyrasis.org>>
Subject: [EXTERNAL] [Archivesspace_Users_Group] MARC XML -->resource record batch import



Hi all,



We're trying to figure out the fastest way to generate stub resource records in ASpace from MARC XML for about 300 collections. The MARC XML we get from our Sierra OPAC creates really messy resource records using the MARC import function baked into ASpace, so while we could do that and spend our time on clean-up after ingest, I feel like there must be a way to convert the MARC XML into clean JSON in bulk (maybe with a CSV intermediary step?) and then import via the API. Before I go too far down the "I'll just do it all myself" route, has anyone done this before and could share workflows or scripts? Or gently tell me why this won't work?



I'm aware of this from Smith: https://github.com/smith-special-collections/a2c-tools/tree/master/mrbc_resources<https://gcc02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fsmith-special-collections%2Fa2c-tools%2Ftree%2Fmaster%2Fmrbc_resources&data=05%7C01%7Creesj%40mail.nlm.nih.gov%7C91446f8f94154e834b9f08daa64e3f5e%7C14b77578977342d58507251ca2dc2b06%7C0%7C0%7C638005152821288955%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=yriunAvEF1yxHzxy4T8Cn4S%2BhFWkBgPmce%2FA%2F5fmSC4%3D&reserved=0>

and the API Playbook<https://gcc02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fsupport.atlas-sys.com%2Fhc%2Fen-us%2Farticles%2F360052217114-The-ArchivesSpace-API-Playbook&data=05%7C01%7Creesj%40mail.nlm.nih.gov%7C91446f8f94154e834b9f08daa64e3f5e%7C14b77578977342d58507251ca2dc2b06%7C0%7C0%7C638005152821288955%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=tEdykf9JrlNM7QWB1oHuRdZaZGN0aU%2BVL%2Bhjp%2BalKDc%3D&reserved=0> from Atlas, the official API wiki<https://gcc02.safelinks.protection.outlook.com/?url=https%3A%2F%2Farchivesspace.github.io%2Farchivesspace%2Fapi%2F&data=05%7C01%7Creesj%40mail.nlm.nih.gov%7C91446f8f94154e834b9f08daa64e3f5e%7C14b77578977342d58507251ca2dc2b06%7C0%7C0%7C638005152821445172%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=1gJo2cuw7xqUiwSEuJrXGsIbavZ0vrmVs6Kh2R0KdVo%3D&reserved=0> and its sample scripts, and various blog posts/guides for ArchivesSnake and PyMARC.



Other suggestions?



Thank you!

__________________________________

Sarah Newhouse   (she, her, hers)
Digital Preservation Archivist
Othmer Library of Chemical History
t. +1.215.873.8249

Science History Institute
Chemistry * Engineering * Life Sciences
315 Chestnut Street * Philadelphia, PA 19106 * U.S.A.
Learn about the scientific discoveries that changed our world at sciencehistory.org/learn<https://gcc02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.sciencehistory.org%2Flearn&data=05%7C01%7Creesj%40mail.nlm.nih.gov%7C91446f8f94154e834b9f08daa64e3f5e%7C14b77578977342d58507251ca2dc2b06%7C0%7C0%7C638005152821445172%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=g%2F4FzZXKynxrygOSzWyf8qen%2F%2FkJJCWz1l1ttxSig5Q%3D&reserved=0>

CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you recognize the sender and are confident the content is safe.


CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you recognize the sender and are confident the content is safe.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lyralists.lyrasis.org/pipermail/archivesspace_users_group/attachments/20221005/e8412af4/attachment.html>


More information about the Archivesspace_Users_Group mailing list