[Archivesspace_Users_Group] Ingesting nested <list> elements

Noah Huffman noah.huffman at duke.edu
Mon Feb 29 16:11:53 EST 2016


Hi Johanna,

We had a similar problem at Duke with nested lists in <controlaccess>.  We decided just to flatten the hierarchy to get the data to import.

If you're not concerned about preserving the hierarchy (indentation?) of your items and just want the data to import, I've attached some XSLT that will copy an entire EAD as is, but "un-nest" lists within <arrangement>.

It should output all <item> elements at the same level no matter how deeply they are nested.

Here's the end result for your example:

<arrangement>
         <list>
            <item>Series I. Biographical and personal</item>
            <item>Series II. Gwynne family</item>
            <item>Series III. Correspondence</item>
            <item>Subseries A. Personal</item>
            <item>Subseries B. Professional</item>
            <item>Series IV. Writings</item>
            <item>Subseries A. Articles</item>
            <item>Subseries B. Monographs </item>
            <item>Subseries C. Pamphlets</item>
            <item>Series V. Elizabeth David Ltd.</item>
            <item>Series VI. Recipes and research</item>
            <item>Series VII. Artemis Cooper and Jill Norman</item>
            <item>Series VIII. Photographs and oversized</item>
            <item>Subseries A. Photographs </item>
            <item>Subseries B. Oversized</item>
         </list>
      </arrangement>

If you have nested lists outside of arrangement, you'd need to modify the script to account for that.  I haven't tested this extensively, so disclaimers as usual.

-Noah

================
Noah Huffman
Archivist for Metadata, Systems, and Digital Records
David M. Rubenstein Rare Book & Manuscript Library
Duke University | 919-660-5982
http://library.duke.edu/rubenstein/


From: archivesspace_users_group-bounces at lyralists.lyrasis.org [mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org] On Behalf Of Carll, Johanna
Sent: Monday, February 29, 2016 12:16 PM
To: Archivesspace Users Group <archivesspace_users_group at lyralists.lyrasis.org>
Subject: [Archivesspace_Users_Group] Ingesting nested <list> elements

We have over 5,000 <list> elements in our finding aids, many of them containing nested <list>s.  Since the AS data model only allows for simple ordered lists, we are finding that while our <list> elements with nested <list>s are ingesting, much of the data in the element is missing.  For example, when attempting to ingest the below <list> found within an <arrangement> tag, only the <item> elements in the final nested <list> ingest


<list><file:///C:\Users\jac811\Desktop\sch01172.xml><item>Series I. Biographical and personal</item><item>Series II. Gwynne family</item><item><file:///C:\Users\jac811\Desktop\sch01172.xml> Series III. Correspondence<list><file:///C:\Users\jac811\Desktop\sch01172.xml><item>Subseries A. Personal</item><item>Subseries B. Professional</item></list></item><item><file:///C:\Users\jac811\Desktop\sch01172.xml> Series IV. Writings<list><file:///C:\Users\jac811\Desktop\sch01172.xml><item>Subseries A. Articles</item><item>Subseries B. Monographs </item><item>Subseries C. Pamphlets</item></list></item><item> Series V. Elizabeth David Ltd.</item><item> Series VI. Recipes and research</item><item> Series VII. Artemis Cooper and Jill Norman</item><item><file:///C:\Users\jac811\Desktop\sch01172.xml> Series VIII. Photographs and oversized<list><item>Subseries A. Photographs </item><item>Subseries B. Oversized</item></list></item></list></arrangement>
Has anyone else encountered this issue?  If so, how have you addressed the problem?

Thanks

Johanna Carll
Archivist and Metadata Specialist
Schlesinger Library
Radcliffe Institute for Advanced Study
Harvard University
10 Garden Street
Cambridge, MA 02138
617-495-8524
jcarll at radcliffe.harvard.edu<mailto:jcarll at radcliffe.harvard.edu>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lyralists.lyrasis.org/pipermail/archivesspace_users_group/attachments/20160229/17864c78/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: ead_list_denester.xsl
Type: text/xml
Size: 1449 bytes
Desc: ead_list_denester.xsl
URL: <http://lyralists.lyrasis.org/pipermail/archivesspace_users_group/attachments/20160229/17864c78/attachment.xml>


More information about the Archivesspace_Users_Group mailing list