[Archivesspace_Users_Group] Invalid EAD export

Wendler, Robin King robin_wendler at harvard.edu
Tue Sep 15 08:47:27 EDT 2015


Hi, both Chrises,
      We have a project now to prep for importing ~6K EADs into ASpace, and are running into the same issue. In my case, the importer removes all the <p>s from a long, complex bioghist where the initial <p> if followed by <persname>. Once we add them back in the export is correct, but that is not practical at this scale.

As part of our project, we're identifying what we can fix in preprocessing and what needs to be fixed in the importer itself. In this case, preprocessing won't do it.  Chris F., is there a reason this can't be fixed in the importer?

Thanks,
Robin

Robin Wendler
Library Technology Services
Harvard University
90 Mt. Auburn St.
Cambridge, MA 02138
617-495-3724
r_wendler at harvard.edu



From: archivesspace_users_group-bounces at lyralists.lyrasis.org [mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org] On Behalf Of Chris Fitzpatrick
Sent: Tuesday, September 15, 2015 3:11 AM
To: archivesspace_users_group
Subject: Re: [Archivesspace_Users_Group] Invalid EAD export


Hi Chris,



Yes, the <p> is one of the more unfortunate aspects of EAD.

For this use case ( where you start the note with markup ), you have to add your own <p> tags to wrap the note.

b,chris.




Chris Fitzpatrick | Developer, ArchivesSpace
Skype: chrisfitzpat  | Phone: 918.236.6048
http://archivesspace.org/

________________________________
From: archivesspace_users_group-bounces at lyralists.lyrasis.org<mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org> <archivesspace_users_group-bounces at lyralists.lyrasis.org<mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org>> on behalf of Chris Powell <sooty at umich.edu<mailto:sooty at umich.edu>>
Sent: Monday, September 14, 2015 5:42 PM
To: archivesspace_users_group
Subject: Re: [Archivesspace_Users_Group] Invalid EAD export

Please disregard the hyphen in the example EAD import! The hazards of cutting and pasting out of Internet Explorer.

<bioghist encodinganalog="545"><p><persname>Francis Steiner</persname>was born January 16, 1895 in New Jersey, to German parents. He was the oldest of three children.</p><p>A communist and conscientious objector [etc.] </p><p>There is no information regarding Francis Steiner after his last letter of November 7, 1920. </p></bioghist>

On Mon, Sep 14, 2015 at 11:15 AM, Chris Powell <sooty at umich.edu<mailto:sooty at umich.edu>> wrote:
Hello --

It appears that if the first word or phrase in any of the "notes" elements that contain a text block and support mixed content, like abstract, bioghist or scopecontent is wrapped in a persname, the EAD export is invalid as all paragraphs lack p element wrappers.

I've tested this with other elements to start the first paragraph and persname to start the second paragraph and those do not cause problems, only persname to start the first paragraph.

Example bioghist EAD prior to import:

<bioghist encodinganalog="545">-<p><persname>Francis Steiner</persname>was born January 16, 1895 in New Jersey, to German parents. He was the oldest of three children.</p><p>A communist and conscientious objector [etc.] </p><p>There is no information regarding Francis Steiner after his last letter of November 7, 1920. </p></bioghist>

Example bioghist EAD after export:

<bioghist id="aspace_6e16003b2d18f8ad6c487cd5712fc162"><head>Biographical / Historical</head><persname>Francis Steiner</persname>was born January 16, 1895 in New Jersey, to German parents. He was the oldest of three children. A communist and conscientious objector [etc.] There is no information regarding Francis Steiner after his last letter of November 7, 1920. </bioghist>


Chris Powell
University of Michigan
Digital Library Production Service

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lyralists.lyrasis.org/pipermail/archivesspace_users_group/attachments/20150915/ed618e7e/attachment.html>


More information about the Archivesspace_Users_Group mailing list