[Archivesspace_Users_Group] Problem importing EAD file
Custer, Mark
mark.custer at yale.edu
Fri Oct 9 14:47:31 EDT 2015
Mark, all:
I should’ve also added that:
1) in the file that I attached previously, I changed those @type=”Series” values to @type=”Box”. If that’s not what you want, then don’t use that file without changing those values back.
2) If you’re importing EAD into ArchivesSpace, you might want to check out this blog post, http://campuspress.yale.edu/yalearchivesspace/2015/07/22/validation-scenarios/, which details how you can bulk validate your files to see if they’re ready for ingest into ASpace (the validation scenario discussed there doesn’t check all of the edge cases, but it will let you know if you have any extent statements, for instance, that ASpace doesn’t like).
From: archivesspace_users_group-bounces at lyralists.lyrasis.org [mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org] On Behalf Of Custer, Mark
Sent: Friday, October 09, 2015 2:38 PM
To: mcyzyk at jhu.edu; Archivesspace Users Group
Subject: Re: [Archivesspace_Users_Group] Problem importing EAD file
Mark,
I notice three issues that you might want to investigate.
Firstly, this file is not entirely EAD valid since the eadid/@mainagencycode and the archdesc/did/unitid/@repository code aren't valid values according to the EAD 2002 schema. This shouldn't cause any import problems, though, since ArchivesSpace doesn't check to see if the files are EAD valid. So, if this file is just being used to transport the data, there’s really no need to fix anything here.
Secondly, and the one thing that will cause an import problem, is this type of encoding for the extent:
<physdesc label="Physical Description"><extent encodinganalog="300" type="Photographs">15000.00</extent></physdesc>
(which is perfectly fine EAD, but not something that ASpace will accept)
To fix that, just change the text value 15000.00 to "15000 photographs".
Thirdly, there are a lot of container groups that have values like this:
<container type="Series">2</container>
<container type="Folder">105</container>
I'm assuming that first container should actually be type="Box" instead of Series? Either way, the way the current EAD importer works, unless those container elements have matching @id and @parent attributes, then they will be imported as separate instances, which is probably not something that you want. To add the @id and @parent attributes, you can use a style sheet like this one, https://github.com/YaleArchivesSpace/xslt-files/blob/master/EAD_add_IDs_to_containers.xsl<https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_YaleArchivesSpace_xslt-2Dfiles_blob_master_EAD-5Fadd-5FIDs-5Fto-5Fcontainers.xsl&d=AwMGaQ&c=-dg2m7zWuuDZ0MUcV7Sdqw&r=s7ciGQfUJeaV_ryx908hbeXDoU9aqDwDN0Z0VbfsJ3Y&m=LatCfECXnq8fraGR0G-n2EPwjUpkQiGrFK6uVBYG_6k&s=d1dMBlvBek5OPXGfV37nobChB76EY_gdIMa2y2-HF8U&e=>
I've made adjustments for all 3 issues and then was able to import the file just fine. I've also attached the EAD that I used.
I hope that helps,
Mark
-----Original Message-----
From: archivesspace_users_group-bounces at lyralists.lyrasis.org<mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org> [mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org] On Behalf Of Mark Cyzyk
Sent: Friday, October 09, 2015 1:14 PM
To: archivesspace_users_group at lyralists.lyrasis.org<mailto:archivesspace_users_group at lyralists.lyrasis.org>
Subject: [Archivesspace_Users_Group] Problem importing EAD file
All,
I am having a vexing issue: I have exported an EAD file out of an old instance of Archon and am trying to get it to import into the very latest instance of ArchivesSpace. I've gone back and forth a bit, fixing things on the Archon side that were required on the ArchivesSpace side. I have, however, run into one last ArchivesSpace error that I can't figure out how to solve. I have attached my Archon-exported EAD file to this message, and I've pasted the ArchivesSpace error log below.
ArchivesSpace is indicating that there is something wrong with our Extent field, but I just don't see what it could be.
Advice appreciated!
Mark
--
<><><><><><><><><><><><><><><><><><><><><><><><><><><><><><><><><><>
Mark Cyzyk, M.A., M.L.S.
Scholarly Communication Architect
User Interface Applications Group
The Sheridan Libraries
The Johns Hopkins University
mcyzyk at jhu.edu<mailto:mcyzyk at jhu.edu>
Verba volant, scripta manent.
ArchivesSpace Error Log:
==================================================
photographic_collection.xml
==================================================
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
IMPORT ERROR
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
The following errors were found:
extents : At least 1 item(s) is required
For JSONModel(:resource):
#<JSONModel(:resource) {"jsonmodel_type"=>"resource", "external_ids"=>[], "subjects"=>[], "linked_events"=>[], "extents"=>[], "dates"=>[{"jsonmodel_type"=>"date", "uri"=>nil, "import_context"=>"<unitdate class=\"cdata\" encodinganalog=\"245$f\"
label=\"Dates\" normal=\"1925/1984\" type=\"inclusive\"> ...
</unitdate>", "date_type"=>"inclusive", "expression"=>"1925-1984", "label"=>"creation", "begin"=>"1925", "end"=>"1984"}], "external_documents"=>[], "rights_statements"=>[], "linked_agents"=>[], "restrictions"=>false, "revision_statements"=>[], "instances"=>[], "deaccessions"=>[], "related_accessions"=>[], "classifications"=>[], "notes"=>[{"jsonmodel_type"=>"note_singlepart", "content"=>["<extent encodinganalog=\"300\" type=\"Photographs\">15000.00</extent>"],
"uri"=>nil, "import_context"=>"<physdesc class=\"cdata\"
label=\"Physical Description\"> ... </physdesc>", "type"=>"physdesc"}], "uri"=>"/repositories/import/resources/import_7f8679e3-a736-4187-bfbb-61a253bbf602",
"ead_id"=>"02/II/II", "finding_aid_title"=>"Guide to the Photographic Collection - BCH 1925- June 30, 1984", "finding_aid_filing_title"=>"Photographic Collection", "level"=>"collection", "publish"=>true, "title"=>"Photographic Collection - BCH 1925- June 30, 1984", "id_0"=>"02/II/II"}>
In :
<ead audience="external" class="cdata" schemaLocation="https://urldefense.proofpoint.com/v2/url?u=http-3A__www.loc.gov_ead_&d=AwICaQ&c=-dg2m7zWuuDZ0MUcV7Sdqw&r=s7ciGQfUJeaV_ryx908hbeXDoU9aqDwDN0Z0VbfsJ3Y&m=aPj7tEKG4VP8iJDbKd6Eg-yBbhW05dcEhCDqnlwBL8w&s=wSHGKKU4rC4bRqtd715tkmPfEZuF2HskmIGtvKpj-Uw&e=
https://urldefense.proofpoint.com/v2/url?u=http-3A__www.loc.gov_ead_ead.xsd-26quot-3B&d=AwICaQ&c=-dg2m7zWuuDZ0MUcV7Sdqw&r=s7ciGQfUJeaV_ryx908hbeXDoU9aqDwDN0Z0VbfsJ3Y&m=aPj7tEKG4VP8iJDbKd6Eg-yBbhW05dcEhCDqnlwBL8w&s=txrVbjoWY4-3KEBqFel7L7ltArUS47MtZz5zExlzWCE&e=
xmlns="urn:isbn:1-931666-22-9"
xmlns:xlink="http://www.w3.org/1999/xlink"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"> ...
</ead>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lyralists.lyrasis.org/pipermail/archivesspace_users_group/attachments/20151009/5e69c520/attachment.html>
More information about the Archivesspace_Users_Group
mailing list