[Archivesspace_Users_Group] batch delete dates saga continues

Lora Woodford lwoodford at jhu.edu
Mon Nov 13 10:39:29 EST 2017


Hey and thanks for the plug, Patrick!

My co-workshop lead, Valerie Addonizio, and I tried to make the resources we put up in that GitHub stand on their own without the in-person part of the workshop.  It was really important to us that everyone – not just those able to get to and pay for a 1-day workshop – be able to tinker about.  As Patrick has said, tinkering is the BEST way to learn.  It’s certainly how I did.  If anyone wants assistance in working through these workshop materials, definitely give a shout.  Like all of the rest of you, I have a super busy day job, but I’m happy to help as time allows!

Regarding the time to do this, everyone’s situation is going to be different, but if your institution supports the idea of professional development time/personal goals, it can be a good idea to talk about this with supervisors and make it an articulated part of your job.  Last year my “personal goal” was to increase my proficiency with Rails, so I tried to set aside an hour or so every couple weeks to dedicate myself to that, in the same way that I try to stay on top of the literature in ours and allied fields. Your learning style may vary, but I find it is best to just set aside a little bit of time over a long period, rather than trying to “cram” everything at once. Approach it as you would approach learning a foreign language. You aren’t going to do it in one weekend, one month, or, even, one year. It will grow naturally over time as you expose yourself to it. There are SO MANY wonderful resources out there (most for free), and you’ve got a community of folks who can help connect you to them. I won’t go too much into specifics as my answers will vary depending on what you want to do/what your specific setup looks like, but Code Academy, Lynda.com, the Hitchhiker’s Guide to Python (http://docs.python-guide.org/en/latest/), and some of the “better” coding boot camps (https://www.switchup.org/research/best-coding-bootcamps) offer free online courses (but, be forewarned, if you sign up with one of them they WILL hound you to apply to their programs and actively try and recruit you to their school ;-) ).

Good luck, all,

Lora

--
Lora J. Woodford
The Sheridan Libraries
Johns Hopkins University
3400 North Charles Street
Baltimore, MD     21228
(410) 516-5898
lwoodford at jhu.edu<mailto:lwoodford at jhu.edu>

From: <archivesspace_users_group-bounces at lyralists.lyrasis.org> on behalf of "Galligan, Patrick" <PGalligan at rockarch.org>
Reply-To: Archivesspace Users Group <archivesspace_users_group at lyralists.lyrasis.org>
Date: Monday, November 13, 2017 at 9:52 AM
To: Archivesspace Users Group <archivesspace_users_group at lyralists.lyrasis.org>
Subject: Re: [Archivesspace_Users_Group] batch delete dates saga continues

Hey all,

I think a great place to get started learning about Python, APIs, and ArchivesSpace would be some of the API tutorials that members of the ArchivesSpace community have taught.

Lora Woodford from JHU taught a workshop about 4 months ago and made the materials available on Github: https://github.com/jhu-archives-and-manuscripts/ASpace_API_Workshop.

Personally, I’ve found that I learn best by doing. So I learned what I did about the AS JSON responses by downloading Postman (https://www.getpostman.com/) and performing a few API calls against our development database. Also, back in 2014, Maureen Callahan and a few other archivists ran a blog about dealing with legacy data. They have some posts about AS API work. I personally liked this post as a beginner: https://icantiemyownshoes.wordpress.com/2014/08/01/a-very-brief-guide-to-deleting-records-with-the-archivesspace-api-from-a-total-tyro/.

This is a great resource for learning python: https://automatetheboringstuff.com/.

I’m sure there are a ton of other resources out there, but those are the few off the top of my head. I’m willing to jump in and give some more recommendations as needed.

-Patrick Galligan

From: archivesspace_users_group-bounces at lyralists.lyrasis.org [mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org] On Behalf Of Margaret Kidd
Sent: Friday, November 10, 2017 5:31 PM
To: Archivesspace Users Group <archivesspace_users_group at lyralists.lyrasis.org>
Subject: Re: [Archivesspace_Users_Group] batch delete dates saga continues

I second that last part of Lydia's remarks about where to begin learning more about XML, JSON, python, ruby, etc. and applying it to ArchivesSpace. I want to learn, but it is overwhelming to know where to begin and my time to devote to it is rather limited. Every time I start trying to teach myself I have other work priorities that take up all my time and soon forget whatever progress I have made.

Thanks,

Margaret


________________________________

Margaret T. Kidd

Project Archivist, Special Collections & Archives

VCU Libraries | Tompkins-McCaw Library for the Health Sciences

509 N. 12th Street / Box 980582, Richmond, VA 23298-0582

(804) 828-3152
[mage removed by sender. em_twitter.png]<https://twitter.com/VCUTMLibrary>[mage removed by sender. em_fb.png]<https://www.facebook.com/VCUTMLib>


[mage removed by sender.]<http://www.vcu.edu/>     [mage removed by sender.] <http://vaheritage.org>


On Fri, Nov 10, 2017 at 12:02 PM, Tang, Lydia <ltang5 at lib.msu.edu<mailto:ltang5 at lib.msu.edu>> wrote:
Mark,
Thank you for identifying my problem!  Just for the sake of technological dummies like me, what should I do?  Everything there (besides removing the <unitid> tags is exactly as it spat out of ArchivesSpace, so I wonder if the export allowed the invalid characters?  I also wonder if the importing process could be improved within the ArchivesSpace code to search for “aspace_" and not batch add it as well as recognize (or not export out in the first place) the "Linear Feet" / linear_feet controlled vocabulary?  Ideally, after establishing with the database that linear_feet should publish as Linear Feet, I wish it would continue to recognize that rule when new stuff is imported in.
I was also meaning poll people on “how do I even get started” with learning more about working the ArchivesSpace guts?  I can understand EAD but obviously don’t know the wizardry possible with XML, there’s JSON, python, etc, and I would be interested in starting courses with Code Academy to learn, but I don’t even know where to begin.  Advice appreciated!
Lydia

From: <archivesspace_users_group-bounces at lyralists.lyrasis.org<mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org>> on behalf of "Custer, Mark" <mark.custer at yale.edu<mailto:mark.custer at yale.edu>>
Reply-To: Archivesspace Users Group <archivesspace_users_group at lyralists.lyrasis.org<mailto:archivesspace_users_group at lyralists.lyrasis.org>>
Date: Friday, November 10, 2017 at 11:50 AM
To: Archivesspace Users Group <archivesspace_users_group at lyralists.lyrasis.org<mailto:archivesspace_users_group at lyralists.lyrasis.org>>
Subject: Re: [Archivesspace_Users_Group] batch delete dates saga continues

Lydia,

In the EAD files that you attached, both have invalid XML characters in them (Unicode: 0x14).  Those are easy to remove before re-importing in an XML editor like oXygen, but I'm curious how they got into ASpace in the first place?   In any event, it's possible that that's what's blocking your imports this time around, and if that's the issue, if you just fix on those issues, then the ASpace importer won't tell you about the next issue until it runs again.

In any event, I'd also suggest making  the following changes to your XML file before re-importing, so perhaps the snag is a good thing for now 😊 :

·         ASpace adds "aspace_" to all of the @id values in the EAD file upon export.  If you don't remove those before reimporting, then on the next export you'll get "aspace_aspace_".  Removing them will invalidate the EAD file, but ASpace doesn't care whether the file is valid or not upon re-importing it.
·         ASpace expects to have the database values for the controlled value terms in the exports, not the translation values.  So, if you don't change things like "Linear Feet" to linear_feet and "Mixed Materials" to mixed_materials, then you'll wind up with new database values in ASpace after the import.  Those of course can be merged after the fact, but that's another step, and it would be unfortunate to have to do that on every occasion.

There's a lot more to say on the practice of re-importing EAD into ASpace, but I haven't still come up with a great strategy for that, so basically we try to avoid it 😊

Mark


-----Original Message-----
From: archivesspace_users_group-bounces at lyralists.lyrasis.org<mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org> [mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org<mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org>] On Behalf Of Tang, Lydia
Sent: Friday, 10 November, 2017 10:47 AM
To: Archivesspace Users Group <archivesspace_users_group at lyralists.lyrasis.org<mailto:archivesspace_users_group at lyralists.lyrasis.org>>
Subject: Re: [Archivesspace_Users_Group] batch delete dates saga continues

Wizardry!  Thank you, Olivia!  I would have NEVER figured that out.  It seemed to take out the <unitdate> tags perfectly but I ended up getting snagged on importing because of, um, a seemingly benign line which was marked as not well-formed.  Oof, so close, but snagged up!
Lydia

From: <archivesspace_users_group-bounces at lyralists.lyrasis.org<mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org>> on behalf of Olivia S Solis <livsolis at utexas.edu<mailto:livsolis at utexas.edu>>
Reply-To: Archivesspace Users Group <archivesspace_users_group at lyralists.lyrasis.org<mailto:archivesspace_users_group at lyralists.lyrasis.org>>
Date: Friday, November 10, 2017 at 10:36 AM
To: Archivesspace Users Group <archivesspace_users_group at lyralists.lyrasis.org<mailto:archivesspace_users_group at lyralists.lyrasis.org>>
Subject: Re: [Archivesspace_Users_Group] batch delete dates saga continues

<unitda[^>]+>[^<]+<\/unitdate>
_______________________________________________
Archivesspace_Users_Group mailing list
Archivesspace_Users_Group at lyralists.lyrasis.org<mailto:Archivesspace_Users_Group at lyralists.lyrasis.org>
http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lyralists.lyrasis.org/pipermail/archivesspace_users_group/attachments/20171113/d6d89c3b/attachment.html>


More information about the Archivesspace_Users_Group mailing list