[Archivesspace_Users_Group] Data cleanup

Olivia S Solis livsolis at utexas.edu
Tue Feb 13 09:25:56 EST 2018

Hi there,

Yes it is definitely possible to use OpenRefine to migrate data into the
system. It's my primary tool for our data migration. One of the nice things
about OpenRefine is its Templating export option. I was inspired by an
extremely helpful University of Maryland Chaos to Order post:

That is how we've been creating records and we have developed a number of
templates to export the JSON. There are a few differences in our process.
For instance, the bash script didn't like spaces in content, so we added a

and all of my templates end in a pipe.

To update records, we were inspired by a Duke Python script:
as described in the blog post above.

So for instance, we migrated our EAD by using BaseX to extract elements. I
created minimal resource records and incrementally added notes.

The process is somewhat laid out in the slides I made for a presentation:

Hopefully, this helps.


On Mon, Feb 12, 2018 at 9:45 AM, Joan Curbow <CurbowJ at bvu.edu> wrote:

> We are a new archives, so we have not had to import any existing data.
> Lucky us, right? But I’m all-too-human, and I’d now like to do some data
> cleanup. Is it possible to do data cleanup using OpenRefine? Theoretically,
> it seems possible to export “stuff” and pull it into OpenRefine, but I’ve
> only ever used OpenRefine in a classroom situation, where the data was
> already populated for us. We did not import the data back into an existing
> database, either, so my experience is limited to just the mechanics of
> OpenRefine. Has anyone used OpenRefine in a real-world situation with data
> that’s already in Aspace? Or is there a better method for data cleanup?
> A further question/complication is that I’m a lone arranger, and my
> instance is hosted by Libraryhost, so any data cleanup may have to be done
> by them? My tech skills are rudimentary, so I’m not clear just how I could
> get this to work. I asked them once, but didn’t get a real answer.
> Sincerely,
> *Joan Curbow*
> Reference Librarian and Archivist
> Buena Vista University Library
> Buena Vista University
> 610 West Fourth Street
> Storm Lake, Iowa  50588
> 712.749.2094 <(712)%20749-2094>
> www.library.bvu.edu​​
> _______________________________________________
> Archivesspace_Users_Group mailing list
> Archivesspace_Users_Group at lyralists.lyrasis.org
> http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group

Olivia Solis, MSIS
Metadata Coordinator
Dolph Briscoe Center for American History
The University of Texas at Austin
2300 Red River St. Stop D1100
Austin TX, 78712-1426
(512) 232-8013
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lyralists.lyrasis.org/pipermail/archivesspace_users_group/attachments/20180213/967d0541/attachment.html>

More information about the Archivesspace_Users_Group mailing list