[Archivesspace_Users_Group] Does the API endpoint for PDFs work?

Benn Joseph benn.joseph at northwestern.edu
Wed May 16 18:56:59 EDT 2018


Hi Celia,
Yes, we have a lot of records with unescaped ampersands—and unfortunately I’m not sure of any tools to check for those. Our plan was to compile a list of resource records that failed the staff PDF export, then export and validate the EAD for those to find the issues. That’s largely a manual process, which is why we haven’t done it yet.

The other option we’re exploring is to create a standalone application that will pull data from the ArchivesSpace tables of a resource record and generate an .rtf document as a printable finding aid. This gives us some more leeway as far as manipulating the document’s look and feel, (since it’s hard to edit that PDF document in any meaningful way) as well as dictate what it includes (since the <bioghist> notes in our Agent records aren’t included in a collection’s staff-side PDF export). I can put you in touch with the staff member here who’s working on that effort if you’re interested. Ideally we wanted to be able to edit the PDF template that ArchivesSpace was using, and we still might try that, but we weren’t able to find enough documentation there to feel comfortable giving it a go.

Best,
--Benn

Benn Joseph
Head of Archival Processing
Northwestern University Libraries
Northwestern University
www.library.northwestern.edu
benn.joseph at northwestern.edu<mailto:benn.joseph at northwestern.edu%0d>
847.467.6581

From: archivesspace_users_group-bounces at lyralists.lyrasis.org <archivesspace_users_group-bounces at lyralists.lyrasis.org> On Behalf Of Celia Caust-Ellenbogen
Sent: Wednesday, May 16, 2018 10:51 AM
To: Archivesspace Users Group <archivesspace_users_group at lyralists.lyrasis.org>
Subject: Re: [Archivesspace_Users_Group] Does the API endpoint for PDFs work?

Thanks, Steve, for the tip! Validating the EAD showed me an unescaped ampersand that was causing the problem.

I'm worried there might be more culprits like this lurking in our database...does anyone know of an existing script to check for rogue unencoded special characters?

And I am still curious about the API endpoint for EAD...has anyone used this successfully? Is the problem specific to me or should I create a JIRA ticket?

On Tue, May 15, 2018 at 4:19 PM, Majewski, Steven Dennis (sdm7g) <sdm7g at virginia.edu<mailto:sdm7g at virginia.edu>> wrote:
Try exporting the EAD and validating.
There are still some cases where it exports malformed XML in some cases where these is mixed content in a note.
The staff PDF export, first exports EAD which then goes to the as-ead-pdf.xsl stylesheet and then to the FOP processor.
Feeding badly formed XML to the stylesheet will fail, and the error messages don’t seem to be caught.

— Steve.



On May 15, 2018, at 4:12 PM, Celia Caust-Ellenbogen <ccauste1 at swarthmore.edu<mailto:ccauste1 at swarthmore.edu>> wrote:

This is on 2.3.2. I can print PDFs manually from the PUI but when I try to automate printing via the API, I get these blank PDFs.

I'm doing all this because I have a particular resource record that is mysteriously giving me an error message when I try to print a PDF from the PUI (this one: http://archives.tricolib.brynmawr.edu/repositories/9/resources/5678<https://urldefense.proofpoint.com/v2/url?u=http-3A__archives.tricolib.brynmawr.edu_repositories_9_resources_5678&d=DwMFaQ&c=yHlS04HhBraes5BQ9ueu5zKhE7rtNXt_d012z2PA6ws&r=fciHLC2ou0tXKp-JlPlsrEmslFw9tnR331DgXAhVLvo&m=nepykAqix2ZauNMcGvirHUjNjGiSnLrFHzlSzlyqnXE&s=p8sfVL5M-jmn7wyhBP59sjE9dQHm2Qb0CR29aDWOi9E&e=>) and I am trying to understand what might be causing the problem.

Thanks in advance for any advice or insight!

On Tue, May 15, 2018 at 3:05 PM, Seth Shaw <seth.shaw at unlv.edu<mailto:seth.shaw at unlv.edu>> wrote:
What version do you have running? PDF export via API broke with 2.2.0 but was fixed in 2.3.0.

On Tue, May 15, 2018 at 11:55 AM, Celia Caust-Ellenbogen <ccauste1 at swarthmore.edu<mailto:ccauste1 at swarthmore.edu>> wrote:
Hello all,

Has anyone had luck using the API endpoint to print a PDF? I am getting blank PDFs that have appropriate bookmarks (different for each PDF I print, appropriate headings for the specific resource) but no text in the document. Am I doing this wrong or does the endpoint not work?

I noticed the API documentation says "EAD" instead of "PDF" ([:GET] /repositories/:repo_id/resource_descriptions/:id.pdf        Description: Get an EAD representation of a Resource) so I suspect this feature may not be fully developed.

Thanks!
Celia

--
Celia Caust-Ellenbogen
Friends Historical Library of Swarthmore College<https://urldefense.proofpoint.com/v2/url?u=http-3A__swarthmore.edu_friends-2Dhistorical-2Dlibrary&d=DwMFaQ&c=yHlS04HhBraes5BQ9ueu5zKhE7rtNXt_d012z2PA6ws&r=fciHLC2ou0tXKp-JlPlsrEmslFw9tnR331DgXAhVLvo&m=nepykAqix2ZauNMcGvirHUjNjGiSnLrFHzlSzlyqnXE&s=1bBWYKSMjZrGPJq_4pb59BSSW1qEOEE74nfc6546mAY&e=>
610-328-8496
ccauste1 at swarthmore.edu<mailto:ccauste1 at swarthmore.edu>
she/her/hers<https://urldefense.proofpoint.com/v2/url?u=https-3A__www.mypronouns.org_she-2Dher&d=DwMFaQ&c=yHlS04HhBraes5BQ9ueu5zKhE7rtNXt_d012z2PA6ws&r=fciHLC2ou0tXKp-JlPlsrEmslFw9tnR331DgXAhVLvo&m=nepykAqix2ZauNMcGvirHUjNjGiSnLrFHzlSzlyqnXE&s=Q0O9y0gF99mknznJVAAQ9pu63vc9tQQ1v45F4WAK8uY&e=>



_______________________________________________
Archivesspace_Users_Group mailing list
Archivesspace_Users_Group at lyralists.lyrasis.org<mailto:Archivesspace_Users_Group at lyralists.lyrasis.org>
http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group<https://urldefense.proofpoint.com/v2/url?u=http-3A__lyralists.lyrasis.org_mailman_listinfo_archivesspace-5Fusers-5Fgroup&d=DwMFaQ&c=yHlS04HhBraes5BQ9ueu5zKhE7rtNXt_d012z2PA6ws&r=fciHLC2ou0tXKp-JlPlsrEmslFw9tnR331DgXAhVLvo&m=nepykAqix2ZauNMcGvirHUjNjGiSnLrFHzlSzlyqnXE&s=yhivpzFRIisd04orHMaXufvHectdqs9v5-wN2AM4vaE&e=>


_______________________________________________
Archivesspace_Users_Group mailing list
Archivesspace_Users_Group at lyralists.lyrasis.org<mailto:Archivesspace_Users_Group at lyralists.lyrasis.org>
http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group<https://urldefense.proofpoint.com/v2/url?u=http-3A__lyralists.lyrasis.org_mailman_listinfo_archivesspace-5Fusers-5Fgroup&d=DwMFaQ&c=yHlS04HhBraes5BQ9ueu5zKhE7rtNXt_d012z2PA6ws&r=fciHLC2ou0tXKp-JlPlsrEmslFw9tnR331DgXAhVLvo&m=nepykAqix2ZauNMcGvirHUjNjGiSnLrFHzlSzlyqnXE&s=yhivpzFRIisd04orHMaXufvHectdqs9v5-wN2AM4vaE&e=>



--
Celia Caust-Ellenbogen
Friends Historical Library of Swarthmore College<https://urldefense.proofpoint.com/v2/url?u=http-3A__swarthmore.edu_friends-2Dhistorical-2Dlibrary&d=DwMFaQ&c=yHlS04HhBraes5BQ9ueu5zKhE7rtNXt_d012z2PA6ws&r=fciHLC2ou0tXKp-JlPlsrEmslFw9tnR331DgXAhVLvo&m=nepykAqix2ZauNMcGvirHUjNjGiSnLrFHzlSzlyqnXE&s=1bBWYKSMjZrGPJq_4pb59BSSW1qEOEE74nfc6546mAY&e=>
610-328-8496
ccauste1 at swarthmore.edu<mailto:ccauste1 at swarthmore.edu>
she/her/hers<https://urldefense.proofpoint.com/v2/url?u=https-3A__www.mypronouns.org_she-2Dher&d=DwMFaQ&c=yHlS04HhBraes5BQ9ueu5zKhE7rtNXt_d012z2PA6ws&r=fciHLC2ou0tXKp-JlPlsrEmslFw9tnR331DgXAhVLvo&m=nepykAqix2ZauNMcGvirHUjNjGiSnLrFHzlSzlyqnXE&s=Q0O9y0gF99mknznJVAAQ9pu63vc9tQQ1v45F4WAK8uY&e=>


_______________________________________________
Archivesspace_Users_Group mailing list
Archivesspace_Users_Group at lyralists.lyrasis.org<mailto:Archivesspace_Users_Group at lyralists.lyrasis.org>
http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group<https://urldefense.proofpoint.com/v2/url?u=http-3A__lyralists.lyrasis.org_mailman_listinfo_archivesspace-5Fusers-5Fgroup&d=DwMFaQ&c=yHlS04HhBraes5BQ9ueu5zKhE7rtNXt_d012z2PA6ws&r=fciHLC2ou0tXKp-JlPlsrEmslFw9tnR331DgXAhVLvo&m=nepykAqix2ZauNMcGvirHUjNjGiSnLrFHzlSzlyqnXE&s=yhivpzFRIisd04orHMaXufvHectdqs9v5-wN2AM4vaE&e=>


_______________________________________________
Archivesspace_Users_Group mailing list
Archivesspace_Users_Group at lyralists.lyrasis.org<mailto:Archivesspace_Users_Group at lyralists.lyrasis.org>
http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group<https://urldefense.proofpoint.com/v2/url?u=http-3A__lyralists.lyrasis.org_mailman_listinfo_archivesspace-5Fusers-5Fgroup&d=DwMFaQ&c=yHlS04HhBraes5BQ9ueu5zKhE7rtNXt_d012z2PA6ws&r=fciHLC2ou0tXKp-JlPlsrEmslFw9tnR331DgXAhVLvo&m=nepykAqix2ZauNMcGvirHUjNjGiSnLrFHzlSzlyqnXE&s=yhivpzFRIisd04orHMaXufvHectdqs9v5-wN2AM4vaE&e=>



--
Celia Caust-Ellenbogen
Friends Historical Library of Swarthmore College<https://urldefense.proofpoint.com/v2/url?u=http-3A__swarthmore.edu_friends-2Dhistorical-2Dlibrary&d=DwMFaQ&c=yHlS04HhBraes5BQ9ueu5zKhE7rtNXt_d012z2PA6ws&r=fciHLC2ou0tXKp-JlPlsrEmslFw9tnR331DgXAhVLvo&m=nepykAqix2ZauNMcGvirHUjNjGiSnLrFHzlSzlyqnXE&s=1bBWYKSMjZrGPJq_4pb59BSSW1qEOEE74nfc6546mAY&e=>
610-328-8496
ccauste1 at swarthmore.edu<mailto:ccauste1 at swarthmore.edu>
she/her/hers<https://urldefense.proofpoint.com/v2/url?u=https-3A__www.mypronouns.org_she-2Dher&d=DwMFaQ&c=yHlS04HhBraes5BQ9ueu5zKhE7rtNXt_d012z2PA6ws&r=fciHLC2ou0tXKp-JlPlsrEmslFw9tnR331DgXAhVLvo&m=nepykAqix2ZauNMcGvirHUjNjGiSnLrFHzlSzlyqnXE&s=Q0O9y0gF99mknznJVAAQ9pu63vc9tQQ1v45F4WAK8uY&e=>


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lyralists.lyrasis.org/pipermail/archivesspace_users_group/attachments/20180516/a48ef47c/attachment.html>


More information about the Archivesspace_Users_Group mailing list