[Archivesspace_Users_Group] Staff PDF Gen Error

Busch, Ed buschedw at msu.edu
Mon Aug 14 15:00:44 EDT 2023


Darn, that was a good catch but changed the error. ☹


Generating PDF for Helen Strow collection
org.xml.sax.SAXParseException; lineNumber: 64; columnNumber: 76417; The content of elements must consist of well-formed character data or markup.
net.sf.saxon.s9api.DocumentBuilder.build(net/sf/saxon/s9api/DocumentBuilder.java:360)
java.lang.reflect.Method.invoke(java/lang/reflect/Method.java:498)
org.jruby.javasupport.JavaMethod.invokeDirectWithExceptionHandling(org/jruby/javasupport/JavaMethod.java:456)
org.jruby.javasupport.JavaMethod.invokeDirect(org/jruby/javasupport/JavaMethod.java:317)
RUBY.build(/archivesspace/gems/gems/saxon-rb-0.8.3-java/lib/saxon/document_builder.rb:225)
RUBY.to_fo(/archivesspace/data/tmp/jetty-0_0_0_0-8089-backend_war-_-any-2950795293246301533/webapp/WEB-INF/app/lib/AS_fop.rb:42)
RUBY.to_pdf(/archivesspace/data/tmp/jetty-0_0_0_0-8089-backend_war-_-any-2950795293246301533/webapp/WEB-INF/app/lib/AS_fop.rb:58)
RUBY.run(/archivesspace/data/tmp/jetty-0_0_0_0-8089-backend_war-_-any-2950795293246301533/webapp/WEB-INF/app/lib/job_runners/print_to_pdf_runner.rb:43)
archivesspace.data.tmp.jetty_minus_0_0_0_0_minus_8089_minus_backend_war_minus___minus_any_minus_2950795293246301533.webapp.WEB_minus_INF.app.lib.request_context.open(/archivesspace/data/tmp/jetty-0_0_0_0-8089-backend_war-_-any-2950795293246301533/webapp/WEB-INF/app/lib/request_context.rb:24)
RUBY.run(/archivesspace/data/tmp/jetty-0_0_0_0-8089-backend_war-_-any-2950795293246301533/webapp/WEB-INF/app/lib/job_runners/print_to_pdf_runner.rb:11)
archivesspace.data.tmp.jetty_minus_0_0_0_0_minus_8089_minus_backend_war_minus___minus_any_minus_2950795293246301533.webapp.WEB_minus_INF.app.lib.background_job_queue.invokeOther31:run(archivesspace/data/tmp/jetty_minus_0_0_0_0_minus_8089_minus_backend_war_minus___minus_any_minus_2950795293246301533/webapp/WEB_minus_INF/app/lib//archivesspace/data/tmp/jetty-0_0_0_0-8089-backend_war-_-any-2950795293246301533/webapp/WEB-INF/app/lib/background_job_queue.rb:126)
archivesspace.data.tmp.jetty_minus_0_0_0_0_minus_8089_minus_backend_war_minus___minus_any_minus_2950795293246301533.webapp.WEB_minus_INF.app.lib.background_job_queue.run_pending_job(/archivesspace/data/tmp/jetty-0_0_0_0-8089-backend_war-_-any-2950795293246301533/webapp/WEB-INF/app/lib/background_job_queue.rb:126)
RUBY.start_background_thread(/archivesspace/data/tmp/jetty-0_0_0_0-8089-backend_war-_-any-2950795293246301533/webapp/WEB-INF/app/lib/background_job_queue.rb:169)
org.jruby.RubyProc.call(org/jruby/RubyProc.java:318)
java.lang.Thread.run(java/lang/Thread.java:750)



From: archivesspace_users_group-bounces at lyralists.lyrasis.org <archivesspace_users_group-bounces at lyralists.lyrasis.org> On Behalf Of Brian Harrington
Sent: Monday, August 14, 2023 2:50 PM
To: Archivesspace Users Group <archivesspace_users_group at lyralists.lyrasis.org>
Subject: Re: [Archivesspace_Users_Group] Staff PDF Gen Error

Hi Ed,

It looks like your problem is here:


<unittitle>Extension Work in ICA Cooperative Countries</emph>. Mimeo. ICA</unittitle>



Since the beginning <emph> is missing, the XML parser thinks you’re trying to close a <unittitle> with an </emph>.



I hope this helps.



Brian

--
Brian Harrington (he/him)
Data Migration Specialist
LYRASIS
brian.harrington at lyrasis.org<mailto:brian.harrington at lyrasis.org>

From: archivesspace_users_group-bounces at lyralists.lyrasis.org<mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org> <archivesspace_users_group-bounces at lyralists.lyrasis.org<mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org>> on behalf of Busch, Ed <buschedw at msu.edu<mailto:buschedw at msu.edu>>
Date: Monday, August 14, 2023 at 2:38 PM
To: archivesspace_users_group at lyralists.lyrasis.org<mailto:archivesspace_users_group at lyralists.lyrasis.org> <archivesspace_users_group at lyralists.lyrasis.org<mailto:archivesspace_users_group at lyralists.lyrasis.org>>
Subject: [Archivesspace_Users_Group] Staff PDF Gen Error


Hi-

We have a weird one. We have a collection that was migrated years ago from AT but never printed from staff side. When we run the Staff side Export >Generate PDF we get
Generating PDF for Helen Strow collection
org.xml.sax.SAXParseException; lineNumber: 64; columnNumber: 10726; The element type "unittitle" must be terminated by the matching end-tag "</unittitle>".
net.sf.saxon.s9api.DocumentBuilder.build(net/sf/saxon/s9api/DocumentBuilder.java:360)
java.lang.reflect.Method.invoke(java/lang/reflect/Method.java:498)
org.jruby.javasupport.JavaMethod.invokeDirectWithExceptionHandling(org/jruby/javasupport/JavaMethod.java:456)
org.jruby.javasupport.JavaMethod.invokeDirect(org/jruby/javasupport/JavaMethod.java:317)
RUBY.build(/archivesspace/gems/gems/saxon-rb-0.8.3-java/lib/saxon/document_builder.rb:225)
RUBY.to_fo(/archivesspace/data/tmp/jetty-0_0_0_0-8089-backend_war-_-any-2950795293246301533/webapp/WEB-INF/app/lib/AS_fop.rb:42)
RUBY.to_pdf(/archivesspace/data/tmp/jetty-0_0_0_0-8089-backend_war-_-any-2950795293246301533/webapp/WEB-INF/app/lib/AS_fop.rb:58)
RUBY.run(/archivesspace/data/tmp/jetty-0_0_0_0-8089-backend_war-_-any-2950795293246301533/webapp/WEB-INF/app/lib/job_runners/print_to_pdf_runner.rb:43)
archivesspace.data.tmp.jetty_minus_0_0_0_0_minus_8089_minus_backend_war_minus___minus_any_minus_2950795293246301533.webapp.WEB_minus_INF.app.lib.request_context.open(/archivesspace/data/tmp/jetty-0_0_0_0-8089-backend_war-_-any-2950795293246301533/webapp/WEB-INF/app/lib/request_context.rb:24)
RUBY.run(/archivesspace/data/tmp/jetty-0_0_0_0-8089-backend_war-_-any-2950795293246301533/webapp/WEB-INF/app/lib/job_runners/print_to_pdf_runner.rb:11)
archivesspace.data.tmp.jetty_minus_0_0_0_0_minus_8089_minus_backend_war_minus___minus_any_minus_2950795293246301533.webapp.WEB_minus_INF.app.lib.background_job_queue.invokeOther31:run(archivesspace/data/tmp/jetty_minus_0_0_0_0_minus_8089_minus_backend_war_minus___minus_any_minus_2950795293246301533/webapp/WEB_minus_INF/app/lib//archivesspace/data/tmp/jetty-0_0_0_0-8089-backend_war-_-any-2950795293246301533/webapp/WEB-INF/app/lib/background_job_queue.rb:126)
archivesspace.data.tmp.jetty_minus_0_0_0_0_minus_8089_minus_backend_war_minus___minus_any_minus_2950795293246301533.webapp.WEB_minus_INF.app.lib.background_job_queue.run_pending_job(/archivesspace/data/tmp/jetty-0_0_0_0-8089-backend_war-_-any-2950795293246301533/webapp/WEB-INF/app/lib/background_job_queue.rb:126)
RUBY.start_background_thread(/archivesspace/data/tmp/jetty-0_0_0_0-8089-backend_war-_-any-2950795293246301533/webapp/WEB-INF/app/lib/background_job_queue.rb:169)
org.jruby.RubyProc.call(org/jruby/RubyProc.java:318)
java.lang.Thread.run(java/lang/Thread.java:750)

When I Export EAD and look for line 64, for some reason it has put all of the archivalobjects on one xml “line”. We tried moving things around and nothing changes other than the order they are on that one line. Still get an error.

I did a count of unittitle and it’s an even number so it doesn’t seem like one is missing. There are a bunch of emph tags but they look ok to me.
Omololu, A. <emph render="italic">Coordination of Relevant Governments Services for Nutrition Education</emph>. Ibadan University

Printing from Public side works fine.

Anyone have a suggestion?

Ed Busch, MLIS
Electronic Records Archivist
Michigan State University Archives and Historical Collections
Conrad Hall
943 Conrad Road, Room 101
East Lansing, MI 48824
517-884-6438
buschedw at msu.edu<mailto:buschedw at msu.edu>
he/him/his

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lyralists.lyrasis.org/pipermail/archivesspace_users_group/attachments/20230814/05148c0e/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: UA.17.443_20230814_185619_UTC__ead.xml
Type: application/xml
Size: 366244 bytes
Desc: UA.17.443_20230814_185619_UTC__ead.xml
URL: <http://lyralists.lyrasis.org/pipermail/archivesspace_users_group/attachments/20230814/05148c0e/attachment.xml>


More information about the Archivesspace_Users_Group mailing list