<div dir="ltr"><div dir="ltr">Thanks for posting this! I created a ticket for this issue some time ago - <a href="https://archivesspace.atlassian.net/projects/ANW/issues/ANW-758">https://archivesspace.atlassian.net/projects/ANW/issues/ANW-758</a>. The issue appears to be that the base PDF font set is limited in its character support, and does not handle diacritics/non-Latin characters well - it either "flattens" them to ASCII, or replaces them with "#".</div><div dir="ltr"><br></div><div>I'm unaware of any workarounds in the meantime, but it's entirely a PDF rendering issue - your data should be fine as-is.<br></div><div dir="ltr"><br></div><div>Thanks,</div><div>--Alex<br></div></div><br><div class="gmail_quote"><div dir="ltr">On Tue, Dec 11, 2018 at 12:57 PM Zalduendo, Ines <<a href="mailto:izalduendo@gsd.harvard.edu">izalduendo@gsd.harvard.edu</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div lang="EN-US">
<div class="gmail-m_-8284767169117567458WordSection1">
<p class="MsoNormal"><span style="color:rgb(31,73,125)">Thanks Benn for sending this along.<u></u><u></u></span></p>
<p class="MsoNormal"><span style="color:rgb(31,73,125)">The same is going on with Japanese characters. They display correctly in ArchivesSpace but the PDF doesn’t display them.<u></u><u></u></span></p>
<p class="MsoNormal"><span style="color:rgb(31,73,125)">Here’s an example: <a href="https://hollisarchives.lib.harvard.edu/repositories/7/resources/201" target="_blank">
https://hollisarchives.lib.harvard.edu/repositories/7/resources/201</a> (top right button for PDF)<u></u><u></u></span></p>
<p class="MsoNormal"><span style="color:rgb(31,73,125)">I never reported this to the users group, but am glad others are interested in this being looked into. I was told core developers already know about this.<u></u><u></u></span></p>
<p class="MsoNormal"><span style="color:rgb(31,73,125)">Ines <u></u><u></u></span></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal"><span style="font-size:10pt;font-family:"Calibri Light",sans-serif;color:rgb(127,127,127)">Special Collections Archivist / Frances Loeb Library / Harvard University Graduate School of Design / 48 Quincy Street, Cambridge, MA 02138 / T. 617.496.1300</span><u></u><u></u></p>
<p class="MsoNormal"><span style="color:rgb(31,73,125)"><u></u> <u></u></span></p>
<div>
<div style="border-color:rgb(225,225,225) currentcolor currentcolor;border-style:solid none none;border-width:1pt medium medium;padding:3pt 0in 0in">
<p class="MsoNormal"><b>From:</b> <a href="mailto:archivesspace_users_group-bounces@lyralists.lyrasis.org" target="_blank">archivesspace_users_group-bounces@lyralists.lyrasis.org</a> <<a href="mailto:archivesspace_users_group-bounces@lyralists.lyrasis.org" target="_blank">archivesspace_users_group-bounces@lyralists.lyrasis.org</a>>
<b>On Behalf Of </b>Benn Joseph<br>
<b>Sent:</b> Tuesday, December 11, 2018 11:19 AM<br>
<b>To:</b> Archivesspace Users Group <<a href="mailto:archivesspace_users_group@lyralists.lyrasis.org" target="_blank">archivesspace_users_group@lyralists.lyrasis.org</a>><br>
<b>Subject:</b> [Archivesspace_Users_Group] diacritics in Title and Filing Title fields<u></u><u></u></p>
</div>
</div>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Not sure if there’s a ticket for this, but we’re seeing some tricky behavior with diacritics in both the Title and Filing Title fields when trying to print a PDF as a background job.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Here’s an example: the collection name is “Camille Saint-Saëns correspondence”, and the umlaut displays correctly in the public interface.
<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">If this text is input into the Title field without any character encoding, i.e. if the “ë” is just pasted in there, then when I print a PDF as a background job in the staff interface it shows up like this:<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">“Camille Saint-Sae#ns correspondence”<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">If I encode the character, whether HTML (ë) or UTF-8 (ë), the title ends up looking like this in the PDF output:<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">“Camille Saint-Saëns correspondence”<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">…because the ampersand gets converted to “&” in the xml and ends up as “& #235;”. I’m not seeing this behavior in any other fields though. Does this mean that no diacritics are allowed in the Title fields? Or, am I just inputting
this wrong? When generating a PDF from the public interface, it seems to remove the encoding entirely, so the title fields end up as “Saint-Saens” in each case--although I understand that PDF creation process to be different than the one done as a background
job.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Thanks!<u></u><u></u></p>
<p class="MsoNormal">--Benn<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal"><b><span style="font-size:9pt;font-family:"Arial",sans-serif;color:rgb(13,13,13)">Benn Joseph<u></u><u></u></span></b></p>
<p class="MsoNormal"><span style="font-size:9pt;font-family:"Arial",sans-serif;color:rgb(13,13,13)">Head of Archival Processing<u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-size:9pt;font-family:"Arial",sans-serif;color:rgb(13,13,13)">Northwestern University Libraries<u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-size:9pt;font-family:"Arial",sans-serif;color:rgb(78,42,132)">Northwestern University<u></u><u></u></span></p>
<p class="MsoNormal"><a href="http://www.library.northwestern.edu" target="_blank"><span style="font-size:9pt;font-family:"Arial",sans-serif;color:windowtext">www.library.northwestern.edu</span></a><span style="font-size:9pt;font-family:"Arial",sans-serif"><u></u><u></u></span></p>
<p class="MsoNormal"><a href="mailto:benn.joseph@northwestern.edu%0d" target="_blank"><span style="font-size:9pt;font-family:"Arial",sans-serif;color:windowtext">benn.joseph@northwestern.edu</span></a><u><span style="font-size:9pt;font-family:"Arial",sans-serif"><u></u><u></u></span></u></p>
<p class="MsoNormal"><span style="font-size:9pt;font-family:"Arial",sans-serif;color:rgb(13,13,13)">847.467.6581</span><span style="font-size:9pt"><u></u><u></u></span></p>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
</div>
_______________________________________________<br>
Archivesspace_Users_Group mailing list<br>
<a href="mailto:Archivesspace_Users_Group@lyralists.lyrasis.org" target="_blank">Archivesspace_Users_Group@lyralists.lyrasis.org</a><br>
<a href="http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group" rel="noreferrer" target="_blank">http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group</a><br>
</blockquote></div><br clear="all"><br>-- <br><div dir="ltr" class="gmail_signature"><div dir="ltr"><div><div dir="ltr">Alexander Duryee<div>Metadata Archivist</div><div>New York Public Library</div><div>(917)-229-9590</div><div><a href="mailto:alexanderduryee@nypl.org" target="_blank">alexanderduryee@nypl.org</a></div></div></div></div></div>