<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<!--[if !mso]><style>v\:* {behavior:url(#default#VML);}
o\:* {behavior:url(#default#VML);}
w\:* {behavior:url(#default#VML);}
.shape {behavior:url(#default#VML);}
</style><![endif]--><style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
font-size:11.0pt;
font-family:"Calibri",sans-serif;}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
span.EmailStyle19
{mso-style-type:personal-reply;
font-family:"Calibri",sans-serif;
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:10.0pt;
mso-ligatures:none;}
@page WordSection1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang="EN-US" link="blue" vlink="purple" style="word-wrap:break-word">
<div class="WordSection1">
<p class="MsoNormal">I was able to do PDF -> text and then use regex and then Excel to import a large part of an old index. It took some manipulation initially, but was certainly easier than rekeyeing.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div>
<p class="MsoNormal"><span style="color:black">--- Matthew<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:black"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="color:black"><o:p> </o:p></span></p>
<table class="MsoNormalTable" border="0" cellspacing="0" cellpadding="0" width="650" style="width:487.5pt">
<tbody>
<tr style="height:27.9pt">
<td width="90" style="width:67.5pt;padding:0in 0in 0in 0in;height:27.9pt">
<p class="MsoNormal"><img width="85" height="56" style="width:.8854in;height:.5833in" id="Picture_x0020_1" src="cid:image001.jpg@01D9CA0C.30BA14E0" alt="alma college logo bw"><o:p></o:p></p>
</td>
<td width="560" style="width:420.0pt;padding:0in 0in 0in 0in;height:27.9pt">
<p class="MsoNormal" style="line-height:9.0pt"><b><span style="font-size:8.0pt">Matthew Collins, Ph.D., MLIS</span></b><span style="font-size:8.0pt;font-family:"Arial",sans-serif;color:#333333"><br>
Library Director and Archivist<br>
Alma College, 614 W. Superior St., Alma, MI 48801<br>
(989)463-7342 | <a href="mailto:collinsms@alma.edu"><span style="color:#0563C1">collinsms@alma.edu</span></a>
<o:p></o:p></span></p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<div style="border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal" style="margin-left:.5in"><b><span style="font-size:12.0pt;color:black">From:
</span></b><span style="font-size:12.0pt;color:black">"Joshua D. Shaw" <Joshua.D.Shaw@dartmouth.edu><br>
<b>Reply-To: </b>Archivesspace Users Group <archivesspace_users_group@lyralists.lyrasis.org><br>
<b>Date: </b>Monday, August 7, 2023 at 5:12 PM<br>
<b>To: </b>Archivesspace Users Group <archivesspace_users_group@lyralists.lyrasis.org><br>
<b>Subject: </b>Re: [Archivesspace_Users_Group] PDF files<o:p></o:p></span></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="font-size:12.0pt;color:black">For something like that, I'd definitely think about ways to convert that to something you can import rather than rekeying that many entries. <o:p></o:p></span></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="font-size:12.0pt;color:black"><o:p> </o:p></span></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="font-size:12.0pt;color:black">I'd probably think about trying pdf -> text and then try some find/replace/regex work to get it into something that is importable - maybe one of the import spreadsheets
if you are using a later version of AS. Or EAD as a 'less than ideal, but probably still doable' option.<o:p></o:p></span></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="font-size:12.0pt;color:black"><o:p> </o:p></span></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="font-size:12.0pt;color:black">jds<o:p></o:p></span></p>
</div>
<div class="MsoNormal" align="center" style="margin-left:.5in;text-align:center">
<hr size="0" width="100%" align="center">
</div>
<div id="divRplyFwdMsg">
<p class="MsoNormal" style="margin-left:.5in"><b><span style="color:black">From:</span></b><span style="color:black"> archivesspace_users_group-bounces@lyralists.lyrasis.org <archivesspace_users_group-bounces@lyralists.lyrasis.org> on behalf of Dean DeBolt
<ddebolt@uwf.edu><br>
<b>Sent:</b> Monday, August 7, 2023 5:08 PM<br>
<b>To:</b> Archivesspace Users Group <archivesspace_users_group@lyralists.lyrasis.org><br>
<b>Subject:</b> Re: [Archivesspace_Users_Group] PDF files</span> <o:p></o:p></p>
<div>
<p class="MsoNormal" style="margin-left:.5in"> <o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal" style="margin-left:.5in">Thanks. We're getting local court records (1820-1930) and they sent a nice PDF
<o:p></o:p></p>
<div>
<p class="MsoNormal" style="margin-left:.5in">inventory (467 pp. of 26,000 files) and I'd hoped to make that searchable through<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in">ArchivesSpace.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in">Dean<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><br clear="all">
<o:p></o:p></p>
<div>
<div>
<div>
<div>
<div>
<div>
<div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="color:#000099;background:white">Dean DeBolt, University Librarian (Professor)/University Archivist<br>
UWF Archives and West Florida History Center</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="color:#000099;background:white">University of West Florida Library<br>
11000 University Parkway<br>
Pensacola, FL 32514-5750<br>
</span><a href="mailto:ddebolt@uwf.edu" target="_blank"><span style="color:#000099;background:white">ddebolt@uwf.edu</span></a><span style="color:#000099;background:white">; 850-474-2213</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="color:#000099;background:white">West Florida History Center is the largest and most comprehensive</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="color:#000099;background:white">history collection about Pensacola and the West Florida region.</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="color:#000099;background:white"><a href="http://libguides.uwf.edu/universityarchives" target="_blank">http://libguides.uwf.edu/universityarchives</a></span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="color:#000099;background:white">Digital collections can be found at: <a href="http://archives.uwf.edu/" target="_blank">http://uwf.lyrasistechnology.org </a></span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="color:#000099;background:white">and
<a href="http://uwf.digital.flvc.org/" target="_blank">http://uwf.digital.flvc.org</a></span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in">If we've been of service, please let us know or our administration,<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in">Dean of Libraries <<a href="mailto:sclark2@uwf.edu" target="_blank">sclark2@uwf.edu</a>><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
</div>
</div>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
<div>
<div>
<p class="MsoNormal" style="margin-left:.5in">On Mon, Aug 7, 2023 at 4:01 PM Joshua D. Shaw <<a href="mailto:Joshua.D.Shaw@dartmouth.edu">Joshua.D.Shaw@dartmouth.edu</a>> wrote:<o:p></o:p></p>
</div>
<blockquote style="border:none;border-left:solid #CCCCCC 1.0pt;padding:0in 0in 0in 6.0pt;margin-left:4.8pt;margin-right:0in">
<div>
<div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="font-size:12.0pt;color:black">Hi Dean<o:p></o:p></span></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="font-size:12.0pt;color:black"><o:p> </o:p></span></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="font-size:12.0pt;color:black">Unfortunately, no. AS only stores metadata about an object, not the object itself. You
<b>can</b> point AS to a storage system (like a digital preservation system or website, etc). That's what Digital Objects are typically used for.<o:p></o:p></span></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="font-size:12.0pt;color:black"><o:p> </o:p></span></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="font-size:12.0pt;color:black">For your particular case, if you converted the pdf to something that you could import as metadata (EAD or something), then that would be searchable as the entries would
then have become entries in the usual archival object tree for the resource(s) the pdf is describing. But it sounds like that's not quite what you are describing.<o:p></o:p></span></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="font-size:12.0pt;color:black"><o:p> </o:p></span></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="font-size:12.0pt;color:black">Also make sure that, if the PDF is something you need to keep, that its stored in another system.<o:p></o:p></span></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="font-size:12.0pt;color:black"><o:p> </o:p></span></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="font-size:12.0pt;color:black">Best,<o:p></o:p></span></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="font-size:12.0pt;color:black">Joshua<o:p></o:p></span></p>
</div>
<div class="MsoNormal" align="center" style="margin-left:.5in;text-align:center">
<hr size="0" width="100%" align="center">
</div>
<div id="x_m_5890890424577666272divRplyFwdMsg">
<p class="MsoNormal" style="margin-left:.5in"><b><span style="color:black">From:</span></b><span style="color:black">
<a href="mailto:archivesspace_users_group-bounces@lyralists.lyrasis.org" target="_blank">
archivesspace_users_group-bounces@lyralists.lyrasis.org</a> <<a href="mailto:archivesspace_users_group-bounces@lyralists.lyrasis.org" target="_blank">archivesspace_users_group-bounces@lyralists.lyrasis.org</a>> on behalf of Dean DeBolt <<a href="mailto:ddebolt@uwf.edu" target="_blank">ddebolt@uwf.edu</a>><br>
<b>Sent:</b> Monday, August 7, 2023 4:35 PM<br>
<b>To:</b> Archivesspace Users Group <<a href="mailto:archivesspace_users_group@lyralists.lyrasis.org" target="_blank">archivesspace_users_group@lyralists.lyrasis.org</a>><br>
<b>Subject:</b> [Archivesspace_Users_Group] PDF files</span> <o:p></o:p></p>
<div>
<p class="MsoNormal" style="margin-left:.5in"> <o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal" style="margin-left:.5in">Can anyone tell me quickly if a PDF document is searchable
<o:p></o:p></p>
<div>
<p class="MsoNormal" style="margin-left:.5in">in ArchivesSpace? We've been given an inventory (467 pp.) as<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in">a PDF and I put that in ArchivesSpace.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in">Dean<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><br clear="all">
<o:p></o:p></p>
<div>
<div>
<div>
<div>
<div>
<div>
<div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="color:#000099;background:white">Dean DeBolt, University Librarian (Professor)/University Archivist<br>
UWF Archives and West Florida History Center</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="color:#000099;background:white">University of West Florida Library<br>
11000 University Parkway<br>
Pensacola, FL 32514-5750<br>
</span><a href="mailto:ddebolt@uwf.edu" target="_blank"><span style="color:#000099;background:white">ddebolt@uwf.edu</span></a><span style="color:#000099;background:white">; 850-474-2213</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="color:#000099;background:white">West Florida History Center is the largest and most comprehensive</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="color:#000099;background:white">history collection about Pensacola and the West Florida region.</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="color:#000099;background:white"><a href="http://libguides.uwf.edu/universityarchives" target="_blank">http://libguides.uwf.edu/universityarchives</a></span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="color:#000099;background:white">Digital collections can be found at: <a href="http://archives.uwf.edu/" target="_blank">http://uwf.lyrasistechnology.org </a></span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><span style="color:#000099;background:white">and
<a href="http://uwf.digital.flvc.org/" target="_blank">http://uwf.digital.flvc.org</a></span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in">If we've been of service, please let us know or our administration,<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in">Dean of Libraries <<a href="mailto:sclark2@uwf.edu" target="_blank">sclark2@uwf.edu</a>><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<p class="MsoNormal" style="margin-left:.5in">_______________________________________________<br>
Archivesspace_Users_Group mailing list<br>
<a href="mailto:Archivesspace_Users_Group@lyralists.lyrasis.org" target="_blank">Archivesspace_Users_Group@lyralists.lyrasis.org</a><br>
<a href="http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group" target="_blank">http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group</a><o:p></o:p></p>
</div>
</blockquote>
</div>
</div>
</div>
</body>
</html>