[Archivesspace_Users_Group] unpublished resource showing up as PDF download in a Google search

Mang Sun mang.sun at rice.edu
Tue Jul 5 16:17:02 EDT 2016


Amanda,


When did you change the Publish? flag of those resources to "No" ? I 
vaguely recall, at the very beginning, all resources were set to 
Publish? YES. Then if this is true, Google has  already crawled and  
indexed the shortcut link to EAD-PDF for each resource, even its 
Publish? was set to NO later on. Because at our current version which is 
1.4.2 the code underlying the link to EAD-PDF seemingly doesn't check 
the PUBLISH? flag of resources, the shortcut link (even can be assembled 
manually by following a pattern) in question will remain valid and in 
effect,and will be kept crawled and indexed by Google even for 
unpublished resources that were ever published . If a visibility check 
could be introduced when generating EAD-PDF and etc., the problem can be 
solved.  To prevent Google from remembering a shortcut link with our 
current version, a new resource should be set to Publish?NO at the very 
beginning without, but this still can't prevent power users from 
handcrafting the link to get EAD-PDF output of an invisible resource if 
they know the generated or assigned resource number.


Mang




On 7/5/2016 2:52 PM, Custer, Mark wrote:
>
> Amanda,
>
> So, it sounds like the PUI is working as expected in that case, but 
> that the ASpace PDF conversion process is including everything from 
> each finding aid, whether it’s listed as published or not.  Is that 
> right?  If so, it should just be a simple update to the ASpace PDF 
> stylesheet, and that type of change should definitely be in the core code.
>
> I’ll look to see if there’s an open issue for this, but if there’s 
> not, I can create one in JIRA.  I’ve made a couple updates  to the 
> core ASpace PDF stylesheet, and I hope to make a few more before the 
> next PUI is released.
>
> Mark
>
> *From:*archivesspace_users_group-bounces at lyralists.lyrasis.org 
> [mailto:archivesspace_users_group-bounces at lyralists.lyrasis.org] *On 
> Behalf Of *Amanda Focke
> *Sent:* Tuesday, 05 July, 2016 2:45 PM
> *To:* archivesspace_users_group at lyralists.lyrasis.org
> *Subject:* Re: [Archivesspace_Users_Group] unpublished resource 
> showing up as PDF download in a Google search
>
> Hello Mang and all -
>
> What I did was search Google for something I know is from one of our 
> unpublished finding aids,
>
> such as this text string:
> "10002.  Genetic regulatory proteins -1 (laci with altered ligand 
> responsivity.  Kathleen Matthews"
>
> and the result was that the entire ArchivesSpace-generated PDF version 
> of the (unfinished / unpublished) finding aid is available as the 2nd 
> hit from Google's results list.
>
>
> I was hoping to attend the ArchivesSpace webinar which is going on 
> right now to see if this issue has been resolved,
> but the webinar is full. I'll just wait for the recording and if my 
> questions aren't answered there, will follow up with ArchivesSpace folks.
>
> Amanda
>
>
>
>
>
> On 7/5/2016 9:48 AM, Mang Sun wrote:
>
>     Amanda,
>
>     I am just back. I seemingly can't reproduce the Google hit by
>     searching Google for "Randall Hulet" and I don't see problem with
>     our Public interface when searching for "Randall Hulet".  Can you
>     give me a screen snapshot of your googling result for the title of
>     this  archival object?
>
>     Mang
>
>     On 6/15/2016 1:59 PM, Amanda Focke wrote:
>
>         I think this may beAR-583   or AR-278 which both seem to say
>         they are resolved, so maybe if we upgrade this summer to the
>         new version this will be fixed....
>
>         Amanda
>
>         On 6/14/2016 4:29 PM, Amanda Focke wrote:
>
>             Hello --
>
>             We have an *unpublished* Resource in our ArchivesSpace
>             instance which is showing up
>             when I search a text string from it in Google.
>
>             I search a text string from that resource and I get a hit
>             (in Google) coming from our ArchivesSpace offering a
>             "printer friendly download" of the full PDF for the Resource.
>
>             I double checked the Resource, it is definitely
>             "unpublished" at the top level, although it has components
>             which are marked as published (I'm not sure why those are
>             published but it shouldn't matter if the parent is
>             unpublished).
>
>             Has anyone noticed this behavior?
>             Thanks,
>             Amanda
>
>             -- 
>             *Amanda Focke, CA, DAS*
>             Asst. Head of Special Collections
>             Woodson Research Center
>             Fondren Library MS-44
>             Rice University
>             6100 Main St.
>             Houston, TX 77005
>             713-348-2124 | afocke at rice.edu <mailto:afocke at rice.edu>
>             Website: http://library.rice.edu/woodson
>             <https://urldefense.proofpoint.com/v2/url?u=http-3A__library.rice.edu_woodson&d=CwMD-g&c=-dg2m7zWuuDZ0MUcV7Sdqw&r=s7ciGQfUJeaV_ryx908hbeXDoU9aqDwDN0Z0VbfsJ3Y&m=qrl1p9pdF8AKUWh4QzJttjsQJvj57JscK0PiJy-NDGM&s=3SIN67f0Tro00gQKJHxLbmDWmnRPz399UpBuwNe5Xr4&e=>
>             Blog: http://woodsononline.wordpress.com/
>             <https://urldefense.proofpoint.com/v2/url?u=http-3A__woodsononline.wordpress.com_&d=CwMD-g&c=-dg2m7zWuuDZ0MUcV7Sdqw&r=s7ciGQfUJeaV_ryx908hbeXDoU9aqDwDN0Z0VbfsJ3Y&m=qrl1p9pdF8AKUWh4QzJttjsQJvj57JscK0PiJy-NDGM&s=dMbXm9sY5G9VGaxj-ur6CTV2KvUNNmKIK2Y0_39Ne5g&e=>
>
>
>
>
>             _______________________________________________
>
>             Archivesspace_Users_Group mailing list
>
>             Archivesspace_Users_Group at lyralists.lyrasis.org
>             <mailto:Archivesspace_Users_Group at lyralists.lyrasis.org>
>
>             http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group
>             <https://urldefense.proofpoint.com/v2/url?u=http-3A__lyralists.lyrasis.org_mailman_listinfo_archivesspace-5Fusers-5Fgroup&d=CwMD-g&c=-dg2m7zWuuDZ0MUcV7Sdqw&r=s7ciGQfUJeaV_ryx908hbeXDoU9aqDwDN0Z0VbfsJ3Y&m=qrl1p9pdF8AKUWh4QzJttjsQJvj57JscK0PiJy-NDGM&s=N0QkZjMA44kL7h0mu-ZlNla8zK2LgHWQ4PAEFM4eAhg&e=>
>
>             !DSPAM:114,57607750232231884916495!
>
>         -- 
>         *Amanda Focke, CA, DAS*
>         Asst. Head of Special Collections
>         Woodson Research Center
>         Fondren Library MS-44
>         Rice University
>         6100 Main St.
>         Houston, TX 77005
>         713-348-2124 | afocke at rice.edu <mailto:afocke at rice.edu>
>         Website: http://library.rice.edu/woodson
>         <https://urldefense.proofpoint.com/v2/url?u=http-3A__library.rice.edu_woodson&d=CwMD-g&c=-dg2m7zWuuDZ0MUcV7Sdqw&r=s7ciGQfUJeaV_ryx908hbeXDoU9aqDwDN0Z0VbfsJ3Y&m=qrl1p9pdF8AKUWh4QzJttjsQJvj57JscK0PiJy-NDGM&s=3SIN67f0Tro00gQKJHxLbmDWmnRPz399UpBuwNe5Xr4&e=>
>         Blog: http://woodsononline.wordpress.com/
>         <https://urldefense.proofpoint.com/v2/url?u=http-3A__woodsononline.wordpress.com_&d=CwMD-g&c=-dg2m7zWuuDZ0MUcV7Sdqw&r=s7ciGQfUJeaV_ryx908hbeXDoU9aqDwDN0Z0VbfsJ3Y&m=qrl1p9pdF8AKUWh4QzJttjsQJvj57JscK0PiJy-NDGM&s=dMbXm9sY5G9VGaxj-ur6CTV2KvUNNmKIK2Y0_39Ne5g&e=>
>
>
>
>
>         _______________________________________________
>
>         Archivesspace_Users_Group mailing list
>
>         Archivesspace_Users_Group at lyralists.lyrasis.org
>         <mailto:Archivesspace_Users_Group at lyralists.lyrasis.org>
>
>         http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group
>         <https://urldefense.proofpoint.com/v2/url?u=http-3A__lyralists.lyrasis.org_mailman_listinfo_archivesspace-5Fusers-5Fgroup&d=CwMD-g&c=-dg2m7zWuuDZ0MUcV7Sdqw&r=s7ciGQfUJeaV_ryx908hbeXDoU9aqDwDN0Z0VbfsJ3Y&m=qrl1p9pdF8AKUWh4QzJttjsQJvj57JscK0PiJy-NDGM&s=N0QkZjMA44kL7h0mu-ZlNla8zK2LgHWQ4PAEFM4eAhg&e=>
>
>
>     !DSPAM:114,577bc8b160581446016412!
>
>
>     _______________________________________________
>
>     Archivesspace_Users_Group mailing list
>
>     Archivesspace_Users_Group at lyralists.lyrasis.org
>     <mailto:Archivesspace_Users_Group at lyralists.lyrasis.org>
>
>     http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group
>     <https://urldefense.proofpoint.com/v2/url?u=http-3A__lyralists.lyrasis.org_mailman_listinfo_archivesspace-5Fusers-5Fgroup&d=CwMD-g&c=-dg2m7zWuuDZ0MUcV7Sdqw&r=s7ciGQfUJeaV_ryx908hbeXDoU9aqDwDN0Z0VbfsJ3Y&m=qrl1p9pdF8AKUWh4QzJttjsQJvj57JscK0PiJy-NDGM&s=N0QkZjMA44kL7h0mu-ZlNla8zK2LgHWQ4PAEFM4eAhg&e=>
>
>     !DSPAM:114,577bc8b160581446016412!
>
> -- 
> *Amanda Focke, CA, DAS*
> Asst. Head of Special Collections
> Woodson Research Center
> Fondren Library MS-44
> Rice University
> 6100 Main St.
> Houston, TX 77005
> 713-348-2124 | afocke at rice.edu <mailto:afocke at rice.edu>
> Website: http://library.rice.edu/woodson 
> <https://urldefense.proofpoint.com/v2/url?u=http-3A__library.rice.edu_woodson&d=CwMD-g&c=-dg2m7zWuuDZ0MUcV7Sdqw&r=s7ciGQfUJeaV_ryx908hbeXDoU9aqDwDN0Z0VbfsJ3Y&m=qrl1p9pdF8AKUWh4QzJttjsQJvj57JscK0PiJy-NDGM&s=3SIN67f0Tro00gQKJHxLbmDWmnRPz399UpBuwNe5Xr4&e=>
> Blog: http://woodsononline.wordpress.com/ 
> <https://urldefense.proofpoint.com/v2/url?u=http-3A__woodsononline.wordpress.com_&d=CwMD-g&c=-dg2m7zWuuDZ0MUcV7Sdqw&r=s7ciGQfUJeaV_ryx908hbeXDoU9aqDwDN0Z0VbfsJ3Y&m=qrl1p9pdF8AKUWh4QzJttjsQJvj57JscK0PiJy-NDGM&s=dMbXm9sY5G9VGaxj-ur6CTV2KvUNNmKIK2Y0_39Ne5g&e=>
>
>
>
> _______________________________________________
> Archivesspace_Users_Group mailing list
> Archivesspace_Users_Group at lyralists.lyrasis.org
> http://lyralists.lyrasis.org/mailman/listinfo/archivesspace_users_group

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lyralists.lyrasis.org/pipermail/archivesspace_users_group/attachments/20160705/b62856a5/attachment.html>


More information about the Archivesspace_Users_Group mailing list