prototypes of pro-active approaches to support the archiving of web references for scholarly...
TRANSCRIPT
![Page 1: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/1.jpg)
Prototypes of pro-active approaches to support the archiving of web references for scholarly
communications
Richard Wincewicz1, Peter Burnhill1 & Herbert Van de Sompel2
1EDINA, University of Edinburgh, 2Los Alamos National Laboratory
![Page 2: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/2.jpg)
The Project Team 2013 – 2015, funded by the
Andrew W. Mellon Foundation
• Los Alamos National Laboratory:
Research Library: Herbert Van de Sompel Harihar Shankar, [Martin Klein, Rob Sanderson]
• University of Edinburgh:
Language Technology Group: Claire Grover, Beatrice Alex, Colin Matheson, Richard Tobin, [Ke “Adam” Zhou]
EDINA * : Peter Burnhill, Muriel Mewissen (Project Manager), Tim Stickland, Richard Wincewicz, [Neil Mayo]
Centre for Service Delivery & Digital Expertise
![Page 3: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/3.jpg)
Overview
1. Introduction
2. Evidence
3. Remedy
![Page 4: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/4.jpg)
1. Introduction
![Page 5: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/5.jpg)
Reference Rot
Links to Web at Large resources are subject to Reference Rot. This is a combination of two factors:
• Link Rot: Link stops working • e.g. HTTP 404 “Not Found”
• Content Drift: Linked content changes over time• Possibly to the extent that it is no longer
representative of the content that was initially referenced
![Page 6: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/6.jpg)
2. Evidence
![Page 7: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/7.jpg)
Articles that Link to Articles & to Web At Large Resources (PMC)
Martin Klein et al. (2014) Scholarly context not foundhttp://dx.doi.org/10.1371/journal.pone.0115253
![Page 8: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/8.jpg)
Articles that Link to Articles & to Web At Large Resources (Elsevier)
Martin Klein et al. (2014) Scholarly context not foundhttp://dx.doi.org/10.1371/journal.pone.0115253
![Page 9: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/9.jpg)
Articles with URI References (PMC)
Articles 479,194
with URI references 399,005
with URI references to articles 240,857
with URI references to Web at Large 156,160
Martin Klein et al. (2014) Scholarly context not foundhttp://dx.doi.org/10.1371/journal.pone.0115253
![Page 10: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/10.jpg)
Link Rot (PMC)
Martin Klein et al. (2014) Scholarly context not foundhttp://dx.doi.org/10.1371/journal.pone.0115253
![Page 11: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/11.jpg)
Link Rot (Elsevier)
Martin Klein et al. (2014) Scholarly context not foundhttp://dx.doi.org/10.1371/journal.pone.0115253
![Page 12: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/12.jpg)
Links from arXiv, Elsevier, PMC to TLD Targets
Martin Klein et al. (2014) Scholarly context not found. In: PLOS ONEhttp://dx.doi.org/10.1371/journal.pone.0115253
![Page 13: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/13.jpg)
Grey is Link Rot – Referenced Content Not Accessible
Martin Klein et al. (2014) Scholarly context not found. In: PLOS ONEhttp://dx.doi.org/10.1371/journal.pone.0115253
![Page 14: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/14.jpg)
Grey is Not Archived - Referenced Content Lost
Martin Klein et al. (2014) Scholarly context not found. In: PLOS ONEhttp://dx.doi.org/10.1371/journal.pone.0115253
![Page 15: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/15.jpg)
Content Drift – http://dl00.org
2000 2004
2005 2008
(a) Dynamic contentvalues on webpage change
over time
(b) Static contentbut very different (often
unrelated) web pages
![Page 16: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/16.jpg)
3. Remedy
![Page 17: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/17.jpg)
Create Snapshots of Referenced Resources
Various web archives support on-demand creation of snapshots of URIs (manual, API):
archive.today Internet Archive perma.cc webcitation.org
When creating snapshots, maintain: Original URI Snapshot URI Date/Time of snapshot
![Page 18: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/18.jpg)
Create Snapshots of Referenced Resources
Snapshots can be created at various stages. The closer to the moment of referencing, the better the image captured.
Stage Actor Snapshot Quality
Preparation Author/reference tool best
Submission/Issue
Editor/manuscript system
good
PublicationAggregator/
publisher platformok
Post-publicationLibrarian/IR,
journal archivebetter than nothing
![Page 19: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/19.jpg)
Authoring - Zotero Plugin Demonstrator
Richard Wincewicz (2014) Prototype Hiberlink plugin for Zotero for pro-active archiving and temporal references
https://www.youtube.com/v/ZYmi_Ydr65M%26vq
![Page 20: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/20.jpg)
Publication - OJS
![Page 21: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/21.jpg)
Publication - OJS
![Page 22: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/22.jpg)
Publication - OJS
![Page 23: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/23.jpg)
Publication - OJS
![Page 24: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/24.jpg)
Publication - HiberActive Service Demonstrator
Martin Klein et al. (2014) HiberActive: Pro-Active Archiving of web references from scholarly articles
Open Repositories 2014 http://www.slideshare.net/martinklein0815/hiberactive
![Page 25: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/25.jpg)
Reference Resources Robustly
When referencing resources include:
Original URI – Allows the user to revisit the URI as it is at the time of reading, if the URI is still operational
Snapshot URI – Allows the user to visit the snapshot, if one was created, and if the web archive in which it was created is still operational
Date/Time – with the original URI allow the user to visit any snapshot created around the Date/Time in any web archive around the world (using Memento infrastructure)
(2015) Robust Links - Motivationhttp://robustlinks.mementoweb.org/about/
![Page 26: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/26.jpg)
Reference Resources Actionably
When referencing resources, use Link Decorations to convey Original URI, Snapshot URI, Date/Time
<a href=“http://www.stanford.edu” data-versionurl=“http://archive.is/FAy6o” data-versiondate=“2014-08-15” >
<a href=“http://www.stanford.edu” data-versiondate=“2014-08-15” >
Herbert Van de Sompel et al. (2015) Robust Links - Link Decorationshttp://robustlinks.mementoweb.org/spec/
<a href=“http://archive.is/FAy6o” data-originalurl=“http://www.stanford.edu” data-versiondate=“2014-08-15” >
![Page 27: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/27.jpg)
Robust Links Using Link Decorations, JavaScript, Memento API
Demo - http://robustlinks.mementoweb.org/demo/uri_references_js.htmlrobustlinks.js - https://github.com/mementoweb/robustlinks
![Page 28: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/28.jpg)
Activate Robust Links
There are no Link Decorations, currently. But there is an article publication date:
Express the article publication date in an actionable manner (‘datePublished’ or ‘dateModified’ Schema.org properties) in HTML pages that contain URI references
Tailor robustlinks.js to exclude links to articles
Inject robustlinks.js in HTML pages that contain URI references
![Page 29: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/29.jpg)
Users Follow Robust Links into Web Archives
The combination of the referenced URI and the article publication date:
Leads users to a snapshot in a web archive, created as close as possible to the article publication date
Addresses link rot
Addresses content drift
![Page 30: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/30.jpg)
Create Archive Copies
When ingesting new content into the platform:
Parse for URI references
Create snapshots in web archives of select URIs
For these URIs, use Link Decorations in HTML to convey:
• original URI• snapshot URI • snapshot Date/Time
![Page 31: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/31.jpg)
Users Follow Robust Links into Web Archives
The Link Decorations:
Lead users to the created snapshot, if the web archive is operational
Lead users to a snapshot in any web archive, created as close as possible to the snapshot Date/Time
Addresses link rot
Addresses content drift
![Page 32: Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert](https://reader038.vdocuments.us/reader038/viewer/2022103022/56649d745503460f94a54d9a/html5/thumbnails/32.jpg)
Prototypes of pro-active approaches to support the archiving of web references for scholarly
communicationsRichard Wincewicz1, Peter Burnhill1
& Herbert Van de Sompel21EDINA, University of Edinburgh, 2Los Alamos National Laboratory
http://hiberlink.org #hiberlink