interoperability and technical collaboration for web and social media archiving

12
Interoperability and Technical Collaboration for Web and Social Media Archiving Nicholas Taylor (@ nullhandle ) Web Archiving Service Manager Stanford University Libraries DocNow Advisory Board Meeting August 22, 2016

Upload: nullhandle

Post on 13-Apr-2017

64 views

Category:

Internet


0 download

TRANSCRIPT

Page 1: Interoperability and Technical Collaboration for Web and Social Media Archiving

Interoperability and Technical Collaboration for Web and Social Media Archiving

Nicholas Taylor (@nullhandle)Web Archiving Service ManagerStanford University Libraries

DocNow Advisory Board MeetingAugust 22, 2016

Page 3: Interoperability and Technical Collaboration for Web and Social Media Archiving

Heritrix is great for archiving the Web

“Stanford University”

Page 4: Interoperability and Technical Collaboration for Web and Social Media Archiving

…of ten years ago

Internet Archive: “Stanford University Homepage”

Page 5: Interoperability and Technical Collaboration for Web and Social Media Archiving

newer capture approaches• headless browsers

– to prospect content only apparent by executing JavaScript

• archiving proxies– to enable more, and more specialized,

capture tools to write to WARC• leveraging APIs

– to more reliably collect higher-fidelity data from major social media services

Page 6: Interoperability and Technical Collaboration for Web and Social Media Archiving

WARC in Social Feed Manager

Justin Littman: “Aligning Social Media Harvesting and Web Harvesting”

Page 8: Interoperability and Technical Collaboration for Web and Social Media Archiving

technical architectures to facilitate contributions by

a broad community?

Page 9: Interoperability and Technical Collaboration for Web and Social Media Archiving

community frameworks to enable broad

participation in shaping technologies?

Page 10: Interoperability and Technical Collaboration for Web and Social Media Archiving

how to build more, and more distributed,

capacity?

Page 11: Interoperability and Technical Collaboration for Web and Social Media Archiving

how to make web and social media archiving

more inclusive?

Page 12: Interoperability and Technical Collaboration for Web and Social Media Archiving