designing preservable websites

16
Designing Preservable Websites Nicholas Taylor @nullhandle DC, VA & MD Search Engine Marketing Meetup July 18, 2012 found glass ” by Flickr user nuanc under CC BY-NC-ND

Upload: nullhandle

Post on 08-May-2015

357 views

Category:

Technology


0 download

DESCRIPTION

Presentation given at the DC, VA and MD Search Engine Marketing Meetup on how to design websites to improve their archiving and preservation.

TRANSCRIPT

Page 1: Designing Preservable Websites

Designing Preservable Websites

Nicholas Taylor@nullhandle

DC, VA & MD Search Engine Marketing MeetupJuly 18, 2012

“found glass” by Flickr user nuanc under CC BY-NC-ND 2.0

Page 2: Designing Preservable Websites

why preserve the web?

copy of the first webpage

Page 3: Designing Preservable Websites

web archivists aren’t visible stakeholders

design

usage

archiving

Page 4: Designing Preservable Websites

search engine crawler ≠archival crawler

“GoogleBots” by Flickr user ares64 under CC BY 2.0

Page 5: Designing Preservable Websites

what is a “preservable” website?

“Fish Preserver” by Flickr user ecstaticist under CC BY-NC-SA 2.0

Page 6: Designing Preservable Websites

three priorities:

• capture: can resources be acquired by current web archiving technologies?

• replay: can the user’s experience of the original website be recreated from the archived resources?

• preservation: how can it be assured that the archived website remains coherent over time?

Page 7: Designing Preservable Websites

follow web standards and accessibility guidelines

“Web Standards Fortune Cookie” by Flickr user mherzber under CC BY-SA 2.0

Page 8: Designing Preservable Websites

be careful with robots.txt exclusions

robots.txt for Last.fm

Page 9: Designing Preservable Websites

use a site map, transparent links, and contiguous

navigation

“Card sorting” by Flickr user Manchester Library under CC BY-SA 2.0

Page 10: Designing Preservable Websites

maintain stable URLs and redirect when necessary

“Improvised detour sign” by Flickr user Jason McHuff under CC BY-SA 2.0

Page 11: Designing Preservable Websites

consider using a Creative Commons license

“2500 Creative Commons Licenses” by Flickr user qthomasbower under CC BY-SA 2.0

Page 12: Designing Preservable Websites

use durable data formats

“Lascaux cave painting” by Flickr user qoforchris under CC BY-ND 2.0

Page 13: Designing Preservable Websites

embed metadata, especially the character encoding

source code of http://www.seo.com/

Page 14: Designing Preservable Websites

use archiving-friendly platform providers and CMSs

robots.txt for Drupal 7

Page 15: Designing Preservable Websites

three tips

1. see how well your site validates on http://validator.w3.org/

2. see how your site looks on http://archive.org/

3. your favorite online sitemap generator is a good starting point

“Highlighters” by Flickr user KJGarbutt under CC BY-ND 2.0

Page 16: Designing Preservable Websites

thank you!

Nicholas Taylor@nullhandle