history in the digital age

Upload: trevor-owens

Post on 06-Apr-2018

221 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/2/2019 History in the Digital Age

    1/19

    S U S A N M C E L R A T H

    U N I V E R S I T Y A R C H I V I S T

    Digital Projects in SpecialCollections

    A M E R I C A N U N I V E R S I T Y M A R C H 7 , 2 0 1 2

  • 8/2/2019 History in the Digital Age

    2/19

    Digital Collections, Exhibits, and Repositories

    What is the difference?

    Repository multiple collections or institutions

    Collection

    Exhibit

    one theme a selection of items

  • 8/2/2019 History in the Digital Age

    3/19

    Multi-Institutional Digital Repository

  • 8/2/2019 History in the Digital Age

    4/19

    Institutional Digital Repository

  • 8/2/2019 History in the Digital Age

    5/19

    Thematic Digital Collection

  • 8/2/2019 History in the Digital Age

    6/19

    Digital Exhibit

  • 8/2/2019 History in the Digital Age

    7/19

    Digital Exhibit on 1960 San Francisco Fire

  • 8/2/2019 History in the Digital Age

    8/19

    Alternate approach to same topic

  • 8/2/2019 History in the Digital Age

    9/19

    Digitization Project Planning

    What work needs to be done;

    How it will be done (according to which standards,specifications, best practices);

    Who should do the work (and where);

    How long the work will take; How much it will cost, both to "resource" the

    infrastructure and to do the content conversion

    http://www.ncecho.org/dig/guide_1planning.shtml http://www.nyu.edu/its/humanities/ninchguide/II/

  • 8/2/2019 History in the Digital Age

    10/19

    Components of Digitization Projects

    Planning and Project Management

    Selection File Formats master & access derivatives

    Conservation Treatment

    e orma ng Metadata Design & Creation

    Quality Control

    Web Platform Open source vs. proprietary systems

    Preservation

  • 8/2/2019 History in the Digital Age

    11/19

    Selection Criteria

    Should they be digitized?

    Research Value May they be digitized? Copyright status

    Can they be digitized? Condition

    Format

    http://www.nedcc.org/resources/leaflets/6Reformatting/06Prese

    rvationAndSelection.php

    http://www.dlib.org/dlib/september09/ooghe/09ooghe.html

  • 8/2/2019 History in the Digital Age

    12/19

    Digitization Standards

    Technical Standards

    Federal Agency Digitization Guidelines Initiative (FADGI) http://www.digitizationguidelines.gov/

    NARA

    http://www.cdlib.org/services/dsc/tools/docs/cdl_gdi_v2.pdf

    University of Colorado

    https://www.cu.edu/digitallibrary/cudldigitizationbp.pdf

  • 8/2/2019 History in the Digital Age

    13/19

    Metadata Requirements

    Metadata Requirements

    Descriptive Metadata Technical & Administrative Metadata

    Element Sets and Standards

    Du n Core http://dublincore.org/documents/dces/

    METS/MODS

    http://www.loc.gov/standards/mods/

    http://www.loc.gov/standards/mets/ VRA Core

    http://www.loc.gov/standards/vracore/

  • 8/2/2019 History in the Digital Age

    14/19

    Web Platform Options

    Open Source Software

    OMEKA Greenstone

    DSpace

    Proprietary Software Contentdm (OCLC)

    Luna Insight

    Digitool

  • 8/2/2019 History in the Digital Age

    15/19

    Web Harvesting involves:

    Identifying and collecting web resources

    Providing search capability for archived webcollections

    Managing and preserving web resources

  • 8/2/2019 History in the Digital Age

    16/19

    Web Harvesting

    The most common web archiving technique uses web

    crawlers to automate the process of collecting webpages. Web crawlers typically view web pages in thesame manner that users with a browser see the Web,

    method of remotely harvesting web content.

  • 8/2/2019 History in the Digital Age

    17/19

    Web Crawling Problems

    Robots exclusion protocol may deny crawlers access

    to portions of a website. Large portions of a web site may be hidden in the

    deep Web.

    Crawler traps may cause a crawler to download aninfinite number of pages, so crawlers are usuallyconfigured to limit the number of dynamic pagesthey crawl.

    Calendars often cause problems for crawlers.

  • 8/2/2019 History in the Digital Age

    18/19

    Web Harvesting Resources

    International Internet Preservation Consortium

    http://netpreserve.org/about/index.php Library of Congress http://www.loc.gov/webarchiving

    Archive-It (Service) www.archive-it.org

  • 8/2/2019 History in the Digital Age

    19/19

    American University Digital Collections