Digital Preservation
MetaArchive Cooperative
9:00-9:45 - Session 1: Digital Preservation Overview
9:45-11:00 - Session 2: Policy & Planning Overview
11:00-11:20 - Break 11:20-12:20 - Session 3: Implementing
Policies & Solutions 12:20-12:30 – Q & A 12:30-4pm - Lunch & Post-Workshop
Informational Meeting
3
How do you start a digital preservation policy development initiative?
Who are the key people in your institution to inform and involve?
What are the basic components of a strong digital preservation policy?
What role do campus/institutional stakeholders play in your policies?
How might institutional values impact policy creation?
4
Session 1
1. What is Digital Preservation?2. A Look at Trends in Digital Preservation
The Goal:To understand the evolving landscape of digital preservation requirements and consider the potential investments needed for developing a policy-driven approach to digital preservation.
6
7
Not to be confused with digitization
8
More than a simple back-up strategy
9
Not as simple as storing things in the Cloud
Though Amazon, Microsoft, Google and now Apple would like you to think so
10
Not an archiving system
11
There has been lots of research into how to preserve digital formats and the applications that create and maintain them.
There have been lots of debates about significant properties and acceptable loss.
Still a lot of research to do, and issues to discuss.
But one thing is clear…
12
…there is no single solution.
13
Ingest Format Validation Audit Storage
Fixity Checking Geographic Replication
Access Repair Data Wrangling Metadata Testing Trust Rights
“The series of managed activities necessary to ensure continued access to digital materials for as long as necessary.”- Definition from Digital Preservation Coalition
14
Quick question?
15
Centralized & Distributed Preservation
Full & Bit-level Preservation Preservation Metadata Open Source Solutions Focus on Economies of Scale and
Benefits Roles & Responsibilities Sustainability Standards and Auditing Metrics
16
Centralized Preservation: Preservation activities managed by a single institution Examples:▪ Library of Congress - Chronicling America▪ FCLA Florida Digital Archive – DAITSS▪ OCLC Digital Archive
Distributed Preservation: Preservation activities managed by multiple
institutions replicating and/or geographically locating collections
Examples:▪ LOCKSS▪ MetaArchive Cooperative ▪ Chronopolis▪ DuraCloud
17
“Full” Preservation Focuses heavily on format migration and
normalization (may still preserve the original)▪ Highly concerned with monitoring and intervening
against format obsolescence up-front
Bit-level Preservation Focuses primarily on preserving the original bits▪ Not opposed to format migration but avoids monitoring
up-front and cites long-lived support or convertability of the majority of formats
Many archives doing a bit of both Somewhat of a false dichotomy
18
PREMIS Intellectual Entities (books, articles, databases) Objects (files, bitstreams, formats) Agents (people, organizations, software) Events (actions involving Agents & Objects) Rights (permissions)
Metadata standards are always under development – weigh issues of adoption seriously
19
Open source is a well recognized best practice at this point – real question is: How open?
Why Open source? Avoiding proprietary solutions can guard against
dependencies and sudden loss Open source formats and technologies maximize
communities of support and ensure flexibility and long-lived solutions
Open source approaches dramatically reduce technology costs and can lead to building of expertise
20
Digital preservation needs are great and digital preservation can be costly
Blue Ribbon Task Force on Sustainable Digital Preservation and Access Economies of scale can reduce staff costs Focus on communicating the benefits to the
institution aids in selection and prioritization▪ Prioritization crucial for offsetting costs▪ Define the institutional value of your assets
You don’t have (shouldn’t try) to save everything!
21
Partnering with other institutions to preserve content is becoming more popular Sharing resources and expertise reduces costs Maintains control over institutional assets rather
than handing over responsibility to third parties Consumers also becoming Producers and
Preservers of digital assets
Modularizing the chain of preservation activities (ingest, storage, dissemination) Microservices and interoperability
22
Many grant-funded projects are short-lived or narrowly focused
Institutions have been pressured to just enter the game and hope for the best
Diverse revenue streams becoming essential
Emphasis on collaborationPromoting self-sustaining cost models
23
Trustworthy digital repositories! Reference Model for an Open Archival
Information System (OAIS) - 2002 Trusted Repositories Audit &
Certification (TRAC) – 2007 Metrics for Digital Repository Audit &
Certification – soon to be an ISO standard
24
Time for one or two quick questions…
25