preservation metadata initiatives: status and direction brian lavoie senior research scientist...
DESCRIPTION
Preservation Metadata: Examples Provenance: Who has had custody/ownership of the digital object? Authenticity: Is the digital object what it purports to be? Preservation Activity: What has been done to preserve the digital object? Technical Environment: What is needed to render and use the digital object? Rights Management: What IPR must be observed?TRANSCRIPT
![Page 1: Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra](https://reader036.vdocuments.us/reader036/viewer/2022082510/5a4d1b0e7f8b9ab05998d661/html5/thumbnails/1.jpg)
Preservation Metadata Initiatives:Status and Direction
Brian LavoieSenior Research ScientistOffice of ResearchOCLC
Archiving Web ResourcesCanberraNovember 10, 2004
![Page 2: Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra](https://reader036.vdocuments.us/reader036/viewer/2022082510/5a4d1b0e7f8b9ab05998d661/html5/thumbnails/2.jpg)
Metadata and Preservation Metadata
“Structured information thatdescribes, explains, locates,or otherwise makes it easier toretrieve, use, or manage aninformation resource”
METADATA
Descriptive
Structural
Administrative
PRESERVATIONMETADATA
“Information that supportsand documents the digitalpreservation process”
Administrative
Structural
Descriptive
![Page 3: Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra](https://reader036.vdocuments.us/reader036/viewer/2022082510/5a4d1b0e7f8b9ab05998d661/html5/thumbnails/3.jpg)
Preservation Metadata: Examples Provenance:
• Who has had custody/ownership of the digital object?
Authenticity:• Is the digital object what it purports to be?
Preservation Activity:• What has been done to preserve the digital object?
Technical Environment:• What is needed to render and use the digital object?
Rights Management:• What IPR must be observed?
![Page 4: Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra](https://reader036.vdocuments.us/reader036/viewer/2022082510/5a4d1b0e7f8b9ab05998d661/html5/thumbnails/4.jpg)
Why Is Preservation Metadata Important? Digital objects are technology-dependent …
• Means to access and use archived object must be documented• Complex technological environment between content and user• Technical metadata especially important
Digital objects are mutable …• Can be easily altered, impacting look, feel, functionality• Changes to object must be documented/validated• Provenance metadata especially important
Digital objects are bound by intellectual property rights …• Preservation must proceed while copyright still in effect• May constrain preservation activities and access policies• Rights management metadata especially important
Makes digital objects self-documenting across time
![Page 5: Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra](https://reader036.vdocuments.us/reader036/viewer/2022082510/5a4d1b0e7f8b9ab05998d661/html5/thumbnails/5.jpg)
Preservation Metadata Initiatives …Around the World
CEDARSOCLC
NLA
NEDLIBNLNZ
U. of E.
![Page 6: Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra](https://reader036.vdocuments.us/reader036/viewer/2022082510/5a4d1b0e7f8b9ab05998d661/html5/thumbnails/6.jpg)
Towards Consensus:OCLC/RLG Preservation Metadata Framework Working Group
March 2000: OCLC, RLG jointly sponsored international working group on preservation metadata• Identify key issues, seek consensus
White paper (January 2001)• Defined preservation metadata; role in preservation process• Reviewed/synthesized existing preservation metadata initiatives
Preservation metadata framework (June 2002)• Comprehensive description of types of information constituting
preservation metadata• Based on OAIS information model• Set of “prototype” preservation metadata elements
![Page 7: Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra](https://reader036.vdocuments.us/reader036/viewer/2022082510/5a4d1b0e7f8b9ab05998d661/html5/thumbnails/7.jpg)
Next Steps: PREMIS Preservation Metadata Framework
• Consolidated expertise• Foundation for formal schemas• Shared departure point for different schema implementations• Interest in moving framework closer to an implementable status
June 2003: OCLC, RLG sponsored new working group:• PREMIS: Preservation Metadata: Implementation Strategies
Objectives• Define core set of preservation metadata elements, with
supporting data dictionary, applicable to broad range of digital preservation activities
• Identify and evaluate alternative strategies for encoding, storing, managing, and exchanging preservation metadata
![Page 8: Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra](https://reader036.vdocuments.us/reader036/viewer/2022082510/5a4d1b0e7f8b9ab05998d661/html5/thumbnails/8.jpg)
PREMIS: Current Status
Core Elements: element-by-element review of prototype elements from metadata framework:• Is the element “core”?• Data dictionary: definition, rationale, examples, usage rules
(data constraints, obligation, repeatability)
Implementation Strategies: survey to identify key characteristics of digital preservation repositories:• Mission, content, funding, policies, etc.• Focus on how metadata is used to support repository
processes, functions, and policies• http://www.oclc.org/research/projects/pmwg/surveyreport.pdf
Expected completion: December 2004
![Page 9: Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra](https://reader036.vdocuments.us/reader036/viewer/2022082510/5a4d1b0e7f8b9ab05998d661/html5/thumbnails/9.jpg)
Preservation Metadata Schemas: Perspectives
Prospects for consensus, standards …• Foundation starting to coalesce, informing current work• But must overcome differences across systems and
policies
Involve all stakeholders: creators, publishers, all cultural heritage communities
Focus on internal guidance AND interoperability
Avoid re-invention of wheels: potential overlap between other metadata initiatives
![Page 10: Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra](https://reader036.vdocuments.us/reader036/viewer/2022082510/5a4d1b0e7f8b9ab05998d661/html5/thumbnails/10.jpg)
Implementation Issues: Tools General consensus that:
1) Metadata is key component of digital preservation process2) Preservation metadata is expensive to create and maintain3) Need to minimize human mediation
JSTOR/Harvard Object Validation Environment (JHOVE):• Identify, validate, and characterize digital object formats• Modules for: TIFF (various versions), PDF, XML, and others
NLNZ Preservation Metadata Extract Tool:• Extracts information from digital file headers (e.g., MS Word, TIFF,
WAV, bitmaps); outputs metadata in XML format
Surface preservation metadata tools in variety of digital repository environments (Dspace, Fedora, DAITSS)
![Page 11: Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra](https://reader036.vdocuments.us/reader036/viewer/2022082510/5a4d1b0e7f8b9ab05998d661/html5/thumbnails/11.jpg)
Implementation Issues: Economics Develop economical ways of acquiring and maintaining preservation
metadata PRONOM File Format Registry (UK National Archives)
• Technical metadata about specific file formats• Description of software needed to create, render, migrate formats• Metadata created once, re-used many times
Automatic Exposure (RLG)• Facilitate capture of metadata specified in NISO Z39.87 (Technical Metadata
for Digital Still Images)• Dialog with digital scanner/camera manufacturers• Technical metadata automatically captured when object created
Reduce cost of metadata creation by leveraging opportunities for sharing and re-use, and diffusing metadata capture throughout information lifecycle
![Page 12: Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra](https://reader036.vdocuments.us/reader036/viewer/2022082510/5a4d1b0e7f8b9ab05998d661/html5/thumbnails/12.jpg)
Implementation Issues: Packaging Link (physically or logically) archived digital object and all
associated metadata
OAIS Information Package• Conceptual structure for information moving into, through, and out of
archival system• Digital object and its metadata, bound into single logical package
Metadata Encoding and Transmission Standard (METS)• XML schema for encoding descriptive, administrative, and structural
metadata associated with digital object• PREMIS elements to be implemented as METS extension schema
Sharing and re-use of preservation metadata in a networked repository environment requires standard mechanisms for encoding and exchange
![Page 13: Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra](https://reader036.vdocuments.us/reader036/viewer/2022082510/5a4d1b0e7f8b9ab05998d661/html5/thumbnails/13.jpg)
Implementation Issues: Perspectives
Current focus on format validation and technical metadata. Need work on tools that:• Address other forms of preservation metadata• Support formal preservation metadata schemas (PREMIS core)
Division of labor:• Map preservation metadata requirements to appropriate stages of
information lifecycle• Allocate responsibility for collecting metadata
“Quality assurance”
![Page 14: Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra](https://reader036.vdocuments.us/reader036/viewer/2022082510/5a4d1b0e7f8b9ab05998d661/html5/thumbnails/14.jpg)
Looking ahead … Questions of “what type”, “how much” preservation metadata
still unsettled …• Digital preservation processes still not fully tested/understood• Metadata requirements shaped by local repository characteristics
Collaboration essential:• Pooling expertise from variety of institutional perspectives
mitigates uncertainty• Highlights points of convergence/divergence; helps distinguish
metadata that is widely applicable vs. domain-specific• Helps identify best practices and encourages standards-building
In the meantime … “Good judgment is based on experience, and experience is based on bad judgment”
![Page 15: Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra](https://reader036.vdocuments.us/reader036/viewer/2022082510/5a4d1b0e7f8b9ab05998d661/html5/thumbnails/15.jpg)
More information…
PADI Preservation Metadata Bibliography:http://www.nla.gov.au/padi/topics/32.html
PREMIS: http://www.oclc.org/research/projects/pmwg/ JHOVE: http://hul.harvard.edu/jhove/jhove.html PRONOM: http://www.nationalarchives.gov.uk/pronom/ OAIS: http://ssdoo.gsfc.nasa.gov/nost/wwwclassic/documents/
pdf/CCSDS-650.0-B-1.pdf METS: http://www.loc.gov/standards/mets/