repositories and preservation by ed pinsent
DESCRIPTION
Repositories and preservation by Ed Pinsent from ULCC. Presented at IRMW12.TRANSCRIPT
Repositories and Preservation
Ed PinsentULCC/LEAP IRM Workshop
15 June 2012
2
Why preservation?
• Long-term value
• Legal needs – compliance or rights
• Business needs
• Cost a lot to produce
• Enhance reputation
3
May depend on…
• Institutional commitment to doing it
• Mandate to preserve
• Business drivers
• Advocacy for “best practice”
• “We are all in the business of knowledge and its preservation”
4
Open Archival Information System (OAIS) Reference Model
5
EPrints and PRESERV
6
FEDORA and OAIS
7
Ingest procedure…+ tools
1. Fixity generation – MD5 checksum
2. Virus checking - AVG
3. Format identification – DROID + PRONOM
4. Format validation - JHOVE
5. Environmental metadata extraction - NLNZ Metadata Extract Tool
6. Format specific metadata extraction - NLNZ Metadata Extract Tool
7. Store in digital archive – software script
8
Integrated tools and plugins
• DROID (Digital Record Object Identification) - an automatic file format identification tool– Looks up live data held in PRONOM– Outputs definitive file format profile, size,
extension + checksum
• EPrints plugin– Automates process– Stores in database as metadata