scidip-es components oct 22-23 2014,brussels. basic preservation strategies often stated as:...

16
SCIDIP-ES Components Oct 22-23 2014,Brussels

Upload: alexander-gilbert

Post on 17-Dec-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: SCIDIP-ES Components Oct 22-23 2014,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation

SCIDIP-ES ComponentsOct 22-23 2014,Brussels

Page 2: SCIDIP-ES Components Oct 22-23 2014,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation

Basic Preservation Strategies

Often stated as: “Emulate or Migrate”OAIS concepts change these to:• Add Representation Information

• includes emulation• Transform

• more specific than “migrate”• Hand over to another repository

Page 3: SCIDIP-ES Components Oct 22-23 2014,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation

When things change

• We need to:• Know something has changed

• Identify the implications of that change

• Decide on the best course of action for preservation

• What RepInfo we need to fill the gaps

• Created by someone else or creating a new one

• If transformed: how to maintain data authenticity

• Alternatively: hand it over to another repository

• Make sure data continues to be usable

Orchestration Service

Gap Identification

Service

Preservation Strategy Tk

RepInfo Registry Service

Authenticity Toolkit

Storage Service

Data Virtualisat

ion Toolkit

Process Virtualisat

ion Toolkit

RepInfo

Toolkit

Page 4: SCIDIP-ES Components Oct 22-23 2014,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation

Threat Requirement for solutionUsers may be unable to understand or use the data e.g. the semantics, format, processes or algorithms involved

Ability to create and maintain adequate Representation Information

Non-maintainability of essential hardware, software or support environment may make the information inaccessible

Ability to share information about the availability of hardware and software and their replacements/substitutes

The chain of evidence may be lost and there may be lack of certainty of provenance or authenticity

Ability to bring together evidence from diverse sources about the Authenticity of a digital object

Access and use restrictions may make it difficult to reuse data, or alternatively may not be respected in future

Ability to deal with Digital Rights correctly in a changing and evolving environment

Loss of ability to identify the location of data

An ID resolver which is really persistent

The current custodian of the data, whether an organisation or project, may cease to exist at some point in the future

Brokering of organisations to hold data and the ability to package together the information needed to transfer information between organisations ready for long term preservation

The ones we trust to look after the digital holdings may let us down

Certification process so that one can have confidence about whom to trust to preserve data holdings over the long term

RepInfo toolkit, Packager and Registry – to create and store Representation Information.In addition the Orchestration Manager and Knowledge Gap Manager help to ensure that the RepInfo is adequate .

Registry and Orchestration Manager to exchange information about the obsolescence of hardware and software, amongst other changes.The Representation Information will include such things as software source code and emulators.

Authenticity toolkit will allow one to capture evidence from many sources which may be used to judge Authenticity.

Packaging toolkit to package access rights policy into AIP

Persistent Identifier system: such a system will allow objects to be located over time.

Orchestration Manager will, amongst other things, allow the exchange of information about datasets which need to be passed from one curator to another.

Certification toolkit to help repository manager capture evidence for ISO 16363 Audit and Certification

Page 5: SCIDIP-ES Components Oct 22-23 2014,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation

APARSEN test audit findings

• Lack of definition of Designated Community• Lack of adequate Representation Information• Inadequate Archival Information Packages• Lack of hand-over plans

Page 6: SCIDIP-ES Components Oct 22-23 2014,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation

SCIDIP-ES – e-Infrastructure for preservation

SCIDIP-ES in brief

• Upgrade CASPAR prototype components into scalable, robust e-infrastructure components to support digital preservation of all types of digital objects

• decentralised, heterogeneous, asynchronous, no single point of failure

• Persistent, simple re-implementable interfaces

• critical mass of users:

• Earth science as initial focus

• Other disciplines via APA

DIGITAL PRESERVATION RESEARCH needed to create the tools needed to create the “metadata” used by the e-infrastructure and user applications. Tools may be domain dependent. Must include Rep. Info. Network of the metadata

SCIence Data Infrastructure for Preservation – with focus on Earth Science http://www.scidip-es.eu

Storage Service

Gap Identification

Service

Orchestration Service

RepInfo Registry Service

Preservation Strategy Toolkit

Process Virtualisation

Toolkit

Finding Aid

Toolkit

Cloud Storage

Persistent ID i/f Service

External PI

services

ISO Certification Organisation

Certification Toolkit

External Access/Use

Services

E-INFRASTRUCTURE

TOOLKITS

Archives

User applications

Domain independent Infrastructure counters threats identified by PARSE.Insight based on CASPAR prototypes

Consistent with APARSEN integrated view

Will help archives with certification

Page 7: SCIDIP-ES Components Oct 22-23 2014,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation
Page 8: SCIDIP-ES Components Oct 22-23 2014,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation
Page 9: SCIDIP-ES Components Oct 22-23 2014,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation

Conclusions: Services and toolkits help repositories to…

• share the effort of preservation• address major threats to digital preservation by

supplementing what they currently do• proof from CASPAR and PARSE.Insight• applicable to all types of digital objects

• become trustworthy• add value to digital holdings

Page 10: SCIDIP-ES Components Oct 22-23 2014,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation

END

Page 11: SCIDIP-ES Components Oct 22-23 2014,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation

Add Representation Information

• OAIS introduces the concept of Representation Information• Information to help understand the digitally encoded object -

includes• emulators• bit-level descriptions• dictionaries

• Ideally description allows automated extraction of information

• In general if a digital object is no longer usable/understandable adding Representation Information digital can often solve the problem

Page 12: SCIDIP-ES Components Oct 22-23 2014,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation

Migration• OAIS defines various types of Migration:

• Do not change the bits • Refresh• Replicate

• Change the packaging but not the content• Repackage

• Change the content• Transform (usually non-reversible)

• Need to consider “Transformational Information Properties” – important for AUTHENTICITY• Related to “Significant properties”

• Add appropriate Representation Information for the new format

Page 13: SCIDIP-ES Components Oct 22-23 2014,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation

AND – be prepared toHand-over

• Preservation requires funding• Funding for a dataset (or a repository) may stop• Need to be ready to hand over everything needed for preservation

• OAIS (ISO 14721) defines “Archival Information Package (AIP) which brings together everything needed for long term preservation

• With information which covers• Understandability• Authenticity• How things are packaged together

• Not a one-off• Need to ensure that Understandability (for the Designated Community) is

maintained• Needs a support system

Page 14: SCIDIP-ES Components Oct 22-23 2014,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation

Preservation Planning Processes

Scop

ing

Form

ulation

Imp

l

ESA, Rome 14/11/2013

Page 15: SCIDIP-ES Components Oct 22-23 2014,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation

• Design Preservation Network Model (PNM)• Capture PNM properties

• cost, risks, objectives, decisions, actions links to metric evidence…

• Evaluate and select preservation solution/s

ESA, Rome 14/11/2013

Form

ulation

Preservation Strategies Toolkit

Page 16: SCIDIP-ES Components Oct 22-23 2014,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation

ESA, Rome 14/11/2013

Imp

lemen

tation

• Design RepInfo Network• Create RepInfo objects

• Capture RepInfo properties• façade to various tools• Search, re-use and share Registry

objects• Maintain registry objects

Repinfo Toolkit