sier man 2009
TRANSCRIPT
-
7/30/2019 Sier Man 2009
1/9
Liber QuarterLy 19 (1), apl 2009 iSSN: 1435-5205. P1321hp://l.l..nl/ig, uch Plshng & achvng Svcs
ths wok s lcnsd nd Cv Commons aon 3.0 unpod Lcns
Liber Quarterly Volume 19 Issue 1 2009 13
The Jigsaw Puzzle o
Digital Preservation an Overview
Barbara Sierman
Team Leader, Digital Preservation Research,
Koninklijke Bibliotheek, National Library o the Netherlands,
PO Box 90407, 2509 LK The Hague,
Abstract
Beore the 22nd Annual Meeting o the Board o Directors o the Foundation CENL,
Zagreb, September 2427, 2008, the author presented a clear overview o the latest
developments in digital preservation in a European context. She dealt with organisa-
tional aspects, the digital objects themselves, and the eects o international European
collaboration. She calls on European organisations such as the Alliance or Permanent
Access to sustain the results o temporary projects like PLANETS and thereby bring
the pieces o the digital preservation puzzle together.
This paper is being published in preparation o the workshop on Curating Research:
e-Merging New Roles and Responsibilities in the European Landscape, which is being
co-organised by LIBER on 17 April 2009 at the Koninklijke Bibliotheek in The Hague.
Key Words: digital preservation; CENL; libraries; European projects
Introduction
Digital preservation is like a jigsaw puzzle: a nice box with thousands o
pieces in it and a beautiul picture on the outside, which you can see i all the
pieces o the puzzle are put together in the right way, oten ater a tremen-
dous lot o eort and perseverance. The digital preservation picture on the
lid o the box would be o a crowd o happy library users, looking, listening
and playing with digital objects which their parents and grandparents cre-
ated, but rendered in their own computer environment. When this picture
http://liber.library.uu.nl/http://creativecommons.org/licenses/by/3.0/http://liber.library.uu.nl/mailto:[email protected]://www.kb.nl/curatingresearchhttp://www.kb.nl/curatingresearchmailto:[email protected]://liber.library.uu.nl/http://creativecommons.org/licenses/by/3.0/http://creativecommons.org/licenses/by/3.0/http://liber.library.uu.nl/ -
7/30/2019 Sier Man 2009
2/9
The Jigsaw Puzzle of Digital Preservation an Overview
14 Liber Quarterly Volume 19 Issue 1 2009
becomes a reality, it will demonstrate that the library community preserved
the heritage in the right way and that it guaranteed the accessibility and
usability over the years.
In the past ew years, much eort has been devoted to raising awareness o
the issue o digital preservation, especially amongst cultural heritage institu-
tions. All those articles, presentations and discussions are gradually begin-
ning to pay o. Digital preservation is no longer a topic that needs to be
explained. On the other hand, however, the ultimate goal, the picture on the
lid o the box where all the dierent pieces will become a coherent entity, is
still not a reality. Although we are making progress, work is too ragmented
and has not led to an out-o-the-box solution. Lots o people are working in
the area o digital preservation, but still much eort is needed to integrate the
work done on separate pieces o the puzzle.
There are lots o methodologies to complete a puzzle. Some people start by
looking or the corner pieces, other people will complete the outer edges rst
and yet another category will rst collect the blue and white pieces to nish
the clouds. In digital preservation similar processes are taking place. With so
many organisations involved, the list o topics related to digital preservation
research gets longer every day. In this article I will make a selection and show
you the current state o aairs in three areas:
the place o digital preservation within an organisation;
developments with regard to the digital objects; and
the eects o international collaboration.
The Place of Digital Preservation within an Organisation
Digital preservation is an intrinsic process and not a separate activity. Whetherit concerns a library or an archive, digital preservation aects the organisa-
tion as a whole and should not be an isolated activity. Work fows need to
be designed or collection policies and management, selection and appraisal,
metadata, access procedures etc. in the same vein as or printed collections.
Several initiatives have been devised to support organisations in implementing
digital preservation, both or newcomers such as organisations starting to
think about setting up a digital repository and more experienced organisa-
http://liber.library.uu.nl/http://liber.library.uu.nl/ -
7/30/2019 Sier Man 2009
3/9
Barbara Sierman
Liber Quarterly Volume 19 Issue 1 2009 15
tions wishing to evaluate their policies and the eects o their preservation
activities.
(Self) Auditing
The status o being a trusted, or more correctly termed a trustworthy reposi-
tory is the ultimate goal o an organisation with a digital collection that needs
to be accessible and usable over time. A rst initiative designed to raise the
issue o certication and to provide guidelines or a trusted repository was
the joint publication in 2002 by the US Research Libraries Group (RLG) and
OCLC o Trusted Digital Repositories: Attributes and Responsibilities. Many
organisations are presently using this document as a checklist. The successo this document and the need or a real auditing instrument led in 2007 to
a new initiative designed to update these guidelines with the latest insights
and experiences, and to turn it into a clear, understandable ISO standard
which can be used as a certication and auditing instrument in the digital
preservation community. To involve as many parties as possible, everyone
interested can participate in this initiative; the discussions and outcomes are
publicly available.1 It can oten take years to create an ISO standard, but a
rst drat will be available by the end o this year.
Audit and certication can also be looked at rom a dierent angle, as is doneby the DRAMBORA initiative. This Digital Repository Audit Method based
on Risk Assessment looks upon digital preservation as the task o managing
risks. It oers training and tools to perorm a risk analysis o the organisa-
tion in order to identiy areas that can be improved. A third initiative is the
Catalogue of Criteria for Trusted Digital Repositories (2007)by the German nestor
group.
The three initiatives mentioned cooperate closely, and in 2007 they jointly or-
mulated the ten core principles o trust2 as leading principles or trustworthyrepositories. These ten core principles were used as input or the PLATTER
tool (Planning Tool or Trusted Electronic Repositories),3 specically devel-
oped to help organisations starting with digital preservation programmes to
implement these principles and be able to meet the audit and certication
requirements. To achieve this, trained and skilled sta is needed who con-
stantly update their knowledge. Several European projects on digital preser-
vation, such as DPE, PLANETS (Preservation and Long-term Access through
http://liber.library.uu.nl/http://www.repositoryaudit.eu/http://www.langzeitarchivierung.de/http://www.digitalpreservationeurope.eu/http://www.planets-project.eu/http://www.planets-project.eu/http://www.digitalpreservationeurope.eu/http://www.langzeitarchivierung.de/http://www.repositoryaudit.eu/http://liber.library.uu.nl/ -
7/30/2019 Sier Man 2009
4/9
The Jigsaw Puzzle of Digital Preservation an Overview
16 Liber Quarterly Volume 19 Issue 1 2009
Networked Services) and CASPAR (I will discuss these below), have speci-
cally mentioned dissemination o knowledge as one o their deliverables and
they oer training by experts who update sta on the latest insights in vari-ous aspects o digital preservation, oten in joint workshops.
The cost o digital preservation still is an interesting and very important topic.
As digital objects cannot be ignored, even or a while, at any stage during
their lie cycle, insight in costing required or the long term, is vital. One o the
major initiatives in this area is the LIFE project, LIFEcycle Inormation or E-
literature, a collaboration between University College London (UCL) Library
Services and the British Library, which is unded by the Joint Inormation
Systems Committee (JISC). The rst stage o this project resulted in the devel-
opment o a costing model or the dierent processes taking place in the lie
o a digital object. Starting with creation, and on to preservation, to access
and usability, every activity in these processes involves costs or the preserv-
ing organisation, such as acquisition activities, metadata creation, storage,
but also preservation watch and preservation action. In 20072008 a second
iteration o this LIFE project was unded by JISC. This phase will lead to an
economic evaluation o the model and an update based on the results o sev-
eral cases studies involving dierent kinds o digital material.
Related to the question o costs is that o the value o collections. How dowe value a digital collection? What material does a library need to preserve?
For example, i a library digitises part o its collection, does it need to pre-
serve the digital master les or the long term, or is it more economical to pre-
serve the paper collection and maybe digitise it again sometime in the uture?
And how about a ull domain crawl o the national websites? As websites
are growing every day, it is a huge task or a national library to organise a
representative domain crawl. The technical means to implement selections
in a ull domain harvest are limited. On the other hand storage costs might
be a reason to make choices and to select. Such a selection is one o the topicsthe European LiWA (Living Web Archives) project will ocus on, but the topic
o appraisal and selection is also requently mentioned in conerences and
articles.4
One o the aspects o digital preservation that is not solved yet, is rights man-
agement. When preserving digital material, it might be necessary to perorm
actions on the digital objects in order to keep the object accessible and usable.
These actions might confict with copyright laws. Preserving organisations
http://liber.library.uu.nl/http://www.casparpreserves.eu/http://www.life.ac.uk/http://www.liwa-project.eu/http://www.liwa-project.eu/http://www.life.ac.uk/http://www.casparpreserves.eu/http://liber.library.uu.nl/ -
7/30/2019 Sier Man 2009
5/9
Barbara Sierman
Liber Quarterly Volume 19 Issue 1 2009 17
are not always sure i they are allowed to perorm the necessary tasks. Is it
allowed to make multiple copies o a work or preservation purposes? Or to
migrate works to a new technological ormat, thus creating a new maniesta-tion o the original object? National laws are oten not updated or the digital
age, and i they are, this aspect is regularly let unresolved. Recently a study5
drew attention to this problem; in conclusion it presented a set o joint recom-
mendations to provide guidelines or national copyright and related laws.
Digital Objects
We looked into the organisational aspects and the trends in that area, but what
about the digital objects themselves? Do they change in a technical sense? For
a long time the majority o digital objects were rather straightorward, oten
consisting o one le in a well-known ormat like PDF or TIFF. Many digitisa-
tion programmes resulted in large quantities o objects in TIFF ormat. But the
digital world is getting more complicated, the users are changing and becom-
ing more demanding, and this is refected in the digital objects themselves.
Websites are a well known example, as the sites become more complex and
oer more eatures. Long-term archiving o the results o domain harvests
is a topic even the International Internet Preservation Consortium (IIPC) is
slowly taking up, ocusing more on harvesting itsel than on the long-term
archiving aspect.
There is also a tendency to link publications with data bases, websites, blogs
etc. to oer the end user a single point o entry to all related publications.
This is especially true in the world o institutional repositories, but academic
publishers also increasingly allow authors to include other types o digital
material within their article. As a memory institution you might want to pre-
serve this set o materials and oer your uture users access to it. But the vari-ous components o this package might not be located in the same repository.
The European DRIVER project will investigate the consequences o these so-
called enhanced publications or long-term preservation. One o the essential
requirements to preserve this material will be the use o persistent unique
identiers to accompany the publications during their entire lietime. Another
requirement will be interoperability between objects in dierent reposito-
ries, using standards or interoperability. These developments will not only
be interesting or institutional repositories, but, as the boundaries between a
http://liber.library.uu.nl/http://www.netpreserve.org/about/index.phphttp://www.driver-community.eu/http://www.driver-community.eu/http://www.netpreserve.org/about/index.phphttp://liber.library.uu.nl/ -
7/30/2019 Sier Man 2009
6/9
The Jigsaw Puzzle of Digital Preservation an Overview
18 Liber Quarterly Volume 19 Issue 1 2009
publication and the linked digital attachments become more blurred, it will
become important or national (deposit) libraries as well.
As accessibility and usability o digital objects are the principal goals o digital
preservation, it is important to gather all o the essential inormation needed
to render the object correctly in the uture. Apart rom le ormat and version
inormation, you want to have inormation on other aspects like behaviour
and appearance o the object at the time it was created in other words, the
signicant properties o an object. For reasons o economics and eciency,
this inormation should be collected automatically. Several reerence services
to collect these kinds o inormation are already available in a basic orm, but
they are presently being updated: the PRONOM registry o le ormat inor-
mation o the National Archives in London will be expanded in the PLANETS
project; the JHOVE project, comprising tools to validate and characterize le
ormats, received new nancial support to start JHOVE2, the UK InSPECT
project published some interesting studies, and international initiatives have
been taken to set up a Global Digital Format Registry (GDFR). Supporting
tools were also built elsewhere, such as the Metadata Extraction Tool o the
National Library o New Zealand and the XENA tool o the National Archives
o Australia which normalises various le ormats.
Although all o these initiatives are warmly welcomed by the preservationcommunity, they do have one major drawback: lack o sustainability. Nearly
all inormation about digital preservation that has been generated by all these
projects is reely available rom the internet and tools can be downloaded at
no cost at all. But there is a risk that these supportive tools or digital preser-
vation will not be managed properly ater the projects are nished. Thereore,
although there is enthusiasm about the initiaitives, organisations are hesitant
to rely on these tools and build their own, sometimes unnecessarily.
The topic o sustainability is especially important in relation to the resultsexpected rom the major European projects PLANETS and CASPAR. Who will
maintain the tools developed? Who will update and monitor the inormation
o le ormat registries that so many preserving organisations will rely on?
I this topic o sustainability is not solved, a lot o eort will be wasted. The
solution might be ound in my last topic, that o international collaboration.
http://liber.library.uu.nl/http://www.nationalarchives.gov.uk/pronom/http://confluence.ucop.edu/display/JHOVE2Info/Home;jsessionid=6A3E8B4924066A596523FF0F3127C5EFhttp://www.significantproperties.org.uk/http://www.gdfr.info/http://www.gdfr.info/http://www.significantproperties.org.uk/http://confluence.ucop.edu/display/JHOVE2Info/Home;jsessionid=6A3E8B4924066A596523FF0F3127C5EFhttp://www.nationalarchives.gov.uk/pronom/http://liber.library.uu.nl/ -
7/30/2019 Sier Man 2009
7/9
Barbara Sierman
Liber Quarterly Volume 19 Issue 1 2009 19
International Collaboration
The 6th and 7th Framework programmes o the EC generated a numbero projects with a ocus on dierent aspects o digital preservation. Three
important projects: PLANETS, CASPAR and DPE are now hal way and are
beginning to present their (intermediate) results at conerences and on their
websites. In the PLANETS project, with the main ocus on preservation plan-
ning, the PLATO tool is taking shape. This decision support tool will help an
organisation to plan its preservation activities. In two years time this support-
ive tool will be integrated with a test bed (where you can perorm tests with
samples o your collection) and registries with inormation on preservation
tools which you can use or preservation actions. As I said, the urther devel-
opment o the PRONOM registry with le ormat inormation will be part o
this project. Digital Preservation Europe (DPE) was unded to bring digital
preservation expertise together and to develop a roadmap or uture research
in the area o digital preservation.The project also published practical solu-
tions like the aorementioned audit tool DRAMBORA and the PLATTER
tool.
A new project is KEEP (Keeping Emulation Environments Portable), in which
emulation as a preservation action will be developed urther and will be inte-
grated into a ramework or use in the preservation community. KEEP willollow on rom the emulation development work done by the Koninklijke
Bibliotheek, National Library o the Netherlands in collaboration with the
National Archives o the Netherlands, which resulted in 2007 in the launch
o the DIOSCURI emulator. DIOSCURI will be urther developed within
PLANETS. The KEEP project will help to put DIOSCURI in a broader context
with other emulator tools.
The European projects also ocus on other areas, such as innovative storage
methods in the CASPAR project. The SHAMAN project (Sustaining Accessthrough Multivalent Heritage Archiving) ocuses on dierent aspects o
digital archiving systems and, as mentioned beore, the LiWA (Living Web
Archives) project deals with websites.
Several European national libraries are participating in these projects and con-
tribute with both practical as well as proessional knowledge. Participating
research institutes and commercial partners have their own skills and are
http://liber.library.uu.nl/http://dioscuri.sourceforge.net/http://shaman-ip.eu/http://shaman-ip.eu/http://dioscuri.sourceforge.net/http://liber.library.uu.nl/ -
7/30/2019 Sier Man 2009
8/9
The Jigsaw Puzzle of Digital Preservation an Overview
20 Liber Quarterly Volume 19 Issue 1 2009
oten more experienced in IT-related areas, which make them important part-
ners in urthering digital preservation. This mix o participants is crucial or
the success and acceptance o the project results.
In Conclusion
I have given you an overview o the latest developments in digital preserva-
tion, with a ocus on organisational aspects, the digital object itsel and pro-
gress in EC co-unded projects. Collaboration in digital preservation is
crucial, as is oten mentioned. Initiatives on a larger scale, like the European
Alliance or Permanent Access, should help to unite the scattered pieces and
to complete the jigsaw puzzle o digital preservation.
Websites Referred to in the Text
Alliance or Permanent Access, http://www.alliancepermanentaccess.eu
CASPAR, http://www.casparpreserves.eu/
DIOSCURI, http://dioscuri.sourceorge.net/
DPE, Digital Preservation Europe, http://www.digitalpreservationeurope.eu/
DRAMBORA, http://www.repositoryaudit.eu/
DRIVER, Digital Repository Inrastructure Vision or European Research, http://
www.driver-community.eu/
GDFR, Global Digital Format Registry, http://www.gdr.ino/
IIPC, International Internet Preservation Consortium, http://www.netpreserve.org/about/index.php
InSPECT, http://www.signicantproperties.org.uk/
JHOVE2, http://confuence.ucop.edu/display/JHOVE2Ino/Home;jsessionid=
6A3E8B4924066A596523FF0F3127C5EF
LIFE, Liecycle inormation or e-literature, http://www.lie.ac.uk/
LiWA, Living Web Archives, http://www.liwa-project.eu/
http://liber.library.uu.nl/http://www.alliancepermanentaccess.eu/http://www.alliancepermanentaccess.eu/http://www.casparpreserves.eu/http://dioscuri.sourceforge.net/http://www.digitalpreservationeurope.eu/http://www.repositoryaudit.eu/http://www.driver-community.eu/http://www.driver-community.eu/http://www.gdfr.info/http://www.netpreserve.org/about/index.phphttp://www.netpreserve.org/about/index.phphttp://www.significantproperties.org.uk/http://confluence.ucop.edu/display/JHOVE2Info/Home;jsessionid=6A3E8B4924066A596523FF0F3127C5EFhttp://confluence.ucop.edu/display/JHOVE2Info/Home;jsessionid=6A3E8B4924066A596523FF0F3127C5EFhttp://www.life.ac.uk/http://www.liwa-project.eu/http://www.liwa-project.eu/http://www.life.ac.uk/http://confluence.ucop.edu/display/JHOVE2Info/Home;jsessionid=6A3E8B4924066A596523FF0F3127C5EFhttp://confluence.ucop.edu/display/JHOVE2Info/Home;jsessionid=6A3E8B4924066A596523FF0F3127C5EFhttp://www.significantproperties.org.uk/http://www.netpreserve.org/about/index.phphttp://www.netpreserve.org/about/index.phphttp://www.gdfr.info/http://www.driver-community.eu/http://www.driver-community.eu/http://www.repositoryaudit.eu/http://www.digitalpreservationeurope.eu/http://dioscuri.sourceforge.net/http://www.casparpreserves.eu/http://www.alliancepermanentaccess.eu/http://www.alliancepermanentaccess.eu/http://liber.library.uu.nl/ -
7/30/2019 Sier Man 2009
9/9
Barbara Sierman
Liber Quarterly Volume 19 Issue 1 2009 21
nestor, http://www.langzeitarchivierung.de
PLANETS, Preservation and Long-term Access through Networked Services, http://
www.planets-project.eu/
PRONOM, http://www.nationalarchives.gov.uk/pronom/
SHAMAN, Sustaining Heritage Access through Multivalent Archiving,
http://shaman-ip.eu/
Notes
1 All links were checked on 5 September 2008. See http://wiki.
digitalrepositoryauditandcertication.org/bin/view/Main/WebHome
2 These principles are described in the DPE Repository Planning Checklist and
Guidance DPED3.2, p. 9, http://www.digitalpreservationeurope.eu/publications/
reports/Repository_Planning_Checklist_and_Guidance.pd
3 Published in 2008, see note 2.
4
See: S. Ross (2007), Digital Preservation, Archival Science and MethodologicalFoundations for Digital Libraries, Keynote Address at the 11th European Conerence
on Digital Libraries (ECDL), Budapest (17 September 2007).
5 International Study on the Impact o Copyright Law on Digital Preservation. A Joint
Report o The Library o Congress National Digital Inormation Inrastructure and
Preservation Program, the Joint Inormation Systems Committee, the Open Access
to Knowledge (OAK) Law Project and the SURFoundation, 2008, http://www.
digitalpreservation.gov/partners/resources/pubs/digital_preservation_nal_
report2008.pd
http://liber.library.uu.nl/http://www.langzeitarchivierung.de/http://www.planets-project.eu/http://www.planets-project.eu/http://www.nationalarchives.gov.uk/pronom/http://shaman-ip.eu/http://wiki.digitalrepositoryauditandcertification.org/bin/view/Main/WebHomehttp://wiki.digitalrepositoryauditandcertification.org/bin/view/Main/WebHomehttp://www.digitalpreservationeurope.eu/publications/reports/Repository_Planning_Checklist_and_Guidance.pdfhttp://www.digitalpreservationeurope.eu/publications/reports/Repository_Planning_Checklist_and_Guidance.pdfhttp://www.digitalpreservation.gov/partners/resources/pubs/digital_preservation_final_report2008.pdfhttp://www.digitalpreservation.gov/partners/resources/pubs/digital_preservation_final_report2008.pdfhttp://www.digitalpreservation.gov/partners/resources/pubs/digital_preservation_final_report2008.pdfhttp://www.digitalpreservation.gov/partners/resources/pubs/digital_preservation_final_report2008.pdfhttp://www.digitalpreservation.gov/partners/resources/pubs/digital_preservation_final_report2008.pdfhttp://www.digitalpreservation.gov/partners/resources/pubs/digital_preservation_final_report2008.pdfhttp://www.digitalpreservationeurope.eu/publications/reports/Repository_Planning_Checklist_and_Guidance.pdfhttp://www.digitalpreservationeurope.eu/publications/reports/Repository_Planning_Checklist_and_Guidance.pdfhttp://wiki.digitalrepositoryauditandcertification.org/bin/view/Main/WebHomehttp://wiki.digitalrepositoryauditandcertification.org/bin/view/Main/WebHomehttp://shaman-ip.eu/http://www.nationalarchives.gov.uk/pronom/http://www.planets-project.eu/http://www.planets-project.eu/http://www.langzeitarchivierung.de/http://liber.library.uu.nl/