sier man 2009

Upload: bunjah

Post on 14-Apr-2018

224 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/30/2019 Sier Man 2009

    1/9

    Liber QuarterLy 19 (1), apl 2009 iSSN: 1435-5205. P1321hp://l.l..nl/ig, uch Plshng & achvng Svcs

    ths wok s lcnsd nd Cv Commons aon 3.0 unpod Lcns

    Liber Quarterly Volume 19 Issue 1 2009 13

    The Jigsaw Puzzle o

    Digital Preservation an Overview

    Barbara Sierman

    Team Leader, Digital Preservation Research,

    Koninklijke Bibliotheek, National Library o the Netherlands,

    PO Box 90407, 2509 LK The Hague,

    [email protected]

    Abstract

    Beore the 22nd Annual Meeting o the Board o Directors o the Foundation CENL,

    Zagreb, September 2427, 2008, the author presented a clear overview o the latest

    developments in digital preservation in a European context. She dealt with organisa-

    tional aspects, the digital objects themselves, and the eects o international European

    collaboration. She calls on European organisations such as the Alliance or Permanent

    Access to sustain the results o temporary projects like PLANETS and thereby bring

    the pieces o the digital preservation puzzle together.

    This paper is being published in preparation o the workshop on Curating Research:

    e-Merging New Roles and Responsibilities in the European Landscape, which is being

    co-organised by LIBER on 17 April 2009 at the Koninklijke Bibliotheek in The Hague.

    Key Words: digital preservation; CENL; libraries; European projects

    Introduction

    Digital preservation is like a jigsaw puzzle: a nice box with thousands o

    pieces in it and a beautiul picture on the outside, which you can see i all the

    pieces o the puzzle are put together in the right way, oten ater a tremen-

    dous lot o eort and perseverance. The digital preservation picture on the

    lid o the box would be o a crowd o happy library users, looking, listening

    and playing with digital objects which their parents and grandparents cre-

    ated, but rendered in their own computer environment. When this picture

    http://liber.library.uu.nl/http://creativecommons.org/licenses/by/3.0/http://liber.library.uu.nl/mailto:[email protected]://www.kb.nl/curatingresearchhttp://www.kb.nl/curatingresearchmailto:[email protected]://liber.library.uu.nl/http://creativecommons.org/licenses/by/3.0/http://creativecommons.org/licenses/by/3.0/http://liber.library.uu.nl/
  • 7/30/2019 Sier Man 2009

    2/9

    The Jigsaw Puzzle of Digital Preservation an Overview

    14 Liber Quarterly Volume 19 Issue 1 2009

    becomes a reality, it will demonstrate that the library community preserved

    the heritage in the right way and that it guaranteed the accessibility and

    usability over the years.

    In the past ew years, much eort has been devoted to raising awareness o

    the issue o digital preservation, especially amongst cultural heritage institu-

    tions. All those articles, presentations and discussions are gradually begin-

    ning to pay o. Digital preservation is no longer a topic that needs to be

    explained. On the other hand, however, the ultimate goal, the picture on the

    lid o the box where all the dierent pieces will become a coherent entity, is

    still not a reality. Although we are making progress, work is too ragmented

    and has not led to an out-o-the-box solution. Lots o people are working in

    the area o digital preservation, but still much eort is needed to integrate the

    work done on separate pieces o the puzzle.

    There are lots o methodologies to complete a puzzle. Some people start by

    looking or the corner pieces, other people will complete the outer edges rst

    and yet another category will rst collect the blue and white pieces to nish

    the clouds. In digital preservation similar processes are taking place. With so

    many organisations involved, the list o topics related to digital preservation

    research gets longer every day. In this article I will make a selection and show

    you the current state o aairs in three areas:

    the place o digital preservation within an organisation;

    developments with regard to the digital objects; and

    the eects o international collaboration.

    The Place of Digital Preservation within an Organisation

    Digital preservation is an intrinsic process and not a separate activity. Whetherit concerns a library or an archive, digital preservation aects the organisa-

    tion as a whole and should not be an isolated activity. Work fows need to

    be designed or collection policies and management, selection and appraisal,

    metadata, access procedures etc. in the same vein as or printed collections.

    Several initiatives have been devised to support organisations in implementing

    digital preservation, both or newcomers such as organisations starting to

    think about setting up a digital repository and more experienced organisa-

    http://liber.library.uu.nl/http://liber.library.uu.nl/
  • 7/30/2019 Sier Man 2009

    3/9

    Barbara Sierman

    Liber Quarterly Volume 19 Issue 1 2009 15

    tions wishing to evaluate their policies and the eects o their preservation

    activities.

    (Self) Auditing

    The status o being a trusted, or more correctly termed a trustworthy reposi-

    tory is the ultimate goal o an organisation with a digital collection that needs

    to be accessible and usable over time. A rst initiative designed to raise the

    issue o certication and to provide guidelines or a trusted repository was

    the joint publication in 2002 by the US Research Libraries Group (RLG) and

    OCLC o Trusted Digital Repositories: Attributes and Responsibilities. Many

    organisations are presently using this document as a checklist. The successo this document and the need or a real auditing instrument led in 2007 to

    a new initiative designed to update these guidelines with the latest insights

    and experiences, and to turn it into a clear, understandable ISO standard

    which can be used as a certication and auditing instrument in the digital

    preservation community. To involve as many parties as possible, everyone

    interested can participate in this initiative; the discussions and outcomes are

    publicly available.1 It can oten take years to create an ISO standard, but a

    rst drat will be available by the end o this year.

    Audit and certication can also be looked at rom a dierent angle, as is doneby the DRAMBORA initiative. This Digital Repository Audit Method based

    on Risk Assessment looks upon digital preservation as the task o managing

    risks. It oers training and tools to perorm a risk analysis o the organisa-

    tion in order to identiy areas that can be improved. A third initiative is the

    Catalogue of Criteria for Trusted Digital Repositories (2007)by the German nestor

    group.

    The three initiatives mentioned cooperate closely, and in 2007 they jointly or-

    mulated the ten core principles o trust2 as leading principles or trustworthyrepositories. These ten core principles were used as input or the PLATTER

    tool (Planning Tool or Trusted Electronic Repositories),3 specically devel-

    oped to help organisations starting with digital preservation programmes to

    implement these principles and be able to meet the audit and certication

    requirements. To achieve this, trained and skilled sta is needed who con-

    stantly update their knowledge. Several European projects on digital preser-

    vation, such as DPE, PLANETS (Preservation and Long-term Access through

    http://liber.library.uu.nl/http://www.repositoryaudit.eu/http://www.langzeitarchivierung.de/http://www.digitalpreservationeurope.eu/http://www.planets-project.eu/http://www.planets-project.eu/http://www.digitalpreservationeurope.eu/http://www.langzeitarchivierung.de/http://www.repositoryaudit.eu/http://liber.library.uu.nl/
  • 7/30/2019 Sier Man 2009

    4/9

    The Jigsaw Puzzle of Digital Preservation an Overview

    16 Liber Quarterly Volume 19 Issue 1 2009

    Networked Services) and CASPAR (I will discuss these below), have speci-

    cally mentioned dissemination o knowledge as one o their deliverables and

    they oer training by experts who update sta on the latest insights in vari-ous aspects o digital preservation, oten in joint workshops.

    The cost o digital preservation still is an interesting and very important topic.

    As digital objects cannot be ignored, even or a while, at any stage during

    their lie cycle, insight in costing required or the long term, is vital. One o the

    major initiatives in this area is the LIFE project, LIFEcycle Inormation or E-

    literature, a collaboration between University College London (UCL) Library

    Services and the British Library, which is unded by the Joint Inormation

    Systems Committee (JISC). The rst stage o this project resulted in the devel-

    opment o a costing model or the dierent processes taking place in the lie

    o a digital object. Starting with creation, and on to preservation, to access

    and usability, every activity in these processes involves costs or the preserv-

    ing organisation, such as acquisition activities, metadata creation, storage,

    but also preservation watch and preservation action. In 20072008 a second

    iteration o this LIFE project was unded by JISC. This phase will lead to an

    economic evaluation o the model and an update based on the results o sev-

    eral cases studies involving dierent kinds o digital material.

    Related to the question o costs is that o the value o collections. How dowe value a digital collection? What material does a library need to preserve?

    For example, i a library digitises part o its collection, does it need to pre-

    serve the digital master les or the long term, or is it more economical to pre-

    serve the paper collection and maybe digitise it again sometime in the uture?

    And how about a ull domain crawl o the national websites? As websites

    are growing every day, it is a huge task or a national library to organise a

    representative domain crawl. The technical means to implement selections

    in a ull domain harvest are limited. On the other hand storage costs might

    be a reason to make choices and to select. Such a selection is one o the topicsthe European LiWA (Living Web Archives) project will ocus on, but the topic

    o appraisal and selection is also requently mentioned in conerences and

    articles.4

    One o the aspects o digital preservation that is not solved yet, is rights man-

    agement. When preserving digital material, it might be necessary to perorm

    actions on the digital objects in order to keep the object accessible and usable.

    These actions might confict with copyright laws. Preserving organisations

    http://liber.library.uu.nl/http://www.casparpreserves.eu/http://www.life.ac.uk/http://www.liwa-project.eu/http://www.liwa-project.eu/http://www.life.ac.uk/http://www.casparpreserves.eu/http://liber.library.uu.nl/
  • 7/30/2019 Sier Man 2009

    5/9

    Barbara Sierman

    Liber Quarterly Volume 19 Issue 1 2009 17

    are not always sure i they are allowed to perorm the necessary tasks. Is it

    allowed to make multiple copies o a work or preservation purposes? Or to

    migrate works to a new technological ormat, thus creating a new maniesta-tion o the original object? National laws are oten not updated or the digital

    age, and i they are, this aspect is regularly let unresolved. Recently a study5

    drew attention to this problem; in conclusion it presented a set o joint recom-

    mendations to provide guidelines or national copyright and related laws.

    Digital Objects

    We looked into the organisational aspects and the trends in that area, but what

    about the digital objects themselves? Do they change in a technical sense? For

    a long time the majority o digital objects were rather straightorward, oten

    consisting o one le in a well-known ormat like PDF or TIFF. Many digitisa-

    tion programmes resulted in large quantities o objects in TIFF ormat. But the

    digital world is getting more complicated, the users are changing and becom-

    ing more demanding, and this is refected in the digital objects themselves.

    Websites are a well known example, as the sites become more complex and

    oer more eatures. Long-term archiving o the results o domain harvests

    is a topic even the International Internet Preservation Consortium (IIPC) is

    slowly taking up, ocusing more on harvesting itsel than on the long-term

    archiving aspect.

    There is also a tendency to link publications with data bases, websites, blogs

    etc. to oer the end user a single point o entry to all related publications.

    This is especially true in the world o institutional repositories, but academic

    publishers also increasingly allow authors to include other types o digital

    material within their article. As a memory institution you might want to pre-

    serve this set o materials and oer your uture users access to it. But the vari-ous components o this package might not be located in the same repository.

    The European DRIVER project will investigate the consequences o these so-

    called enhanced publications or long-term preservation. One o the essential

    requirements to preserve this material will be the use o persistent unique

    identiers to accompany the publications during their entire lietime. Another

    requirement will be interoperability between objects in dierent reposito-

    ries, using standards or interoperability. These developments will not only

    be interesting or institutional repositories, but, as the boundaries between a

    http://liber.library.uu.nl/http://www.netpreserve.org/about/index.phphttp://www.driver-community.eu/http://www.driver-community.eu/http://www.netpreserve.org/about/index.phphttp://liber.library.uu.nl/
  • 7/30/2019 Sier Man 2009

    6/9

    The Jigsaw Puzzle of Digital Preservation an Overview

    18 Liber Quarterly Volume 19 Issue 1 2009

    publication and the linked digital attachments become more blurred, it will

    become important or national (deposit) libraries as well.

    As accessibility and usability o digital objects are the principal goals o digital

    preservation, it is important to gather all o the essential inormation needed

    to render the object correctly in the uture. Apart rom le ormat and version

    inormation, you want to have inormation on other aspects like behaviour

    and appearance o the object at the time it was created in other words, the

    signicant properties o an object. For reasons o economics and eciency,

    this inormation should be collected automatically. Several reerence services

    to collect these kinds o inormation are already available in a basic orm, but

    they are presently being updated: the PRONOM registry o le ormat inor-

    mation o the National Archives in London will be expanded in the PLANETS

    project; the JHOVE project, comprising tools to validate and characterize le

    ormats, received new nancial support to start JHOVE2, the UK InSPECT

    project published some interesting studies, and international initiatives have

    been taken to set up a Global Digital Format Registry (GDFR). Supporting

    tools were also built elsewhere, such as the Metadata Extraction Tool o the

    National Library o New Zealand and the XENA tool o the National Archives

    o Australia which normalises various le ormats.

    Although all o these initiatives are warmly welcomed by the preservationcommunity, they do have one major drawback: lack o sustainability. Nearly

    all inormation about digital preservation that has been generated by all these

    projects is reely available rom the internet and tools can be downloaded at

    no cost at all. But there is a risk that these supportive tools or digital preser-

    vation will not be managed properly ater the projects are nished. Thereore,

    although there is enthusiasm about the initiaitives, organisations are hesitant

    to rely on these tools and build their own, sometimes unnecessarily.

    The topic o sustainability is especially important in relation to the resultsexpected rom the major European projects PLANETS and CASPAR. Who will

    maintain the tools developed? Who will update and monitor the inormation

    o le ormat registries that so many preserving organisations will rely on?

    I this topic o sustainability is not solved, a lot o eort will be wasted. The

    solution might be ound in my last topic, that o international collaboration.

    http://liber.library.uu.nl/http://www.nationalarchives.gov.uk/pronom/http://confluence.ucop.edu/display/JHOVE2Info/Home;jsessionid=6A3E8B4924066A596523FF0F3127C5EFhttp://www.significantproperties.org.uk/http://www.gdfr.info/http://www.gdfr.info/http://www.significantproperties.org.uk/http://confluence.ucop.edu/display/JHOVE2Info/Home;jsessionid=6A3E8B4924066A596523FF0F3127C5EFhttp://www.nationalarchives.gov.uk/pronom/http://liber.library.uu.nl/
  • 7/30/2019 Sier Man 2009

    7/9

    Barbara Sierman

    Liber Quarterly Volume 19 Issue 1 2009 19

    International Collaboration

    The 6th and 7th Framework programmes o the EC generated a numbero projects with a ocus on dierent aspects o digital preservation. Three

    important projects: PLANETS, CASPAR and DPE are now hal way and are

    beginning to present their (intermediate) results at conerences and on their

    websites. In the PLANETS project, with the main ocus on preservation plan-

    ning, the PLATO tool is taking shape. This decision support tool will help an

    organisation to plan its preservation activities. In two years time this support-

    ive tool will be integrated with a test bed (where you can perorm tests with

    samples o your collection) and registries with inormation on preservation

    tools which you can use or preservation actions. As I said, the urther devel-

    opment o the PRONOM registry with le ormat inormation will be part o

    this project. Digital Preservation Europe (DPE) was unded to bring digital

    preservation expertise together and to develop a roadmap or uture research

    in the area o digital preservation.The project also published practical solu-

    tions like the aorementioned audit tool DRAMBORA and the PLATTER

    tool.

    A new project is KEEP (Keeping Emulation Environments Portable), in which

    emulation as a preservation action will be developed urther and will be inte-

    grated into a ramework or use in the preservation community. KEEP willollow on rom the emulation development work done by the Koninklijke

    Bibliotheek, National Library o the Netherlands in collaboration with the

    National Archives o the Netherlands, which resulted in 2007 in the launch

    o the DIOSCURI emulator. DIOSCURI will be urther developed within

    PLANETS. The KEEP project will help to put DIOSCURI in a broader context

    with other emulator tools.

    The European projects also ocus on other areas, such as innovative storage

    methods in the CASPAR project. The SHAMAN project (Sustaining Accessthrough Multivalent Heritage Archiving) ocuses on dierent aspects o

    digital archiving systems and, as mentioned beore, the LiWA (Living Web

    Archives) project deals with websites.

    Several European national libraries are participating in these projects and con-

    tribute with both practical as well as proessional knowledge. Participating

    research institutes and commercial partners have their own skills and are

    http://liber.library.uu.nl/http://dioscuri.sourceforge.net/http://shaman-ip.eu/http://shaman-ip.eu/http://dioscuri.sourceforge.net/http://liber.library.uu.nl/
  • 7/30/2019 Sier Man 2009

    8/9

    The Jigsaw Puzzle of Digital Preservation an Overview

    20 Liber Quarterly Volume 19 Issue 1 2009

    oten more experienced in IT-related areas, which make them important part-

    ners in urthering digital preservation. This mix o participants is crucial or

    the success and acceptance o the project results.

    In Conclusion

    I have given you an overview o the latest developments in digital preserva-

    tion, with a ocus on organisational aspects, the digital object itsel and pro-

    gress in EC co-unded projects. Collaboration in digital preservation is

    crucial, as is oten mentioned. Initiatives on a larger scale, like the European

    Alliance or Permanent Access, should help to unite the scattered pieces and

    to complete the jigsaw puzzle o digital preservation.

    Websites Referred to in the Text

    Alliance or Permanent Access, http://www.alliancepermanentaccess.eu

    CASPAR, http://www.casparpreserves.eu/

    DIOSCURI, http://dioscuri.sourceorge.net/

    DPE, Digital Preservation Europe, http://www.digitalpreservationeurope.eu/

    DRAMBORA, http://www.repositoryaudit.eu/

    DRIVER, Digital Repository Inrastructure Vision or European Research, http://

    www.driver-community.eu/

    GDFR, Global Digital Format Registry, http://www.gdr.ino/

    IIPC, International Internet Preservation Consortium, http://www.netpreserve.org/about/index.php

    InSPECT, http://www.signicantproperties.org.uk/

    JHOVE2, http://confuence.ucop.edu/display/JHOVE2Ino/Home;jsessionid=

    6A3E8B4924066A596523FF0F3127C5EF

    LIFE, Liecycle inormation or e-literature, http://www.lie.ac.uk/

    LiWA, Living Web Archives, http://www.liwa-project.eu/

    http://liber.library.uu.nl/http://www.alliancepermanentaccess.eu/http://www.alliancepermanentaccess.eu/http://www.casparpreserves.eu/http://dioscuri.sourceforge.net/http://www.digitalpreservationeurope.eu/http://www.repositoryaudit.eu/http://www.driver-community.eu/http://www.driver-community.eu/http://www.gdfr.info/http://www.netpreserve.org/about/index.phphttp://www.netpreserve.org/about/index.phphttp://www.significantproperties.org.uk/http://confluence.ucop.edu/display/JHOVE2Info/Home;jsessionid=6A3E8B4924066A596523FF0F3127C5EFhttp://confluence.ucop.edu/display/JHOVE2Info/Home;jsessionid=6A3E8B4924066A596523FF0F3127C5EFhttp://www.life.ac.uk/http://www.liwa-project.eu/http://www.liwa-project.eu/http://www.life.ac.uk/http://confluence.ucop.edu/display/JHOVE2Info/Home;jsessionid=6A3E8B4924066A596523FF0F3127C5EFhttp://confluence.ucop.edu/display/JHOVE2Info/Home;jsessionid=6A3E8B4924066A596523FF0F3127C5EFhttp://www.significantproperties.org.uk/http://www.netpreserve.org/about/index.phphttp://www.netpreserve.org/about/index.phphttp://www.gdfr.info/http://www.driver-community.eu/http://www.driver-community.eu/http://www.repositoryaudit.eu/http://www.digitalpreservationeurope.eu/http://dioscuri.sourceforge.net/http://www.casparpreserves.eu/http://www.alliancepermanentaccess.eu/http://www.alliancepermanentaccess.eu/http://liber.library.uu.nl/
  • 7/30/2019 Sier Man 2009

    9/9

    Barbara Sierman

    Liber Quarterly Volume 19 Issue 1 2009 21

    nestor, http://www.langzeitarchivierung.de

    PLANETS, Preservation and Long-term Access through Networked Services, http://

    www.planets-project.eu/

    PRONOM, http://www.nationalarchives.gov.uk/pronom/

    SHAMAN, Sustaining Heritage Access through Multivalent Archiving,

    http://shaman-ip.eu/

    Notes

    1 All links were checked on 5 September 2008. See http://wiki.

    digitalrepositoryauditandcertication.org/bin/view/Main/WebHome

    2 These principles are described in the DPE Repository Planning Checklist and

    Guidance DPED3.2, p. 9, http://www.digitalpreservationeurope.eu/publications/

    reports/Repository_Planning_Checklist_and_Guidance.pd

    3 Published in 2008, see note 2.

    4

    See: S. Ross (2007), Digital Preservation, Archival Science and MethodologicalFoundations for Digital Libraries, Keynote Address at the 11th European Conerence

    on Digital Libraries (ECDL), Budapest (17 September 2007).

    5 International Study on the Impact o Copyright Law on Digital Preservation. A Joint

    Report o The Library o Congress National Digital Inormation Inrastructure and

    Preservation Program, the Joint Inormation Systems Committee, the Open Access

    to Knowledge (OAK) Law Project and the SURFoundation, 2008, http://www.

    digitalpreservation.gov/partners/resources/pubs/digital_preservation_nal_

    report2008.pd

    http://liber.library.uu.nl/http://www.langzeitarchivierung.de/http://www.planets-project.eu/http://www.planets-project.eu/http://www.nationalarchives.gov.uk/pronom/http://shaman-ip.eu/http://wiki.digitalrepositoryauditandcertification.org/bin/view/Main/WebHomehttp://wiki.digitalrepositoryauditandcertification.org/bin/view/Main/WebHomehttp://www.digitalpreservationeurope.eu/publications/reports/Repository_Planning_Checklist_and_Guidance.pdfhttp://www.digitalpreservationeurope.eu/publications/reports/Repository_Planning_Checklist_and_Guidance.pdfhttp://www.digitalpreservation.gov/partners/resources/pubs/digital_preservation_final_report2008.pdfhttp://www.digitalpreservation.gov/partners/resources/pubs/digital_preservation_final_report2008.pdfhttp://www.digitalpreservation.gov/partners/resources/pubs/digital_preservation_final_report2008.pdfhttp://www.digitalpreservation.gov/partners/resources/pubs/digital_preservation_final_report2008.pdfhttp://www.digitalpreservation.gov/partners/resources/pubs/digital_preservation_final_report2008.pdfhttp://www.digitalpreservation.gov/partners/resources/pubs/digital_preservation_final_report2008.pdfhttp://www.digitalpreservationeurope.eu/publications/reports/Repository_Planning_Checklist_and_Guidance.pdfhttp://www.digitalpreservationeurope.eu/publications/reports/Repository_Planning_Checklist_and_Guidance.pdfhttp://wiki.digitalrepositoryauditandcertification.org/bin/view/Main/WebHomehttp://wiki.digitalrepositoryauditandcertification.org/bin/view/Main/WebHomehttp://shaman-ip.eu/http://www.nationalarchives.gov.uk/pronom/http://www.planets-project.eu/http://www.planets-project.eu/http://www.langzeitarchivierung.de/http://liber.library.uu.nl/