peprs and the keepers registry

40
Finding out about the preservation of e-journals: an overview of the PEPRS project and the Keepers Registry Beta service Fred Guy, EDINA, University of Edinburgh British Library, Boston Spa 13 th October 2011 on behalf of the Project Team – EDINA/ISSN-IC

Upload: suncat

Post on 18-Nov-2014

1.112 views

Category:

Education


1 download

DESCRIPTION

Presentation given on PEPRS and the Keepers Registry at the British Library UKSG 2011 event held in Boston Spa on Thursday 13th October 2011

TRANSCRIPT

Page 1: PEPRS and the Keepers Registry

Finding out about the preservation of e-journals: an overview of the PEPRS project and the Keepers Registry Beta service

Fred Guy,

EDINA, University of Edinburgh

British Library, Boston Spa

13th October 2011

on behalf of the Project Team – EDINA/ISSN-IC

Page 2: PEPRS and the Keepers Registry

http://www.flickr.com/photos/sinclairlibrary/769777273/sizes/z/in/photostream/

Page 3: PEPRS and the Keepers Registry

Computer room in London School of Economics 1981http://www.flickr.com/photos/lselibrary/4401344940/sizes/o/in/photostream/

Page 4: PEPRS and the Keepers Registry

Online availability of journals by discipline

82.6%92.7% 96.1%

0.0%

20.0%

40.0%

60.0%

80.0%

100.0%

120.0%

2003 2005 2008

Arts, Humanitiesand SocialSciences

Science,Technology andMedicine

Average percentage of titles online by publisher size

80.0%

85.0%

90.0%

95.0%

100.0%

105.0%

SMALL MEDIUM LARGE

2005

2008

Downloads of SHEDL content in Scottish universities

0200000

400000600000800000

10000001200000

2007 2008 2009

RIN. E-only scholarly journals: overcoming the barriers. November 2010.

Downloads in UK universities and colleges

0

20,000,000

40,000,000

60,000,000

80,000,000

100,000,000

120,000,000

2005-06 2006-07 2007-08

Statistics related to e-journals

23%3

Page 5: PEPRS and the Keepers Registry

Print – key aspects

• Once purchased is owned by the library and can be retained, transferred to remote store or disposed of when library determines this

• Library can check if other libraries hold the material and it can be consulted on the premises or be available via Inter-Library loan

• Likely that it will be available in a national library via legal deposit legislation (goes back to 17th century in UK)

Page 6: PEPRS and the Keepers Registry

E-journals: key aspects

• Libraries are licensed for usage – do not host the material

• Control lies with the publisher rather than with the subscriber

• Publishers are not a constant in the life of a journal– titles are often transferred between publishers

• Publishers may decide that they do not want to host back material

• Legislation for legal deposit is not yet in place in UK and many other countries

Page 7: PEPRS and the Keepers Registry

7

Why a Preservation Registry?

• Many schemes emerging to meet challenge

• But who is doing what? – How can libraries & policy-makers assess which e-

journals are being archived, by what methods, and under what terms of access?

• JISC commissioned a scoping study for an e-journals preservation registry– the idea had been mentioned in the literature

Page 8: PEPRS and the Keepers Registry

Scoping Study for a Registry

Page 9: PEPRS and the Keepers Registry

9

Scoping Study Report Precedes PEPRS

• Rightscom / Loughborough University, 2007

– Confirmed expressed need among libraries and policy makers

– Warned of potential burden on digital preservation agencies

– Recommended: * an e-journals preservation registry should be built* UK Union Catalogue of Serials (SUNCAT)

or SHERPA (Open Access) get involved – SUNCAT is hosted and managed at EDINA

Page 10: PEPRS and the Keepers Registry

PROJECT DETAILS

• Phase 1 funded by JISC (Preservation Programme) from August 2008 – July 2010

• EDINA, University of Edinburgh, grant recipient

• Project partner – ISSN International Centre, Paris

• Evaluation carried out by Charles Beagrie Limited for the JISC in February 2010

Page 11: PEPRS and the Keepers Registry

11

Digital Preservation Agencies in the Pilot* Two 3rd Party Organisations

– CLOCKSS (Controlled Lots Of Copies Keeps Stuff Safe)

– Portico* Two National Libraries (c.f. legal deposit)

– British Library (BL)British Library e-Journal Digital Archive

– Koninklijke Bibliotheek (KB e-Depot) KB, National Library of the Netherlands

• Two library cooperatives - General LOCKSS Network (Lots Of Copies Keeps Stuff Safe)

- HathiTrust

Page 12: PEPRS and the Keepers Registry

The Agencies - LOCKSS

• LOCKSS (Lots Of Copies Keeps Stuff Safe), based at Stanford University Libraries, is an international community initiative that provides libraries with digital preservation tools and support so that they can easily and inexpensively collect and preserve their own copies of authorized e-content.

Page 13: PEPRS and the Keepers Registry

The Agencies - CLOCKSS

• CLOCKSS (Controlled LOCKSS) is a not for profit joint venture between the world’s leading scholarly publishers and research libraries whose mission is to build a sustainable, geographically distributed dark archive with which to ensure the long-term survival of Web-based scholarly publications for the benefit of the greater global research community.

Page 14: PEPRS and the Keepers Registry

The Agencies - Portico

• Portico provides libraries and publishers with a reliable, cost-effective solution to one of the most critical challenges facing the scholarly community today—ensuring that the electronic resources you rely on everyday will be accessible to future researchers, scholars, and students.

Page 15: PEPRS and the Keepers Registry

The Agencies – e-Depot

• The e-Depot is a digital archiving environment that ensures long-term access to digital objects.

• e-Depot is based at the Koninklijke Bibliotheek in The Hague

Page 16: PEPRS and the Keepers Registry

The Agencies – British Library

• The BL preserves digital content that is collected but also material that is created, such as digitised collections. The store is an important component for forthcoming e-Legal Deposit.

Page 17: PEPRS and the Keepers Registry

The Agencies – the HathiTrust

HathiTrust is a partnership of major research institutions and libraries working to ensure that the cultural record is preserved and accessible long into the future. There are more than fifty partners in HathiTrust, and membership is open to institutions worldwide.HathiTrust is a partnership of major research institutions and libraries working to ensure that the cultural record is preserved and accessible long into the future. There are more than fifty partners in HathiTrust, and membership is open to institutions worldwide.HathiTrust is a partnership of major research institutions and libraries working to ensure that the cultural record is preserved and accessible long into the future. There are more than fifty partners in HathiTrust, and membership is open to institutions worldwide.HathiTrust is a partnership of major research institutions and libraries working to ensure that the cultural record is preserved and accessible long into the future. There are more than fifty partners in HathiTrust, and membership is open to institutions worldwide.

HathiTrust is a partnership of major research institutions and libraries working to ensure that the cultural record is preserved and accessible long into the future. There are more than fifty partners in HathiTrust, and membership is open to institutions worldwide.

Page 18: PEPRS and the Keepers Registry

What is in the vaults?

http://www.flickr.com/photos/wka/4283285201/http://www.flickr.com/photos/mcfull/421644442/sizes/s/in/photostream/

Page 19: PEPRS and the Keepers Registry

http://www.flickr.com/photos/akeeh/4300472592/sizes/z/in/photostream/

Agencymetadata

Agency metadata

Agencymetadata

Agencymetadata

Agencymetadata

The Keepers Registry

Agency metadata

Page 20: PEPRS and the Keepers Registry

Creating the database

Agency data

ISSN Register

ISSNs

The Keepers Registry

ISSN-L + p-ISSN & e-ISSN

Register metadata

Agency metadata

Page 21: PEPRS and the Keepers Registry

Open Source components used in the Keepers Registry

Component Software choice Comment

User interface Apache::ASP http://www.apache-asp.org/

Offers fast and easy development and is extremely flexible

Database: metadata hosted by the Keepers Registry

Zebra http://www.indexdata.dk/zebra/

Provides structured text indexing and retrieval. Fast and scales well. Provides powerful and flexible text retrieval capabilities.

Harvester Custom Perl and CPAN packages Data files will be collected using FTP and HTTP.

Normalisation Custom Perl and CPAN packages including MARC::Record http://search.cpan.org/~gmcharlt/MARC-Record-2.0.2/

Each preservation agency supplies custom data at the moment, so scripts will be created for each data source. ISSN data is in MARC21 format and will be processed using MARC::Record CPAN package

Z39.50 support in Perl ZOOM http://zoom.z3950.org/api/ Abstract Perl API supporting search and retrieval. Based on YAZ toolkit.

Page 22: PEPRS and the Keepers Registry

Beta service demonstration

• The Keepers Registry

Page 23: PEPRS and the Keepers Registry

HOME PAGEHOME PAGE

Page 24: PEPRS and the Keepers Registry
Page 25: PEPRS and the Keepers Registry

Search Results screen – multiple records

Page 26: PEPRS and the Keepers Registry

Full record display

Page 27: PEPRS and the Keepers Registry

Variant title

Variant title

Page 28: PEPRS and the Keepers Registry

4 agencies

Status

Status

Page 29: PEPRS and the Keepers Registry

HathiTrust - summary

Page 30: PEPRS and the Keepers Registry

HathiTrust – full record display

Page 31: PEPRS and the Keepers Registry

Journal browse

Page 32: PEPRS and the Keepers Registry

Browse by publishers

Page 33: PEPRS and the Keepers Registry

PEPRS Phase 2

• Funding provided from August 2010 – July 2012

• Beta service – end of April 2011 www.peprs.org/

• The Keepers Registry – October 2011 http://thekeepers.org

• Full service –2012

• Involve international users in testing

34

Page 34: PEPRS and the Keepers Registry

Phase 2: key stages

ACTIVITY Aug-10 Dec-10 Apr-11 Aug-11

Oct-11 Dec-11 Feb-11 Apr-12 May-11 Aug-12

Dec-12

                     

PEPRS Beta service start and end

               

The Keepers Registry Beta service start

     

Full service operation

                   

                     

Software releases

     0.1 0.2   0.3   0.4    

Establishment of Advisory Board

       

Business Planning

     

Phase 2 start and end

PEPRS Development activity

Page 35: PEPRS and the Keepers Registry

ISSN issues

• ISSNs missing in some agency records and some not in ISSN Register

• Some duplicate records

• Some p-ISSNs used as e-ISSNs

• Some p-ISSNs linked via a common ISSN-L to a number of e-ISSNs but which one is correct?

• Some were incorrect

Page 36: PEPRS and the Keepers Registry

Holdings information - variation

e-Depot: Preserved: v. 1 - 36, 38 - 46.

UK LOCKSS Alliance: Preserved: v. 42 - 45. In progress: v. 46, 47.

Portico: Preserved: (2002-2009) v.40, v.41, v.42, v.43, v.44, v.45, v.46, v.47.

HathiTrust: Preserved: n.s. v. 3 (1883/85); n.s. v. 12 (1897/98); n.s. v. 16 (1903/04); 1864-1865; n.s. v. 4

Page 37: PEPRS and the Keepers Registry

Terms used by preservation agencies CLOCKSS LOCKSS e-Depot Portico BL No action: The agency has no relationship with this title at present. (or is it best simply not to mention your agency with regard to this title?)

√ √ √

Committed: the publisher has agreed that the agency may preserve the title but the ingest process has not yet begun.

√ √ √ (with qualifications)

Queued: Publisher technical work is complete, but the preservation agency has not yet processed the title

√ √ (use term with different meaning)

Archived: The title has been ingested into the archive

√ √ √ √

Available for Library Archiving: The title has been made available for preservation by a library, subject to a library’s subscription rights.

Page 38: PEPRS and the Keepers Registry

Involvement with international initiatives

• Print Archives Program of the Center for Research Libraries – “CRL is working with consortial partners to plan a prototype print archives framework to link existing print archiving efforts. has developed a searchable Print Archives Registry of information about print-archiving initiatives, including:– Projects – Serial Holdings.

Page 39: PEPRS and the Keepers Registry

Additional agencies

• National Science Library, Chinese Academy of Sciences (NSLC) (c.20,000 journals with ISSNs) - metadata submitted.

• Others to be considered: California Digital Library; Ontario Scholars Portal; Archaeology Data Service (York)

• Drawing up inclusion criteria

Page 40: PEPRS and the Keepers Registry

PEPRS: Further information and Contact details

The Keepers Registry Beta service

http://thekeepers.org

Project website

http://edina.ac.uk/projects/peprs/index.html

Help Desk [email protected]

Fred Guy, EDINA, University of Edinburgh

[email protected]