data curation and preservation: the digital curation centre

18
a centre of expertise in data curation and preservation Support for e-Research, Edinburgh, 14 June 2007 Funded by: Data Curation and Preservation: the Digital Curation Centre Michael Day DCC Research Team UKOLN, University of Bath [email protected] http://www.ukoln.ac.uk/

Upload: michael-day

Post on 10-Nov-2014

4.776 views

Category:

Technology


2 download

DESCRIPTION

Slides from a presentation given at: Support for e-Research: Filling the Library Skills Gap, National e-Science Centre, Edinburgh, UK, 14-15 June 2007

TRANSCRIPT

Page 1: Data curation and preservation: the Digital Curation Centre

a centre of expertise in data curation and preservation

Support for e-Research, Edinburgh, 14 June 2007

Funded by:

Data Curation and Preservation: the Digital Curation Centre

Michael DayDCC Research Team

UKOLN, University of [email protected]

http://www.ukoln.ac.uk/

Page 2: Data curation and preservation: the Digital Curation Centre

a centre of expertise in data curation and preservation

Support for e-Research, Edinburgh, 14 June 2007

2

Outline

• Contexts• DCC aims and objectives• Major DCC activities

Page 3: Data curation and preservation: the Digital Curation Centre

a centre of expertise in data curation and preservation

Support for e-Research, Edinburgh, 14 June 2007

3

Contexts (1)

• There are increasing amounts of information in digital form being used in HE, e.g.:– Research outputs (publications, data)– Learning objects– Administrative records (electronic records

management systems, databases, Web sites)– Information licensed from third-parties (e.g., e-

journals, research databases)

Page 4: Data curation and preservation: the Digital Curation Centre

a centre of expertise in data curation and preservation

Support for e-Research, Edinburgh, 14 June 2007

4

Contexts (2)

• There is a strategic need to manage these assets on behalf of the institution, e.g.:– Compliance with:

• Freedom of Information (FoI) legislation• Data Protection legislation

– Verifiability and reproducibility of research• Research Council rules on data retention

– The Open Access agenda

Page 5: Data curation and preservation: the Digital Curation Centre

a centre of expertise in data curation and preservation

Support for e-Research, Edinburgh, 14 June 2007

5

Contexts (3)

• Institutional responses include:– Electronic Records Management Systems– Institutional Repositories

• Supra-institutional initiatives:– Some research councils fund central

repositories for certain types of data– Many other discipline-based databases

Page 6: Data curation and preservation: the Digital Curation Centre

a centre of expertise in data curation and preservation

Support for e-Research, Edinburgh, 14 June 2007

6

Contexts (4)

• The main drivers for digital curation:– An increasing awareness that digital assets are

vulnerable– Continuing access is vital to ensure that

contemporary scholarship is reproducible and verifiable

– Digital assets can be re-used in innovative ways to create new research

Page 7: Data curation and preservation: the Digital Curation Centre

a centre of expertise in data curation and preservation

Support for e-Research, Edinburgh, 14 June 2007

7

Digital Curation Centre

• Launched: Edinburgh, 5 November 2004

• Grant funding from:

– Joint Information Systems Committee (JISC)

– UK e-Science Core Programme (Engineering and Physical Sciences Research Council)

• Main activities:

– Development, services and outreach in digital curation

– Research programme

• Now in second phase

Page 8: Data curation and preservation: the Digital Curation Centre

a centre of expertise in data curation and preservation

Support for e-Research, Edinburgh, 14 June 2007

8

DCC partners• University of Edinburgh

– Database Research Group (School of Informatics)– AHRC Research Centre for Studies in Intellectual Property and

Technology Law– EDINA– National e-Science Centre

• University of Glasgow– Humanities Advanced Technology and Information Institute

• UKOLN, University of Bath

• Science and Technology Facilities Council– Rutherford Appleton and Daresbury laboratories

Page 9: Data curation and preservation: the Digital Curation Centre

a centre of expertise in data curation and preservation

Support for e-Research, Edinburgh, 14 June 2007

9

Digital curation

• Active management of data over life-cycle of scholarly and scientific interest– Reproducibility– Reuse

• Appreciation of differences between disciplines• Importance of lifecycles

– Conception, creation, use, re-use– Potentially involving a lifetime of endeavour

Page 10: Data curation and preservation: the Digital Curation Centre

a centre of expertise in data curation and preservation

Support for e-Research, Edinburgh, 14 June 2007

10

DCC purpose

• Supporting and promoting continuing improvement in the quality of data curation and digital preservation activity

Page 11: Data curation and preservation: the Digital Curation Centre

a centre of expertise in data curation and preservation

Support for e-Research, Edinburgh, 14 June 2007

11

DCC vision

• Centre of excellence in digital curation and preservation in the UK

• Authoritative source of advocacy and expert advice and guidance to the community

• Key facilitator of an informed research community with established collaborative networks of digital curators

• Service provider of a wide range of resources, software, tools and support services

Page 12: Data curation and preservation: the Digital Curation Centre

a centre of expertise in data curation and preservation

Support for e-Research, Edinburgh, 14 June 2007

12

DCC objectives

• Provide strategic leadership in digital curation and preservation for the UK research community, with particular emphasis on science data

• Influence and inform national and international policy• Provide advocacy and expert advice and guidance to practitioners and

funding bodies• Create, manage and develop an outstanding suite of resources and

tools• Raise the level of awareness and expertise amongst data creators

and curators, and other individuals with a curation role• Strengthen community curation networks and collaborative

partnerships• Continue strong association with our research programme

Page 13: Data curation and preservation: the Digital Curation Centre

a centre of expertise in data curation and preservation

Support for e-Research, Edinburgh, 14 June 2007

13

DCC research goals

• Bringing Strands of Curation together including– Traditional archiving functions– The curation of evolving knowledge, e.g. as

seen in scientific databases• Conduct research in areas crucial to digital curation• To institute two-way conduits between research

activity and service provision

Page 14: Data curation and preservation: the Digital Curation Centre

a centre of expertise in data curation and preservation

Support for e-Research, Edinburgh, 14 June 2007

14

DCC research agenda

• Data integration and publishing• Annotation• Provenance and data quality• Data citation• Metadata extraction• Archiving and appraisal• Legal issues• Networks of trusted repositories • Economic cost-benefit analysis of curation

Page 15: Data curation and preservation: the Digital Curation Centre

a centre of expertise in data curation and preservation

Support for e-Research, Edinburgh, 14 June 2007

15

DCC tools and infrastructure

• Representation Information Registry and Repository– Representation Information is all of the

information needed to turn byte-streams into something meaningful

– Pilot registry developed in phase 1; it now needs to be deployed as a service

• Toolkits for other types of metadata• Packaging tools, e.g. XFDU (XML Formatted Data

Unit), SAFE (Standard Archive Format for Europe)

Page 16: Data curation and preservation: the Digital Curation Centre

a centre of expertise in data curation and preservation

Support for e-Research, Edinburgh, 14 June 2007

16

DCC user services

• Resources:– Helpdesk

– Publications

– Databases of external resources and standards (DIFFUSE)

• Curation services– e.g., DRAMBORA (Digital Repository Audit Method Based on

Risk Assessment) Toolkit: http://www.repositoryaudit.eu/

• Professional development (training events)• LOCKSS Technical Support Service

Page 17: Data curation and preservation: the Digital Curation Centre

a centre of expertise in data curation and preservation

Support for e-Research, Edinburgh, 14 June 2007

17

DCC community development

• Raising awareness of DCC and dissemination of results:– Web portal (http://www.dcc.ac.uk/)– International Journal of Digital Curation (IJDC)– International Conference (annual)

• Associates Network• Understanding users and their needs, e.g.:

– Specific events organised with data centres– SCARP - separately funded project

Page 18: Data curation and preservation: the Digital Curation Centre

a centre of expertise in data curation and preservation

Support for e-Research, Edinburgh, 14 June 2007

Funded by:

Data Curation and Preservation: the Digital Curation Centre

Michael DayDCC Research Team

UKOLN, University of [email protected]

http://www.dcc.ac.uk/http://www.ukoln.ac.uk/