data provision and the sbb-spk - oclc...data provision and the sbb-spk current status and...
TRANSCRIPT
Data provision and the SBB-SPKCurrent status and perspectives
Reinhard Altenhöner, Staatsbibiothek zu Berlin – Stiftung Preußischer Kulturbesitz
Berlin, 21.02.2017
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 2
Who we are…
Five institutions united within the Foundation
Staatliche Museen zu Berlin
Staatsbibliothek zu Berlin
Geheimes Staatsarchiv
Preußischer Kulturbesitz
Ibero-Amerikanisches Institut
Staatliches Institut
für Musikforschung
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 3
Prussian Cultural Heritage Foundation
museums, libraries, archives, and research institutes
collections with an universal approach showing the evolution of human culture from its beginnings to the present in Europe and on other continents
Continuous acquisition and cooperative research
largest employer in the cultural field (2,000 employees)
The state library was founded in 1661
The biggest research library in the German-speaking countries
more than 11 mio books (huge special collections (manuscripts, early books, music sheets, …), 25 mio items overall
focus on the humanities and social sciences, scientific literature in all languages, all times and all countries
Also a digital library, special attention for access to retrodigitised material
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 4
Promoting our identity and the identity of our collections
What we all want Institutional visibility (in general), prominance (locally, regional, global)
Dedicated allocation of trust & quality for our services, becoming a brand
Recognition of our services, acknowledgment in terms of ressources
One way to reach this: Provide your (open) data (metadata and digital objects) What does this mean, what should we do?
What are the limitations?
How can we measure success?
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 5
The SPK in a Digital World
Digital WorldSBB
SPK
IAISMB
GSTA
SIM
From: Institutional paths
(with digital sidelines)
To: One SPK strategy
with a digital core
SPK
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 6
Data production and services in the SBB/SPK: Metadata, sharing, visibility I
Metadata production in SBB & SPK:
Many dedicated systems for different kind of material and departments
different access systems
Some aggregated stores with unified collections, metastore (but strong limits)
flat web syndication
Uncertainties around the legal situation
SBB: Metadata governance
Shared environment (with plenty of exceptions)
Clear rules for semantic enrichement of data (authority files)
No clearly mandated responsibility for metadata governance
Marketing of data and services
Separated and isolated view, PR-driven
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 7
Data production and services in the SBB/SPK: Metadata, sharing, visibility II
Metadata proliferation
Context sensitive for download of structured data (single units or small blocksof data), OAI – Interface for broader request
Data provision / sharing (GBV, Worldcat, DDB)
Linked Data Services e.g. ISIL
Accepted service
Usage rates?
To sum up:
Difficulties to measure (Download rates, registered users vs. open dataapproach)
What we can say: Minor use overall
Global visibility? Institutional promotion?
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 8
OAI-Interface - result set
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 9
ISIL – Linked Data Service
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 10
Reuse of Metadata
Big move to expose metadata – a common approach with many examples
DNB, BNF, KB of Sweden, OCLC (still experimental), …
LD4P, LD4L
BIBFRAME, Schema.org, Google's data- vocabulary.org… Advanced translationsets how bibliographic data can be shifted into the broader net
It‘s proven (?): We have
new ways to expose and distribute metadata
Extended options to connect data
Potentials for a new infrastructure platform
But: This is from specialists to specialists: „Semantic-Web-Community”
Low rate of reuse outside the community
Visibility of data, institutional promotion?
Only one (additional) way
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 11
Some interim results for SBB / SPK, orientation
Metadata production
Shift to aligned data schemas offering more options in order to facilitate theinteroperability of data
Entity based recording of objects
Shared environments
Community-based standard development, but in broader scope, e.g. provenance information
First steps for a semantic web standard-driven ecosystem
Metadata diffusion
Traditional way is not so bad and almost sufficient
Support for broader platforms
Open data – invisibility?
Change the expectation for success
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 12
Broader scope of visibility
Open data. It’s nearly impossible to claim for originality of data
We can’t measure the “success” of metadata provision
It’s more on sharing knowledge in the (extended) community and empowering the network
Our customers are asking for content
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 13
Visibility of data in the web
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 14
Extended visibility in the net
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 15
Another example from the SBB: OPAC
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 16
Another example from the SBB: Presentation
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 17
Second example I
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 18
Second example II
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 19
An example from the DDB I
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 20
An example from the DDB II
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 21
Results for SBB / SPK
Metadata! Openess, interoperability, capacity building
Lifting up the Metadata to the top of the list (Foundation)
Extension of functions w.r.t. content‒ Flexible PDF module
‒ Further formats for the full texts - such as TEI, mobi, ePub.
‒ Extended recognition and linking of authorised entities
‒ Enhanced OAI interface: full-text data.
‒ CCO license for everything
‒ Extending the full-text availability
‒ IIIF compliance
Universal reuse of images: IIIF (for reference Presentation API 2.1), but e.g. a legal statement is linked out, availability of a fulltext transcript, NER and dedicated tools support is not covered in a standardised way
Metadata to indicate in a registry / search systems, what we offer.
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 22
Discussion
Linked (open) data‒ Impact for visibility is in fact limited
‒ Level of complexity to high
‒ High potential for reuse in a joined eco-system in the LAM-sector – so let‘s move theview
‒ Reduced / simple ontologies for reuse in the net
Dedicated role of OCLC !?
The SBB/SPK-case‒ Focus on collections, extension of (data)services
‒ Relevant data outside the library
‒ Involved in an overall marketing strategy
‒ Part of the digital transformation agenda (started in 2016)
‒ Not only a technical challenge but even an organisational
‒ Integration of machine-readable information into metadata (like REL)
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 23
Data Management: Analysis of the ZDB
27.02.2017OCLC EMEA Berlin 2017: The Institution Identity - Provisioning Our Data for Greater Impact / Altenhöner Page 24
Discussing
Linked (open) data‒ Impact for visibility is in fact limited
‒ Level of complexity to high
‒ High potential for reuse in a joined eco-system in the LAM-sector – so let‘s move theview
‒ Reduced / simple ontologies for reuse in the net
Dedicated role of OCLC !?
The SBB/SPK-case‒ Focus on collections, extension of (data)services
‒ Relevant data outside the library
‒ Involved in an overall marketing strategy
‒ Part of the digital transformation agenda (started in 2016)
‒ Not only a technical challenge but even an organisational
‒ Integration of machine-readable information into metadata (like REL)
Thank [email protected]
Berlin, 27.02.2017