DG Joint Research Centre
13th March 2014
Alice Vasilescu - Deloitte
Study and guidelines on Geospatial Linked Data as part of ISA Action 1.17
Persistent IDentifiers (PIDs)
Click to edit Master title style
Agenda
Introduction
Context
3
2
2
Preliminary results 4
5 Sharing your experience – What barriers could there be for governing PIDs?
1
Next steps6
Approach
Click to edit Master title style
Introduction
3
We will provide an outline of the study and our work so far with regards to PIDs. This webinar is also an opportunity to provide
feedback and to exchange experiences: What barriers could there be for governing PIDs?
Persistent Identifiers
(PID)
GUIDELINES
WEBINAR II Guidelines on methodologies
Presentation of the first version of the guidelines for a common RDF Vocabulary for INSPIRE data and an approach to PID governance: opportunities for community-led improvements and next steps towards an official INSPIRE encoding.
Today’s webinar
1. Setting the scene
3. Refinement & recommendations
2.
Developing
proposals
β
Click to edit Master title style
Agenda
Introduction
Context
3
2
4
Preliminary results 4
5 Sharing your experience – What barriers could there be for governing PIDs?
1
Next steps6
Approach
Click to edit Master title style
Objectives
5
This study has been prepared in the context of
the Interoperability for European Public
Administrations (ISA) Programme and, in
particular A Reusable INSPIRE Reference
Platform (ARE3NA, ISA Action 1.17)
Connecting Geospatial DataThis study should provide:
1. Shared evidence about the current status
in Europe of linked (geospatial) data
related to INSPIRE.
2. An initial common/agreed methodology
and guidelines towards RDF encodings for
INSPIRE
3. Recommendations for how location PIDs
could be governed for INSPIRE and other
relevant activities.
Click to edit Master title style
Scope
6
PIDs
Real World Things
Data Sets/Objects
Resources on the web that
contain data about real-world
things
Real-world things such as a river
basin, train station, a country, etc.
Linked Data Principles
1) Use URIs as names for things
2) Use HTTP URIs, so that people can look up those names
3) When someone looks up a URI, useful information should be provided, using the standards (RDF, SPARQL)
4) Include links to other URIs, so that they can discover more things
<<assign>> <<assign>>
<<include>>
PID = A persistent identifier (PI) is a long-lasting reference to a digital object—a single file or set of files.
Noted persistent identifier systems include: Archival Resource Keys (ARKs), Digital Object Identifiers (DOIs),
Persistent Uniform Resource Locators (PURLs), Uniform Resource Names (URNs), and Extensible Resource
Identifiers (XRIs)
Click to edit Master title style
Agenda
Introduction
Context
3
2
7
Preliminary results 4
5 Sharing your experience – What barriers could there be for governing PIDs?
1
Next steps6
Approach
Click to edit Master title style
Approach
Governance Operations
Financing Architecture
Deloitte Data Governance
Framework (COBIT based)
DAMA Framework
(DAMA-DMBOK Guide)
Tools & Technology
Policies & Principles
Organi-zation
Processes
Governance Controls
DataGovernance
Strategy
Click to edit Master title style
Evaluation Framework
9
Governance Operations
Financing Architecture
Governance refers to the control and
the steering of the decisions on PIDs
Operations is about the processes
and tools needed to run PIDs
Financing is about the resources
needed for the operations and the
architectural updates
Architecture corresponds to a formal
specifications around PIDs
Click to edit Master title style
10
Evaluation Framework
Click to edit Master title style
Literature
11
Author(s) Date Title Type Description
Archer, P., Dekkers,
M., Goedertier, S., &
Loutas, N.
201
3
Study on business models for Linked
Open Data (BM4LOGD)
R Source:
https://joinup.ec.europa.eu/sites/default/files/Study_on_business_m
odels_for_Linked_Open_Government_Data_BM4LOGD_v1.00_2.p
df
Overbeek, H. & van
den Brink, L.
201
3
Towards a national URI Strategy for
Linked Data of the
Dutch public sector
R Source:
http://www.pilod.nl/images/a/aa/D1-2013-09-
19_Towards_a_NL_URI_Strategy.pdf
Overbeek, H. &
Brentjes, T.
201
3
Concept URI Strategy for the NL Public
Sector
PPT Source:
http://www.geonovum.nl/sites/default/files/2013-03-12_uri-strat.pdf
Portele, C.201
3
URI strategy of INSPIRE PPT Source:
http://www.geonovum.nl/sites/default/files/4clemensportele.pdf
Vanbockryck, J. &
Robbrecht, J.
201
2
Concepts of Meta-SDI PPT Source: http://www.poweredbyinspire.eu/documents/0403-
linkeddata-robbrechtvanbockryck.pdf
Berendt, B.201
4
USEWOD 2014: Building a Web
Observatory for research on LOD usage
W Source:
http://people.cs.kuleuven.be/~bettina.berendt/USEWOD2014/
DDGI201
3
Open Government Data –
Verwaltungsdaten frei für Wirtschaft und
Gesellschaft
PP Source:
http://www.google.be/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1
&ved=0CCsQFjAA&url=http%3A%2F%2Fwww.ddgi.de%2Fdownloa
ds-topmenu-8%2Fdoc_download%2F1066-positionspapier-open-
government-data-verwaltungsdaten-frei-fuer-wirtschaft-und-
gesellschaft&ei=Rl3vUvnAOMijhgeLx4D4Dg&usg=AFQjCNHDyPud
3AJxKZTzBI7ovAJ3z3lzWg
Geonovum201
3
Projectplan Linked Open Data R Source:
http://www.geonovum.nl/sites/default/files/projectplan_linked_open_
data_2012-2013.pdf
Schade, S. & Smits,
P.
201
2
Why Linked Data should not lead to next
generation SDI
P Source: IEEE International on Geoscience and Remote Sensing
Symposium (IGARSS), pp. 2894-2897
Archer, P.,
Goedertier, S. &
Loutas, N.
201
2
10 Rules for Persistent URIs RSource: https://joinup.ec.europa.eu/community/semic/document/10-
rules-persistent-uris
W3C201
2
223 Best Practices URI Construction.
W3C GLD WG wiki.
W Source:
http://www.w3.org/2011/gld/wiki/223_Best_Practices_URI_Construc
tion
Davidson, P.,
Murray, K. &
Williams, S.
201
1
Designing URI Sets for Location. Version
1.0.
R Source:
http://data.gov.uk/sites/default/files/Designing_URI_Sets_for_Locati
on-V1.0_10.pdf
Click to edit Master title style
Literature
12
Author(s) Date Title Type Description
Davidson, P.201
0
Designing URI sets for the UK public
sector. Version 1.0. Interim paper.
P Source:
https://www.gov.uk/government/uploads/system/uploads/attachment
_data/file/60975/designing-URI-sets-uk-public-sector.pdf
UK CTOC201
4
URI patterns for identifiers proposal W http://standards.data.gov.uk/proposal/uri-patterns-identifiers-
proposal
Abbas, S. & Ojo, A.201
4
Applying Design Patterns in URI
Strategies - Naming in Linked Geospatial
Data Infrastructure
P Source:
https://www.deri.ie/content/applying-design-patterns-uri-strategies-
naming-linked-geospatial-data-infrastructure
Sauermann, L. &
Cyganiak, R.
200
8
Cool URIs for the Semantic Web. W3C
Interest Group Note
WSource : http://www.w3.org/TR/cooluris/
201
4
Digital Object Identifier System (DOI) WSource: http://www.doi.org/
201
4
The Handle System WSource : http://www.handle.net/
201
4
EZID (Easy ID) – Long Term Identifiers
Made Easy
Whttp://n2t.net/ezid/
EC, DG JRCImplementation of Identifiers using URIs in
INSPIRE – Frequently Asked Questions.
Whttp://inspire.ec.europa.eu/index.cfm/pageid/5120
PURL201
3
Persistent Uniform Resource Locators Whttp://purl.org/
KE – Knowledge
Exchange
201
3
Persistent Identifiers Studies Whttp://www.knowledge-exchange.info/Default.aspx?ID=332
Click to edit Master title style
Literature
13
If there is anything relevant missing from our list, please contact us:
The list will be provided together with this current presentation.
Click to edit Master title style
Agenda
Introduction
Context
3
2
14
Preliminary results 4
5 Sharing your experience – What barriers could there be for governing PIDs?
1
Next steps6
Approach
Click to edit Master title style
Governance
Click to edit Master title style
Governance
16
Organization Austria – REEEP(Renewable Energyand Energy Efficiency Partnership)Reegle.info
Belgium – Agiv Flemish LandAgency
Denmark – Knowledge Exchange
Germany - German NationalLibrary
Italy - Agenzia per l'Italia Digitale
Netherlands – SURF Initiative
United Kingdom - ChiefTechnology Officer Council
W3CWork Study on business models for
Linked Open Data (BM4LOGD)
Concepts of Meta-SDI
URI Strategy of INSPIRE
Designing URI sets for UK PublicSector
Designing URI sets for Location(UK)Input reviewed
Unknown yet
Click to edit Master title style
Governance
17
Preliminary conclusions
Based on available data analyzed until now:
1) Policy is often associated with URI design
patterns and the management of domains.
However, there is definitely more than these
two elements.
2) When a policy exists it isn’t always well
formalized depending on the size and
organizational structure.
3) There seem to be no examples of PID
organizational structures in the context of
INSPIRE and in e-Government.
4) There is a lot of theoretical work about the
“technical side” of persistent identifiers, often
associated with linked data
5) Lack of a holistic picture that integrates the
several GOFA dimensions.
Barriers
Based on available data analyzed until now:
1) PIDs governance is an inherently complex
topic given the way that “ICT governance and
management” is done in the different Member
States
2) There is “no silver bullet” for PID governance
Click to edit Master title style
Financing
18
Governance Operations
Financing Architecture
PIDs PolicyDefines what and
how can be assigned a PDI
ValidationValidates the
format, location and the uniqueness of
PIDs
RegistrationCreates and registers
PIDs
Redirection Uses persistent PIDs
to redirect to the
desired Web resource
Long Term Preservation Archives fullback copies of “dead”
resources
Business CaseJustifies the investment in persistent identifiers
Cost ModelDefines the allocation of costs and charges associated with the provision of persistent PIDs.
PIDs Organisational structureDefines who has the authority and control (planning, monitoring and enforcement) over the management of PIDs
Service ModelDescribes the set of services that will benefit from the use of persistent PIDs, this encompasses both data and metadata about services
Financing
Click to edit Master title style
Financing – Business case
19
Organisation United Kingdom - Department ofEnvironment, Food and RuralAffairs (DEFRA)
Work Study on business models forLinked Open Data (BM4LOGD)
Input reviewed
Unknown yet
Click to edit Master title style
Financing – Cost model
20
OrganisationEuropean Commission
Austria – REEEP (RenewableEnergy and Energy EfficiencyPartnership) Reegle.info
Belgium – INSPIRE State Report
Germany – INSPIRE State Report
Italy- Agenzia per l'Italia Digitale
UK – INSPIRE State ReportWork Study on business models for
Linked Open Data (BM4LOGD)
Belgium – INSPIRE State Report
Germany – INSPIRE State Report
Input reviewed
Unknown yet
Click to edit Master title style
Financing
21
Preliminary conclusions
Based on available data analyzed until now:
1) There are no formalized business cases for
PIDs – we believe that the interest in linked
data is sometimes enough:
e.g. John Sheridan, Head of
Legislation Services “A business
case for using linked data would like
making a business case for using
electricity.”
2) There seems to be no cost model for PIDs.
For Linked Data or the implementation of
INSPIRE, when they can be found, they are
very different from one case to the other.
Below are a few examples:
Development costs, Maintenance costs, Promotion
costs – Linked Data
Operating costs of the IT infrastructure, Production of
interoperability Business model for the implementation
of INSPIRE, IT structure Processing of spatial data,
Personnel – INSPIRE Germany
Barriers
1) No clear understanding on the investment
needed to set up PIDs.
Click to edit Master title style
Financing
22
Governance Operations
Financing Architecture
PIDs PolicyDefines what and
how can be assigned a PDI
ValidationValidates the
format, location and the uniqueness of
PIDs
RegistrationCreates and registers
PIDs
Redirection Uses persistent PIDs
to redirect to the
desired Web resource
Long Term Preservation Archives fullback copies of “dead”
resources
Business CaseJustifies the investment in persistent identifiers
Cost ModelDefines the allocation of costs and charges associated with the provision of persistent PIDs.
PIDs Organisational structureDefines who has the authority and control (planning, monitoring and enforcement) over the management of PIDs
Service ModelDescribes the set of services that will benefit from the use of persistent PIDs, this encompasses both data and metadata about services
Operational
Click to edit Master title style
Operational
23
OrganisationGermany - German NationalLibrary
Netherlands - SURF
United Kingdom - ChiefTechnology Officer Council
Work PersID III.a – Current State andState of the Art & III.b – UserRequirements
10 Rules for Persistent URIs
URI Strategy of INSPIRE
Designing URI sets for UK PublicSector
Designing URI sets for Location(UK)
Towards a national URI-‐Strategyfor Linked Data of the Dutchpublic
Input reviewed
Unknown yet
Click to edit Master title style
Operational
24
Preliminary conclusions
Based on available data analyzed until now:
1) The domain owner is implicitly considered
to be responsible for registration and
validation
2) There is little discussion about long term
preservation and most papers mention 303
Redirection
3) There is little discussion about validation
of PIDs (it is much more about
standardization and very little about control)
Barriers
Click to edit Master title style
Financing
25
Governance Operations
Financing Architecture
PIDs PolicyDefines what and
how can be assigned a PDI
ValidationValidates the
format, location and the uniqueness of
PIDs
RegistrationCreates and registers
PIDs
Redirection Uses persistent PIDs
to redirect to the
desired Web resource
Long Term Preservation Archives fullback copies of “dead”
resources
Business CaseJustifies the investment in persistent identifiers
Cost ModelDefines the allocation of costs and charges associated with the provision of persistent PIDs.
PIDs Organisational structureDefines who has the authority and control (planning, monitoring and enforcement) over the management of PIDs
Service ModelDescribes the set of services that will benefit from the use of persistent PIDs, this encompasses both data and metadata about services
Architecture
Click to edit Master title style
Architecture
26
OrganisationAustria – REEEP(RenewableEnergy and Energy EfficiencyPartnership) Reegle.info
Belgium – Agiv Flemish LandAgency
Denmark – Knowledge Exchange
Germany - German NationalLibrary
Ireland - Digital EnterpriseResearch Institute, NationalUniversity of Ireland, Galway
Italy - Agenzia per l'Italia Digitale
Netherlands - SURF
United Kingdom - ChiefTechnology Officer Council
W3C
Work Study on business models forLinked Open Data (BM4LOGD)
Concepts of Meta-SDI
URI Strategy of INSPIRE
Designing URI sets for UK PublicSector
Designing URI sets for Location(UK)
Input reviewed
Unknown yet
Click to edit Master title style
Architecture
27
Preliminary conclusions
Most work has been done in this area.
1) Most referenced work is the 10 Rules for
Persistent URI and Designing URI sets for
UK Public Sector
2) There is no EU agreed policy for PIDs for
centrally managed, shared resources
3) Several guidelines have been put forward
for the URI persistence ( W3C - RFC2616., UK
- Designing URI sets for UK Public Sector,
ISA), but currently these are just
recommendations and not always aligned
Barriers
1) Control and validation are as important as
standardization but often overlooked.
Click to edit Master title style
Agenda
Introduction
Context
3
2
28
Preliminary results 4
5 Sharing your experience – What barriers could there be for governing PIDs?
1
Next steps6
Approach
Sharing your experience – What barriers could there be for governing PIDs?
Click to edit Master title style
Sharing your experience – What barriers could there be for governing PIDs?
12/12
30
Operations
Financing
Architecture
Or our approach
Governance
Click to edit Master title style
Agenda
Introduction
Context
3
2
31
Preliminary results 4
5 Sharing your experience – What barriers could there be for governing PIDs?
1
Next steps6
Approach
Click to edit Master title style
Next steps
32
WEBINAR II: Guidelines & methodologies 5 & 6 May 2014
1. Highlights
2. Refinement
Follow us on Joinup. We will provide the highlights of this
webinar
AND….
At any time, your feedback is very appreciated.
Click to edit Master title style
Next steps
33
Join the collaborative platforms in
other areas of Open Source &
Semantic Interoperability on
https://joinup.ec.europa.eu/
Connecting Geospatial Data
Thank you for your participation!