dg joint research centre 13th march 2014 alice vasilescu - deloitte study and guidelines on...
TRANSCRIPT
DG Joint Research Centre
13th March 2014Alice Vasilescu - Deloitte
Study and guidelines on Geospatial Linked Data as part of ISA Action 1.17
Persistent IDentifiers (PIDs)
Click to edit Master title style
Agenda
Introduction
Context
3
2
2
Preliminary results 4
5 Sharing your experience – What barriers could there be for governing PIDs?
1
Next steps6
Approach
Click to edit Master title style
Introduction
3
We will provide an outline of the study and our work so far with regards to PIDs. This webinar is also an opportunity to provide feedback and to exchange experiences: What barriers could there be for governing PIDs?
Persistent Identifiers
(PID)
GUIDELINES
WEBINAR II Guidelines on methodologies
Presentation of the first version of the guidelines for a common RDF Vocabulary for INSPIRE data and an approach to PID governance: opportunities for community-led improvements and next steps towards an official INSPIRE encoding.
Today’s webinar
1. Setting the scene
3. Refinement & recommendations
2. Developing proposals
β
Click to edit Master title style
Agenda
Introduction
Context
3
2
4
Preliminary results 4
5 Sharing your experience – What barriers could there be for governing PIDs?
1
Next steps6
Approach
Click to edit Master title style
Objectives
5
This study has been prepared in the context of the Interoperability for European Public Administrations (ISA) Programme and, in particular A Reusable INSPIRE Reference Platform (ARE3NA, ISA Action 1.17)
Connecting Geospatial DataThis study should provide:
1.Shared evidence about the current status in Europe of linked (geospatial) data related to INSPIRE.2.An initial common/agreed methodology and guidelines towards RDF encodings for INSPIRE3.Recommendations for how location PIDs could be governed for INSPIRE and other relevant activities.
Click to edit Master title style
Scope
6
Resources on the web that contain data about real-world things
Real-world things such as a river basin, train station, a country, etc.
Linked Data Principles
1) Use URIs as names for things2) Use HTTP URIs, so that people can look up those names3) When someone looks up a URI, useful information should be provided, using the standards (RDF, SPARQL)4) Include links to other URIs, so that they can discover more things
<<assign>> <<assign>>
<<include>>
PID = A persistent identifier (PI) is a long-lasting reference to a digital object—a single file or set of files. Noted persistent identifier systems include: Archival Resource Keys (ARKs), Digital Object Identifiers (DOIs), Persistent Uniform Resource Locators (PURLs), Uniform Resource Names (URNs), and Extensible Resource Identifiers (XRIs)
Click to edit Master title style
Agenda
Introduction
Context
3
2
7
Preliminary results 4
5 Sharing your experience – What barriers could there be for governing PIDs?
1
Next steps6
Approach
Click to edit Master title style
Approach
Governance Operations
Financing Architecture
Deloitte Data Governance Framework
(COBIT based)
DAMA Framework(DAMA-DMBOK Guide)
Tools & Technology
Policies & Principles
Organi-zation
Processes
Governance Controls
DataGovernance
Strategy
Click to edit Master title style
Evaluation Framework
9
Governance Operations
Financing Architecture
Governance refers to the control and the steering of the decisions on PIDs
Operations is about the processes and tools needed to run PIDs
Financing is about the resources needed for the operations and the architectural updates
Architecture corresponds to a formal specifications around PIDs
Click to edit Master title style
10
Evaluation Framework
Click to edit Master title style
Literature
11
Author(s) Date Title Type Description
Archer, P., Dekkers, M., Goedertier, S., & Loutas, N.
2013
Study on business models for Linked Open Data (BM4LOGD)
R Source: https://joinup.ec.europa.eu/sites/default/files/Study_on_business_models_for_Linked_Open_Government_Data_BM4LOGD_v1.00_2.pdf
Overbeek, H. & van den Brink, L.
2013
Towards a national URI Strategy for Linked Data of the
Dutch public sector
R Source:
http://www.pilod.nl/images/a/aa/D1-2013-09-19_Towards_a_NL_URI_Strategy.pdf
Overbeek, H. & Brentjes, T.
2013
Concept URI Strategy for the NL Public Sector
PPT Source:
http://www.geonovum.nl/sites/default/files/2013-03-12_uri-strat.pdf
Portele, C.2013
URI strategy of INSPIRE PPT Source: http://www.geonovum.nl/sites/default/files/4clemensportele.pdf
Vanbockryck, J. & Robbrecht, J.
2012
Concepts of Meta-SDI PPT Source: http://www.poweredbyinspire.eu/documents/0403-linkeddata-robbrechtvanbockryck.pdf
Berendt, B.2014
USEWOD 2014: Building a Web Observatory for research on LOD usage
W Source: http://people.cs.kuleuven.be/~bettina.berendt/USEWOD2014/
DDGI2013
Open Government Data – Verwaltungsdaten frei für Wirtschaft und Gesellschaft
PP Source: http://www.google.be/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&ved=0CCsQFjAA&url=http%3A%2F%2Fwww.ddgi.de%2Fdownloads-topmenu-8%2Fdoc_download%2F1066-positionspapier-open-government-data-verwaltungsdaten-frei-fuer-wirtschaft-und-gesellschaft&ei=Rl3vUvnAOMijhgeLx4D4Dg&usg=AFQjCNHDyPud3AJxKZTzBI7ovAJ3z3lzWg
Geonovum2013
Projectplan Linked Open Data R Source: http://www.geonovum.nl/sites/default/files/projectplan_linked_open_data_2012-2013.pdf
Schade, S. & Smits, P.
2012
Why Linked Data should not lead to next generation SDI
P Source: IEEE International on Geoscience and Remote Sensing Symposium (IGARSS), pp. 2894-2897
Archer, P., Goedertier, S. & Loutas, N.
2012
10 Rules for Persistent URIs RSource: https://joinup.ec.europa.eu/community/semic/document/10-rules-persistent-uris
W3C2012
223 Best Practices URI Construction. W3C GLD WG wiki.
W Source: http://www.w3.org/2011/gld/wiki/223_Best_Practices_URI_Construction
Davidson, P., Murray, K. & Williams, S.
2011
Designing URI Sets for Location. Version 1.0.
R Source: http://data.gov.uk/sites/default/files/Designing_URI_Sets_for_Location-V1.0_10.pdf
Click to edit Master title style
Literature
12
Author(s) Date Title Type Description
Davidson, P.2010
Designing URI sets for the UK public sector. Version 1.0. Interim paper.
P Source: https://www.gov.uk/government/uploads/system/uploads/attachment_data/file/60975/designing-URI-sets-uk-public-sector.pdf
UK CTOC2014
URI patterns for identifiers proposal W http://standards.data.gov.uk/proposal/uri-patterns-identifiers-proposal
Abbas, S. & Ojo, A.2014
Applying Design Patterns in URI Strategies - Naming in Linked Geospatial Data Infrastructure
P Source:
https://www.deri.ie/content/applying-design-patterns-uri-strategies-naming-linked-geospatial-data-infrastructure
Sauermann, L. & Cyganiak, R.
2008
Cool URIs for the Semantic Web. W3C Interest Group Note
WSource : http://www.w3.org/TR/cooluris/
2014
Digital Object Identifier System (DOI) WSource: http://www.doi.org/
2014
The Handle System WSource : http://www.handle.net/
2014
EZID (Easy ID) – Long Term Identifiers Made Easy
Whttp://n2t.net/ezid/
EC, DG JRC Implementation of Identifiers using URIs in INSPIRE – Frequently Asked Questions.
Whttp://inspire.ec.europa.eu/index.cfm/pageid/5120
PURL2013
Persistent Uniform Resource Locators Whttp://purl.org/
KE – Knowledge Exchange
2013
Persistent Identifiers Studies Whttp://www.knowledge-exchange.info/Default.aspx?ID=332
Click to edit Master title style
Literature
13
If there is anything relevant missing from our list, please contact us: [email protected]
The list will be provided together with this current presentation.
Click to edit Master title style
Agenda
Introduction
Context
3
2
14
Preliminary results 4
5 Sharing your experience – What barriers could there be for governing PIDs?
1
Next steps6
Approach
Click to edit Master title style
Governance
Click to edit Master title style
Governance
16
Organization Austria – REEEP(Renewable Energy and Energy Efficiency Partnership) Reegle.infoBelgium – Agiv Flemish Land AgencyDenmark – Knowledge ExchangeGermany - German National LibraryItaly - Agenzia per l'Italia DigitaleNetherlands – SURF Initiative United Kingdom - Chief Technology Officer Council
W3C Work Study on business models for Linked Open Data (BM4LOGD)Concepts of Meta-SDIURI Strategy of INSPIREDesigning URI sets for UK Public Sector Designing URI sets for Location (UK)
Input reviewed
Unknown yet
Click to edit Master title style
Governance
17
Preliminary conclusions
Based on available data analyzed until now:
1)Policy is often associated with URI design patterns and the management of domains. However, there is definitely more than these two elements.
2)When a policy exists it isn’t always well formalized depending on the size and organizational structure.
3)There seem to be no examples of PID organizational structures in the context of INSPIRE and in e-Government.
4)There is a lot of theoretical work about the “technical side” of persistent identifiers, often associated with linked data
5)Lack of a holistic picture that integrates the several GOFA dimensions.
Barriers
Based on available data analyzed until now:
1)PIDs governance is an inherently complex topic given the way that “ICT governance and management” is done in the different Member States
2)There is “no silver bullet” for PID governance
Click to edit Master title style
Financing
18
Governance Operations
Financing Architecture
PIDs PolicyDefines what
and how can be assigned a PDI
ValidationValidates the
format, location and the uniqueness of
PIDs
RegistrationCreates and registers
PIDs
Redirection Uses persistent PIDs
to redirect to the desired Web
resource
Long Term Preservation
Archives fullback copies of “dead”
resources
Business CaseJustifies the investment in persistent identifiers
Cost ModelDefines the allocation of costs and charges associated with the provision of persistent PIDs.
PIDs Organisational structureDefines who has the authority and control (planning, monitoring and enforcement) over the management of PIDs
Service ModelDescribes the set of services that will benefit from the use of persistent PIDs, this encompasses both data and metadata about services
Financing
Click to edit Master title style
Financing – Business case
19
Organisation United Kingdom - Department of Environment, Food and Rural Affairs (DEFRA)
Work Study on business models for Linked Open Data (BM4LOGD)
Input reviewed
Unknown yet
Click to edit Master title style
Financing – Cost model
20
OrganisationEuropean CommissionAustria – REEEP (Renewable Energy and Energy Efficiency Partnership) Reegle.infoBelgium – INSPIRE State ReportGermany – INSPIRE State ReportItaly- Agenzia per l'Italia Digitale
UK – INSPIRE State ReportWork Study on business models for Linked Open Data (BM4LOGD)Belgium – INSPIRE State ReportGermany – INSPIRE State Report
Input reviewed
Unknown yet
Click to edit Master title style
Financing
21
Preliminary conclusions
Based on available data analyzed until now:
1)There are no formalized business cases for PIDs – we believe that the interest in linked data is sometimes enough:
e.g. John Sheridan, Head of Legislation Services “A business case for using linked data would like making a business case for using electricity.”
•There seems to be no cost model for PIDs. For Linked Data or the implementation of INSPIRE, when they can be found, they are very different from one case to the other. Below are a few examples:
Development costs, Maintenance costs, Promotion costs – Linked Data
Operating costs of the IT infrastructure, Production of interoperability Business model for the implementation of INSPIRE, IT structure Processing of spatial data, Personnel – INSPIRE Germany
Barriers
1) No clear understanding on the investment needed to set up PIDs.
Click to edit Master title style
Financing
22
Governance Operations
Financing Architecture
PIDs PolicyDefines what
and how can be assigned a PDI
ValidationValidates the
format, location and the uniqueness of
PIDs
RegistrationCreates and registers
PIDs
Redirection Uses persistent PIDs
to redirect to the desired Web
resource
Long Term Preservation
Archives fullback copies of “dead”
resources
Business CaseJustifies the investment in persistent identifiers
Cost ModelDefines the allocation of costs and charges associated with the provision of persistent PIDs.
PIDs Organisational structureDefines who has the authority and control (planning, monitoring and enforcement) over the management of PIDs
Service ModelDescribes the set of services that will benefit from the use of persistent PIDs, this encompasses both data and metadata about services
Operational
Click to edit Master title style
Operational
23
OrganisationGermany - German National LibraryNetherlands - SURFUnited Kingdom - Chief Technology Officer Council
Work PersID III.a – Current State and State of the Art & III.b – User Requirements 10 Rules for Persistent URIsURI Strategy of INSPIREDesigning URI sets for UK Public Sector Designing URI sets for Location (UK)Towards a national URI- ‐Strategy for Linked Data of the Dutch public
Input reviewed
Unknown yet
Click to edit Master title style
Operational
24
Preliminary conclusions
Based on available data analyzed until now:
1)The domain owner is implicitly considered to be responsible for registration and validation
2)There is little discussion about long term preservation and most papers mention 303 Redirection
3)There is little discussion about validation of PIDs (it is much more about standardization and very little about control)
Barriers
Click to edit Master title style
Financing
25
Governance Operations
Financing Architecture
PIDs PolicyDefines what
and how can be assigned a PDI
ValidationValidates the
format, location and the uniqueness of
PIDs
RegistrationCreates and registers
PIDs
Redirection Uses persistent PIDs
to redirect to the desired Web
resource
Long Term Preservation
Archives fullback copies of “dead”
resources
Business CaseJustifies the investment in persistent identifiers
Cost ModelDefines the allocation of costs and charges associated with the provision of persistent PIDs.
PIDs Organisational structureDefines who has the authority and control (planning, monitoring and enforcement) over the management of PIDs
Service ModelDescribes the set of services that will benefit from the use of persistent PIDs, this encompasses both data and metadata about services
Architecture
Click to edit Master title style
Architecture
26
OrganisationAustria – REEEP(Renewable Energy and Energy Efficiency Partnership) Reegle.infoBelgium – Agiv Flemish Land AgencyDenmark – Knowledge ExchangeGermany - German National LibraryIreland - Digital Enterprise Research Institute, National University of Ireland, GalwayItaly - Agenzia per l'Italia DigitaleNetherlands - SURFUnited Kingdom - Chief Technology Officer CouncilW3C
Work Study on business models for Linked Open Data (BM4LOGD)Concepts of Meta-SDIURI Strategy of INSPIREDesigning URI sets for UK Public Sector Designing URI sets for Location (UK)
Input reviewed
Unknown yet
Click to edit Master title style
Architecture
27
Preliminary conclusions
Most work has been done in this area.
1)Most referenced work is the 10 Rules for Persistent URI and Designing URI sets for UK Public Sector
2)There is no EU agreed policy for PIDs for centrally managed, shared resources
3)Several guidelines have been put forward for the URI persistence ( W3C - RFC2616., UK - Designing URI sets for UK Public Sector, ISA), but currently these are just recommendations and not always aligned
Barriers
1) Control and validation are as important as standardization but often overlooked.
Click to edit Master title style
Agenda
Introduction
Context
3
2
28
Preliminary results 4
5 Sharing your experience – What barriers could there be for governing PIDs?
1
Next steps6
Approach
Sharing your experience – What barriers could there be for governing PIDs?
Click to edit Master title style
Sharing your experience – What barriers could there be for governing PIDs?
12/12
30
Operations
Financing
Architecture
Or our approach
Governance
Click to edit Master title style
Agenda
Introduction
Context
3
2
31
Preliminary results 4
5 Sharing your experience – What barriers could there be for governing PIDs?
1
Next steps6
Approach
Click to edit Master title style
Next steps
32
WEBINAR II: Guidelines & methodologies 5 & 6 May 2014
1. Highlights
2. Refinement
Follow us on Joinup. We will provide the highlights of this webinar
AND….
At any time, your feedback is very appreciated.
Click to edit Master title style
Next steps
33
Join the collaborative platforms in other areas of Open Source & Semantic Interoperability on https://joinup.ec.europa.eu/
Connecting Geospatial Data
Thank you for your participation!