from quality on spatial data to data quality vocabulary (dqv) · 2018-06-22 · that a potential...

15
2nd International Workshop on Spatial Data Quality From quality on spatial data to data quality vocabulary (DQV) for the Semantic Web - the Norwegian experience of aligning ISO 19157 Data Quality to DQV. Morten Borrebæk/Magni Busterud Norwegian Mapping Authority

Upload: others

Post on 17-Jun-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: From Quality on spatial data to data quality vocabulary (DQV) · 2018-06-22 · that a potential user of a dataset can make his/her own judgment about its fitness for purpose. Mapping

2nd International Workshop on Spatial Data Quality

From quality on spatial data to data quality vocabulary (DQV) for the Semantic Web - the Norwegian experience of aligning ISO 19157 Data

Quality to DQV.

Morten Borrebæk/Magni Busterud

Norwegian Mapping Authority

Page 2: From Quality on spatial data to data quality vocabulary (DQV) · 2018-06-22 · that a potential user of a dataset can make his/her own judgment about its fitness for purpose. Mapping

Norway

• 5 million inhabitants

• 18 counties

• 422 municipalities

• 324 000 km2 land territory

• 2 million km2 sea territory

• 3275 datasets

• >200 INSPIRE datasets

What about quality??

Page 3: From Quality on spatial data to data quality vocabulary (DQV) · 2018-06-22 · that a potential user of a dataset can make his/her own judgment about its fitness for purpose. Mapping

Background

ISO 19157:2013 Data Quality

• Quality model• Evaluation method• Reporting• Conformance tests• National register of quality meassures

Spatial data quality

Production of basis spatial data

Page 4: From Quality on spatial data to data quality vocabulary (DQV) · 2018-06-22 · that a potential user of a dataset can make his/her own judgment about its fitness for purpose. Mapping

Backgroundclass Fig C.1 - Specification Packages

ISO 19110 Methodology for feature cataloguing

(from ISO TC211)

«Leaf»

Specification Feature Based Information

(from Specification Content and Structure)

Specification Content and StructureISO 19115:2006 Metadata (Corrigendum)

(from ISO 19115-All Metadata)

Specification Additional Information

«Leaf»

Specification Scopes

«Leaf»

Specification Reference System

«Leaf»

Specification Portrayal Information

«Leaf»

Specification Maintenance Information

«Leaf»

Specification Identification

«Leaf»

Specification Deliv ery Information

«Leaf»

Specification Data Quality Requirement

«Leaf»

Specification Data Capture Information

«Leaf»

Specification Cov erages and Images

(from Specification Content and Structure)

«Leaf»

DPS

Packages in a DPSclass Class Model

Data product

specificationData Product Dataset Metadata

+describedBy

1..

+specifies

0..*

+implementedAs

0..*

Page 5: From Quality on spatial data to data quality vocabulary (DQV) · 2018-06-22 · that a potential user of a dataset can make his/her own judgment about its fitness for purpose. Mapping

Status (Geonorge.no)

Not all products have a product specificationNot all product specifications have quality requirements

Still a long way to go!

Page 6: From Quality on spatial data to data quality vocabulary (DQV) · 2018-06-22 · that a potential user of a dataset can make his/her own judgment about its fitness for purpose. Mapping

Spatial data and eGovernment in Norway

1st priority: Data Catalog

Other domain-specific cataloges

Spatial data

https://fellesdatakatalog.brreg.no/datasets/

https://kartkatalog.geonorge.no/search

http://inspire-geoportal.ec.europa.eu/

https://www.europeandataportal.eu/

https://data.europa.eu/euodp/en/data/

Page 7: From Quality on spatial data to data quality vocabulary (DQV) · 2018-06-22 · that a potential user of a dataset can make his/her own judgment about its fitness for purpose. Mapping

BIM

ICT policy

GIS

W3C(Semantic web)

Information management

ISO/TC 59/SC13 and CEN/TC 442

ISO/TC 211 - OGC

Geospatial solutions(983K Euro)

(TOGAF/ARCHIMATE)

(UML)

(Express)

EIF

131 Million Euro(2016-2020)

URI-pattern

Spatial data and eGovernment in Norway

Page 8: From Quality on spatial data to data quality vocabulary (DQV) · 2018-06-22 · that a potential user of a dataset can make his/her own judgment about its fitness for purpose. Mapping

ISA – Interoperability Solutions for European public administration – impact assessment

Experts(By invitation only)

Public hearingEUEEC

”Endorsement”

Public Authorities

Data Catalog

Vocabulary(DCAT)

Agency for Public Management and eGovernment

Page 9: From Quality on spatial data to data quality vocabulary (DQV) · 2018-06-22 · that a potential user of a dataset can make his/her own judgment about its fitness for purpose. Mapping

Quality in eGoverment

There is a «strong» national recommendation to describe quality of datasets in the catalogue of publicsector datasets.

When the public sectors have described quality in theirdatasets, it shall be implemented in the catalogue.

The DCAT Application Profile for data portals (DCAT-AP) is a specification based on the Data Cataloguevocabulary (DCAT).

DCAT-AP provides a common specification for describing public sector datasets in Europe to enablethe exchange of descriptions of datasets among data portals.

DCAT-AP has an extension GeoDCAT-AP for describing geospatial datasets, dataset series and services.

Quality will be implemented as an extension to theDCAT-AP-NO profile.

Automatic processing and digital services can be improvedand made more efficient withgood access to qualitycontrolled data from publicsector datasets.

Digital agendaNorway

Page 10: From Quality on spatial data to data quality vocabulary (DQV) · 2018-06-22 · that a potential user of a dataset can make his/her own judgment about its fitness for purpose. Mapping

International standard ontologyhttps://github.com/ISO-TC211/GOM/tree/master/isotc211_GOM_harmonizedOntology/19157/2013

Derived from the UML models in ISO 19157 Data Quality, applying the mappingrules in ISO 19150-2:2015, which defines the conversion of the UML static view modeling elements used in the ISO geographic information standards into OWL/RDF.

Page 11: From Quality on spatial data to data quality vocabulary (DQV) · 2018-06-22 · that a potential user of a dataset can make his/her own judgment about its fitness for purpose. Mapping

Data Quality vocabulary (DCAT)

Provided by W3C, draft document that may be updated, replaced or obsoleted by other documents at any time.

It provides a framework in which the quality of a dataset can be described, whether by the dataset publisher or by a broader community of users. It does not provide a formal, complete definition of quality, rather, it sets out a consistent means by which information can be provided such that a potential user of a dataset can make his/her own judgment about its fitness for purpose.

Mapping to ISO/IEC 25012:2008 Software engineering -- Software product Quality Requirements and Evaluation (SQuaRE) -- Data quality model. No mapping to ISO 19157 Data Quality

Page 12: From Quality on spatial data to data quality vocabulary (DQV) · 2018-06-22 · that a potential user of a dataset can make his/her own judgment about its fitness for purpose. Mapping

The model shows the most relevant classes in DQV (source: W3C DQV ).

Page 13: From Quality on spatial data to data quality vocabulary (DQV) · 2018-06-22 · that a potential user of a dataset can make his/her own judgment about its fitness for purpose. Mapping

Quality in eGovernmentclass Hov eddiagram

(dqv :Metric)

Kvalitetsmål

(dqv :qualityMeasurement)

Måleresultat

(dqv :Dimension)

Kv alitetsdimensjon

(dqv :QualityAnnotation)

Kv alitetsnote

(dcat:datasett eller dcat:Distribution)

Dataset

(dct:Standard)

Standard/Spesifikasjon

(oa:TextualBody)

TekstligBeskriv else

(dqv :UserQualityFeedback)

BrukerKv alitetstilbakemelding

(oa:Motiv ation)

Motiv asjon(dqv :Dimension)

Kv alitetsdeldimensjon

Hvorav den ene må være

dqv:QualityAssessment

(dqv:InDimension)

erIDimensjon

0..*

(dqv:IsMeasurementOf)

erMåleresultatAv 1

(dqv:inDimension)

erIDimensjon0..*

(dqv:hasQualityMeasurement)

harKvantifiserbarKvalitet

0..*

(dqv:inDimension)

erIDelDimensjon 1

(dct:conformsTo)

iSamvarMed 0..*

(oa:motivatedBy)

motivertAv1..*

(dqv:hasQualityAnnotation)

harKvalitetsnote

0..*

+skos:broader

1

(oa:hasBody)

harTekst 0..*

Page 14: From Quality on spatial data to data quality vocabulary (DQV) · 2018-06-22 · that a potential user of a dataset can make his/her own judgment about its fitness for purpose. Mapping

Examples:

Quality sub-dimension

RDF

Currentness <> dqv:hasQualityAnnotation [ a dqv:QualityAnnotation ; # kvalitetsnotedqv:inDimension iso:Currentness ; oa:hasBody [

rdf:value=”Enhetsregisteret er kontinuerlig oppdatert, men egenskapen antall ansatte oppdateres månedlig fra Aa-registeret”@no;

] .] .

Conformance <>dcat:conformsTo [ skos:prefLabel “Produktspesifikasjon NVE flomsoner

1.0”@no rdfs:seeAlsohttp://sosi.geonorge.no/Produktspesifikasjoner/Produktspesifikasjon_NVE_Flomsoner_1%200.pdf

] .

Page 15: From Quality on spatial data to data quality vocabulary (DQV) · 2018-06-22 · that a potential user of a dataset can make his/her own judgment about its fitness for purpose. Mapping

Examples:

Quality sub-dimension

RDF

Completeness <> dqv:hasQualityAnnotation [ a dqv:QualityAnnotation ; dqv:inDimension iso:Completeness ; oa:hasBody [ rdf:value=”Enhetsregisteret inneholder ikke slettede selskaper før 1994.”@no; ] . ] .