connecting heterogeneous collections using linked data

24
Connecting Heterogeneous Collections using Linked Data Victor de Boer Web & Media Group, CS, Vrije Universiteit Amsterdam Netherlands Institute for Sound and Vision

Upload: victor-de-boer

Post on 15-Apr-2017

80 views

Category:

Education


1 download

TRANSCRIPT

Page 1: Connecting Heterogeneous Collections using Linked Data

Connecting Heterogeneous Collections using Linked Data

Victor de BoerWeb & Media Group, CS, Vrije Universiteit Amsterdam

Netherlands Institute for Sound and Vision

Page 2: Connecting Heterogeneous Collections using Linked Data

The Problem:(historical) data is not integrated

• Researchers’ data is “lost” or published without reusability– In different physical locations– In different file formats– In different semantic structures

• We do not want to force one monolithic data model

• Flexible integration

• Re-use existing data sources

Page 3: Connecting Heterogeneous Collections using Linked Data

Linked Data for Digital History

• Represent heterogeneous datasets with their own data models in common format: Resource Description Format (RDF)– Link what can be linked

• re-use and re-usability

• Linked Data is the (technically) best way to publish and share your (research) data

OBJECT EVENT

PLACE

TIME

PERSON

CONCEPT

PROVENANCE

Page 4: Connecting Heterogeneous Collections using Linked Data

ww

w.w

3.org/designissues/linkeddata.html

Page 5: Connecting Heterogeneous Collections using Linked Data

URIsUse Web URIs for things you want to talk about.

http://rijksmuseum.nl/data/painting1

My application can go there using HTTP (the Web) and I get information about it

HTML page for humansRDF data for machines

rijks:Painting001

Page 6: Connecting Heterogeneous Collections using Linked Data

RDF Triples form Graphs

rijks:Painting001

geo:Haarlem

dcterms:spatial

dcterms:creatorrijks:Frans_Hals

147590geo:population

52.38084, 4.63683

geo:partOf

geo:Noord-Holland

geo:Netherlands

geo:coordinates

geo:part

Of

rijks:Painting002dcterms:creator

Page 7: Connecting Heterogeneous Collections using Linked Data

http://lod-cloud.net/

Page 8: Connecting Heterogeneous Collections using Linked Data

Dutch Ships and Sailors

KB NEWSPAPERS

Dutch-Asiatic Shipping “VOC Opvarenden”

Jur LeinengaMatthias van Rossum

Elbing voyagesArchangel voyages

Page 9: Connecting Heterogeneous Collections using Linked Data

HETEROGENEOUS but LINKED DATAMODELS

dss:Recordgzmvoc:Telling

gzmvoc:telling-1046-De_Berkel

__bnode_1

gzmvoc:aziatischeBemanning

dss:Shipgzmvoc:Schip

gzmvoc: schip-1046-De_Berkel

dss:has_shipgzmvoc:schip

"1046"

“Schip”

“De Berkel”

rdfs:labeldss:scheepsnaam

gzmvoc:scheepsnaam

dss:ShipTypegzmvoc:Scheepstype

gzmvoc: type-Shipdss:has_shiptype

gzmvoc:has_shiptype

gzmvoc:scheepstype

“21”

“Moorse mattroosen”

dss:azRegistratieKop

gzmvoc:azAantalMatrozen

gzmvoc:telling

gzmvoc:heeft DAS heenreis

dss:Recorddas:Voyage

das:voyage-1918_61sameAs

Integrate datasetsNo monolithic datamodel neededNo normalisation / dumbing down of data neededRetain original model and intent

Page 10: Connecting Heterogeneous Collections using Linked Data

Reuse: Links to other web documents

Historical Newspapershttp://delpher.nl

isReferencedBy

Page 11: Connecting Heterogeneous Collections using Linked Data

mdb:Schip1 mdb:Kof

mdb:scheepsType

das:ShipX das:Kofship

das:typeOfShip

Aat:Kof

Aat:Platbodems

owl:sameAs

skos: broader

Reuse: background knowledge

AAT = Art & Architecture Thesaurus http://www.getty.edu/research/tools/vocabularies/aat/

owl:sameAs

OWL = Web ontology languagehttps://www.w3.org/OWL/

Page 12: Connecting Heterogeneous Collections using Linked Data

Data analysis and visualisation

The interlinked knowledge graph makes it possible to efficiently investigate integrated research questions.

Page 14: Connecting Heterogeneous Collections using Linked Data

Wrap up1. Linked Data allows for flexible

integration of heterogeneous data, metadata and background knowledge media

2. (Re)use of web resources, vocabularies and ontologies allows for efficient and novel research questions

3. Data provenance fits DH requirements

Page 15: Connecting Heterogeneous Collections using Linked Data

Thank you

Page 16: Connecting Heterogeneous Collections using Linked Data
Page 17: Connecting Heterogeneous Collections using Linked Data

BIG ? LINKED? OPEN?

Page 18: Connecting Heterogeneous Collections using Linked Data

Wrap up• Graphs, not tables

• Web standards and technologies• URIs for things• RDF (triples) to describe these things and have links

• Distributed, heterogeneous (meta)data• Integrate datasets in a flexible way• Cross-collection, -institution, -domain

• Re-use background knowledge

• Provenance fits DH requirements well

• Knowledge graphs enable efficient investigation of integrated research questions.

Page 19: Connecting Heterogeneous Collections using Linked Data

BIG DATA

LINKED DATA

OPEN DATA

Page 20: Connecting Heterogeneous Collections using Linked Data

Four V’s of Big Data http://ww

w.ey.com

/GL/en/Services/Advisory/EY-big-data-big-opportunities-big-challenges

Page 21: Connecting Heterogeneous Collections using Linked Data

BIG DATA

LINKED DATA

VARIANCE as one of the V’s of Big Data

VOLUME, VELOCITY are challenges for LD

Page 22: Connecting Heterogeneous Collections using Linked Data
Page 23: Connecting Heterogeneous Collections using Linked Data

LINKED DATA

OPEN DATA

LINKED OPEN DATA

as the way to publish and reuse datasets across the Web

Page 24: Connecting Heterogeneous Collections using Linked Data

www.w3.org/designissues/linkeddata.html