an introduction to the semantic web and the web of...

26
An Introduction to the Semantic Web and the Web of Data Prof. Dr. Philippe Cudré-Mauroux Director, eXascale Infolab University of Fribourg, Switzerland May 12, 2011,VeriSign EMEA,Villars-sur-Glane, Switzerland

Upload: others

Post on 07-Jun-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

An Introduction to the Semantic Web and the Web of Data

Prof. Dr. Philippe Cudré-MaurouxDirector, eXascale Infolab

University of Fribourg, Switzerland

May 12, 2011, VeriSign EMEA, Villars-sur-Glane, Switzerland

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

Page 2: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

Overview■The Vision behind the Semantic Web■Applications■Building Blocks■RDF/S■SPARQL■OWL

■A Few Research Topics■idMesh■GraphDB

■Q&A

2

Page 3: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

My Background in 10 Bullet Points■HP Europe■EPFL M.Sc., Eurécom Institute + INRIA M.Sc.■IBM Watson Research (NY) ■EPFL Ph.D. (best thesis award 2006, EPFL Press Mention 2006)

■Emergent Semantics, RDF PDMS■U.C. Berkeley■Microsoft Research Asia■M.I.T.■Database Systems

■Swiss NSF Prof. at Uni Fribourg

3

Page 4: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

SW and ME

■ Interested for years in sharing data on the Web■RDF/S is the only reasonable solution!

■ International Semantic Web Conference (ISWC)■Co-organizer in 2007, 2009, 2010■Vice-Chair in 2011 (Bonn)■PC-Chair in 2012 (Boston)■Hope to see you there!

■Research Topics■Decentralized Data Integration■RDF Storage (P2P/Cloud)■Semantic DNS

4

Page 5: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

The Semantic Web Vision

■Provide interoperable, machine-processable data on the Web■Machines cannot meaningfully parse HTML pages■Relational DBMSs were designed to be centralized

(implicit schema, local data, outside of the Web)■XML is document-oriented

■Pushed by Sir Tim Berners-Lee■Standardized by the W3C (RDF/S, OWL)■Now promoted by major vendors & actors■Google, Yahoo!, Facebook, Oracle, ■Pharmaceuticals, Governments, News industry, etc.

5

Page 6: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

Advantages

■Standardized!■Flexible schemas■Also supports highly-expressive ontologies■ inference!

■Easy to ■export (Rel2RDF)■publish (SPARQL end-point)■discover (SW search-engines)■consume (standard libraries)■integrate (links bw datasets)

6

Page 7: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

Application (1/3): Knowledge Bases■US National Center for Biotechnology

Information, "Oncology Metathesaurus"■50,000+ classes, ~8 people supporting full time■OWL DL rigorously followed■Provably consistent

■BBC World Cup 2010 Web Site■Dynamic publishing

7

Page 8: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

Applications (2/3): Web Metadata

8

■RNews■New IPTC

standard (News)■Better news search■Better news alerts■Better ad

placement■Better analytics

Page 9: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

Applications (2/3): Web Metadata

■Metadata parsing supported by major search engines

■Also: Facebook Open Graph API

9

Page 10: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

Application (3/3): Decentralized DBMS

10

■ Taken from data.gov■ Easy publication and integration of data

Page 11: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

Application (3/3): Decentralized DBMS

■For the first time, a giant Web of Data is emerging!

➡Linked Open Data

11

Page 12: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

Building Block 1: RDF

■Triples!

1:subject, 2:predicate, 3:object

ex.: philippe made idmesh_paper:1: http://data.semanticweb.org/person/philippe-cudre-mauroux2: http://xmlns.com/foaf/0.1/made

3: http://data.semanticweb.org/conference/www/2009/paper/60

■Everything is a URI!■Literals / XML data types also possible for object■That’s it!

12

Page 13: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

Naturally Forms Distributed Graphs

13

Graphs © Sandro Hawke, W3C

Page 14: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

A Bit of Structure

■RDF Schema (RDFS)■Like a DB schema but

Web-based■Discoverable, sharable

■Defines classes, subclasses, properties, lists, etc.■Vocabularies

(schemas) can be global (foaf) or local

14

Page 15: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

RDF Schema Example

15■Provides structure! easier to check, process

Page 16: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

XML Serialization

■Exchange format■Not meant for humans (ugly!)■Not meant for DBMS (verbose!)

<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:db="http://dbpedia.org/resource/"> <rdf:Description rdf:about="http://dbpedia.org/resource/Massachusetts"> <db:Governor> <rdf:Description rdf:about="http://dbpedia.org/resource/Deval_Patrick" /> </db:Governor> <db:Nickname>Bay State</db:Nickname> <db:Capital> <rdf:Description rdf:about="http://dbpedia.org/resource/Boston"> <db:Nickname>Beantown</db:Nickname> </rdf:Description> </db:Capital> </rdf:Description></rdf:RDF>

16

Page 17: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

RDFa

■Embedding RDF information in HTML pages■Supported by Google, Yahoo, etc.

<body> <div about="http://dbpedia.org/resource/Massachusetts">The Massachusetts governor is <span rel="db:Governor">! <span about="http://dbpedia.org/resource/Deval_Patrick">Deval! Patrick! </span>, </span> the nickname is "<span property="db:Nickname">Bay State</span>", and the capital <span rel="db:Capital">! <span about="http://dbpedia.org/resource/Boston">! has the nickname "<span property="db:Nickname">Beantown</span>".! </span> </span> </div> </body>

17

Page 18: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

Building Block 2: SPARQL

■Declarative query language for RDF/S■SQL for the Semantic Web!

prefix db: <http://dbpedia.org/resource/>prefix dbo: <http://dbpedia.org/ontology/>

SELECT ?cap WHERE { db:Massachusetts dbo:capital ?cap }

18

Page 19: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

Building Block 3: OWL

■The Web Ontology Language■Very expressive schemas! (ontologies)■Description Logics■ ... and several flavours

■Example: OWL 2 EL axioms:■ class inclusion (SubClassOf)■ class equivalence (EquivalentClasses)■ class disjointness (DisjointClasses)■ object property inclusion (SubObjectPropertyOf) with or without property chains, and data property inclusion (SubDataPropertyOf)■ property equivalence (EquivalentObjectProperties and EquivalentDataProperties),■ transitive object properties (TransitiveObjectProperty)■ reflexive object properties (ReflexiveObjectProperty)■ domain restrictions (ObjectPropertyDomain and DataPropertyDomain)■ range restrictions (ObjectPropertyRange and DataPropertyRange)■ assertions (SameIndividual, DifferentIndividuals, ClassAssertion, ObjectPropertyAssertion, DataPropertyAssertion,

NegativeObjectPropertyAssertion, andNegativeDataPropertyAssertion)■ functional data properties (FunctionalDataProperty)■ keys (HasKey)

■ Inference! ex.: TransitiveObjectProperty(hasAncestor)hasAncestor(x, y) ∧ hasAncestor(y, z) → hasAncestor(x, z)

19

Page 20: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

Research Topic 1:idMesh--Disambiguation of URIs

■The World, modeled as a collection of identifiers■Linked Data■Semantic Web■RESTful services■...

http://data.semanticweb.org/person/philippe-cudre-mauroux

http://data.semanticweb.org/conference/www/2009/paper/60

foaf:made

20

Page 21: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

Naming & Decentralization

■The great thing about unique identifiers is that there are so many to choose from■Top-down efforts to impose canonical ontological

names (instances) have failed largely■Decentralized naming game■Soaring dimensions in Web 2.0 / 3.0 contexts

http://semanticweb.org/id/Philippe_Cudre-Maurouxhttp://data.semanticweb.org/person/philippe-cudre-maurouxhttp://people.csail.mit.edu/pcm/i http://lsirpeople.epfl.ch/pcudre/ihttp://semanticweb.org/wiki/Special:ExportRDF/Philippe_Cudr%C3%A9-Maurouxhttp://tw.rpi.edu/wiki/Special:ExportRDF/Philippe_Cudr%C3%A9-Mauroux http://wiki.ontoworld.org/index.php/Special:ExportRDF/Philippe_Cudr%C3%A9-Maurouxhttp://korrekt.org/index.php/Special:ExportRDF/Philippe_Cudr%C3%A9-Maurouxhttp://prauw.cs.vu.nl:8080/flink/graph?profile=http%3A%2F%2Fwww.cs.vu.nl%2F%7Epmika%2Fsocionet%23Philippe%2BCudre-Maurouxhttp://www.zoominfo.com/PersonID=402960578 http://www.flickr.com/photos/28735...@N00/http://www.facebook.com/profile.php?id=1251943... .......

ID Ju

ngle

21

Page 22: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

Entity Consolidation

■Urgent need to disambiguate / consolidate URIs■URI directories■semantic DNS ?

■Pairwise, decentralized mechanisms

22

...

<rdfs:Class rdf:ID="Entity"/>

<rdf:Property rdf:ID="idMeshProperty"> <rdfs:domain rdf:resource="#Entity" /> <rdfs:range rdf:resource="#Entity" />

</rdf:Property>

<rdf:Property rdf:ID="LinkConfidence"> <rdfs:domain rdf:Statement /> <rdfs:range rdf:datatype="&xsd;decimal" />

</rdf:Property>

<rdf:Property rdf:ID="EquivalentTo"> <rdfs:subPropertyOf rdf:resource="#idMeshProperty" />

</rdf:Property>

<rdf:Property rdf:ID="NotEquivalentTo"> <rdfs:subPropertyOf rdf:resource="#idMeshProperty" />

</rdf:Property>

<rdf:Property rdf:ID="Predates"> <rdfs:subPropertyOf rdf:resource="#EquivalentTo" />

</rdf:Property>

<rdf:Property rdf:ID="Postdates"> <rdfs:subPropertyOf rdf:resource="#EquivalentTo" />

</rdf:Property>

<rdf:Property rdf:ID="Equidates"> <rdfs:subPropertyOf rdf:resource="#EquivalentTo" />

</rdf:Property>

<rdf:Description rdf:about="http://www.epfl.ch/"><idMesh: NotEquivalentTo rdf:ID="link0001"

rdf:resource="http://www.ethz.ch"/></rdf:Description>

<rdf:Description rdf:about="http://www.epfl.ch/"><idMesh:EquivalentTo rdf:ID="link0002"

rdf:resource="http://en.wikipedia.org/wiki/EPFL"/></rdf:Description>

<rdf:Description rdf:about="#link0002"><idMesh:LinkConfidence

rdf:datatype="&xsd;decimal"> 0.9 </idMesh:LinkConfidence></rdf:Description>

Page 23: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

idMesh: Social Disambiguation

lk1-2

gc2 ( ) gc3 ( )gc1 ( )

lk1-3 lk1-4 lk2-4 lk3-4

vc1

ts1ts2

Trust Values

for Sources

Initial

Link Values

Combined

Value Functions /

Priors for Links

Trust

Constraint

Inferred

Link Values

Graph

Constraints

iii)

Rep

uta

tio

n-B

ase

d T

rust

Man

ag

emen

tC

on

stra

int-

Sa

tisf

act

ion

lk1-2

e1

e4

lk1-3

lk3-4lk2-4

lk1-4

i)

ii)

e2 e3

lk1-2

s1 s2

lk1-3 lk1-4 lk2-4 lk3-4

cv1 ( ) cv2 ( ) cv3 ( ) cv4 ( ) cv5 ( )

tc1 ( )

vc2vc3

vc4vc5

vc6

c1 c2 c3 c5 c6c4

Source Graph

Entity Graph

Probabilistic, social inference based on graphscf. World Wide Web 2009 paper

23

Page 24: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

Research Topic 2: GraphDB

■RDF has a data model that is very different from relational data or XML■Urgent need to develop highly scalable, native

stores

■GraphDB, a native graph-based RDF store■on-the-fly schemas■novel graph indexing ■highly-compressed template-based data■dedicated query optimizer■extensions for collaborative cloud storage

➡ 2 orders of magnitude faster than Oracle RDF on most queries!

24

Page 25: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

Conclusions

■Semantic Web provides machine-processable data to the Web■Standardized■Currently growing exponentially■Major companies are jumping on the bandwagon■Many exciting research / R&D opportunities

■To go one step further: invited tutorial @ ISWChttp://people.csail.mit.edu/pcm/SemWebTutorial.html

25

Page 26: An Introduction to the Semantic Web and the Web of Datapeople.csail.mit.edu/pcm/Media/IntroSemWeb2011.pdf · 2011-05-25 · An Introduction to the Semantic Web and the Web of Data

UNI V ER S I T Y O F FR IBOURG / FACULT Y O F SC IENCE

Thanks for you Attention!

■Q&A

26