linked opendata parisemantique.fr - 24062011
TRANSCRIPT
Linked Data
for Open Cities
Loc DIAS DA SILVA
[email protected]
http://parisemantique.fr
is your data 5 ?
Amsterdam Barcelona Berlin Paris
- deeply passionate about the web, its history, its future..- software engineer, GNU/Linux addict- responsible for the API team in Viadeo (http://www.viadeo.com)- not a designer (as you will see soon..)- semantic web evangelist, open-source software user- created some weeks ago the webapp http://parisemantique.fr
The semantic web paradigm
Paris Semantique feedbacks
Linked Open Cities
WEB before ~2000the document web
webmasters
readers
HTML
Netscape
unidirectional
to link documents
WEB '2.0'the social web
user-centric
prosumers
crowd sourcing
participation
friends
sharing
media
to link people
API
Web of Datause-case
http://www.dltk-teach.com/rhymes/sleeping-beauty/story.htm
The nicer story i've never read !I want more !
OK !
I want more !
http://www.imdb.com/title/tt1588398/
Highly ranked !
?!?!?!
http://www.rcpsych.ac.uk/mentalhealthinfoforall/problems/sleepproblems/sleepingwell.aspx
Recommended to you
????
:~(
=
?
ok !
media suitable for little girlssearching for princess stories...
Web of Data
to link knowledge
a unit of information can be atomicized :
as of RDF vocabulary :
an RDF triple
[] [] []
lorem ipsum
lang
type
all is URI
express the semanticsusing ontologies
FRBR
[] [] []
Loc
is a
person
viadeoprofile:002l8n979djxjeg
rdf:type
foaf:Person
viadeoprofile:loic.diasdasilvafacebook:loic.diasdasilva
owl:sameAs
foaf:Person
rdf:type
wikipedia:Semantic_Webfoaf:Personfoaf:Person
foaf:interests
(foaf:interests)
w3cpeople:Berners-Lee
dc:creator
(rdf:type)
viadeoprofile:jean-philippe.clement1
foaf:knows
twitter:opendataParis
owl:sameAs
(foaf:knows)
Loc DIAS DA SILVA
foaf:name
Mglcel
foaf:nickname
RDF expresses graphes, this is not about formatseveral serializations
Loc DIAS DA SILVA
RDF/XML
@prefix foaf: .@prefix viadeoprofile: .
viadeoprofile:002l8n979djxjeg a foaf:Person ; foaf:name Loc Dias Da Silva .
TURTLE
{ "http://www.viadeo.com/r/profile/002l8n979djxjeg" : { "http://www.w3.org/1999/02/22-rdf-syntax-ns#type": [ { "type" : "uri" , "value" : "http://xmlns.com/foaf/0.1/Person" } ], "http://xmlns.com/foaf/0.1/name": [ { "type" : "literal" , "value" : "Loc Dias Da Silva" } ] }}
JSON
RDFa : RDF in HTML attributes
Loc Dias Da Silva
xHTML
Loc Dias Da Silva
xHTML + RDFa
SPARQL : query RDF graphes
viadeoprofile:loic.diasdasilvardf:type
foaf:PersonLoc DIAS DA SILVAfoaf:name
@prefix foaf: @prefix wikipedia:
SELECT ?theName WHERE {?person a foaf:Person .?person foaf:interests .?person foaf:name ?theName .}
wikipedia:Semantic_Webfoaf:interests
SPARQL
PREFIX pswcity: SELECT DISTINCT ?name ?add_number ?add_waynameWHERE { ?s pswcity:name ?name. ?s pswcity:address ?address. ?address pswcity:address_number ?add_number. ?address pswcity:in_way ?way. ?way pswcity:fullName ?add_wayname. FILTER ( ?s = ) .}
Linked data principles
Use URIs to identify things.
Use HTTP URIs so that these things can be referred to and looked up ("dereferenced") by people and user agents.
Provide useful information about the thing when its URI is dereferenced, using standard formats such as RDF/XML.
Include links to other, related URIs in the exposed data to improve discovery of other related information on the Web.
Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
????
A big work has to be done in order to semantize current data :
- Different values for identifiation, ex Districts: '20e', 'XXe', '75020', 'PARIS 20', street names, etc..- Many datasets not documented, semantization = meaning, is difficult- Some cells containing differently formatted values, multiple lines on multiple cells- ~80% of ESRI geo data without metadata
Two main issues to resolve :
Objects identificationTypes determination
make your stuff available on the Web (whatever format) under an open license make it available as structured data (e.g., Excel instead of image scan of a table) use non-proprietary formats (e.g., CSV instead of Excel) use open standards from W3C (RDF and SPARQL) to identify things, so that people can point at your stuff link your data to other data to provide context
http://www.w3.org/DesignIssues/LinkedData.html
TBL :