linked open data : opportunités et défis par makx dekkers

43
Linked (Open) Data Opportunities and challenges Makx Dekkers [email protected]

Upload: abes

Post on 29-Jan-2018

2.467 views

Category:

Education


0 download

TRANSCRIPT

Page 1: Linked Open Data : opportunités et défis par Makx Dekkers

Linked (Open) Data

Opportunities and challenges

Makx Dekkers

[email protected]

Page 2: Linked Open Data : opportunités et défis par Makx Dekkers

Outline

• Basic notions

• Recent developments

• Comparing objectives

• Opportunities and risks

• Conclusions

© 2011 Makx Dekkers Journeés ABES 2011 2

Page 3: Linked Open Data : opportunités et défis par Makx Dekkers

BASIC NOTIONS

© 2011 Makx Dekkers Journeés ABES 2011 3

Page 4: Linked Open Data : opportunités et défis par Makx Dekkers

The idea and its history

• 1989: Tim Berners-Lee already talked about linking documents and data together (http://www.w3.org/History/1989/proposal.html)

• 2001: Tim Berners-Lee and Ora Lassila introduced the “Semantic Web” (http://www.scientificamerican.com/article.cfm?id=the-semantic-web)

• 2006: Tim Berners-Lee presented the initial design issues (rules) for Linked Data (http://www.w3.org/DesignIssues/LinkedData.html)

© 2011 Makx Dekkers Journeés ABES 2011 4

Page 5: Linked Open Data : opportunités et défis par Makx Dekkers

W3C Semantic Web initiative

• Objective– to create a universal medium for the exchange of

data […] to smoothly interconnect personal information management, enterprise application integration, and the global sharing of commercial, scientific and cultural data

• Main results– Resource Description Framework (RDF), RDFa

(RDF-in-HTML), SPARQL Query Language

© 2011 Makx Dekkers Journeés ABES 2011 5

Page 6: Linked Open Data : opportunités et défis par Makx Dekkers

Core Linked Data Specifications

• Transport– HTTP Hypertext Transfer Protocol

• Identification– URI Uniform Resource Identifier

• Description and linking– RDF Resource Description Framework

• Search and access– SPARQL Query Language for RDF

© 2011 Makx Dekkers Journeés ABES 2011 6

Page 7: Linked Open Data : opportunités et défis par Makx Dekkers

The four rules of Linked Data

• TBL’s recommendations:1. Use URIs as names for things

2. Use HTTP URIs so that people can look up those names

3. When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL)

4. Include links to other URIs so that they can discover more things

© 2011 Makx Dekkers Journeés ABES 2011 7

Page 8: Linked Open Data : opportunités et défis par Makx Dekkers

The basic model of RDF

• Resource Description Framework “triple”:– Subject: the “thing” (resource) described

– Predicate: the characteristic of the resource

– Object: the value of the characteristic

Subject ObjectPredicate

© 2011 Makx Dekkers Journeés ABES 2011 8

Page 9: Linked Open Data : opportunités et défis par Makx Dekkers

Complex structures in RDF

This presentation

Makx Dekkers Barcelona

Journées ABES

ABES

Montpellier

17-18 May 2011

presenter

partOf organizer location

hometown

date

location

© 2011 Makx Dekkers Journeés ABES 2011 9

Page 10: Linked Open Data : opportunités et défis par Makx Dekkers

Linked (Open / Enterprise) Data

• Commonalities– Using Semantic Web technologies (RDF)

– Linking information resources, people, places

• Differences– Open Data with open licenses; Enterprise Data

mostly for closed, controlled environments

– Open Data links to other Open Data, available for external use; Enterprise Data may link to external data but not openly available for external use

© 2011 Makx Dekkers Journeés ABES 2011 10

Page 11: Linked Open Data : opportunités et défis par Makx Dekkers

Linked Data -- Open Data

• Linked Data: focus on technology– Semantic Web: Resource Description Framework,

and other Web standards

– Final solutions still under development

• Open Data: focus on strategy– Based on notion that sharing is important and

benefits all

– Technology is secondary

© 2011 Makx Dekkers Journeés ABES 2011 11

Page 12: Linked Open Data : opportunités et défis par Makx Dekkers

The five-star system

Source: http://inkdroid.org/journal/2010/06/04/the-5-stars-of-open-linked-data/

© 2011 Makx Dekkers Journeés ABES 2011 12

Page 13: Linked Open Data : opportunités et défis par Makx Dekkers

The LOD diagram: 2007

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

© 2011 Makx Dekkers Journeés ABES 2011 13

25 datasets

Page 14: Linked Open Data : opportunités et défis par Makx Dekkers

The LOD diagram: 2008

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

© 2011 Makx Dekkers Journeés ABES 2011 14

45 datasets

Page 15: Linked Open Data : opportunités et défis par Makx Dekkers

The LOD diagram: 2009

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

© 2011 Makx Dekkers Journeés ABES 2011 15

95 datasets

Page 16: Linked Open Data : opportunités et défis par Makx Dekkers

The LOD diagram: 2010

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

© 2011 Makx Dekkers Journeés ABES 2011 16

203 datasets

Page 17: Linked Open Data : opportunités et défis par Makx Dekkers

RECENT DEVELOPMENTS

© 2011 Makx Dekkers Journeés ABES 2011 17

Page 18: Linked Open Data : opportunités et défis par Makx Dekkers

W3C communities

• LinkingOpenData SWEO Community Project– Goal: to extend the Web with a data commons by

publishing various open data sets as RDF on the Web and by setting RDF links between data items from different data sources (http://www.w3.org/wiki/SweoIG/TaskForces/CommunityProjects/LinkingOpenData)

• Library Linked Data Incubator Group– to help increase global interoperability of library

data on the Web (http://www.w3.org/2005/Incubator/lld/)

© 2011 Makx Dekkers Journeés ABES 2011 18

Page 19: Linked Open Data : opportunités et défis par Makx Dekkers

More W3C communities

• Government Linking Data Working Group– to provide standards and other information which

help governments around the world publish their data as effective and usable linked data (http://www.w3.org/2011/gld/charter)

• Semantic Web Health Care and Life Sciences (HCLS) Interest Group– to develop, advocate for, and support the use of

Semantic Web technologies for health care and life science (e.g. biology, medicine) (http://www.w3.org/2001/sw/hcls/)

© 2011 Makx Dekkers Journeés ABES 2011 19

Page 20: Linked Open Data : opportunités et défis par Makx Dekkers

Open Knowledge Foundation, okfn.org

• not-for-profit organization promoting open knowledge: any kind of data and content that can be freely used, reused, and redistributed

• Working and Interest Groups, e.g.– Open Data in Science, Open Government Data,

Open Bibliographic Data, Cultural Heritage etc.

• CKAN.net: registry of open datasets and other “knowledge resources”

© 2011 Makx Dekkers Journeés ABES 2011 20

Page 21: Linked Open Data : opportunités et défis par Makx Dekkers

Linked Data initiativesPredicate vocabularies (descriptors)

Research Description and Access (RDA) http://metadataregistry.org/rdabrowse.htm

The Bibliographic Ontology (BIBO) http://bibliontology.com/

Dublin Core http://dublincore.org/

Object vocabularies (values)

Virtual International Authority File (VIAF) http://viaf.org/

Library of Congress authorities http://id.loc.gov/authorities/

AgroVOC (agricultural terminology) e.g. http://aims.fao.org/aos/agrovoc/c_550

DBPedia (based on Wikipedia) e.g. http://dbpedia.org/page/Montpellier

Bibliographic data

LIBRIS Sweden e.g. http://libris.kb.se/library/S

British Library http://www.bl.uk/bibliographic/datasamples.html

CrossRef (DOI metadata) http://www.crossref.org/CrossTech/linked_data/

© 2011 Makx Dekkers Journeés ABES 2011 21

Page 22: Linked Open Data : opportunités et défis par Makx Dekkers

More Linked Data initiativesBroadcasting, publishing

BBC http://www.bbc.co.uk/blogs/bbcinternet/linked_data/

New York Times http://data.nytimes.com/

Governments (small sample)

USA http://data.gov/

France http://opendata.paris.fr/

Finland http://data.suomi.fi/

UK http://data.gov.uk/

Spain (Cataluña) http://dadesobertes.gencat.cat/

Norway http://data.norge.no

Netherlands http://www.overheid.nl/opendata

Australia http://data.gov.au/

© 2011 Makx Dekkers Journeés ABES 2011 22

Page 23: Linked Open Data : opportunités et défis par Makx Dekkers

COMPARING OBJECTIVES

© 2011 Makx Dekkers Journeés ABES 2011 23

Page 24: Linked Open Data : opportunités et défis par Makx Dekkers

Strategic aspects Linked Data

• Achieving global interoperability with minimal coordination

• Aggregating human knowledge

• Supporting democracy, transparency and accountability

• Enhancing and enriching information

• Enabling user-driven and user-generated applications

© 2011 Makx Dekkers Journeés ABES 2011 24

Page 25: Linked Open Data : opportunités et défis par Makx Dekkers

Strategic aspects libraries

• Organizing information for use by specific users for specific goals

• Ensuring and maintaining quality

• Sustaining services economically

• Preserving information for the long term

• Providing trusted services

© 2011 Makx Dekkers Journeés ABES 2011 25

Page 26: Linked Open Data : opportunités et défis par Makx Dekkers

Functional aspects Linked Data

• Searching distributed collections

• “Following your nose” – navigating links between pieces of content

• Distributing responsibility for making statements about things

• Leaving to the user whom and what to trust

• Leaving development of products and services to an open market (apps)

© 2011 Makx Dekkers Journeés ABES 2011 26

Page 27: Linked Open Data : opportunités et défis par Makx Dekkers

Functional aspects libraries

• Describing information by professionals

• Bringing together and managing aggregations of information

• Selecting relevant information

• Mixing analogue and digital resources

© 2011 Makx Dekkers Journeés ABES 2011 27

Page 28: Linked Open Data : opportunités et défis par Makx Dekkers

Technical aspects Linked Data

• Publishing and using machine-readable statements (“data that speak for themselves”)

• Focusing on Semantic Web technology

• Enabling inferences across large distributed data sets

• (Still to be done) Solving issues around harvesting, caching and real-time updating

© 2011 Makx Dekkers Journeés ABES 2011 28

Page 29: Linked Open Data : opportunités et défis par Makx Dekkers

Technical aspects libraries

• Using proven technology to provide high-quality services

• Managing production systems and services

• Guaranteeing performance, uptime, consistency across data

© 2011 Makx Dekkers Journeés ABES 2011 29

Page 30: Linked Open Data : opportunités et défis par Makx Dekkers

Agility versus sustainability

• In the Linked Data space:– Things move fast– Trial-and-error– Lots of development by volunteers (hackers)

• In the library domain:– Operational systems need to evolve– Need to handle legacy data– Development by professionals in managed

projects

© 2011 Makx Dekkers Journeés ABES 2011 30

Page 31: Linked Open Data : opportunités et défis par Makx Dekkers

Data versus services

• In the Linked Data space:– Focus on availability of “raw data”

– Quality is secondary

– Data and technology should lead to useful results

• In the library domain:– Focus on services

– Quality is essential

– Data and technology in support of the service

© 2011 Makx Dekkers Journeés ABES 2011 31

Page 32: Linked Open Data : opportunités et défis par Makx Dekkers

Economic aspects

• In the Linked Data space:– “Information wants to be free” – a human right?

– Short-term thinking: today is hot, yesterday is not

– Focus on applications to create value out of data

• In the library domain:– Long-term view: sustainability is crucial

– Public money to provide community services

– Expected to do more with less money

© 2011 Makx Dekkers Journeés ABES 2011 32

Page 33: Linked Open Data : opportunités et défis par Makx Dekkers

OPPORTUNITIES AND RISKS

© 2011 Makx Dekkers Journeés ABES 2011 33

Page 34: Linked Open Data : opportunités et défis par Makx Dekkers

Strong points Linked Data

• Attempt to create a common technical platform for machine-readable data

• Lots of enthusiasm in publishing open data

• Promise of global interoperability

• Mix of researchers, user communities, hackers, professional data providers

• High visibility on political level

© 2011 Makx Dekkers Journeés ABES 2011 34

Page 35: Linked Open Data : opportunités et défis par Makx Dekkers

Risks Linked Data

• Driven by technology, not by requirements

• Technology may not (yet) be stable – RDF 2.0?

• Operational issues far from solved (reliability, performance, quality, security, trust)

• Hope for general agreement across domains may not be realistic

• Promise may turn into disappointment

© 2011 Makx Dekkers Journeés ABES 2011 35

Page 36: Linked Open Data : opportunités et défis par Makx Dekkers

Strong points libraries

• Long time operational experience in managing information

• Professional intermediaries between users and information needs

• Sustainable business models (albeit with eternally shrinking budgets)

• Long-term perspective: the past (legacy data) as well as the future (preservation)

© 2011 Makx Dekkers Journeés ABES 2011 36

Page 37: Linked Open Data : opportunités et défis par Makx Dekkers

Risks libraries

• Technologies change rapidly

• New skills difficult to spread through the organization

• Some people see libraries as a thing of the past (“the book museum”)

• Underestimation of information handling skills

• Information overload, human intervention does not scale, need for better tools

© 2011 Makx Dekkers Journeés ABES 2011 37

Page 38: Linked Open Data : opportunités et défis par Makx Dekkers

Meeting both worlds

• An example: Europeana.eu– Started out with domain perspectives (libraries,

archives, museums, audiovisual archives)

– “Traditional” approach (metadata mappings) works but insufficient

– Using Linked Data approach preserves domain specifics but allows for generalization to support common services

– Cross-domain (but co-ordinated) interoperability

© 2011 Makx Dekkers Journeés ABES 2011 38

Page 39: Linked Open Data : opportunités et défis par Makx Dekkers

Europeana Data ModelClasses Properties

Simple example

Complex example

Source at:http://version1.europeana.eu/web/europeana-project/technicaldocuments/

© 2011 Makx Dekkers Journeés ABES 2011 39

Page 40: Linked Open Data : opportunités et défis par Makx Dekkers

CONCLUSION

© 2011 Makx Dekkers Journeés ABES 2011 40

Page 41: Linked Open Data : opportunités et défis par Makx Dekkers

Libraries and Linked Data

• Using Linked Data technology as the next step in connecting services

• Offering information management skills to the technology domain

• Creating a quality hub in the Linked Data space

© 2011 Makx Dekkers Journeés ABES 2011 41

Page 42: Linked Open Data : opportunités et défis par Makx Dekkers

Best of both worlds

• Libraries providing stability and sustainability to Linked Data spaces

• Library professionals helping to manage the distributed collections

• Libraries delivering high-quality linked data to the Web

• Technologists to provide the next generation of systems and tools

© 2011 Makx Dekkers Journeés ABES 2011 42

Page 43: Linked Open Data : opportunités et défis par Makx Dekkers

Linked (Open) Data: opportunity for libraries!

Thank you!

Makx [email protected]