from structured data to linked open governmental data

Post on 12-Apr-2017

598 Views

Category:

Technology

3 Downloads

Preview:

Click to see full reader

TRANSCRIPT

?

Dongpo Deng

dongpo@iis.sinica.edu.tw

5

http://5stardata.info/tw/

MS EXCEL XLS

CSV

(Web of Data)

:

(Resource Description Framework, RDF)

64

siteAddress

Subject Predicate Object

TripleURL

URL

URL

HTTP URL and URL

• HTTP URIs, in the web architecture, have been used to denote documents -- "web pages" informally, or "information resources" more formally.

• However, with the growth of the Semantic Web, which uses URIs to denote anything at all, the urge to use and practice of using HTTP URIs for arbitrary things grew steadily.

https://www.w3.org/DesignIssues/HTTP-URI.html

(ontology)• 建⽴知識本體

• 定義語彙,清楚表達資料,使資料能夠相互連結

http://lov.okfn.org/dataset/lov/

URI

(Resource Description Framework, RDF)

RDF

Resource URI: http://www.epa.gov.tw/resource/Taiepi_Weather_Station

db:Taipei a lodtw:UV_Site ; rdfs:isDefinedBy <http://www.epa.gov.tw/data/Taiepi_Weather_Station> ; rdfs:label “Taipei_Weather_Station” ; <http://lod.tw/ontologies/weather_stations.owl#1st_Administration> “ ” <Taipei_City> ; <http://lod.tw/ontologies/weather_stations.owl#2nd_Administration> “ ” <Zhongzheng_Distract> ; lodtw:Government_Organization “ ” ; lodtw:Weather_Station “ ” ;lodtw:id 6 ;wgs84_pos:lat “25.037583”^^xsd:float ;wgs84_pos:long “121.514861”^^xsd:float ;card:Address “ 64 ” ;foam:page <http://www.epa.gov.tw/page/Taiepi_Weather_Station>

Taiepi_Weather_Station

(Linked Data)

• Tim Berners-Lee

1. URI

2. HTTP URI

3. URI (RDF, SPARQL)

4. URI

https://www.w3.org/DesignIssues/HTTP-URI.html

http://blog.okfn.org/category/working-groups/wg-archaeology/

URIURI

(Linked Open Data, LOD)!!

Resource URI: http://www.epa.gov.tw/resource/Taiepi_Weather_Station

Taiepi_Weather_Station

Resource URI: http://lod.tw/resource/Taiepi_City

DBpedia

• DBpedia is a community effort to extract structured “infobox” information from Wikipedia

Academia Sinica in Wikipedia

http://en.wikipedia.org/wiki/Academia_Sinica

Academia Sinica in DBpedia

http://dbpedia.org/page/Academia_Sinica

LOD Cloud (2007-05-01)12 data sets

LOD Cloud (2008-02-28)32 data sets

LOD Cloud (2009-03-05)93 data sets

LOD Cloud (2010-09-22)203 data sets

LOD Cloud (2011-09-19)

LODE-Taiwan Biodiversity Dataset

295 data sets

LOD Cloud (2014-08-30)570 data sets

Government Linked Open Data

Government Linked Open Data

http://hangingtogether.org/?p=5206

90

0 5 10 15 20 25 30 35 40 45

USA

Spain

UK

The Netherlands

Norway

Canada

Australia

France

Germany

Italy

Switzerland

Austria

Czech Republic

Hungary

Ireland

Japan

Malaysia

Portugal

Singapore

Sweden

Linked Data Survey Respondents

Karen Smith-Yoshimura (2015) Linked Data Implementations— Who, What and Why?, OCLC Research

0

5

10

15

20

25

Academic library

National library Network Government Scholarly Public Library Museum Other

2014 2015

Karen Smith-Yoshimura (2015) Linked Data Implementations— Who, What and Why?, OCLC Research

2015

Steep learning curve for staff

40

Inconsistency in legacy data

33

Selecting appropriate ontologies to represent our data

31

Establishing the links

27

Little documentation or advice on how to build the systems

21

Karen Smith-Yoshimura (2015) Linked Data Implementations— Who, What and Why?, OCLC Research

?

2015 2014

Expose to larger audience on the Web

67 45

Demonstrate what could be done with datasets as linked data

59 41

Heard about linked data and wanted to try it out by exposing our data as linked data.

43 21

See if publishing linked data would improve our Search Engine Optimization (SEO.)

29 9

Karen Smith-Yoshimura (2015) Linked Data Implementations— Who, What and Why?, OCLC Research

?2015 2014

Provide our users with a richer experience.

51 35

Enhance our own data by consuming linked data from other sources.

50 37

More effective internal metadata management.

32 16

Greater accuracy and scope in our search results

27 12

See if consuming linked data would improve our Search Engine Optimization (SEO).

19 12

Experiment with combining different types of data into a single triple store.

17 15

Heard about linked data and wanted to try it out by using linked data sources.

17 13

2015

VIAF (Virtual International Authority File) 41

DBpedia 36

GeoNames 35

id.loc.gov 35

Resources we convert to linked data ourselves 17

Getty's AAT 16

FAST (Faceted Application of Subject Terminology) 15

WorldCat.org 15

data.bnf.fr 12

Deutsche National Bib Linked Data Service 12

Karen Smith-Yoshimura (2015) Linked Data Implementations— Who, What and Why?, OCLC Research

• URI

• BaseURI http://geo.lod.tw• TBoxURIs http://geo.lod.tw/ontology/

{class|property} • ABoxURIs

http://geo.lod.tw/resource/Name/

(Ontologies)

• OWL RDFs

• Protege

• 要連到其它資料,要清楚資料的脈絡關係,以找到可以連結的資料

• 基本上要梳理資料脈絡關係,可由三個⼤⽅向著⼿,資料的時間特性、資料的空間特性、和資料的主題

• ⼯具 • Silk • http://wifo5-03.informatik.uni-mannheim.de/bizer/silk/

• LIME • http://aksw.org/Projects/LIMES.html

• CSV and spreadsheets

• RDF extension of Google Refine, XLWrap, RDF123, NOR2O

• RDB

• D2R Server, ODEMapster, W3C RDB2RDF WG – R2RML

• XML

• GRDDL, ReDeFer

• http (accessibility)

(derefencability)

• namespace vocabulary

• RDF stores and SPARQL endpoints

• Jena, Virtuoso, Sesame,4Store, OWLIM, BBN Parliament

• linked-data front-end services

• Pubby, TalisPlatform, Fuseki, D2RQ

MySQLRDB

D2R Pubby

VirtuosoRDF store

Web

HTML RDF

RelFinder

dongpo.deng@gmail.comtwitter: @dongpo

facebook: dongpo.deng

Slides are available on http://tinyurl.com/hobd6az

top related