from structured data to linked open governmental data
TRANSCRIPT
5
•
•
http://5stardata.info/tw/
MS EXCEL XLS
CSV
(Web of Data)
•
:
•
•
•
•
(Resource Description Framework, RDF)
64
siteAddress
Subject Predicate Object
TripleURL
URL
URL
HTTP URL and URL
• HTTP URIs, in the web architecture, have been used to denote documents -- "web pages" informally, or "information resources" more formally.
• However, with the growth of the Semantic Web, which uses URIs to denote anything at all, the urge to use and practice of using HTTP URIs for arbitrary things grew steadily.
https://www.w3.org/DesignIssues/HTTP-URI.html
(ontology)• 建⽴知識本體
• 定義語彙,清楚表達資料,使資料能夠相互連結
http://lov.okfn.org/dataset/lov/
URI
(Resource Description Framework, RDF)
RDF
Resource URI: http://www.epa.gov.tw/resource/Taiepi_Weather_Station
db:Taipei a lodtw:UV_Site ; rdfs:isDefinedBy <http://www.epa.gov.tw/data/Taiepi_Weather_Station> ; rdfs:label “Taipei_Weather_Station” ; <http://lod.tw/ontologies/weather_stations.owl#1st_Administration> “ ” <Taipei_City> ; <http://lod.tw/ontologies/weather_stations.owl#2nd_Administration> “ ” <Zhongzheng_Distract> ; lodtw:Government_Organization “ ” ; lodtw:Weather_Station “ ” ;lodtw:id 6 ;wgs84_pos:lat “25.037583”^^xsd:float ;wgs84_pos:long “121.514861”^^xsd:float ;card:Address “ 64 ” ;foam:page <http://www.epa.gov.tw/page/Taiepi_Weather_Station>
Taiepi_Weather_Station
(Linked Data)
• Tim Berners-Lee
1. URI
2. HTTP URI
3. URI (RDF, SPARQL)
4. URI
https://www.w3.org/DesignIssues/HTTP-URI.html
http://blog.okfn.org/category/working-groups/wg-archaeology/
URIURI
(Linked Open Data, LOD)!!
Resource URI: http://www.epa.gov.tw/resource/Taiepi_Weather_Station
Taiepi_Weather_Station
Resource URI: http://lod.tw/resource/Taiepi_City
•
•
•
•
DBpedia
• DBpedia is a community effort to extract structured “infobox” information from Wikipedia
Academia Sinica in Wikipedia
http://en.wikipedia.org/wiki/Academia_Sinica
Academia Sinica in DBpedia
http://dbpedia.org/page/Academia_Sinica
LOD Cloud (2007-05-01)12 data sets
LOD Cloud (2008-02-28)32 data sets
LOD Cloud (2009-03-05)93 data sets
LOD Cloud (2010-09-22)203 data sets
LOD Cloud (2011-09-19)
LODE-Taiwan Biodiversity Dataset
295 data sets
LOD Cloud (2014-08-30)570 data sets
Government Linked Open Data
Government Linked Open Data
http://hangingtogether.org/?p=5206
90
0 5 10 15 20 25 30 35 40 45
USA
Spain
UK
The Netherlands
Norway
Canada
Australia
France
Germany
Italy
Switzerland
Austria
Czech Republic
Hungary
Ireland
Japan
Malaysia
Portugal
Singapore
Sweden
Linked Data Survey Respondents
Karen Smith-Yoshimura (2015) Linked Data Implementations— Who, What and Why?, OCLC Research
0
5
10
15
20
25
Academic library
National library Network Government Scholarly Public Library Museum Other
2014 2015
Karen Smith-Yoshimura (2015) Linked Data Implementations— Who, What and Why?, OCLC Research
2015
Steep learning curve for staff
40
Inconsistency in legacy data
33
Selecting appropriate ontologies to represent our data
31
Establishing the links
27
Little documentation or advice on how to build the systems
21
Karen Smith-Yoshimura (2015) Linked Data Implementations— Who, What and Why?, OCLC Research
?
2015 2014
Expose to larger audience on the Web
67 45
Demonstrate what could be done with datasets as linked data
59 41
Heard about linked data and wanted to try it out by exposing our data as linked data.
43 21
See if publishing linked data would improve our Search Engine Optimization (SEO.)
29 9
Karen Smith-Yoshimura (2015) Linked Data Implementations— Who, What and Why?, OCLC Research
?2015 2014
Provide our users with a richer experience.
51 35
Enhance our own data by consuming linked data from other sources.
50 37
More effective internal metadata management.
32 16
Greater accuracy and scope in our search results
27 12
See if consuming linked data would improve our Search Engine Optimization (SEO).
19 12
Experiment with combining different types of data into a single triple store.
17 15
Heard about linked data and wanted to try it out by using linked data sources.
17 13
2015
VIAF (Virtual International Authority File) 41
DBpedia 36
GeoNames 35
id.loc.gov 35
Resources we convert to linked data ourselves 17
Getty's AAT 16
FAST (Faceted Application of Subject Terminology) 15
WorldCat.org 15
data.bnf.fr 12
Deutsche National Bib Linked Data Service 12
Karen Smith-Yoshimura (2015) Linked Data Implementations— Who, What and Why?, OCLC Research
•
•
•
• URI
• BaseURI http://geo.lod.tw• TBoxURIs http://geo.lod.tw/ontology/
{class|property} • ABoxURIs
http://geo.lod.tw/resource/Name/
•
(Ontologies)
•
•
• OWL RDFs
• Protege
• 要連到其它資料,要清楚資料的脈絡關係,以找到可以連結的資料
• 基本上要梳理資料脈絡關係,可由三個⼤⽅向著⼿,資料的時間特性、資料的空間特性、和資料的主題
• ⼯具 • Silk • http://wifo5-03.informatik.uni-mannheim.de/bizer/silk/
• LIME • http://aksw.org/Projects/LIMES.html
•
• CSV and spreadsheets
• RDF extension of Google Refine, XLWrap, RDF123, NOR2O
• RDB
• D2R Server, ODEMapster, W3C RDB2RDF WG – R2RML
• XML
• GRDDL, ReDeFer
•
• http (accessibility)
(derefencability)
• namespace vocabulary
•
• RDF stores and SPARQL endpoints
• Jena, Virtuoso, Sesame,4Store, OWLIM, BBN Parliament
• linked-data front-end services
• Pubby, TalisPlatform, Fuseki, D2RQ
MySQLRDB
D2R Pubby
VirtuosoRDF store
Web
HTML RDF
RelFinder