data identification
DESCRIPTION
Data Identification. Open Data Around the World. Before. What data do you have? Have to ask for data through a FIOA request Wasn't always in a digital format Very long time to get and make use of. Why Should I care?. Health (hospital scores, diet/food) Economics (unemployment, CPI) - PowerPoint PPT PresentationTRANSCRIPT
1
Data Identification
2
Open Data Around the World
3
Before
What data do you have?
Have to ask for data through a FIOA request
Wasn't always in a digital format
Very long time to get and make use of
4
Why Should I care?
Health (hospital scores, diet/food)
Economics (unemployment, CPI)
Crime (rates, geo/temporal)
Environment (air quality, weather)
Education (rates, school districts)
So much more....
5
Data.gov
6
Raw Data
7
Data.gov Dataset Page
8
Other Raw Datasets
9
Challenges
Machine-readability
Metadata
Provenance
Discovery
Mashing/linking
10Linked Data
decentralized - sources may be spread out and referenced across the Web
modular - linked without advance planning or coordination
scalable - once stored in place, it’s easy to extend
advantages hold even when definitions and structure of the data changes over time.
11Linked Open Data Cloud
12
13Linking Open Government Data
14Catalog
15Dataset Page
16
Data Understaing
17Conversion:
From Raw Tabular Data to RDF
18Enhancement:
Linking Open Government Data
ID year PHSY_ST site-id cost
1998 10.0
1999 site123 11.3
2000 NY 8.3
2001 20
site-id Latitude longitude
site123 43.993 -70.326
Year claims
2000 382
PHSY_ST: state abbreviationID: unique id
cost: unit is million US dollarsyear: 1975-2008
Correlated dataset Complement dataset
Metadata (field definition) Metadata (value definition)
owl:sameAs
DS123:NY