Linked Data: Forming Partnerships at the Data
Layer IN12A-03
AGU Fall Meeting 2015Monday, 14 December
theBILLION DOLLARQUESTION
How do we prevent our high-quality digital data from becoming dark data
when shared with our partners?
Five Star Open Data
http://5stardata.info/en/
Five Star Open Data
http://5stardata.info/en/
RDF moves data schema to data layer
Partnerships in Big Data
Big Data Challenges at BCO-DMO
VERACITY Uncertainty of the data
VARIETY Heterogeneity of the data
Veracity: Cruise Metadata
Data Partnering = Blind Date
Software: The silent partners
Data Exchange
Assumptions in Software
SoftwareAgent
X
Partner A Partner BSoftware
AgentY
Context A
Concept MapsData SchemaData Types
DATA
Community Recommendations for Sustainable Scientific Software
http://dx.doi.org/10.5334/jors.bt
“Science software must be sustainable and reliable to contribute to future science practices….the research community needs to ensure that science software can be relied upon to reproduce research results.”
- R. R. Downs, et al., Journal of Open Research Software, 2015
Identify software assumptions about data
& capture provenance
Variety: External Related Datasets
Assumptions1. LTER data are available at DataONE
2. LTER Site has a specific identifier at DataONE
3. Query DataONE SOLR endpoint with the identifier
Provenance in RDF
Provenance in RDF
Tool: PROV-O-Viz
DataONE: The silent partner
Researchers
Aggregators
Archives
Domain Repositories
Data Scope
Researchers
Aggregators
Archives
Domain Repositories
RDFData
Researchers
Aggregators
Archives
Domain Repositories
RDFData
Researchers
Aggregators
Archives
Domain Repositories
ISO
Today
Researchers
Aggregators
Archives
Domain Repositories
ISO
Today
RDFData
RDFData
RDFData
Researchers
Aggregators
Archives
Domain Repositories
ISO
Today
RDFData
RDFData
RDFData
Researchers
Aggregators
Archives
Domain Repositories
ISO
Today
RDFData
RDFData
RDFData
Where do we start?
To elevate your business logic to the data layer:TWC Semantic Web Methodology
http://tw.rpi.edu/web/doc/TWC_SemanticWebMethodology
Recap
1. Recognize digital data can still be dark
2. Elevate data schema to data layer
3. Identify data assumptions in software & capture provenance
QUESTIONS?
PA13B-02 Improved Access to NSF Funded Ocean Research Data (oral)
IN41C Semantics for the Discovery, Access & Integration of Geoscience Data (session)
IN41C-1710 EarthCube GeoLink: Semantics and Linked Data for the Geosciences (poster)
Related AGU Things