linked data approach for integration of human health & environmental data
DESCRIPTION
Best practices and platforms for access and reuse of scientific data and models. We explore a Linked Data approach for data integration, modeling and interoperability. Delivered by Bernadette Hyland at EPA & Society of Toxicology Scientific Workshop titled: "Building for Better Decisions: Multi-scale Integration of Human Health and Environmental Data.. Delivered 8-May-2012 at EPA Research Triangle Park, NC USA.TRANSCRIPT
![Page 1: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/1.jpg)
Linked Data Approach for Integration of Human Health and
Environmental DataBuilding for Better Decisions: Multi-scale Integration of Human Health
and Environmental Data 8-11 May 2012
By: Bernadette Hyland, Chair, W3C Government Linked Data WG
CEO, 3 Round Stones, Inc
Email. [email protected]: @BernHyland
This presentation: http://slideshare.net/3roundstones
1Tuesday, May 8, 12
![Page 2: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/2.jpg)
• Linked Data is about publishing and consuming data using international data standards
• Based on 20 year old idea
• A system of linked information systems
2Tuesday, May 8, 12
![Page 3: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/3.jpg)
3Tuesday, May 8, 12
![Page 4: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/4.jpg)
Photo credit: http://www.flickr.com/photos/sjungling/5974860/
4Tuesday, May 8, 12
![Page 5: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/5.jpg)
1970s 1980s 1990s
$ cat foo.txt | grep blah | sort
A neat little package Client-Server The Early Web
A HISTORY OF SILOS
5Tuesday, May 8, 12
![Page 6: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/6.jpg)
There is a better way to connect data ...
• No one vendor owns it• It scales ... to Web-scale• Doesn’t require a super model• Based on International Data Exchange Standards (RDF, SPARQL)
6Tuesday, May 8, 12
![Page 7: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/7.jpg)
• What is next for Open Data on the Web
• Structured data on the Web is quickly becoming mainstream
• Authorities beginning to appreciate a new way to publish and consume content
What is next for Data in the Web?
7Tuesday, May 8, 12
![Page 8: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/8.jpg)
8Tuesday, May 8, 12
![Page 9: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/9.jpg)
9Tuesday, May 8, 12
![Page 10: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/10.jpg)
“Linked Data means
Cooperation without coordination”
-- David Wood, PhD
10Tuesday, May 8, 12
![Page 11: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/11.jpg)
GovernmentsGoals: Governmental transparency and/or improved
internal efficiencies (data warehouses)
11Tuesday, May 8, 12
![Page 12: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/12.jpg)
Hardware/Software Vendors
Goal: Improve interoperability between products and product lines
12Tuesday, May 8, 12
![Page 13: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/13.jpg)
RetailersGoal: Improve click-throughs on search results
13Tuesday, May 8, 12
![Page 14: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/14.jpg)
Book PublishersGoals: Improve internal manuscript pipelines, expose
additional ways of finding and using content
14Tuesday, May 8, 12
![Page 15: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/15.jpg)
New Media
15Tuesday, May 8, 12
![Page 16: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/16.jpg)
Web
Universal Client
Universal Connection
Universal Database
Logic and interlinking
Ubiquitous,reusable applications
URL Curation
of Data
Linked Data in Context
16Tuesday, May 8, 12
![Page 17: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/17.jpg)
17Tuesday, May 8, 12
![Page 18: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/18.jpg)
18Tuesday, May 8, 12
![Page 19: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/19.jpg)
19Tuesday, May 8, 12
![Page 20: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/20.jpg)
Why is RDF important?• It is an international standard for publishing data on
the Web (public and private)
• Data exchange model
• Serializations include RDF/XML, N-triples, N3, Turtle ...
• It is the future of using the Web
20Tuesday, May 8, 12
![Page 21: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/21.jpg)
What you can do ...
• Good = Use Data Standards (RDF) to publish metadata about data and models, at a minimum
• Better = Use RDF to publish all your data
• Best = Link your data + models
• Web architecture, Web-scale
21Tuesday, May 8, 12
![Page 22: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/22.jpg)
22Tuesday, May 8, 12
![Page 23: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/23.jpg)
23Tuesday, May 8, 12
![Page 24: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/24.jpg)
WE’VE SEEN THIS BEFORE
24Tuesday, May 8, 12
![Page 25: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/25.jpg)
25Tuesday, May 8, 12
![Page 26: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/26.jpg)
26Tuesday, May 8, 12
![Page 27: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/27.jpg)
27Tuesday, May 8, 12
![Page 28: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/28.jpg)
28Tuesday, May 8, 12
![Page 29: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/29.jpg)
29Tuesday, May 8, 12
![Page 30: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/30.jpg)
30Tuesday, May 8, 12
![Page 31: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/31.jpg)
31Tuesday, May 8, 12
![Page 32: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/32.jpg)
32Tuesday, May 8, 12
![Page 33: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/33.jpg)
33Tuesday, May 8, 12
![Page 34: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/34.jpg)
EMRData
InternalPortal Data
Linked DataCloud
Open Government Data
Social Media
Clinical Condi*on Specific
PhysiciansServicesLoca*ons
DBpediaPub MedNLM
CDCEPA
US Census
FacebookTwiCer
ClinicalOntology
BusinessOntology
34Tuesday, May 8, 12
![Page 35: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/35.jpg)
•Decrease costly emergency department visits
•Reduce hospital re-admissions after treatment
• Improved self-care and medication compliance
•Education of triggers and disease management
Value Proposition
35Tuesday, May 8, 12
![Page 36: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/36.jpg)
Func*onal Model
1. Define target popula*on and clinical data from electronic medical record
2. Iden*fy sources of open government data related to environmental, weather, and other variables related to chronic pulmonary disease exacerba*ons
3. Combine open content from NLM, PubMed, Medline to support educa*on
4. Leverage a Linked Data approach, using Open Source and interna*onal data exchange standards (RDF)
5. Alert pa*ent of possible hazardous condi*ons and recommend appropriate ac*ons
36Tuesday, May 8, 12
![Page 37: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/37.jpg)
CA-‐email-‐message.jpg
Leverage Linked Data, Open Source & Standards
CDCEPA
US Census
DBpediaPub MedNLM
Web of Data
EMR
SMS
Web
37Tuesday, May 8, 12
![Page 38: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/38.jpg)
38Tuesday, May 8, 12
![Page 39: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/39.jpg)
Shows:
1) Air Quality data from US EPA
2) Anonymized EMR data
3) Doctor’s details from CSV file
Uses Callimachus,a Linked Data Management Platform
39Tuesday, May 8, 12
![Page 40: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/40.jpg)
• Large and small vendors are involved in Linked Data
• From Oracle, IBM to 3 Round Stones
• Listing of active projects, companies and research See http://dir.w3.org/
• Best practices, see http://www.w3.org/2011/gld/charter
Tools & best practices?
40Tuesday, May 8, 12
![Page 41: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/41.jpg)
•Callimachus is a framework for data-driven applications based on Linked Data principles
•Callimachus allows Web developers to easily create data driven applications for the Web
• It is Open Source (FLOSS)
•http://callimachusproject.org
41Tuesday, May 8, 12
![Page 42: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/42.jpg)
http://www.w3.org/2011/gld/charter42Tuesday, May 8, 12
![Page 43: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/43.jpg)
DELIVERABLES
Community Directory
Best Practices for Publishing Linked Data
Procurement, vocabulary selection, URI construction, versioning, stability, legacy data issues
Cookbook for Linked Open Data
Standard Vocabularies
Metadata, Statistical “Cube” Data, People, Organizational structures
43Tuesday, May 8, 12
![Page 44: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/44.jpg)
44Tuesday, May 8, 12
![Page 45: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/45.jpg)
• Be prepared for the scientific community & public to demand that your data be
published in re-usable format (RDF)
• Demand your vendors use Open Source whenever possible
• Incentivize industry & STM publishers to do the right thing
• Open vs. proprietary technologies & data formats ... be OPEN• Beware of semantic “pixie dust” - be “an educated consumer” (and scientist!)
• Solutions must embrace International Standards and published Best Practices (W3C, OMG, IETF)
• Define a URI Policy and Strategy, document it and ensure scientists use it!
• Leverage the work of others and work cooperatively...
• Our future is all connected through your work...
Recommendations
45Tuesday, May 8, 12
![Page 46: Linked Data Approach for Integration of Human Health & Environmental Data](https://reader033.vdocuments.us/reader033/viewer/2022060107/554b8522b4c90574668b4c7b/html5/thumbnails/46.jpg)
This work is Copyright © 2011-2012 3 Round Stones Inc.It is licensed under the Creative Commons Attribution 3.0 Unported LicenseFull details at: http://creativecommons.org/licenses/by/3.0/
You are free:
to Share — to copy, distribute and transmit the work
to Remix — to adapt the work
Under the following conditions:Attribution. You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work).
Share Alike. If you alter, transform, or build upon this work, you may distribute the resulting work only under the same or similar license to this one.
46Tuesday, May 8, 12