data driven science at nih: a conversationdata driven science at nih: a conversation philip e....
TRANSCRIPT
![Page 1: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/1.jpg)
Data Driven Science at NIH:
A Conversation
Philip E. Bourne, PhD, FACMI
Associate Director for Data Science
National Institutes of Health
Federal Demonstration Partnership
May 11, 2015, Washington DC
![Page 2: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/2.jpg)
What Drives Our Strategic Thinking?
Be Prepared - Responding to take advantage of the
opportunities offered by a major disruption in the
biomedical research enterprise arising through
digitization and exponential growth
Accelerating discovery during this time of disruptive
development
Continually catalyzing a cultural shift towards a more
analytical enterprise while managing expectations
![Page 3: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/3.jpg)
Let Me Give You 4 Examples of What
Drives Us …
![Page 4: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/4.jpg)
1. We are at a Point of Deception …
Evidence:
– Google car
– 3D printers
– Waze
– Robotics
– Sensors
From: The Second Machine Age: Work, Progress,
and Prosperity in a Time of Brilliant Technologies
by Erik Brynjolfsson & Andrew McAfee
![Page 5: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/5.jpg)
Example - Photography
Digitization Deception
Disruption
Demonetization
Dematerialization
Democratization
Time
Volu
me,
Velo
city,
Variety
Digital camera invented by
Kodak but shelved
Megapixels & quality improve slowly;
Kodak slow to react
Film market collapses;
Kodak goes bankrupt
Phones replace
cameras
Instagram,
Flickr become the
value proposition
Digital media becomes bona fide
form of communication
![Page 6: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/6.jpg)
1. We Are At a Point of Deception
The 6D Exponential Framework
Digitization of Basic &
Clinical Research & EHR’s Deception
We Are Here
Disruption
Demonetization
Dematerialization
Democratization
Open science
Patient centered health care
![Page 7: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/7.jpg)
2. Democratization Will Follow
The Story of Meredith
http://fora.tv/2012/04/20/Congress_Unplugged_
Phil_Bourne Stephen Friend
![Page 8: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/8.jpg)
47/53 “landmark” publications
could not be replicated
[Begley, Ellis Nature,
483, 2012] [Carole Goble]
3. Disruption Can Occur
![Page 9: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/9.jpg)
4. Demonetization, Democratization?
![Page 10: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/10.jpg)
“And that’s why we’re here today. Because something
called precision medicine … gives us one of the greatest
opportunities for new medical breakthroughs that we
have ever seen.”
President Barack Obama January 30, 2015
![Page 11: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/11.jpg)
Precision Medicine Initiative
Vision: Build a broad research program to encourage
creative approaches to precision medicine, test them
rigorously, and, ultimately, use them to build the
evidence base needed to guide clinical practice.
Near Term: apply the tenets of precision medicine to a
major health threat – cancer
Longer Term: generate the knowledge base necessary
to move precision medicine into virtually all areas of
health and disease
![Page 12: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/12.jpg)
Precision Medicine Initiative
National Research Cohort
– >1 million U.S. volunteers
– Numerous existing cohorts (many funded by NIH)
– New volunteers
Participants will be centrally involved in design and
implementation of the cohort
They will be able to share genomic data, lifestyle
information, biological samples – all linked to their
electronic health records
![Page 13: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/13.jpg)
An Example of That Promise:
Comorbidity Network for 6.2M Danes
Over 14.9 Years
Jensen et al 2014 Nat Comm 5:4022
![Page 14: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/14.jpg)
Office of Biomedical
Data Science
Mission Statement
To use data science to foster an
open digital ecosystem that will
accelerate efficient, cost-effective
biomedical research
to enhance health, lengthen life, and
reduce illness and disability
Goals expanded from recommendations in the June 2012 DIWG and
BRWWG reports.
![Page 15: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/15.jpg)
Overall Goals by 2020
Enable major scientific discovery through the BD2K
initiative
Establish and provide evidence of a more sustainable,
efficient and productive data science ecosystem
both internal and external to NIH
Establish and provide evidence of a well-trained and
diverse workforce able to use and develop biomedical
data science tools and methods
Build upon NIH’s leadership and reputation in data
science
![Page 16: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/16.jpg)
The BD2K Program is Central
to the Mission
$0
$20,000,000
$40,000,000
$60,000,000
$80,000,000
$100,000,000
$120,000,000
FY14 FY15 FY16 FY17 FY18 FY19 FY20 FY21
Planned – Black; Available- Green
![Page 17: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/17.jpg)
Elements of The Digital Enterprise
Communities Policies
Infrastructure
• Intersection:
• Sustainability
• Efficiency
• Collaboration
• Training
![Page 18: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/18.jpg)
Elements of The Digital Enterprise
Communities Policies
Infrastructure
• Intersection:
• Sustainability
• Efficiency
• Collaboration
• Training
Virtuous
Research
Cycle
![Page 19: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/19.jpg)
Consider an example…
![Page 20: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/20.jpg)
Big Data: The study involved
MRI images & GWAS data
from over 30,000 people
Collaboration: Data came
from many different sights
affiliated with the ENIGMA
consortium
Methods: To homogenize
data from different sites, the
group designed standardized
protocols for image analysis,
quality assessment, genetic
imputation, and association
Found five novel genetic
variants
Results provided insight into
the variability of brain
development, and may be
applied to study of
neuropsychiatric dysfunction
![Page 21: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/21.jpg)
Community – Enigma, BD2K
Policy
– Improved consent methods
– Cloud accessibility for human subjects data
– Trusted partners
– Data sharing
Infrastructure
– Standards, compute resources, software
![Page 22: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/22.jpg)
Communities: Thus Far
Visioning workshop convened 9/3/14
Launched BD2K ($32M)
– 12 Centers of data excellence
– Data Discovery Index Coordination Consortium
(DDICC)
– Training awards
First successful consortia meeting 11/3-4
Workshops to inform future funding
– Software indexing and discoverability
– Gaming
![Page 23: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/23.jpg)
Communities: 2015 Activities
New FOAs with outreach to new
communities – math, stats, comp science etc.
Work with e.g GA4GH, RDA, FORCE11,
NDS ….
IDEAS lab with NSF
Competition with international funders
Software carpentry, hackathons, Pi Day
![Page 24: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/24.jpg)
Communities: Questions?
Societies of the modern age?
How to enable these groups?
How to marry the funding of individuals with
the funding of communities?
![Page 25: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/25.jpg)
Policies: Now & Forthcoming
Data Sharing
– Genomic data sharing announced
– Data sharing plans on all research awards
– Data sharing plan enforcement
• Machine readable plan
• Repository requirements to include grant numbers
http://www.nih.gov/news/health/aug2014/od-27.htm
![Page 26: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/26.jpg)
Policies - Forthcoming
Data Citation
– Goal: legitimize data as a form of scholarship
– Process:
• Machine readable standard for data citation (done)
• Endorsement of data citation for inclusion in NIH bib
sketch, grants, reports, etc.
• Example formats for human readable data citations
• Slowly work into NLM/NCBI workflow
dbGaP in the cloud (done!)
![Page 27: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/27.jpg)
BD2K
Center
BD2K
Center
BD2K
Center
BD2K
Center BD2K
Center
BD2K
Center
DDICC
Software
Standards
Infrastructure - The
Commons Labs
Labs
Labs
Labs
![Page 28: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/28.jpg)
The Commons
Digital Objects (with UIDs)
Search (indexed metadata)
Computing
Platform
The
Com
mons
Vivien Bonazzi
George Komatsoulis
![Page 29: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/29.jpg)
The Commons: Compute Platforms
The Commons
Conceptual Framework
Public Cloud
Platforms
Super Computing
(HPC) Platforms Other
Platforms ?
Google, AWS (Amazon)
Microsoft (Azure), IBM,
other?
In house compute
solutions
Private clouds, HPC
– Pharma
– The Broad
– Bionimbus
Traditionally low access
by NIH
![Page 30: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/30.jpg)
The Commons:
Business Model
[George Komatsoulis]
![Page 31: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/31.jpg)
Infrastructure: Standards
2013 Workshop on Frameworks for Community-
Based Standards
August 2014 Input on Information Resources for
Data-Related Standards Widely Used in Biomedical
Science – 30 responses
Feb 2015 Workshop Community-based Data and
Metadata Standards
Internal CDE Registry project
![Page 32: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/32.jpg)
Elements of The Digital Enterprise
Communities Policies
Infrastructure
• Intersection:
• Sustainability
• Efficiency
• Collaboration
• Training
![Page 33: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/33.jpg)
Elements of The Digital Enterprise
Communities Policies
Infrastructure
• Intersection:
• Sustainability
• Efficiency
• Collaboration
• Training
![Page 34: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/34.jpg)
Sustainability 101
Source Michael Bell http://homepages.cs.ncl.ac.uk/m.j.bell1/blog/?p=830
![Page 35: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/35.jpg)
Workforce Training
![Page 36: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/36.jpg)
Strengthening a diverse biomedical workforce to
utilize data science
BD2K funding of Short Courses and Open
Educational Resources
Building a diverse workforce in biomedical
data science
BD2K Training programs and Individual Career
Awards
Fostering Collaborations
BD2K Training Coordination Center, NSF/NIH IDEAs Lab
Expanding NIH Data Science Workforce
Development Center
Local courses, e.g. Software Carpentry
Discovery of Educational Resources
BD2K Training Coordination Center
Goal: To strengthen the ability of a
diverse biomedical workforce to develop
and benefit from data science
![Page 37: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/37.jpg)
I not only use all the brains
I have, but all I can borrow.
– Woodrow Wilson
![Page 38: Data Driven Science at NIH: A ConversationData Driven Science at NIH: A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health](https://reader033.vdocuments.us/reader033/viewer/2022043022/5f3daa0e45e6447d8613b6d9/html5/thumbnails/38.jpg)
Associate Director for Data Science
Commons BD2K Efficiency
Sustainability Education Innovation Process
• Cloud – Data &
Compute
• Search
• Security
• Reproducibility
Standards
• App Store
• Coordinate
• Hands-on
• Syllabus
• MOOCs
• Community
• Centers
• Training Grants
• Catalogs
• Standards
• Analysis
• Data
Resource
Support
• Metrics
• Best
Practices
• Evaluation
• Portfolio
Analysis
The Biomedical Research Digital Enterprise
Partnerships
Collaboration
Programmatic Theme
Deliverable
Example Features • IC’s
• Researchers
• Federal
Agencies
• International
Partners
• Computer
Scientists
Scientific Data Council External Advisory Board
Training