Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH Philip E. Bourne Ph.D.
Associate Director for Data ScienceNational Institutes of Health
http://www.slideshare.net/pebourne
The Backdrop…
To foster an ecosystem that enables biomedical research to be conducted as a digital enterprise that enhances
health, lengthens life and reduces illness and disability
So what can you do for the NIH?First, some background
a) We believe what you are doing in discovery informatics to be very important to the ADDS mission
b) We have defined 5 thematic areas to pursue so how can you help?
Here are the areas….
Associate Director for Data Science
CommonsTrainingCenter
BD2KModifiedReview
Sustainability* Education* Innovation* Process
• Cloud – Data & Compute
• Search• Security • Reproducibility
Standards• App Store
• Coordinate• Hands-on• Syllabus• MOOCs
• Community• Centers• Training Grants• Catalogs• Standards• Analysis
• Data Resource Support
• Metrics• Best
Practices• Evaluation• Portfolio
Analysis
The Biomedical Research Digital Enterprise
Communication
Collaboration
Programmatic Theme
Deliverable
Example Features • IC’s• Researchers• Federal
Agencies• International
Partners• Computer
Scientists
Scientific Data Council External Advisory Board
* Hires made
Lets take each area one at a time
1. Sustainability & the Commons
Source Michael Bell http://homepages.cs.ncl.ac.uk/m.j.bell1/blog/?p=830
What The Commons Is and Is Not
Is Not:– A database
– Confined to one physical location
– A new large infrastructure
– Owned by any one group
Is:– A conceptual framework
– Analogous to the Internet
– A collaboratory
– A few shared rules
• All research objects have unique identifiers
• All research objects have limited provenance
http://video.open-bio.org/video/23/biomedical-research-as-an-open-digital-enterprise
The Commons (Vivien Bonnazi & George Komatsoulis (NCBI))
What we beginning to do:• Establishing public/private partnership
• Working with IC’s, NCBI, CIT, the community to identify and run pilots in the cloud, HPC centers, institutions
• Porting DbGAP to the cloud
• Experimenting with new funding strategies
• Evaluating
Sustainability and Sharing: The Commons
Data
The Long Tail
Core Facilities/HS Centers
Clinical /Patient
The Why:Data Sharing Plans
TheCommons
Government
The How:
DataDiscoveryIndex
SustainableStorage
Quality
Scientific Discovery
Usability
Security/Privacy
Commons == Extramural NCBI == Research Object Sandbox == Collaborative Environment
The End Game:
KnowledgeNIHAwardees
PrivateSector
Metrics/Standards
Rest ofAcademia
Software StandardsIndex
BD2KCenters
Cloud, Research Objects,Business Models
What Does the Commons Enable?
Dropbox like storage
The opportunity to apply quality metrics
Bring compute to the data
A place to collaborate
A place to discover
http://100plus.com/wp-content/uploads/Data-Commons-3-1024x825.png
[Adapted from George Komatsoulis]
One Possible Commons Business Model
HPC, Institution …
1. So What Can You Do for the Commons?
Contribute to the discussion on research object identifiers
Contribute to the discussion on provenance for research objects
Propose and implement pilots in the commons
Apply for FY15 RFAs associated with the commons
Critique it!
2. Training (Michelle Dunn)2. Training (Michelle Dunn)
Training Goals:
– Develop a sufficient cadre of researchers skilled in data science
– Elevate general competencies in data usage and analysis across the biomedical research workforce
– Combat the Google bus
How:
– Traditional training grants
– Non-traditional funding mechanisms
– Work with IC’s on a needs assessment
– Work with institutions on raising awareness
– Training center(s)
2. What Can You Do for Training?
Propose new models for training at the intersection of the disciplines
Address the question, are training centers a good idea?
Propose new funding models for training around prizes, challenges, hackathons etc.
Data Discovery Index Coordination Consortium (U24) (under review; FY14)
Centers (under review; FY14)
Metadata standards (under development; FY15)
Targeted Software Development (FY14;FY15)
Workshops (FY14, FY15)
3. BD2K Innovation (Jennie 3. BD2K Innovation (Jennie Larkin and Mark Guyer)Larkin and Mark Guyer)
3. What Can You Do for BD2K/Innovation?
Apply to the RFAs
Respond to the RFIs
Participate in the workshops
Talk to us about what we should be doing in the extramural community to foster the ecosystem
Other?
4. Process
What this involves:
– Policies and procedures
– Grant management and review
– Communicating with the community …
4. What Can You Do for Process?
– Make research object citation a reality
– Support machine readable data sharing plans?
– Support open review?
– Support micro funding?
– Support standing data committees to explore best practices?
– Support Crowd sourcing?
– [your ideas here….]
5. Collaboration
Between funding agencies
Between different branches of the federal government
Between countries / geographic regions
Between communities / disciplines
5. What Can You Do for Collaboration?
Continue to do what you are doing
Encourage interagency funding initiatives
Suggest workshops that bring folks together
Suggest mechanisms for making the collaborations stick
Suggest ways to stimulate communities
Your ideas here ….
That’s enough for you to do for one day…
Thanks for listening and acting as I know you will (no pressure)
Some Acknowledgements
Eric Green & Mark Guyer (NHGRI)
Jennie Larkin (NHLBI)
Leigh Finnegan (NHGRI)
Vivien Bonazzi (NHGRI)
Michelle Dunn (NCI)
Mike Huerta (NLM)
David Lipman (NLM)
Jim Ostell (NLM)
Andrea Norris (CIT)
Peter Lyster (NIGMS)
All the over 100 folks on the BD2K team
NIHNIH……Turning Discovery Into HealthTurning Discovery Into Health