nbcc open notebook science talk
DESCRIPTION
Jean-Claude Bradley presents "Accelerating Discovery by Sharing: a case for Open Notebook Science" at the National Breast Cancer Coalition Annual Advocacy Conference in Arlington, VA on May 1, 2011.TRANSCRIPT
![Page 1: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/1.jpg)
Accelerating Discovery by Sharing: a case for Open
Notebook Science
Jean-Claude Bradley
May 1, 2011
National Breast Cancer Coalition Annual Advocacy Conference
Associate Professor of ChemistryDrexel University
![Page 2: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/2.jpg)
Outline
1. Trends in sharing for drug discovery
2. ONS for malaria research3. Crowdsourcing solubility with
ONS4. Leveraging the educational
system to contribute new science
5. Open modeling and web services
6. Discovering connections7. Moving forward: tools and
practices
![Page 3: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/3.jpg)
Industry is Sharing More
![Page 4: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/4.jpg)
Opportunities for Competitive Collaboration
![Page 5: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/5.jpg)
Some Initiatives Promoting More Openness in Drug Discovery
![Page 6: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/6.jpg)
Motivation: Faster Science, Better Science
![Page 7: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/7.jpg)
There are NO FACTS, only measurements embedded
within assumptions
Open Notebook Science maintains the integrity of data
provenance by making assumptions explicit
![Page 8: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/8.jpg)
TRUST
PROOF
![Page 9: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/9.jpg)
First record then abstract structure
In order to be discoverable use Google friendly formats (simple HTML, no
login) In order to be replicable use free hosted tools (Wikispaces, Google
Spreadsheets)
Strategy for an Open Notebook:
![Page 10: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/10.jpg)
UsefulChem Project:UsefulChem Project: Open Primary Open Primary Research in Drug Design using Web2.0 Research in Drug Design using Web2.0
toolstools
Docking
Synthesis
Testing
Rajarshi GuhaIndiana U
JC BradleyDrexel U
Phil RosenthalUCSF
(malaria)
Dan ZaharevitzNCI
(tumors)
Tsu-Soo TanNanyang Inst.
![Page 11: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/11.jpg)
Malaria Target: falcipain-2 involved in hemoglobin metabolism
Dana.org
![Page 12: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/12.jpg)
The Ugi Reaction
![Page 13: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/13.jpg)
Outcome of Guha-Bradley-Outcome of Guha-Bradley-Rosenthal collaborationRosenthal collaboration
![Page 14: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/14.jpg)
References to papers, blog posts, lab notebook pages, raw
data
![Page 15: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/15.jpg)
The Ugi reaction: can we predict precipitation?
Can we predict solubility in organic solvents?
![Page 16: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/16.jpg)
Crowdsourcing Solubility Data
![Page 17: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/17.jpg)
ONS Challenge Judges
![Page 18: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/18.jpg)
ONS Challenge Award Winners
![Page 19: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/19.jpg)
Solubilities collected in a Google Spreadsheet
![Page 20: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/20.jpg)
Rajarshi Guha’s Live Web Query using Google Viz API
![Page 21: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/21.jpg)
Data provenance: From Wikipedia to…
![Page 22: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/22.jpg)
…the lab notebook and raw data
![Page 23: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/23.jpg)
Interactive NMR spectra using JSpecView and JCAMP-DX
![Page 24: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/24.jpg)
(Andy Lang, Tony Williams)
Open Data JCAMP spectra for education
(Andy Lang, Tony Williams, Robert Lancashire)
![Page 25: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/25.jpg)
Raw Data As Images
Splatter?
Some liquid
![Page 26: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/26.jpg)
YouTube for demonstrating experimental YouTube for demonstrating experimental set-upset-up
![Page 27: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/27.jpg)
The importance of raw data availability
Missed in a prior publication on
solubility for this compound
![Page 28: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/28.jpg)
Case study: Chemical Information
Retrieval course at Drexel (Fall 2009/2010)
Leveraging the educational system to contribute new science
![Page 29: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/29.jpg)
The Chemical Information Validation Sheet
567 curated and referenced measurements from Fall 2010 Chemical Information Retrieval course
![Page 30: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/30.jpg)
The Chemical Information Validation Explorer
(Andrew Lang)
![Page 31: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/31.jpg)
Discovering outliers for melting points (stdev/average)
![Page 32: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/32.jpg)
Investigating the m.p. inconsistencies of EGCG
![Page 33: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/33.jpg)
Investigating the m.p. inconsistencies of cyclohexanone
![Page 34: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/34.jpg)
Sigma-Aldrich, Acros and Wolfram Alpha apparently use the same sources for melting
points
![Page 35: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/35.jpg)
Sigma-Aldrich, Acros and Wolfram Alpha apparently use the same sources for boiling
points
![Page 36: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/36.jpg)
Sigma-Aldrich, Acros and Wolfram Alpha apparently
DO NOT use the same sources for flash points
![Page 37: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/37.jpg)
Most popular data sources
![Page 38: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/38.jpg)
Alfa Aesar donates melting points to the public
![Page 39: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/39.jpg)
Open Melting Point Explorer
![Page 40: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/40.jpg)
Outliers
MDPI dataset
EPI (via ChemSpider)
![Page 41: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/41.jpg)
Outliers
Alfa Aesar
![Page 42: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/42.jpg)
Inconsistencies and SMILES problems within MDPI dataset
![Page 43: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/43.jpg)
MDPI Dataset labeled with High Trust Level
![Page 44: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/44.jpg)
Open Melting Point Datasets
![Page 45: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/45.jpg)
Open Random Forest modeling of Open Melting Point data using CDK descriptors
(Andrew Lang)
R2 = 0.78, TPSA and nHdon most important
![Page 46: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/46.jpg)
Melting point prediction service
![Page 47: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/47.jpg)
Other Web Services…
(Andrew Lang)
General Transparent Solubility Prediction
![Page 48: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/48.jpg)
Convenient web services for solubility measurement and
prediction
(Andrew Lang)
![Page 49: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/49.jpg)
Integration of Multiple Web Services to Recommend Solvents
for Reactions
(Andrew Lang)
![Page 50: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/50.jpg)
Using melting point for temperature dependent solubility prediction
![Page 51: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/51.jpg)
Semi-Automated Semi-Automated Measurement of solubility via Measurement of solubility via
web service analysis of web service analysis of JCAMP-DX files JCAMP-DX files
(Andy Lang)(Andy Lang)
![Page 52: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/52.jpg)
![Page 53: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/53.jpg)
![Page 54: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/54.jpg)
Solubility Prediction (Andy Lang uses Abraham Model)
![Page 55: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/55.jpg)
Reaction Attempts Book
![Page 56: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/56.jpg)
Reaction Attempts Book: Reactants listed Alphabetically
![Page 57: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/57.jpg)
![Page 58: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/58.jpg)
Dynamic links to private tagged Mendeley collections
(Andrew Lang)
![Page 59: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/59.jpg)
All ONS web services
![Page 60: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/60.jpg)
For all Formats of ONS Projects
![Page 61: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/61.jpg)
ONS Challenge Solubility Book cited for nanotechnology
application
![Page 62: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/62.jpg)
Visualizing molecule-researcher connection maps reveals link between 2 Open Notebooks (Todd
and Bradley)
(Don Pellegrino)
![Page 63: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/63.jpg)
The Intersection of Open Notebooks (Bradley/Todd) and IP implications
Open Notebook could have blocked patent
if done earlier
![Page 64: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/64.jpg)
Decanoic acid
WaterNaCl
![Page 65: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/65.jpg)
Phrase searching for useful solubility applications
![Page 66: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/66.jpg)
Search for applications of solubility for breast cancer research
![Page 67: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/67.jpg)
Solubility prediction for Taxol using Abraham descriptors
Pred Exp
![Page 68: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/68.jpg)
Predicted temperature dependent solubility of Taxol in water (M)
![Page 69: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/69.jpg)
Current research questions for Taxol solubility
1. Does Taxol have a meaningful solubility in methanol or does it decompose too quickly?
2. Why is methanol reported to decompose Taxol but not ethanol?
3. Can the solubility of Taxol in solvent mixtures be predicted, especially for approved excipients?
4. Can the solubility of Taxol analogs be used to create reliable models for the solubility of this class of compounds?
![Page 70: NBCC Open Notebook Science Talk](https://reader036.vdocuments.us/reader036/viewer/2022062312/554e99d9b4c90526358b5301/html5/thumbnails/70.jpg)
Moving Forward: Tools and Practices
Use free hosted web tools and open data formats
1. Google Spreadsheets (numerical data)2. Wikispaces (human readable format)3. YouTube, SlideShare, LuLu, Nature Precedings,
etc. (multiple data formats)4. JCAMP-DX for spectral data
Practices1. Report all findings immediately – even if tentative2. Participate in social media to share progress and
find collaborators3. Abstract experiments and findings to machine
readable formats and make these easily discoverable