igert drexel open notebook science talk

49
Open Notebook Science as an efficient means for transparency in science April 15, 2011 IGERT Drexel Meeting Jean-Claude Bradley Department of Chemistry Drexel University

Upload: jean-claude-bradley

Post on 24-Dec-2014

629 views

Category:

Education


0 download

DESCRIPTION

Jean-Claude Bradley presents "Open Notebook Science as an efficient means for transparency in science" on April 15, 2011 at the Drexel Nanotechnology Institute IGERT Meeting.

TRANSCRIPT

Page 1: IGERT Drexel Open Notebook Science Talk

Open Notebook Science as an efficient means for transparency in science

April 15, 2011

IGERT Drexel Meeting

Jean-Claude Bradley

Department of ChemistryDrexel University

Page 2: IGERT Drexel Open Notebook Science Talk

The current state of transparency in scientific communication

Case study of melting point data

Page 3: IGERT Drexel Open Notebook Science Talk

The Chemical Information Validation Sheet

567 curated and referenced measurements from Fall 2010 Chemical Information Retrieval course

Page 4: IGERT Drexel Open Notebook Science Talk

The Chemical Information Validation Explorer

(Andrew Lang)

Page 5: IGERT Drexel Open Notebook Science Talk

Discovering outliers for melting points (stdev/average)

Page 6: IGERT Drexel Open Notebook Science Talk

Investigating the m.p. inconsistencies of EGCG

Page 7: IGERT Drexel Open Notebook Science Talk

Investigating the m.p. inconsistencies of cyclohexanone

Page 8: IGERT Drexel Open Notebook Science Talk

Sigma-Aldrich, Acros and Wolfram Alpha apparently use the same sources for melting

points

Page 9: IGERT Drexel Open Notebook Science Talk

Sigma-Aldrich, Acros and Wolfram Alpha apparently use the same sources for boiling

points

Page 10: IGERT Drexel Open Notebook Science Talk

Sigma-Aldrich, Acros and Wolfram Alpha apparently

DO NOT use the same sources for flash points

Page 11: IGERT Drexel Open Notebook Science Talk

Most popular data sources

Page 12: IGERT Drexel Open Notebook Science Talk

Alfa Aesar donates melting points to the public

Page 13: IGERT Drexel Open Notebook Science Talk

Open Melting Point Explorer

Page 14: IGERT Drexel Open Notebook Science Talk

Outliers

MDPI dataset

EPI (via ChemSpider)

Page 15: IGERT Drexel Open Notebook Science Talk

Outliers

Alfa Aesar

Page 16: IGERT Drexel Open Notebook Science Talk

Inconsistencies and SMILES problems within MDPI dataset

Page 17: IGERT Drexel Open Notebook Science Talk

MDPI Dataset labeled with High Trust Level

Page 18: IGERT Drexel Open Notebook Science Talk

Open Melting Point Datasets

Page 19: IGERT Drexel Open Notebook Science Talk

Open Random Forest modeling of Open Melting Point data using CDK descriptors

(Andrew Lang)

R2 = 0.78, TPSA and nHdon most important

Page 20: IGERT Drexel Open Notebook Science Talk

Melting point prediction service

Page 21: IGERT Drexel Open Notebook Science Talk

Using melting point for temperature dependent solubility prediction

Page 22: IGERT Drexel Open Notebook Science Talk

Motivation: Faster Science, Better Science

Page 23: IGERT Drexel Open Notebook Science Talk

There are NO FACTS, only measurements embedded

within assumptions

Open Notebook Science maintains the integrity of data

provenance by making assumptions explicit

Page 24: IGERT Drexel Open Notebook Science Talk

TRUST

PROOF

Page 25: IGERT Drexel Open Notebook Science Talk

First record then abstract structure

In order to be discoverable use Google friendly formats (simple HTML, no login)

In order to be replicable use free hosted tools (Wikispaces, Google Spreadsheets)

Strategy for an Open Notebook:

Page 26: IGERT Drexel Open Notebook Science Talk

Crowdsourcing Solubility Data

Page 27: IGERT Drexel Open Notebook Science Talk

Data provenance: From Wikipedia to…

Page 28: IGERT Drexel Open Notebook Science Talk

…the lab notebook and raw data

Page 29: IGERT Drexel Open Notebook Science Talk

Calculations Made Public on Google Spreadsheets

Page 30: IGERT Drexel Open Notebook Science Talk

Interactive NMR spectra using JSpecView and JCAMP-DX

Page 31: IGERT Drexel Open Notebook Science Talk

Raw Data As Images

Splatter?

Some liquid

Page 32: IGERT Drexel Open Notebook Science Talk

YouTube for demonstrating experimental set-up

Page 33: IGERT Drexel Open Notebook Science Talk

The importance of raw data availability

Missed in a prior publication on solubility

for this compound

Page 34: IGERT Drexel Open Notebook Science Talk

Solubilities collected in a Google Spreadsheet

Page 35: IGERT Drexel Open Notebook Science Talk

Rajarshi Guha’s Live Web Query using Google Viz API

Page 36: IGERT Drexel Open Notebook Science Talk

Web services for summary data

(Andrew Lang)

Page 37: IGERT Drexel Open Notebook Science Talk

Web service calls from within a Google Spreadsheet for solubility measurement and

prediction

(Andrew Lang)

Page 38: IGERT Drexel Open Notebook Science Talk

Integration of Multiple Web Services to Recommend Solvents for Reactions

(Andrew Lang)

Page 39: IGERT Drexel Open Notebook Science Talk
Page 40: IGERT Drexel Open Notebook Science Talk
Page 41: IGERT Drexel Open Notebook Science Talk
Page 42: IGERT Drexel Open Notebook Science Talk

Reaction Attempts Book

Page 43: IGERT Drexel Open Notebook Science Talk

Reaction Attempts Book: Reactants listed Alphabetically

Page 44: IGERT Drexel Open Notebook Science Talk

ONS Challenge Solubility Book cited for nanotechnology application

Page 45: IGERT Drexel Open Notebook Science Talk

Lulu.com Data Disks

Page 46: IGERT Drexel Open Notebook Science Talk

Visualizing molecule-researcher connection maps reveals link between 2 Open Notebooks (Todd and

Bradley)

(Don Pellegrino)

Page 47: IGERT Drexel Open Notebook Science Talk

All ONS web services

Page 48: IGERT Drexel Open Notebook Science Talk

For all Formats of ONS Projects

Page 49: IGERT Drexel Open Notebook Science Talk

Conclusions

•Our current system of publication is not as transparent as it could be

•Open Notebook Science offers an efficient way to make research transparent and discoverable