![Page 1: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/1.jpg)
The Possibilities and Pitfalls of Internet-
Based Chemical Data
Antony WilliamsRoyal Society of Chemistry
![Page 2: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/2.jpg)
I’ve performed a few dozen chemical syntheses
I’ve run thousands of analytical spectra I’ve generated thousands of NMR
assignments I’ve probably published <5% of all work But things can be different today….
About Me…as a Chemist
![Page 3: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/3.jpg)
My Early Scientific Computing
![Page 4: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/4.jpg)
If it was not just about me…
![Page 5: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/5.jpg)
If it was not just about me…
Together we might: build an encyclopedia …and rate restaurants …provide book reviews to each other …or movie reviews …or reviews of service providers …organize sit-ins and social action …and more data might just be Open
![Page 6: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/6.jpg)
If it was not just about me…
Together we might: build an encyclopedia …and rate restaurants …provide book reviews to each other …or movie reviews …or reviews of service providers …organize sit-ins and social action …and more data might just be Open …more Chemists might share rather than
just take!
![Page 7: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/7.jpg)
A hobby-project to connect chemistry data on the web Three servers – one purchased, two hand-built Software begged and borrowed – and thanks to Microsoft! Some late nights – 10pm to 2am for over a year Some survival of the naysayers in the community …and taking advantage of a changing world of data availability
and the crowdsourcing of willing participants
NO formal funding. Simply passion and abilities lining up.
A story of a hobby gone wild…
Years 1 and 2
![Page 8: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/8.jpg)
ChemSpider(Year 2-present)
Building a Free Chemical Database
A central hub for chemists to source information >28 million unique chemical records Aggregated from >400 data sources Chemicals, analytical data, movies, images,
podcasts, links to patents, publications, predictions Web services for integration Daily updates of data
![Page 9: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/9.jpg)
Answer Questions for Chemists
Questions a chemist might ask… What is the melting point of n-heptanol? What is the chemical structure of Xanax? Chemically, what is phenolphthalein? What are the stereocenters of cholesterol? Where can I find publications about xylene? What are the different trade names for
Ketoconazole? What is the NMR spectrum of Aspirin? What are the safety handling issues for Thymol
Blue?
![Page 10: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/10.jpg)
A LITTLE Chemistry First
![Page 11: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/11.jpg)
Structural Diagrams
![Page 12: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/12.jpg)
Structural Diagrams
![Page 13: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/13.jpg)
Analytical Data
![Page 14: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/14.jpg)
Does Stereochemistry Matter?
![Page 15: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/15.jpg)
Does one stereocenter matter?
Distaval, Talimol, Nibrol, Sedimide, Quietoplex, Contergan, Neurosedyn, Softenon, Thalidomide
![Page 16: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/16.jpg)
Structural Representations
![Page 17: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/17.jpg)
The InChI Standard
![Page 18: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/18.jpg)
InChIKeysSearch the Web by
Structure
![Page 19: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/19.jpg)
I want to know about “Vincristine”
![Page 20: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/20.jpg)
![Page 21: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/21.jpg)
Vincristine: Identifiers and Properties
![Page 22: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/22.jpg)
Vincristine: Vendors and Sources
![Page 23: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/23.jpg)
Vincristine: Patents
![Page 24: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/24.jpg)
Chemical Names and Synonyms
VALIDATION OF NAMES
![Page 25: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/25.jpg)
Validated Names for Searching…
![Page 26: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/26.jpg)
Information System Architecture
Input FilteringCuratio
nArchival
StorageIndexin
g
Processing
Search Browse
Presentation
API
![Page 27: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/27.jpg)
The Quality of Chemical Data Online
What is the Structure of Vitamin K?
A lipid cofactor that is required for normal blood clotting. Several forms of vitamin K have been identified: VITAMIN K1 (phytomenadione) derived from plants, VITAMIN K2 (menaquinone) from bacteria & synthetic naphthoquinone provitamins, VITAMIN K3 (menadione).
![Page 28: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/28.jpg)
What is the Structure of Vitamin K1?
![Page 29: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/29.jpg)
What is the Structure of Vitamin K1?
![Page 30: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/30.jpg)
CAS’s Common Chemistry
![Page 31: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/31.jpg)
Wikipedia
![Page 32: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/32.jpg)
Wolfram Alpha
![Page 33: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/33.jpg)
DailyMed
![Page 34: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/34.jpg)
![Page 35: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/35.jpg)
People Use Trusted Resources…
![Page 36: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/36.jpg)
Just Yesterday…
![Page 37: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/37.jpg)
How will it improve?
Participation and
contribution
![Page 38: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/38.jpg)
ALL Different, ALL “Domoic Acids”
![Page 39: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/39.jpg)
ALL Different, ALL “Domoic Acids”
![Page 40: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/40.jpg)
The EXPERTS must get it right?!
![Page 41: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/41.jpg)
Question Everything Online:
www.dhmo.org
![Page 42: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/42.jpg)
ANYBODY can annotate a record on ChemSpider
Registered users can deposit new data
Registered users can validate existing data
Deposition, Annotation and Validation
![Page 43: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/43.jpg)
CURATION Search “Vitamin H”
![Page 44: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/44.jpg)
“Curate” Identifiers
![Page 45: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/45.jpg)
“Curate” Identifiers
![Page 46: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/46.jpg)
ChemSpider Web Services
![Page 47: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/47.jpg)
ChemSpider via web service access For structure identification for mass
spectrometry For name and structure resolution For structure and substructure searching For an “innovative medicines initiative”
semantic web project…
Open APIs for Science
![Page 48: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/48.jpg)
Open PHACTS Project Develop a set of robust standards Integrate Chemistry and Biology data by implementing
the standards in a semantic integration hub Deliver services to support drug discovery programs in
pharma and public domain INITIALLY 22 partners, 8 pharmaceutical companies, 3
biotechs 36 months project – first public release version is
imminentGuiding principle is open access, open usage, open source
- Key to standards adoption -
![Page 49: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/49.jpg)
Using RDF permalinks http://www.chemspider.com/Chemical-Structure
.7787.rdf
Using a Search Term http://www.chemspider.com/rdf.ashx?q=cyclohe
xane http://rdf.chemspider.com/cyclohexane
RDF and the semantic web
![Page 50: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/50.jpg)
RDF and the semantic web
![Page 51: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/51.jpg)
www.SpectralGame.comhttp://www.jcheminf.com/content/1/1/9
![Page 52: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/52.jpg)
Times have changed Immediacy of social networks Commenting on articles/data is here The “participating scientist” has high
profile And who can be a scientist now???
The World of Contribution
![Page 53: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/53.jpg)
A Ten Year Old Scientist
![Page 54: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/54.jpg)
![Page 55: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/55.jpg)
Challenging a Publication
![Page 56: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/56.jpg)
![Page 57: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/57.jpg)
Oops…
![Page 58: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/58.jpg)
>2 Years to Resolution
![Page 59: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/59.jpg)
What of Hexacyclinol?
![Page 60: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/60.jpg)
The Blogosphere “Discusses”…
![Page 61: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/61.jpg)
Oxidation by Sodium Hydride?
![Page 62: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/62.jpg)
The Blogosphere Analyzes…
![Page 63: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/63.jpg)
The Blogosphere Analyzes…
![Page 64: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/64.jpg)
How much is in the archives?
![Page 65: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/65.jpg)
Open Notebook Science Analysis
![Page 66: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/66.jpg)
Motivation Faster Science, Better
Science
![Page 67: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/67.jpg)
Openness – Still Carries Licensing
Openness may be hard..
Open Access flavors Open Source licenses Open Data licenses Open Notebook
Science
![Page 68: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/68.jpg)
License data based on GOALS: scientific, commercial, or mixed
Explore the benefits of open licensing and drawbacks of enclosure
Provide simple explanations terms of use If you can't make the data public domain,
make the metadata public domain.
We SuggestRules for Licensing Data
![Page 69: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/69.jpg)
We SuggestRules for Licensing Data
![Page 70: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/70.jpg)
Challenged in the Twittersphere
![Page 71: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/71.jpg)
Annotating Articles Today…
![Page 72: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/72.jpg)
Attribution to me…
![Page 73: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/73.jpg)
Other Publications to Annotate…
![Page 74: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/74.jpg)
Other Publications to Annotate…
![Page 75: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/75.jpg)
Publications to Annotate…
“We then established a collaboration with professor Sum Ting Wong, a fugitive from the North Korean University Hu Yu Hai Ding”
“..identified as the new protein Wai So Dim”
![Page 76: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/76.jpg)
A New World for Publishing?
![Page 77: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/77.jpg)
An Adventure into the World of Small
but significant contribution..
![Page 78: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/78.jpg)
ChemSpider SyntheticPages
![Page 79: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/79.jpg)
Micropublishing with Peer Review
(a chemical synthesis blog?)
![Page 80: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/80.jpg)
Multi-Step Synthesis
![Page 81: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/81.jpg)
Interactive Data
![Page 82: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/82.jpg)
A New Route for Scientific Recognition?
![Page 83: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/83.jpg)
How do “we” measure a scientist? The funding bodies, department heads etc. use
Publication profile Impact factors An index – h, m, g, i10, c, s … Grants brought in
Scientists are notable in different ways – technology can help measure different types of “impact”
The Measure of a Scientist?
![Page 84: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/84.jpg)
What makes a Scientist Notable?
![Page 85: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/85.jpg)
Online tools track activities of scientistsSome are totally opt-in, an increasing
number are about you and need checking!Take responsibility for your profile online Actively BUILD your online profile
Public Profiles of Scientists
![Page 86: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/86.jpg)
Microsoft Academic Search
![Page 87: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/87.jpg)
My Academic Search Profile
![Page 88: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/88.jpg)
My Co-author Graph
![Page 89: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/89.jpg)
How many times do you see errors where: 1) You have not been able to annotate
or curate 2) You have chosen not to annotate or
curate
Q: How Often Do You Contribute?
Annotation and Validation
![Page 90: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/90.jpg)
My Co-author Graph
![Page 91: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/91.jpg)
Contribute when you can!
![Page 92: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/92.jpg)
Contribute when you can!
![Page 93: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/93.jpg)
![Page 94: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/94.jpg)
Scientists and Orcids?
![Page 95: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/95.jpg)
A unique identifier for a scientist – a Scientists InChI !
Will enable aggregation of a scientists activities
ORCIDs associated with publications, data, blog comments, other contributions (Wikipedia, reviews etc.) will be a way to measure their impact
![Page 96: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/96.jpg)
The Alt-Metrics Manifesto
http://altmetrics.org/manifesto/
![Page 97: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/97.jpg)
ImpactStory
![Page 98: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/98.jpg)
ImpactStory
![Page 99: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/99.jpg)
SlideShare
![Page 100: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/100.jpg)
SlideShare via ImpactStory
![Page 101: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/101.jpg)
ImpactStory
![Page 102: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/102.jpg)
Where do I contribute? How might I be measured?
![Page 103: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/103.jpg)
Article Level Metrics
![Page 104: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/104.jpg)
Article Level Metrics
![Page 105: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/105.jpg)
Impact will be an aggregate measure of Publications – classic measures and article level metrics Data, algorithms and code – and its distribution and
reuse Contributions as comments, annotation and curation
activities
New “impact factors” will develop with time
New Measures of Impact
![Page 106: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/106.jpg)
Some challenges are technology based The growth in data – storage and compute speed Ontologies, dictionaries and trusted sources
Many challenges are “about us” Licenses and rights Rewards and recognition Participation, contribution and collaboration
The Challenges
![Page 107: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/107.jpg)
There are many government institutions building public compound databases that should collaborate more: National Cancer Institute (NCI) National Institutes of Health (NIH) Environmental Protection Agency (EPA) Food and Drug Administration (FDA) National Library of Medicine (NLM)
Tear Down Walls between Government Labs
![Page 108: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/108.jpg)
![Page 109: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/109.jpg)
Release STRUCTURES Please!
![Page 110: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/110.jpg)
What Does the Future Hold?
![Page 111: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/111.jpg)
The Linked Network Will Grow
![Page 112: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/112.jpg)
The Data Deluge Will Not Go Away
![Page 113: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/113.jpg)
RSC Activities in Development
Deliver a Global Chemistry Hub “Data enable” the RSC archive back to 1841:
Extract chemistry – chemicals, reactions, experimental data points, complex data
Enrich the articles for interactive viewing and crowdsourced annotation and curation
Enhance queries possible across the archive
![Page 114: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/114.jpg)
Federated Data Segregation
![Page 115: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/115.jpg)
Future System Architecture
Input FilteringCuratio
nArchival
StorageIndexin
g
Processing
Search Bro
PresentationNo more complex
APIComplexity is hidden
InputInput
CurationCuratio
n
StorageStorageElastic,
distributedIndexing
Indexing
New algorithms
Processing
Processing
Distributed
SearchSearchOver
federated systems
ArchivalArchival
FilteringFilteringSmarter
algorithms
BrowseOver
federated systems
![Page 116: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/116.jpg)
Data Validation is Exacting Work
![Page 117: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/117.jpg)
“Challenge” the Community
![Page 118: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/118.jpg)
Chemistry is NOT just small molecules!Data in RSC publications will be “enabled” Data available for validation and curationThe delivery of the “Datument”Data will be fed to models for validation, to
retrain the models, full provenance retainedAlgorithms will be provided to the community
Chemistry Data at RSC
![Page 119: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/119.jpg)
Enhanced Mark-Up?
![Page 120: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/120.jpg)
An Error in my Abstract?
![Page 121: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/121.jpg)
An Error in my Abstract?
Chemists have embraced the web as a rich source of data and knowledge. However, all that glisters is not gold
![Page 122: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/122.jpg)
Thanks Shakespeare
![Page 123: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/123.jpg)
Acknowledgments
RSC and RSC|Cheminformatics team All data source providers, curators and
annotators All software providers: commercial and open
source Contributors, curators, collaborators
Trusted Advisors: Jean-Claude Bradley, Sean Ekins, Lee Harland, Gary Martin, Martin Walker and…
![Page 124: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/124.jpg)
Meet Valery…We’d love to chat…
![Page 125: The Possibilities and Pitfalls of Internet-Based Chemical Data](https://reader036.vdocuments.us/reader036/viewer/2022062418/554e86bfb4c90573338b477c/html5/thumbnails/125.jpg)
Thank you
Email: [email protected] Twitter: ChemConnectorPersonal Blog: www.chemconnector.com SLIDES: www.slideshare.net/AntonyWilliams