the roots - startseite€¦ · emoji2vec: learning emoji representations from their description....
TRANSCRIPT
![Page 1: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/1.jpg)
THE ROOTSLINKED DATA AND THE FOUNDATIONS OF SUCCESSFUL AGRICULUTURE DATA
Dr. Paul Groth | @pgroth | pgroth.com
Disruptive Technology Director
Elsevier Labs | @elsevierlabs
G20 Workshop Linked Open Data and Agriculture
September 27, 2017
![Page 2: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/2.jpg)
QUESTIONS FOR THIS WORKSHOP
1. How can Linked Open Data make a difference in agriculture?
2. What technical obstacles stand in the way?
3. What policies are needed to achieve the potential?
![Page 3: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/3.jpg)
![Page 4: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/4.jpg)
DATA IS CENTRAL IN PRECISION AGRICULTURE
Fig. 2 Precision agriculture
information flow in crop
production [after (19),
modified].
Robin Gebbers, and
Viacheslav I. Adamchuk
Science 2010;327:828-831
Published by AAAS
![Page 5: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/5.jpg)
THE DATA
SUPPLY CHAIN IN
AGRICULTURE
Sjaak Wolfert, Lan Ge, Cor Verdouw, Marc-Jeroen
Bogaardt, Big Data in Smart Farming – A review,
In Agricultural Systems, Volume 153, 2017, Pages
69-80, ISSN 0308-521X,
https://doi.org/10.1016/j.agsy.2017.01.023.
![Page 6: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/6.jpg)
WHERE LINKED
DATA CAN HELP
Sjaak Wolfert, Lan Ge, Cor Verdouw, Marc-Jeroen
Bogaardt, Big Data in Smart Farming – A review,
In Agricultural Systems, Volume 153, 2017, Pages
69-80, ISSN 0308-521X,
https://doi.org/10.1016/j.agsy.2017.01.023.
![Page 7: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/7.jpg)
STARTING FROM THE GROUND UP
![Page 8: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/8.jpg)
![Page 9: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/9.jpg)
FAIR EVERYWHERE
![Page 10: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/10.jpg)
![Page 11: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/11.jpg)
CREATING SUCCESSFUL DATA
![Page 12: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/12.jpg)
ENCOURAGING THE RESEARCHER
![Page 13: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/13.jpg)
![Page 14: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/14.jpg)
HOW DO RESEARCHERS SEARCH FOR DATA?
Gregory, K., Groth, P., Cousijn, H., Scharnhorst, A.,
& Wyatt, S. (2017). Searching Data: A Review of
Observational Data Retrieval Practices. arXiv
preprint arXiv:1707.06937.
Some observations from @gregory_km
survey:
1. The needs and behaviours of specific user groups
(e.g. early career researchers, policy makers,
students) are not well documented.
2. Background uses of observational data are better
documented than foreground uses.
3. Reconstructing data tables from journal articles,
using general search engines, and making direct data
requests are common.
![Page 15: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/15.jpg)
DATA SEARCH
Antony Scerri, John Kuriakose, Amit Ajit Deshmane, Mark Stanger, Peter Cotroneo, Rebekah Moore, Raj Naik, Anita de Waard;
Elsevier’s approach to the bioCADDIE 2016 Dataset Retrieval Challenge, Database, Volume 2017, 1 January 2017,
bax056, https://doi.org/10.1093/database/bax056
![Page 16: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/16.jpg)
ENABLING DATASET DISCOVERY
![Page 17: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/17.jpg)
![Page 18: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/18.jpg)
INTEROPERABILITY & INTEGRATION
![Page 19: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/19.jpg)
![Page 20: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/20.jpg)
MOVING UP THE STACK
![Page 21: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/21.jpg)
INTEGRATION
![Page 22: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/22.jpg)
INTEGRATION ACROSS DOMAINS
Entity
recognitionDictionaries
ConceptScan
IE Patterns
Grammar
Pattern
matching
Processing
PS Mammal
Protein interaction facts
ChemEffect ®
Drug Effects
DiseaseFxTM
Disease State
PS Plant
Interactions in PlantsCartridges
Pathway Studio technology overview
Internal
DocumentsSubscribed Titles*
116,125 full-
text article
Open Access
journals
286,867 abstracts Plant
Pathway
ChemEffect®
DiseaseFx™
Agrochemicals safety
MaizeRice
RiceRice proteins
and processes
MaizeProteins and
processes
Pathway Studio Plant Knowledgebase>778,99 mln unique relations supported by >576,083 references
§ Automatically curated MedScan data§ Compressed and purified by automatic curation
ü removes historical redundancy (>30%)
ü removes false positives (~5%)
§ Entity annotation§ Entrez Gene for proteins
§ Pubchem for chemicals
§ Aliases from MedScan dictionaries
§ Protein functional annotation from Gene Ontology
§ Ontologies§ Pathway Studio Ontology of intracellular signaling
§ Gene Ontology
§ Curated pathways § Cell signaling pathways
§ Metabolic pathways (AraCyc)
5
Before automatic
curator
After automatic
curator
![Page 23: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/23.jpg)
DATA SUSTAINABILITY
![Page 24: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/24.jpg)
THINGS TO THINK ABOUT
![Page 25: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/25.jpg)
ARE WE MISSING A USER?
![Page 26: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/26.jpg)
WHAT CAN MACHINE INTELLIGENCE DO TODAY?
If there’s a task that a normal person can do with
less than one second of thinking, there’s a very
good chance we can automate it with deep
learning.
Andrew Ng, Chief Scientist, Baidu (lecture at Bay Area Deep Learning
School, Stanford, CA, September 24, 2016)
![Page 27: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/27.jpg)
IMAGE RECOGNITION
https://devblogs.nvidia.com/parallelforall/author/czhang/
![Page 28: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/28.jpg)
ADVANCES ARE ENABLED BY MACHINE LEARNING
input
output
algorithm
input
output
model
learning
architecture
data
Programming
Machine learningGPU
CPU
CPU
![Page 29: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/29.jpg)
THESE RESULTS ARE DRIVEN BY DATA
“The paradigm shift of the ImageNet
thinking is that while a lot of people
are paying attention to models, let’s
pay attention to data, …”
– Prof. Fei-Fei Li [1]
[1] The data that transformed AI research—and possibly the world
https://qz.com/1034972/the-data-that-changed-the-direction-of-ai-research-and-
possibly-the-world/
![Page 30: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/30.jpg)
RAW DATA
From: Alain, G. and Bengio, Y. (2016). Understanding intermediate layers using linear classifier probes. arXiv:1610.01644v1.
![Page 31: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/31.jpg)
VOCABULARIES ARE SETS OF VECTOR EMBEDDINGS
From: Eisner, B., Rocktäschel, T., Augenstein, I., Bošnjak, M. and Riedel, S. (2016). Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1.
![Page 32: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/32.jpg)
MODELS AS REUSABLE COMPONENTS
Check out: sujitpal.blogspot.com for more
![Page 33: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/33.jpg)
LINKED DATA & MACHINE LEARNING
• Machines’ proficiency in learning to answer questions from text, audio,
images and video will depend on our ability to train them effectively to read
information from the Web
• How machines read the Web today
• Crawling and indexing Web resources, possibly semantically tagged
(e.g. using schema.org)
• Find-and-follow crawling of open linked data resources for ontology and
data sharing and reuse
• Programmatic access to APIs mediated through HTTP/S and other
Internet protocols
• Need to think about supporting ML oriented data
![Page 34: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/34.jpg)
PROVENANCE FOR DATA
Credits: Curt Tilmes, Peter Fox
Tilmes, C.; Fox, P.; Ma, X.; McGuinness, D.L.; Privette, A.P.; Smith, A.; Waple, A.; Zednik, S.; Zheng, J.G.,
"Provenance Representation for the National Climate Assessment in the Global Change Information System,"
Geoscience and Remote Sensing, IEEE Transactions on , vol.51, no.11, pp.5160,5168, Nov. 2013
![Page 35: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/35.jpg)
NATIONAL CLIMATE CHANGE ASSESSMENT
PROVENANCE
![Page 36: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/36.jpg)
FAIR TRADE + FAIR TRADE DATA?
Groth, Paul, "Transparency and Reliability in the Data Supply
Chain," Internet Computing, IEEE, vol.17, no.2, pp.69,71, March-
April 2013 doi: 10.1109/MIC.2013.41
![Page 37: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/37.jpg)
GOAL: SUCCESSFUL FAIR AGRICULTURE DATA
1. How can Linked Open Data make a
difference in agriculture?
2. What technical obstacles stand in the
way?
3. What policies are needed to achieve
the potential?
![Page 38: THE ROOTS - Startseite€¦ · Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1. MODELS AS REUSABLE COMPONENTS Check out: sujitpal.blogspot.com](https://reader034.vdocuments.us/reader034/viewer/2022050220/5f6590d46fa41c1d252299ae/html5/thumbnails/38.jpg)
THANK YOU
Dr. Paul Groth | @pgroth | pgroth.com
labs.elsevier.com