the roots: linked data and the foundations of successful agriculture data
TRANSCRIPT
![Page 1: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/1.jpg)
THE ROOTSLINKED DATA AND THE FOUNDATIONS OF SUCCESSFUL AGRICULUTURE DATA
Dr. Paul Groth | @pgroth | pgroth.com
Disruptive Technology Director
Elsevier Labs | @elsevierlabs
G20 Workshop Linked Open Data and Agriculture
September 27, 2017
![Page 2: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/2.jpg)
QUESTIONS FOR THIS WORKSHOP
1. How can Linked Open Data make a difference in agriculture?
2. What technical obstacles stand in the way?
3. What policies are needed to achieve the potential?
![Page 3: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/3.jpg)
![Page 4: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/4.jpg)
DATA IS CENTRAL IN PRECISION AGRICULTURE
Fig. 2 Precision agriculture
information flow in crop
production [after (19),
modified].
Robin Gebbers, and
Viacheslav I. Adamchuk
Science 2010;327:828-831
Published by AAAS
![Page 5: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/5.jpg)
THE DATA
SUPPLY CHAIN IN
AGRICULTURE
Sjaak Wolfert, Lan Ge, Cor Verdouw, Marc-Jeroen
Bogaardt, Big Data in Smart Farming – A review,
In Agricultural Systems, Volume 153, 2017, Pages
69-80, ISSN 0308-521X,
https://doi.org/10.1016/j.agsy.2017.01.023.
![Page 6: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/6.jpg)
WHERE LINKED
DATA CAN HELP
Sjaak Wolfert, Lan Ge, Cor Verdouw, Marc-Jeroen
Bogaardt, Big Data in Smart Farming – A review,
In Agricultural Systems, Volume 153, 2017, Pages
69-80, ISSN 0308-521X,
https://doi.org/10.1016/j.agsy.2017.01.023.
![Page 7: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/7.jpg)
STARTING FROM THE GROUND UP
![Page 8: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/8.jpg)
![Page 9: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/9.jpg)
FAIR EVERYWHERE
![Page 10: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/10.jpg)
![Page 11: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/11.jpg)
CREATING SUCCESSFUL DATA
![Page 12: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/12.jpg)
ENCOURAGING THE RESEARCHER
![Page 13: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/13.jpg)
![Page 14: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/14.jpg)
HOW DO RESEARCHERS SEARCH FOR DATA?
Gregory, K., Groth, P., Cousijn, H., Scharnhorst, A.,
& Wyatt, S. (2017). Searching Data: A Review of
Observational Data Retrieval Practices. arXiv
preprint arXiv:1707.06937.
Some observations from @gregory_km
survey:
1. The needs and behaviours of specific user groups
(e.g. early career researchers, policy makers,
students) are not well documented.
2. Background uses of observational data are better
documented than foreground uses.
3. Reconstructing data tables from journal articles,
using general search engines, and making direct data
requests are common.
![Page 15: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/15.jpg)
DATA SEARCH
Antony Scerri, John Kuriakose, Amit Ajit Deshmane, Mark Stanger, Peter Cotroneo, Rebekah Moore, Raj Naik, Anita de Waard;
Elsevier’s approach to the bioCADDIE 2016 Dataset Retrieval Challenge, Database, Volume 2017, 1 January 2017,
bax056, https://doi.org/10.1093/database/bax056
![Page 16: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/16.jpg)
ENABLING DATASET DISCOVERY
![Page 17: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/17.jpg)
![Page 18: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/18.jpg)
INTEROPERABILITY & INTEGRATION
![Page 19: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/19.jpg)
![Page 20: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/20.jpg)
MOVING UP THE STACK
![Page 21: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/21.jpg)
INTEGRATION
![Page 22: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/22.jpg)
INTEGRATION ACROSS DOMAINS
Entity
recognitionDictionaries
ConceptScan
IE Patterns
Grammar
Pattern
matching
Processing
PS Mammal
Protein interaction facts
ChemEffect ®
Drug Effects
DiseaseFxTM
Disease State
PS Plant
Interactions in PlantsCartridges
Pathway Studio technology overview
Internal
DocumentsSubscribed Titles*
116,125 full-
text article
Open Access
journals
286,867 abstracts Plant
Pathway
ChemEffect®
DiseaseFx™
Agrochemicals safety
MaizeRice
RiceRice proteins
and processes
MaizeProteins and
processes
Pathway Studio Plant Knowledgebase>778,99 mln unique relations supported by >576,083 references
§ Automatically curated MedScan data§ Compressed and purified by automatic curation
ü removes historical redundancy (>30%)
ü removes false positives (~5%)
§ Entity annotation§ Entrez Gene for proteins
§ Pubchem for chemicals
§ Aliases from MedScan dictionaries
§ Protein functional annotation from Gene Ontology
§ Ontologies§ Pathway Studio Ontology of intracellular signaling
§ Gene Ontology
§ Curated pathways § Cell signaling pathways
§ Metabolic pathways (AraCyc)
5
Before automatic
curator
After automatic
curator
![Page 23: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/23.jpg)
DATA SUSTAINABILITY
![Page 24: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/24.jpg)
THINGS TO THINK ABOUT
![Page 25: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/25.jpg)
ARE WE MISSING A USER?
![Page 26: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/26.jpg)
WHAT CAN MACHINE INTELLIGENCE DO TODAY?
If there’s a task that a normal person can do with
less than one second of thinking, there’s a very
good chance we can automate it with deep
learning.
Andrew Ng, Chief Scientist, Baidu (lecture at Bay Area Deep Learning
School, Stanford, CA, September 24, 2016)
![Page 27: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/27.jpg)
IMAGE RECOGNITION
https://devblogs.nvidia.com/parallelforall/author/czhang/
![Page 28: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/28.jpg)
ADVANCES ARE ENABLED BY MACHINE LEARNING
input
output
algorithm
input
output
model
learning
architecture
data
Programming
Machine learningGPU
CPU
CPU
![Page 29: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/29.jpg)
THESE RESULTS ARE DRIVEN BY DATA
“The paradigm shift of the ImageNet
thinking is that while a lot of people
are paying attention to models, let’s
pay attention to data, …”
– Prof. Fei-Fei Li [1]
[1] The data that transformed AI research—and possibly the world
https://qz.com/1034972/the-data-that-changed-the-direction-of-ai-research-and-
possibly-the-world/
![Page 30: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/30.jpg)
RAW DATA
From: Alain, G. and Bengio, Y. (2016). Understanding intermediate layers using linear classifier probes. arXiv:1610.01644v1.
![Page 31: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/31.jpg)
VOCABULARIES ARE SETS OF VECTOR EMBEDDINGS
From: Eisner, B., Rocktäschel, T., Augenstein, I., Bošnjak, M. and Riedel, S. (2016). Emoji2vec: learning emoji representations from their description. arXiv:1609.08359v1.
![Page 32: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/32.jpg)
MODELS AS REUSABLE COMPONENTS
Check out: sujitpal.blogspot.com for more
![Page 33: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/33.jpg)
LINKED DATA & MACHINE LEARNING
• Machines’ proficiency in learning to answer questions from text, audio,
images and video will depend on our ability to train them effectively to read
information from the Web
• How machines read the Web today
• Crawling and indexing Web resources, possibly semantically tagged
(e.g. using schema.org)
• Find-and-follow crawling of open linked data resources for ontology and
data sharing and reuse
• Programmatic access to APIs mediated through HTTP/S and other
Internet protocols
• Need to think about supporting ML oriented data
![Page 34: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/34.jpg)
PROVENANCE FOR DATA
Credits: Curt Tilmes, Peter Fox
Tilmes, C.; Fox, P.; Ma, X.; McGuinness, D.L.; Privette, A.P.; Smith, A.; Waple, A.; Zednik, S.; Zheng, J.G.,
"Provenance Representation for the National Climate Assessment in the Global Change Information System,"
Geoscience and Remote Sensing, IEEE Transactions on , vol.51, no.11, pp.5160,5168, Nov. 2013
![Page 35: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/35.jpg)
NATIONAL CLIMATE CHANGE ASSESSMENT
PROVENANCE
![Page 36: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/36.jpg)
FAIR TRADE + FAIR TRADE DATA?
Groth, Paul, "Transparency and Reliability in the Data Supply
Chain," Internet Computing, IEEE, vol.17, no.2, pp.69,71, March-
April 2013 doi: 10.1109/MIC.2013.41
![Page 37: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/37.jpg)
GOAL: SUCCESSFUL FAIR AGRICULTURE DATA
1. How can Linked Open Data make a
difference in agriculture?
2. What technical obstacles stand in the
way?
3. What policies are needed to achieve
the potential?
![Page 38: The Roots: Linked data and the foundations of successful Agriculture Data](https://reader031.vdocuments.us/reader031/viewer/2022030317/5a6700a27f8b9a68588b498d/html5/thumbnails/38.jpg)
THANK YOU
Dr. Paul Groth | @pgroth | pgroth.com
labs.elsevier.com