the i-adopt rda wg: boosting the i in fair through the ...€¦ · the i-adopt rda wg: boosting the...

25
The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi Buttigieg on behalf of Barbara Magagna and the i-ADOPT WG GoFAIR Workshop: “Semantic Interoperability of Metadata for Cross-Domain Research of the Future” Hamburg, November 11 2019

Upload: others

Post on 13-Aug-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

The I-ADOPT RDA WG: Boosting the I in FAIR through the

harmonization of observational data terminologies

Pier Luigi Buttigieg on behalf of Barbara Magagna

and the i-ADOPT WG

GoFAIR Workshop: “Semantic Interoperability of Metadata for Cross-Domain Research of the Future” Hamburg, November 11 2019

Page 2: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

Aims of the task group

Core objective: enable and perpetuate testable, machine-centric interoperability of existing terminologies across the semantic gradient

◦ Develop best practices and an interoperability

framework for terminology resources pertinent to observable properties

◦ Test and ensure interoperation through annotation and multi-resource querying/mobilisation of research data

Semantics for biodiversity and ecosystem research, ICEI 2018, Jena

Page 3: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

The challenge: a case from ecology

Analysing ecological phenomena across

geographic, temporal, biological scales

requires a variety of disparate observational data sets

Observational data are often represented in tabular form but differ in:

The number of attributes

the relationships implied or asserted between attributes

the coding conventions used for representing information within data sets

Semantics for biodiversity and ecosystem research, ICEI 2018, Jena

Page 4: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

Collected observational data

RDA P11 - March 2018 - VSSIG Harmonize the conceptualization of observation types

Page 5: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

Shortcomings of existing schemas

A number of different and incompatible semantic schemas for describing research data exist

Key axes of differentiation

Degree of expressivity along the complexity of properties

Domain-specificity

Attempt to capture the value of attributes

Specification of units

Most schemas conflate attributes along disciplinary conventions and thus are not suitable to describe complex properties to machine agents

Semantics for biodiversity and ecosystem research, ICEI 2018, Jena

Page 6: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

Some different models/schemas

Observations and Measurements Complex Property Model SOSA/SSN Ontologies SVO (Scientific Variable Ontology) CF conventions OBOÉ SERONTO EFO OBO Foundry conventions…

Page 7: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

Different terminologies used

SDN-Voc ENVO BIPM IUPAC EnvThes CHEBI OM SWEET QUDT SDMX/DDI WORMS ITIS ..

Page 8: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

Steps towards true interoperation: unity in diversity? (PS: it’s not the user’s problem)

Page 9: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

Define observable property…

Property/quality of an object of interest

Any quantifiable or qualifiable characterstic of an object or subject of reserach or monitorng of a given „feature of interest“

Biological, Chemical, Physical, Administrative

„Observations“, „Traits“, „Variables“, „Parameters“, „Measurements“,…

Page 10: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

Define observable property…

Continued….

Observed directly or by proxy (modelling/calibration) ◦ Eg. chlorophyll-a fluorescence > chlorophyll-a

concentration and productivity > biomass of photosynthetic material and primary production

Field observations, Laboratory experiments, Remote sensing, Modelling

Object of interests: Specimen, Populations, Samples, entire Environments

Page 11: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

Complex observable properties

Feature of Interest

Procedure

Monitored Property

Monthly mean dissolved lead (ppb) in water taken from the river Thames by

autonomous sampling

RDA P11 - March 2018 - VSSIG Harmonize the conceptualization of observation types

Observable Property

Unit

Page 12: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

Complex observable properties

Monitored Property

Monthly mean dissolved lead (ppb) in water taken from the river Thames by

autonomous sampling

Concentration of lead dissolved in water

(ppb)

RDA P11 - March 2018 - VSSIG Harmonize the conceptualization of observation types

Feature of Interest Procedure

Observable Property

River Thames

Sampling & averaging...

Page 13: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

Atomisation of complex properties

Monitored Property

Monthly mean dissolved lead (ppb) in water taken from the river Thames by

autonomous sampling

(ppb)

RDA P11 - March 2018 - VSSIG Harmonize the conceptualization of observation types

Feature of Interest

Observable Property

ATOMISATION

Procedure

Concentration of lead dissolved in water

River Thames

Sampling & averaging...

Page 14: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

Monitored Property

Monthly mean dissolved lead (ppb) in water taken from the river Thames by

autonomous sampling

concentration

monthly mean

lead dissolved ppb water body

? ? ? unit ? ?

RDA P11 - March 2018 - VSSIG Harmonize the conceptualization of observation types

Procedure

Observable Property

ATOMISATION

Atomisation of complex properties

Concentration of lead dissolved in water

River Thames

Sampling & averaging...

Feature of Interest

Page 15: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

BODC PUV P01 Semantic model

concentration

monthly mean

lead dissolved ppb water body

Property Statistical property

Object of Interest

Matrix Matrix phase

Units

http://vocab.nerc.ac.uk/collection/P06/current/UPPB/

http://vocab.nerc.ac.uk/collection/S06/current/S0600045/

http://vocab.nerc.ac.uk/collection/S21/current/S21S027/

http://vocab.nerc.ac.uk/collection/S07/current/S0700016/

http://vocab.nerc.ac.uk/collection/S27/current/CS002545/

http://vocab.nerc.ac.uk/collection/S23/current/S23C010/

http://purl.obolibrary.org/obo/CHEBI_27889

http://chem.sis.nlm.nih.gov/chemidplus/rn/7439-92-1

Page 16: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

Parameter model OBOE used by AquaDiva/AnaEE

RDA P11 - March 2018 - VSSIG Harmonize the conceptualization of observation types

Page 17: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

Parameter structure in PANGAEA RDA P11 - March 2018 - VSSIG Harmonize the conceptualization of observation types

Page 18: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

(very rough) OBO Library patterns

Page 19: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

Chemicals

Environmental and ecological entities

Biological processes

Food and agriculture lite

AgrO

Links to SDG Agenda

OBO Foundry

Page 20: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

Ontologies

Thesauri Glossaries

Controlled vocabularies

A rough illustration of the semantic gradient

Weaker semantics

Stronger semantics

Modified from McCreary D (2006) Patterns of Semantic Integration. CC 2.5

Taxonomies

Data models

Page 21: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

I-ADOPT: building and testing a common framework for persistent alignment across terminology resources

Page 22: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

Tasks of the I-ADOPT WG

1. Collect user stories > Formalise into use cases

2. Collect observational data > annotation practices with used terminologies and representation strategies

3. Derive requirements from use cases

4. Check compliance of representation strategies with each requirement, analyse overlaps and gaps between them

5. Develop Interoperability Framework

6. Develop local Mapping Design Patterns

Page 23: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

Deliverables of I-ADOPT

D1 Catalogue of domain-specific terminologies (2020-01-31)

D2 Synthesis report on expressiveness of representation strategies (2020-12-31)

D3 Interoperability Framework for observable properties in environmental research (2021-01-31)

D4 Guidelines on best practices on the implementation and use of the framework (2021-04-30)

Page 24: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

I-ADOPT WG - status

Officially endorsed RDA Working Group

Kick-Off Oct 25 2019 in Helsinki ◦ November 2019 – April 2021 (18 months)

Around 60 members: 20 RIs/Initiatives (eLTER, NERC, LifeWatch, PANGAEA, OBO Foundry, GoFAIR,..)

Official I-ADOPT WG Site

Github repo for I-ADOPT activities

Page 25: The I-ADOPT RDA WG: Boosting the I in FAIR through the ...€¦ · The I-ADOPT RDA WG: Boosting the I in FAIR through the harmonization of observational data terminologies Pier Luigi

The I-ADOPT RDA WG: Boosting the I in FAIR through the

harmonization of observational data terminologies

Pier Luigi Buttigieg on behalf of Barbara Magagna

and the i-ADOPT WG

GoFAIR Workshop: “Semantic Interoperability of Metadata for Cross-Domain Research of the Future” Hamburg, November 11 2019