bid ce workshop 1 - session 11 - basic concepts about biodiversity data quality

38
August 24th, 2016 THE BID PROGRAMME IS FUNDED BY THE EUROPEAN UNION Data publishing concepts and introduction to the IPT Nicolas Noé

Upload: alberto-gonzalez-talavan

Post on 19-Feb-2017

291 views

Category:

Education


0 download

TRANSCRIPT

Page 1: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

August 24th, 2016THE BID PROGRAMME IS FUNDED BY THE EUROPEAN UNION

Data publishing concepts and introduction to the IPT

Nicolas Noé

Page 2: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

August 24th, 2016THE BID PROGRAMME IS FUNDED BY THE EUROPEAN UNION

Publication de données: concepts et introduction à l’IPT Nicolas Noé

Page 3: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Publishing

“Publishing” refers to making biodiversity datasets publicly accessible and discoverable, in a standardized form, via an access point, typically a web address (a URL).

Pub

lishi

ng

Page 4: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Publication

“La publication a pour but de rendre un jeu de données de biodiversité accessible publiquement et découvrables, dans un format standardisé via un point d’accès, typiquement un adresse (URL).”

Pub

licat

ion

Page 5: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Pub

lishi

ng Publishing

Page 6: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Pub

licat

ion

Publication

Page 7: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Classes of dataset: occurrencesD

atas

et c

lass

es

Digital text or multimedia data record detailing facts about the instance of occurrence of an organism, i.e. on the what, where, when, how and by whom of the occurrence and the recording.

Page 8: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Type de jeu de donnée: occurrencesTy

pe d

e je

u de

don

nées

Texte numérisé ou données multimédia détaillants des faits sur l’occurrence d’un organisme: le quoi, où, quand, comment et par qui de l’occurrence et de son enregistrement.

Page 9: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Classes of dataset: checklist

“A catalogue or list of named organisms, or taxa.”

Possibly also: vernacular names, literature, relationships, ...

Typically categorize information along taxonomic, geographic, and thematic lines, or some combination of the three.

Dat

aset

cla

sses

Page 10: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Type de jeu de données: checklist / liste d’espèce

“Un catalogue, ou une liste de taxons.”

Éventuellement aussi: noms vernaculaires, citations, ...

Classent généralement l’information par taxonomie, géographie et statut ou en combinant les trois.Ty

pe d

e je

u de

don

nées

Page 11: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Classes of dataset: sampling-event

Datasets sometimes provide greater detail, not only offering evidence that a species occurred at a given location and date, but also making it possible to assess community composition for broader taxonomic groups or even the abundance of species at multiple times and places. These datasets typically derive from standard protocols for measuring and monitoring biodiversity like vegetation transects, bird censuses and freshwater or marine sampling. By indicating the methods, events and relative abundance of species recorded in a sample, these datasets improve comparisons with data collected using the same protocols at different times and places—in some cases, even leading researchers to infer the absence of particular species from particular sites.

Dat

aset

cla

sses

Page 12: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Type de jeu de données: données d’échantillonage

Parfois, les ensembles de données fournissent de plus amples détails, mettant en évidence non seulement l’enregistrement d’une espèce à un endroit et une date donnée, mais également la possibilité d’évaluer la composition des communautés de groupes taxonomiques plus larges ou même l’abondance des espèces en plusieurs moments et lieux. Ces ensembles de données proviennent généralement des protocoles standards développés pour mesurer et suivre la biodiversité comme les transects, les comptages d’oiseaux ou les prélèvements d’eau de mer ou d’eau douce. En indiquant, lors d’un échantillonnage, les méthodes, événements et l’abondance relative des espèces enregistrées, ces ensembles de données améliorent les comparaisons pouvant être faîtes avec des données collectées en utilisant les mêmes protocoles à différents endroits et moments - dans certains cas, cela permet aux chercheurs d’en déduire l’absence d’espèces particulières sur des sites spécifiques

Type

de

jeu

de d

onné

es

Page 13: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Classes of dataset: metadata-only

● “Data about data”● Always mandatory● Very important to assess

fitness for use

Dat

aset

cla

sses

Page 14: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Type de jeu de données: métadonnées uniquement

● “Données sur les données”

● Toujours obligatoire● Crucial pour l’adéquation

à l’usage

Type

de

jeu

de d

onné

es

Page 15: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Licenses

●Everything at GBIF now has a licenses●Choice between:

• Public domain: CC0• Creative Commons Attribution: CC-BY• Creative Commons Attribution Non Commercial: CC-

BY-NC

Lice

nses

Page 16: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Licences

●Toutes les données GBIF on maintenant une licence claire

●Choix possibles:• Domaine public: CC0• Creative Commons Attribution: CC-BY• Creative Commons Attribution Non Commercial: CC-

BY-NC

Lice

nces

Page 17: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Darwin Core, Simple Darwin Core and Darwin Core Archive

Darwin Core: a list of terms

Dar

win

cor

e

Page 18: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Darwin Core, Simple Darwin Core et Darwin Core Archive

Darwin Core: une liste de termes

Dar

win

cor

e

Page 19: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Darwin Core, Simple Darwin Core and Darwin Core Archive

Simple Darwin Core: Darwin Core expressed in a simple table structure.

Dar

win

cor

e

Page 20: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Darwin Core, Simple Darwin Core et Darwin Core Archive

Simple Darwin Core: Darwin Core exprimé sous forme de structure tabulaire simple

Dar

win

cor

e

Page 21: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Darwin Core, Simple Darwin Core and Darwin Core Archive

Darwin Core Archive: more complex format, allows extensions.E

xten

sion

s

Page 22: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Darwin Core, Simple Darwin Core et Darwin Core Archive

Darwin Core Archive: un format plus avancé, qui permet l’usage d’extensions.E

xten

sion

s

Page 23: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Data Publishing methodP

ublis

hing

Page 24: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Méthodes de publication de donnéesP

ublic

atio

n

Page 25: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Data Publishing method: IPT

●Server-side software, needs a stable connection●One IPT can host many datasets, on behalf of

several institutions, while giving proper credit ●Main (but not only) publishing tool for GBIF●Test mode and production mode●Multilingual

IPT

Page 26: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Méthode de publication: IPT

●Logiciel serveur, nécessite une connexion stable●Un IPT peut héberger plusieurs datasets, pour

plusieurs institutions, et toujours donner crédit/attribution

●Modes test et production

IPT

Page 27: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

STEP 1: Get access and log in an IPT instance

Page 28: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Etape 1: Accédez à l’IPT et connectez-vous !

Page 29: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

STEP 2: Create a new resource

Page 30: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Etape 2: Création d’une nouvelle ressource

Page 31: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

STEP 3: Get familiar with the main resource configuration page

Page 32: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Etape 3: Familiarisez-vous avec la page de configuration de la resource

Page 33: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

STEP 4: Author metadata

Page 34: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Etape 4: Rédigez les métadonnées

Page 35: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

STEP 5: Publish, make visible and register the dataset

Page 36: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

Etape 5: Publiez, rendez visibles et enregistrez l’IPT

Page 37: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

August 24th, 2016THE BID PROGRAMME IS FUNDED BY THE EUROPEAN UNION

Data publishing concepts and introduction to the IPT

Nicolas Noé

Page 38: BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality

August 24th, 2016THE BID PROGRAMME IS FUNDED BY THE EUROPEAN UNION

Publication de données: concepts et introduction à l’IPT

Nicolas Noé