data discovery and data processing for environmental research infrastructures

17
Data discovery and data processing for environmental research infrastructures Roberto Cossu ENVRI WP4 leader ESA

Upload: ginata

Post on 23-Feb-2016

31 views

Category:

Documents


2 download

DESCRIPTION

Data discovery and data processing for environmental research infrastructures. Roberto Cossu ENVRI WP4 leader ESA. Outline. The communities and the data in the project Discover the data Process the data Linked data. Environmental Science. Earth as a single complex and coupled system. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Data discovery and data processing for environmental research infrastructures

Data discovery and data processing for environmental research infrastructures

Roberto CossuENVRI WP4 leaderESA

Page 2: Data discovery and data processing for environmental research infrastructures

Outline

1. The communities and the data in the project2. Discover the data3. Process the data4. Linked data

2

Page 3: Data discovery and data processing for environmental research infrastructures

Environmental Science

oceanic and atmospheric

processes

long-term development of

the climate system

Biological processes

biodiversity

development of the cryosphere and lithosphere

3

Earth as a single complex and coupled system

Page 4: Data discovery and data processing for environmental research infrastructures

Goal

Enable multidisciplinary scientists to access and study data from multiple domains for “system level” research

by providing solutions and guidelines for the RIs common needs

Multiple data producersMultiple data consumers

4

Page 5: Data discovery and data processing for environmental research infrastructures

ESFRI Environmental Research Infrastructures

• Tropospheric research aircraft

COPAL

• Upgrade of incoherent SCATter facility

EISCAT-3D

• Multidisciplinary seafloor observatory

EMSO

• Plate observing system

EPOS

• Global ocean observing infrastructure

EURO-ARGO

• Aircraft for global observing system

IAGOS

• Integrated carbon observation system

ICOS

• Biodiversity and ecosystem research infra

LIFEWATCH

• Svalbard arctic Earth observing system

SIOS

5

Page 6: Data discovery and data processing for environmental research infrastructures

Distributed measurements and monitoring• physical, chemical and biological parameters

Laboratories and experimental facilities• in fixed monitoring stations• on research vehicles, ships, floats and buoys• from aircraft and satellites

A variety of data• heterogeneous in format• primary and processed data

Analytical and modeling platforms• data exchange and integration• high performance computing and Grid services• e-Laboratories

Discover heterogeneous data at different places and in different catalogues.

Page 7: Data discovery and data processing for environmental research infrastructures

First steps - priority areas

Integrated data discovery across various centres / catalogues

(near) Real-time data handling

Federation over existing (national or international) infrastructures / services

7

Page 8: Data discovery and data processing for environmental research infrastructures

8

Approach

discover data which are heterogeneous in format, content, and metadata description

harmonise, integrate and analyse data across domains and RIs Pr

omot

e Ac

cess

ibili

tyPreserve Specificity

PROVIDE SOFTWARE TOOLS TO

Page 9: Data discovery and data processing for environmental research infrastructures

Study cases

They are needed for:Identify dataTune/evolve “basic” services, e.g., discovery, accessDevelop “more complex” services , e.g, visualizationIntegrate Processing services (availability of SW)

Two regions:The iceland Volcano:

ICOS, EISCAT, EuroArgo, satellite images, ( + DLR/”IAGOS-like”)South Italy:

Lifewatch, EPOS, EMSO, EuroArgo, Italian ISPRA environmental dataEMSO, EPOS

Page 10: Data discovery and data processing for environmental research infrastructures

Dataset Discovery

Set the bounding box as desired

Insert Start Date and Stop Date

Insert the text string and set the specific parameters

Click on Search to start the query

Collection of data corresponding to the search criteria are listed here.

Page 11: Data discovery and data processing for environmental research infrastructures

Interferograms computed from data (either on demand computation or discovery

of previously generated products)

In- Situ data

Satellite data

Query of heterogeneous data based on geo-spatial and

temporal criteria defined by the user

Data discovery example

Page 12: Data discovery and data processing for environmental research infrastructures

Study case:Iceland volcanic ash (2010)

12

In situ data from ICOS Demo Atmospheric Network

Measures from airborne sensor (DLR-IAGOS)

Envisat Sciamachy atmospheric data

Page 13: Data discovery and data processing for environmental research infrastructures

Discovery Service: OpenSearch

The discovery services is based on GENESI-DEC approach. The catalogues of the different repositories expose an OpenSearch-based interface by which data can be discovered and accessed through external applicationsOpenSearch is a collection of technologies allowing websites and search engines to publish search results in a standard and accessible format Search engines are described through OpenSearch Description Documents

Page 14: Data discovery and data processing for environmental research infrastructures

Full ENVRI workflow forgeospatial Data Services

Geospatial Repositories

Data Discovery

Data Access Data Process

OGCOpenSearch

Linked Open DataCatalogue Services

OGCWCS

THREDDS

OGCWPS

WPS 52N

P1 P2 P..

WPS Hadoop

Hadoop Cluster

HDFS

Data Pub. /Vis.OGC

WMS, WFS

GeoServer

gCub

e Da

ta

stag

ing

by courtesy of P. Pagano (ISTI-CNR)

Page 15: Data discovery and data processing for environmental research infrastructures

Linked data

DATASETOBSERVATIONSMETADATA

(parameter, unit of measure,instrument, provider, ...)

DIMENSIONS(time, lat/long, elevation)

Page 16: Data discovery and data processing for environmental research infrastructures

Linked Data

Modelling ENVRI data with the Data Cube vocabulary The Data Cube vocabulary provides a generic

framework to encode collections of observations. This vocabulary was developed for the statistical

domain and based on the SDMX standard

Analyze Model Publish Use

Page 17: Data discovery and data processing for environmental research infrastructures

Thank you

http://envri.eu/

17