data e-infrastructures (including the federation of open access repositories)
DESCRIPTION
data e-infrastructures (including the federation of Open Access Repositories). European Infrastructure for Science Governance European Parliament Brussels, 26 March 2013 Carlos Morais Pires, European Commission DG CONNECT, unit C1 carlos.morais-pires[@]ec.europa.eu @CarlosMPires. - PowerPoint PPT PresentationTRANSCRIPT
data e-infrastructures(including the federation of Open Access Repositories)
European Infrastructure for Science GovernanceEuropean Parliament
Brussels, 26 March 2013
Carlos Morais Pires, European CommissionDG CONNECT, unit C1
carlos.morais-pires[@]ec.europa.eu@CarlosMPires
Author’s views do not commit the European Commission
summary
• Data as Infrastructure: Europe is "Riding the Wave"
• Implementing Interoperable Data Infrastructure
• balancing community driven and service driven initiatives
• Data Infrastructure in FP7 (examples of project initiatives)
• Policy framework related with data infrastructures
• Main Messages
data as infrastructure: Europe is Riding the Wave
The High Level Expert Group on Scientific Data presented Riding the Wave in October 2010
Vision: "data e-infrastructure that supports seamless access, use, re-use, and trust of data. In a sense, the physical and technical infrastructure becomes invisible and the data themselves become the infrastructure a valuable asset on which science, technology, the economy and society can advance".
useful definitions
Data: digital recorded factual material commonly accepted in the scientific community as necessary to validate research findings
(not include lab notebooks, preliminary analysis, drafts of scientific papers, plans for future research, peer review reports, communication with peers, physical objects, lab specimens)
[c.f. White House Memo on "Increasing Access to the Results of Federally Funded Scientific Research"]
Data infrastructures: services, applications, tools, knowledge and policies for research data to be discoverable, understandable, accessible, preserved and curated… and available 24/7
implementing interoperable data infrastructure
(a)data generators; research projects, big research infrastructure, installations or medium size laboratories, simulation centres, surveys or individual researchers
(b)discipline-specific data service providers, providing data and workflows as a service
(c) providers of generic common data services (computing centres, libraries)
(d)researchers as users, using the data for science and engineering
community driven data infrastructure, including ESFRI, ESFRI clusters and others
network infrastructure, GÉANT
distributed computing/software infrastructure
scientific data infrastructure
data infrastructure:bridging islands
bridges
Science. Set Free.
Research results. Linked.
Open. Share. Re-use.
ScientificInformation Infrastructure
8
Data driven research across disciplinary and geographical boundariesRegister relevant data objects stored in certified repositoriesVirtually integrate data objects in trusted federations Foster advancements in interoperability of object contentFragmentation and heterogeneity of data require standardization vs. innovation dynamics
European Data Centers
Collaborative Data Infrastructure EUDAT Scenario
DESY
ARGO
MetaNet
INCF
HealtheChild
service-driven data e-infrastructures
community-driven data e-infrastructuresSCIDIP-ES (Earth Observation Long Term Data Preservation )
Adapted from a slide of Dr. Mirco Albani (ESA), project leader of SCIDIP-ES
community-driven data e-infrastructures
The Virtual Observatory concept is a bold community-led response to the challenges the astronomical community faces in data management and storage.
Impressive progress has been made and the momentum of the International Virtual Observatory Alliance will ensure sustained progress.
All ESFRI Life Sciences infrastructures, coordinated by EMBL
Interoperability across data sources and services
EU funding : 10.5 M€, started in 2012
EATRIS
EMBRC
EU-Openscreen
ECRIN
Euro-Bio-imagingBSL4
BIOBANKS-BBMRI
EBI-ELIXIR
INFRAFRONTIER
INSTRUCT
Implementation Cluster for Life Science BioMedBridg
es
Development of common reference model , standards, and common components for data pre-processing and post-processing
Contribution to GEOSS (Global Earth Observation System of Systems) and compliance with INSPIRE EC Directive
Large participation of ICT and e-infrastructures actors (key partners from D4SCIENCE, GENESI, EGI, EUDAT, PRACE…)
EPOS
EURO-ARGO
ICOS
SIOS
EMSOEISCAT
LIFE-WATCH
EU funding : 3.7 M€, started in 2011
Implementation cluster for Environment
ENVRI
Research Data Alliance:Common Infrastructure, Policy and PracticeDrives Data Sharing and Exchange throughout the Data Life Cycle
From Prof. Fran Berman and Prof. John Wood, Members of the RDA Council
Open Infrastructures for Open Science
COM and REC on Scientific Information
Adopted July 2012
Open Access to publications… and data
Long term preservation,
Capacity building with e-infrastructures
On-going discussions with Council and EP
Policy formulated Policy not formulated
Formulated, notimplementing
Formulated, im
plementing, no
outcomes
Formulated, implementing, with outcomes
Policy formulated Policy not formulated
Formulated, notimplementing
Formulated, im
plementing, no
outcomes
Formulated, implementing, with outcomes
Policy formulated
Policy not formulatedFormulated, not
implementing
Formulated, im
plementing, no outcom
es
Formulated, implementing,
with outcomes
Policy formulated Policy not formulatedFormulated, notimplementing
Formulated, im
plementing, no outcom
es
Formulated, implementing, with outcomes
National policy not formulated
National policy formulated but not implemented
National policy formulated, implemented but no outcomes yet
National policy formulated, implemented, and outcomes delivered
OA Publications OA Data Preservation eInfrastructures
Better Access to Scientific Informationslide from Neil Jacobs (JISC) presentation to the EC
consultation towards horizon2020
intelligent access to scientific information
We don't know how scholarly communication
will adapt to new paradigms bringing closer
human and machine readable information...
e-Infrastructure can help realising the
AAIR vision and opportunities for
innovation in publishing
publication + data + software
data e-Infrastructure: framework for action
1. community support data services
2. infrastructure for Open Access
3. store, managing and preserving research data
4. discovery and provenance of research data
5. towards global data e-infrastructures
6. Authentication and Authorisation e-infrastructures
7. skills and new professions for research data
main messages
• Research Data "is" an Infrastructure for modern science
• Data is generated and used by disciplinary communities
• Data is stored, moved and processed by common infrastructures
• Crossing disciplinary and geographic boundaries requires
• exploring the commonalities of data infrastructures
• implement global and interoperable data infrastructures
• Policies for Open Access remove, where possible, barriers to access and share data
• H2020 will make OA to publication the rule
• H2020 will start a pilot on OA to publicly funded research data
Carlos Morais Pires
Thank you!