JPI Climate Symposium 22-23 October 2015 Vienna, Austria · Key BYTE outcomes • Vision. for Big Data for Europe for 2020, incorporating externalities • Amplify positive externalities
22
Open Data Perspectives from the RECODE and BYTE projects Lorenzo Bigagli, CNR-IIA JPI Climate Symposium 22-23 October 2015 Vienna, Austria
“Sharing of Data Leads to Progress on Alzheimer’s”, and Parkinson’s, and…
“No one would own the data. No one could submit patent applications, though private companies would ultimately profit from any drugs or imaging tests developed as a result of the effort” (NYT, 12 Aug. 2010)
Vorführender
Präsentationsnotizen
A new (2010) project led by the Michael J. Fox foundation looks to further develop biomarkers to help understand Parkinsons’s A key element of the project includes the immediate sharing of data The model was a similar initiative on Alzheimer’s, in 2003 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ http://www.nytimes.com/2010/08/13/health/research/13alzheimer.html?emc=eta1 Sharing of Data Leads to Progress on Alzheimer’s
Open Access – a dynamic period for policy & practice
• 2003 Directive 2003/98/EC: on the re-use of public sector information– Revised in June 2013
• 2013 The RDA was launched– It is planning its 6th plenary and currently has more than 2500 members
• April 2013 The RCUK open access policy came into effect• May 2013 White House launches open data policy• July 2013 EC Public Consultation on Open Research Data• December 2013 European Commission launched the Open Data
pilot in H2020• November 2014
– Dutch government announces open access to data as a key policy aim– Gates Foundation announces world’s strongest policy on open access research
Vorführender
Präsentationsnotizen
Open Access is not a new topic, has been an important part of EU policy conversation since 2002. Original directive on re-use of PSI was issued in late 2003. Two years ago, the Directive on PSI was amended, partly to address the exponential growth in the amount and types of data available.
H2020 Pilot on Open Access to Research Data
• Requirements– Detailed data management plan covering individual datasets
(within 6 months)– Deposit the research data, preferably into a research data
repository – Take measures to enable for third parties to access, mine,
exploit, reproduce and disseminate (free of charge for any user) this research data
• E.g., Creative Commons License (CC-BY, CC0)
– Provide information about tools and instruments at the disposal of the beneficiaries and necessary for validating the results
• E.g., specialized software, algorithms, analysis protocols. Where possible, they should provide the tools and instruments themselves
Vorführender
Präsentationsnotizen
Aims to improve and maximize access to and re-use of research data generated by projects. Scope covers 20% of H2020.
The RECODE Projectwww.recodeproject.eu
Policy RECommendations for Open access to research Data in Europe
Global Earth Observation System of Systems (GEOSS)
Archaeology
Open Context
Vorführender
Präsentationsnotizen
Objective - To use five case studies to examine each of these areas across disciplinary boundaries. In each case, we will look at how open access to research data “works” in these contexts (or doesn’t work). These case studies will provide a multidisciplinary platform from which RECODE can examine open access and data preservation grand challenges. The purpose of the disciplinary case studies is to recognise and work with the disciplinary fragmentation associated with open access to research data. Some disciplines have well-developed processes, practices and procedures for providing open access to data, data preservation and data re-use. Other disciplines’ practices are less developed and can learn from these early leaders. Furthermore, some organisations, sub-disciplines and even individual research teams are also more developed, and these may also provide good practice examples and lessons.
Grand challenges across case studies
Stakeholder values & inter-relationships
Infrastructure & technology
Legal and ethical issues
Institutional and policy issuesPh
ysic
s
Hea
lth
Bio
engi
neer
ing
Eart
h Sc
ienc
es
Arc
haeo
logy
Key accomplishments
• 5 stakeholder workshops• 168 workshop participants from 35 countries• 65 Interviews with academics, researchers, policy-makers,
data centres, legal experts, publishers and others• Specific recommendations for 4 stakeholder groups
– Funders,– Research institutions– Data managers– Publishers
• Book upcoming – Mobilising Data in a Knowledge Society
Vorführender
Präsentationsnotizen
Plus: 3 Practical Guides for developing policies for Research Funders, Research Institutions, Publishers Resources (policies, etc.) to expedite the process of policy development and implementation among stakeholders
The BYTE Projectwww.byte-project.eu
• Big data roadmap and cross-disciplinarY community for addressing socieTal Externalities (BYTE) project
• March 2014 – Feb 2017 (36 months)• Funded by DG-CNCT: €2.25 million
• Grant agreement no: 619551• 11 Partners• 10 Countries
Big data externality
• The effect of a decision by stakeholders (e.g., governments, industry, scientists, policy-makers) that have an impact on a third party (especially members of the public)
• May be positive or negative
Economic
•Boost to the economy
• Innovation• Increase efficiency• Smaller actors left
Bullet one – how we define an externality – as an “impact” Public opinion surveys reveal that citizens are concerned about many of these issues, especially privacy and data protection.
Case studies: a sectorial approach
Environment
Energy
Utilities / Smart Cities
Cultural Data
Health
Crisis Informatics
Transport
Capture externalities associated with big data practice to• Identify potential
benefits• Address potential
negative impacts
Vorführender
Präsentationsnotizen
All seven case studies focus on economic costs and benefits (new products and services, restructuring) technology and innovation impacts (improved service delivery) legal issues (privacy, data protection, intellectual property) ethical issues (potential for discrimination, public trust) and political issues (cross-border data flows)
Key BYTE outcomes
• Vision for Big Data for Europe for 2020, incorporating externalities• Amplify positive externalities• Diminish negative ones
• Roadmaps• Research Roadmap• Policy Roadmap
• Charter of a Big Data Community• Will implement the roadmap• Currently under definition, in synergy with the BDVA PPP
Vorführender
Präsentationsnotizen
Define research efforts and policy measures necessary for responsible participation in the big data economy Production of a roadmap outlining a plan of action to enable European scientists and industry to capture a proportionate share of the big data market Provision of assistance to industry in capturing positive externalities (efficiencies, new business models, etc.) and addressing potential negative externalities before beginning a project, initiative or programme A series of clear and precise future research needs and policy steps
D2.3: Open Data vs. Big Data
• Burgeoning relationship, great benefit to society – abundance of opportunities for Europe– threat from negative externalities
• Imbalance between the number of open access policies and initiatives by the public sector, compared with fewer in the private sector
Vorführender
Präsentationsnotizen
“Public vs. private” debate
Key RECODE findings
• Stakeholder Values and Ecosystems – Shared perception of OA of a public good, but conflicting value chains
• Costs of research data
• Infrastructure and Technology Challenges – Less of a concern compared to financial, cultural and legal ones
• Legal and Ethical Challenges– Strategies already adopted to address potential legal and ethical issues,
but stakeholders often subject to conflicted legal obligations– Not all data can be open
• Institutional Challenges– Data quality essential for reuse and long-term preservation– Need clear guidelines and mechanisms that contribute to evaluation,
such as data journals and peer review– Training and rewarding is paramount
BYTE: the Sentinel issue
• Dedicated satellite mission of the Copernicus (fka GMES) space componenet
• Policy jointly decided by EC and ESA– Free of charge – Anybody can access online
• No difference is made between public, commercial and scientific use and in between European or non-European
Vorführender
Präsentationsnotizen
Copernicus is the European Programme for the establishment of a European capacity for Earth Observation Landsat was opened in 2008
BYTE: the Sentinel issue
• Policy supported by a regulation– Both positive and negative implications
• Facilitates consistent implementation of open access in the environmental sector throughout the whole EU
• Not binding for foreign entities, external to the EU
• Non-European entities free access to Copernicus data and information (e.g. Google, China, India) may result in a competitive advantage over European industry– Draft amendment introducing a reciprocity principle– Possible violation of WTO GATS
Vorführender
Präsentationsnotizen
Copernicus is the European Programme for the establishment of a European capacity for Earth Observation Landsat data opened in 2008 On 27 September 2013, the Committee on Industry, Research and Energy submitted a draft proposal amendment on the Copernicus Regulation. In particular, the report submitted that Article 14 of the Copernicus Regulation should be amended as follows: Copernicus data and information shall be made available on a full, open and free-of- charge basis for all participating Member States, for emergency situations and for development aid purposes. In all other cases a policy of pay-for-data shall be adopted or a reciprocity principle shall be applied. However, the issue is very complicated and it is likely that both the industry and the ITRE proposal of potential restrictions in accessing and using Copernicus data and information for non-EU entities could lead to a violation of EU obligations and commitments under the WTO General Agreement on Trade in Services (GATS)
Overarching RECODE recommendation (1/2)
1. Develop aligned and comprehensive policies for open access to research data (aligned policies)
2. Ensure appropriate funding for open access to research data (funding)
3. Develop policies and initiatives that offer researchers rewards for open access to high quality data (rewards)
4. Identify key stakeholders and relevant networks and foster collaborative work for a sustainable ecosystem for open access to research data (collaboration)
5. Plan for the long-term, sustainable curation and preservation of open access data (long-term preservation)
Vorführender
Präsentationsnotizen
direct consensus-building and action towards ten broad areas that were identified by project research as significant in view of enabling open access to research data. Broad recommendations intended to be useful and accessible to both stakeholders with very developed open access policies that could be improved and stakeholders with less developed policies supplemented by more specific recommendations for each category of stakeholder. necessarily geared towards decision-making stakeholders But we encourage decision-makers to consult the research community 1. Develop aligned and comprehensive policies for open access to research data Funder, institutional and publisher policies setting open access to research data as the default practice are necessary in transitioning towards open science. Policies should be consistent with national priorities and aligned with the European framework for open access to research data (2012 Recommendation and Horizon 2020), while also complementing that for open government data. Provisions should be made for the necessary resources that will allow policy implementation. 2. Ensure appropriate funding for open access to research data Policies and mandates for open access will bring the expected results if accompanied by appropriate funds. Particular attention should be directed towards provisions for funding the development and long-term sustainability of necessary infrastructures; training of researchers, librarians and other technical staff; innovative actions. Develop policies and initiatives that offer researchers rewards for open access to high quality data Funder and institutional policies that offer researchers rewards for providing open access to high quality data are central in the transition towards open science. Official measures and processes need to be put in place to include the open sharing of research data in funding and professional advancement decisions. 4. Identify key stakeholders and relevant networks and foster collaborative work for a sustainable ecosystem for open access to research data The open access ecosystem comprises a diverse group of stakeholders with multiple and often overlapping functions. To be sustainable, collaboration is essential as it affords the gradual development of a coherent view among stakeholders, an agreement on their roles and responsibilities, the allocation of resources and alignment of stakeholder policies, while avoiding the duplication of effort and loss of resources, as well as capacity-building. 5. Plan for the long-term, sustainable curation and preservation of open access data Stakeholders should draw their attention specifically to the long-term availability of high-quality research data. A strategy for long-term, sustainable curation and preservation requires leveraging resources as well as developing appropriate services and infrastructure. In doing so, the use of collaborative models should be considered.
Overarching RECODE recommendation (2/2)
6. Develop comprehensive and collaborative technical and infrastructure solutions that afford open access to and long-term preservation of high-quality research data (infrastructure)
7. Develop technical and scientific quality standards for research data (quality standards)
8. Require the use of harmonized open licensing frameworks (licensing)
9. Systematically address legal and ethical issues arising from open access to research data (legal/ethical aspects)
10. Support the transition to open research data through curriculum-development and training (training/education)
Vorführender
Präsentationsnotizen
6. Develop comprehensive and collaborative technical and infrastructure solutions that afford open access to and long-term preservation of high-quality research data Existing infrastructures should be further collaboratively developed to address in a comprehensive way data harmonization, discovery and access, preservation, technological obsolescence, documentation and metadata, quality and relevance indicators and security issues, among others. Approaches should address the diverse disciplinary requirements and data variety, as well as metadata and data standardization. 7. Develop technical and scientific quality standards for research data Stakeholders should collaborate in developing shared quality standards that will ensure the proliferation of high-quality reusable research data. Consensus should be built on the technical quality standards of research data, as well as on their scientific quality in line with disciplinary practices and norms. Appropriate strategies should be developed for the evaluation of the scientific quality of data. 8. Require the use of harmonized open licensing frameworks Open licenses, like creative commons, describe the terms under which research data should be accessed, shared, and re-used. Their popularity is an indication of their utility and efficacy, yet further options for licensing should be examined, along with identifying mechanisms to enforce these licenses and developing new, interoperable licenses. 9. Systematically address legal and ethical issues arising from open access to research data Open access to research data raises important legal and ethical issues, which should be addressed systematically by stakeholders. This can be done through the institutionalization of processes, dedicated fora, training, the use of technological solutions (e.g. machine-readable licenses) and the systematic pursuit for new and more efficient solutions. 10. Support the transition to open research data through curriculum-development and training The transition to an open science paradigm where research data plays a significant role requires training and education for researchers and for data managers who support open science. Courses for getting researchers and data managers up-to date with current relevant issues are necessary, as well as the development of curricula that contribute towards the development of data science and information management as distinct and legitimate career paths.
Stay engaged
• EGU ESSI Division session on Open Access to Research Data and Public Sector Information: perspective, drivers, and barriers– Convener: Lorenzo Bigagli– Co-Conveners: José Miguel Rubio Iglesias, Berit Arheimer,
Stefano Nativi• European Geosciences Union General Assembly
Meeting 2016– Vienna, 17-22 April 2016
• Call for Abstract deadline– Sometimes in January 2016
Vorführender
Präsentationsnotizen
The session will offer a multi-disciplinary space for members interested in Open Access to research data and Public Sector Information. Topics include challenges related to infrastructure and technology, as well as legal, ethical, institutional and policy issues relating to sharing and providing Open Access to research data and PSI from a variety of sources, and in a variety of formats.