the byte project
Post on 12-Apr-2017
Embed Size (px)
Big Data Externalities the BYTE Case StudiesRachel FinnTrilateral Research & Consulting, LLP
Big data roadmap and cross-disciplinarY community for addressing socieTal Externalities
European Data Economy Workshop15 September 2015
@BYTE_EUwww.byte-project.euProject details: BYTEBig data roadmap and cross-disciplinarY community for addressing socieTal Externalities (BYTE) project March 2014 Feb 2017; 36 months Funded by DG-CNCT: 2.25 million (Grant agreement no: 619551) 11 Partners 10 Countries
@BYTE_EUwww.byte-project.euObjectivesThe BYTE project has three main objectives:1. To produce a research and policy roadmap and recommendations to support European stakeholders in increasing their share of the big data market by 2020 and in capturing and addressing the positive and negative societal externalities associated with use of big data.2. To involve all of the European actors relevant to big data in order to identify concrete current and emerging problems to be addressed in the BYTE roadmap. The stakeholder engagement activities will lead to the creation of the Big Data Community, a sustainable platform from which to measure progress in meeting the challenges posed by societal externalities and identify new and emerging challenges.3. To disseminate the BYTE findings, recommendations and the existence of the BYTE Big Data Community to a larger population of stakeholders in order to encourage them to implement the BYTE guidelines and participate in the Big Data Community.
@BYTE_EUwww.byte-project.euCase studies: big data practitioners assist to identify externalities
Understanding externalitiesIn BYTE we consider the externalities or impacts of big data
Positive effects or benefits realised by a third partyNegative costs (or harm) that affects a third party
Externalities relate to social processes linked to big data, as well as the opportunities & risks that may arise as a result of the existence of the data.
Some effects may be unexpected or unintentional
@BYTE_EUwww.byte-project.euPositive externalities occur when a product, activity or decision by an actor causes positive effects or benefits realised by a third party resulting from a transaction in which they had no direct involvement. Negative externalities occur when a product, activity or decision by an actor causes costs (or harm) that is not entirely born by that actor but that affects a third party, e.g., citizens (Business Dictionary, 2014).externalities are related to processes (i.e., production, service, use) and not to the product itself. That is, it is not big data per se that causes a particular externality, but rather, it is the social processes employed via big data that can produce externalities. Furthermore, these externalities may result from the direct collection or processing of data (e.g., privacy infringements), as well as the opportunities and risks that may arise as a result of the existence of the data (e.g., linking data sets). In addition, as externalities may have unexpected effects on third parties, a central task in BYTE is the identification of the involved processes, their effects as well as the potential affected parties.5
Big data concerns: externalities
@BYTE_EUwww.byte-project.euBullet one how we define an externality as an impact
Public opinion surveys reveal that citizens are concerned about many of these issues, especially privacy and data protection.
Select horizontal findings
@BYTE_EUwww.byte-project.euCase study-specific findings: health
Big data in healthcare is quite well developed and widespread across a number of health areas. Genetic data use is maturing and focused on high-grade analytics and the discovery of rare genes and genetic disorders. The key improvements include timely and more accurate diagnosis, the development of personalised medicines, and drug and other treatments/ therapy development, which can save lives. Key innovations include the development of privacy protecting and secure databases for genetic data samples.However, there tends to be a reluctance by public sector initiatives to share data due to legal and ethical constraints.
So in our own consent we never say that data will be fully anonymous. We do everything in our power so that it is deposited in a anonymous fashion and  when we consent we are very careful in saying look its very unlikely that anyone is going to actively identify information about you (Program head, Clinical geneticist )
@BYTE_EUwww.byte-project.euGenerally, data utilisation in the healthcare sector is developed and widespread across a number of health areas, especially in terms of medical research and diagnostic testing that translates into improved, more specialised care for patients. Genetic data use is maturing and focused on high-grade analytics and the discovery of rare genes and genetic disorders. The key improvements include timely and more accurate diagnosis, the development of personalised medicines, and drug and other treatments/ therapy development, which can save lives Key innovations include the development of privacy protecting and secure databases for genetic data samples, which is vital given the highly sensitive nature of the personal data utilised; and new business models focused on big genetic data sequencing However, there tends to be a reluctance by public sector initiatives to share data on open databases or in collaborations with private organisations (big pharma etc.) due to legal/ ethical constraints (e.g. consent/ privacy), and public sector ethos (public good v. profit generation).
Case study-specific findings: crisis informatics
Crisis informatics is in the early stages of integrating big data. Currently, its primary focus is on integrating social media and geographical data. The key improvement is that the analysis of this data improves situational awareness more quickly after an event has occurred. A key innovation is the combination of human computing and machine computing, primarily through digital volunteers, to validate the data collected and determine how trustworthy it is. Stakeholders in this area are making progress in addressing privacy and data protection issues.Some evidence of reliance on US cloud and technology services.
And I have seen this on multiply occasions from  big private companies in this, theyll deal with their own huge amount of data and response to crisis and so on. But [then] become very unpredictable unsustainable outside of an emergency, do a good job of talking about what they do during a crisis but then sort of disappear in-between. (Programme manager, International Governmental Organisation)
@BYTE_EUwww.byte-project.euCrisis informatics is in the early stages of integrating big data into standard operations and is primarily focussed on integrating social media and geographical data (There has not yet been much progress integrating other data types e.g., environmental measurements, meteorological data, etc)The key improvement is that the analysis of this data improves situational awareness more quickly after an event has occurred. A key innovation is the use of human computing, primarily through digital volunteers, to validate the data collected and determine how trustworthy it is. Stakeholders in this area are making progress in addressing privacy and data protection issues, which are significant and complex, given their focus on data from social media sources.
BYTE project key outputs Define research efforts and policy measures necessary for responsible participation in the big data economy Vision for Big Data for Europe for 2020, incorporating externalitiesAmplify positive externalitiesDiminish negative ones RoadmapResearch RoadmapPolicy Roadmap Formation of a Big Data communityImplement the roadmapSustainability plan
@BYTE_EUwww.byte-project.euProduction of a roadmap outlining a plan of action to enable European scientists and industry to capture a proportionate share of the big data market.Provision of assistance to industry in capturing positive externalities (efficiencies, new business models, etc.) and addressing potential negative externalities before beginning a project, initiative or programme. A series of clear and precise future research needs and policy steps
Next eventValidating case study externalitiesDublin14th October 2015, 9am-5pmPresentations by:Sonja Zillner, SIEMENSBig Data in a Digital City
Knut Sebastian Tungland, StatoilBig data in the energy sector
Key contacts:Rachel Finn firstname.lastname@example.org Kush Wadhwa email@example.com