harnessing edge informatics to accelerate collaboration in biopharma (bio-it world 2016)

42
Tom Plasterer, PhD. Research & Development Information (RDI) Director, US Cross-Science Harnessing Edge Informatics to Accelerate Collaboration in BioPharma WINNER

Upload: tom-plasterer

Post on 14-Jan-2017

190 views

Category:

Health & Medicine


1 download

TRANSCRIPT

Page 1: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

Tom Plasterer, PhD.Research & Development Information (RDI)Director, US Cross-Science

Harnessing Edge Informatics to Accelerate Collaboration in

BioPharma

WINNER

Page 2: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

The US Cross-Science Team in Research and

Development Information (RDI) is a group of

informaticians, mathematicians, project/

program managers, developers, architects

dedicated to data science—data discovery, data

reuse, data harmonization, analytics and self-describing data

or Smart Data

We strive to create tangible digital and social

artefacts used to accelerate delivering

medicines to patients and improving their impact

once in the clinical setting.

These artefacts include web-based software,

community-driven data models, data sharing best

practices, data science communities of practice and strong advocacy of

Smart Data inside and out of AstraZeneca.

Page 3: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Sharing & Collaboration: Are you a Data Parasite?

Page 4: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Sharing & Collaboration: Are you a Data Parasite?

‘A second concern held by some is that a new class of research person will emerge — people who had nothing to do with the design and execution of the study but use another group’s data for their own ends,

possibly stealing from the research productivity planned by the data gatherers, or even use the data to try to disprove what

the original investigators had posited. There is concern among some front-line researchers that the system will be taken

over by what some researchers have characterized as “research parasites.”’

Dan Longo and Jeffrey Drazen, the deputy editor and editor-in-chief, NEJM

Page 5: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Sharing & Collaboration: Are you a Data Parasite?

‘A second concern held by some is that a new class of research person will emerge — people who had nothing to do with the design and execution of the study but use another group’s data for their own ends,

possibly stealing from the research productivity planned by the data gatherers, or even use the data to try to disprove what

the original investigators had posited. There is concern among some front-line researchers that the system will be taken

over by what some researchers have characterized as “research parasites.”’

Dan Longo and Jeffrey Drazen, the deputy editor and editor-in-chief, NEJM

Page 6: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Sharing & Collaboration: Are you a Data Parasite?

‘A second concern held by some is that a new class of research person will emerge — people who had nothing to do with the design and execution of the study but use another group’s data for their own ends,

possibly stealing from the research productivity planned by the data gatherers, or even use the data to try to disprove what

the original investigators had posited. There is concern among some front-line researchers that the system will be taken

over by what some researchers have characterized as “research parasites.”’

Dan Longo and Jeffrey Drazen, the deputy editor and editor-in-chief, NEJM

‘The condescension implicit in this statement is deeply

troubling. Drazen and Longo are saying, essentially, that only the people who originally collect a

data set can truly understand it, and anyone else who wants to

take a look is a parasite.’

Steven Salzberg, JHU

Page 7: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Sharing & Collaboration: Are you a Data Parasite?

‘A second concern held by some is that a new class of research person will emerge — people who had nothing to do with the design and execution of the study but use another group’s data for their own ends,

possibly stealing from the research productivity planned by the data gatherers, or even use the data to try to disprove what

the original investigators had posited. There is concern among some front-line researchers that the system will be taken

over by what some researchers have characterized as “research parasites.”’

Dan Longo and Jeffrey Drazen, the deputy editor and editor-in-chief, NEJM

‘The condescension implicit in this statement is deeply

troubling. Drazen and Longo are saying, essentially, that only the people who originally collect a

data set can truly understand it, and anyone else who wants to

take a look is a parasite.’

Steven Salzberg, JHU

Page 8: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Sharing & Collaboration: Are you a Data Parasite?

‘A second concern held by some is that a new class of research person will emerge — people who had nothing to do with the design and execution of the study but use another group’s data for their own ends,

possibly stealing from the research productivity planned by the data gatherers, or even use the data to try to disprove what

the original investigators had posited. There is concern among some front-line researchers that the system will be taken

over by what some researchers have characterized as “research parasites.”’

Dan Longo and Jeffrey Drazen, the deputy editor and editor-in-chief, NEJM

‘The condescension implicit in this statement is deeply

troubling. Drazen and Longo are saying, essentially, that only the people who originally collect a

data set can truly understand it, and anyone else who wants to

take a look is a parasite.’

Steven Salzberg, JHU

‘But the science, data, and research results are trapped in silos,

preventing faster progress and greater reach to patients. It’s not

just about developing game-changing treatments — it’s about

delivering them to those who need them.’

Vice President Biden’s Blog

Page 9: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Public Research, Private Results?

Page 10: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Public Research, Private Results?

Page 11: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Public Research, Private Results?

Page 12: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Public Research, Private Results?

‘Payment of 32 dollars is just insane when you need to skim or read tens or hundreds of these papers to do research. I obtained these papers

by pirating them. Later I found there are lots and lots of researchers (not

even students, but university researchers) just like me, especially

in developing countries. They created online communities

(forums) to solve this problem.’

Alexandra ElbakyanSci-hub operator

Page 13: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Sharing Clinical Trial Results

Page 14: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Sharing Clinical Trial Results

Page 15: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Sharing Clinical Trial Results

Thousands of clinical trials have not reported their results; some have not even

been registered.

Information on what was done and what was found in these trials could be lost

forever to doctors and researchers, leading to bad treatment decisions, missed

opportunities for good medicine, and trials being repeated.

All trials past and present should be registered, and the full methods and the

results reported.

We call on governments, regulators and research bodies to implement measures to

achieve this.

AllTrials.Net Petition(2015)

Page 16: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Sharing Clinical Trial Results

Page 17: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Sharing Clinical Trial Results

• Maximize the benefits while minimizing the risks of sharing clinical trial data

• Respect individual participants whose data are shared

• Increase public trust in clinical trials and the sharing of trial data

• Conduct the sharing of clinical trial data in a fair manner

IOM Report: Sharing Clinical Trial Data: Maximizing Benefits, Minimizing Risk

(2015)

Page 18: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Edge InformaticsInterfaces within the Drug Development Process

Target Discovery

NGS Exome analysis

Pathway Analysis

StructureAnalysis

Lead Discovery

RNAi

Assay Development

HTS

Lead Optimization

SAR

In vivo non-human testing

Exploratory PK

Exploratory Tox

Pre-Clinical Development

GLP Tox

Formulation

ADME

PK

Efficacy

Clinical Development

IND

Safety, Tolerability

Phase I-III

Registration

NDA/BLA

MAA

Marketing & Sales

PMRREMS

PSUR

Observational Research

Pathway Enrichment

Disease Contextualization

Seamless information connectivity (an EDGE) needed across domain NODEs

Page 19: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Integration Quandary: Content Does Not Combine Easily

Fit-for-Purpose to “Standards”

Models

Structured

Triplestores

Semi-StructuredUnstructured

Content

Lack of Compatible

Containers → the ”Plumbing

Problem”

Lack of Compatible

Semantics→ the ”Meaning Problem”

Page 20: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

What’s Needed?Linked Data!

LOD Cloud 2014Schmachtenberg, Bizer, Jentzsch and Cyganiak.http://lod-cloud.net/

Page 21: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

What’s Needed?Linked Data!

LOD Cloud 2014Schmachtenberg, Bizer, Jentzsch and Cyganiak.http://lod-cloud.net/

“Smart Data” means information that actually makes sense.

Wired Magazine, April 2013

Page 22: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Thanks to: Eric Little, VP Data Science, Osthus

The Emergence of Smart DataStandards Driven at Container Interfaces

Page 23: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Competitive Intelligence 360 (CI360) ApproachFlexibly Addressing Key Questions

23

Capture Business Questions and

Sources

Domain Expert Concept Map

Build Formal Ontology

Challenge with Linked Data

Examine with a Faceted Browser

Share insights with a Knowledge

Base

Page 24: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Capture Business QuestionsCapture Business

Questions and Sources

Page 25: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Translate Questions into ConceptsDomain Expert Concept Map

“Where are the key clinical studies in NSCLC and who are the principle investigators?”

Page 26: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Challenge with Data“Where are the key clinical studies in NSCLC and who are the principle investigators?”

(one example)

Challenge with Linked Data

Source: https://clinicaltrials.gov/ct2/show/NCT02027428

Page 27: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Refine the AnswerExamine with a

Faceted Browser

“What are the open trials in metastatic breast cancer and what drugs are being tested?”

Page 28: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Share Insights as a Community“Can a biomarker defined population be added to a trial record?”

Share insights with a Knowledge

Base

Page 29: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Data FAIRport

To be Findable:F1. (meta)data are assigned a globally unique and persistent identifierF2. data are described with rich metadata (defined by R1 below)F3. metadata clearly and explicitly include the identifier of the data it describesF4. (meta)data are registered or indexed in a searchable resource

To be Accessible:A1. (meta)data are retrievable by their identifier using a standardized communications protocolA1.1 the protocol is open, free, and universally implementableA1.2 the protocol allows for an authentication and authorization procedure, where necessaryA2. metadata are accessible, even when the data are no longer available

To be Interoperable:I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation.I2. (meta)data use vocabularies that follow FAIR principlesI3. (meta)data include qualified references to other (meta)data

To be Reusable:R1. meta(data) are richly described with a plurality of accurate and relevant attributesR1.1. (meta)data are released with a clear and accessible data usage licenseR1.2. (meta)data are associated with detailed provenanceR1.3. (meta)data meet domain-relevant community standards

InteroperabilityInvestment

Page 30: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Naming ThingsUse Resolvable URIs

InteroperabilityInvestment

http://purl.uniprot.org/uniprot/P30453

http://www.uniprot.org/uniprot/P30453

http://purl.uniprot.org/uniprot/P30453.ttl

Page 31: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Describing DataReuse, Reuse, Reuse (build only if essential)

InteroperabilityInvestment

Page 32: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Describing DataReuse, Reuse, Reuse (build only if essential)

InteroperabilityInvestment

Page 33: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Describing DataReuse, Reuse, Reuse (build only if essential)

InteroperabilityInvestment

Page 34: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Finding DataVocabulary of Interlinked Datasets (VoID)

InteroperabilityInvestment

Page 35: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Cross-BioPharma Data Standardization EffortsInteroperability

Investment

Page 36: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Open Data, Open Science Efforts

36

Page 37: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Open Data, Open Science Efforts

37

Page 38: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Open Data, Open Science Efforts

38

Page 39: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Open Data, Open Science Efforts

39

Page 40: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Open Data, Open Science Efforts

40

Page 41: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Open Data, Open Science Efforts

41

Page 42: Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT World 2016)

R&D | RDI

Get your plumbing right• And your data won’t be stuck in a silo

Leverage working public solutions• Don’t reinvent the wheel

Use Edge Informatics• Consider handoffs—you don’t know how your data will be used in the

future

Invest in Data Stewardship• Small tax to future-proof your efforts

Data and Collaboration ARE Business Assets