pistoia alliance conference april 2016: big data: mathew woodwark
TRANSCRIPT
![Page 1: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark](https://reader035.vdocuments.us/reader035/viewer/2022070514/587e4f6b1a28abeb1a8b5bed/html5/thumbnails/1.jpg)
Big Biomedical DataPistoia Panel Discussion19th April 2016Mathew Woodwark, Director of Research BioinformaticsMedImmune
![Page 2: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark](https://reader035.vdocuments.us/reader035/viewer/2022070514/587e4f6b1a28abeb1a8b5bed/html5/thumbnails/2.jpg)
![Page 3: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark](https://reader035.vdocuments.us/reader035/viewer/2022070514/587e4f6b1a28abeb1a8b5bed/html5/thumbnails/3.jpg)
3
Goal: Integrative Informatics
![Page 4: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark](https://reader035.vdocuments.us/reader035/viewer/2022070514/587e4f6b1a28abeb1a8b5bed/html5/thumbnails/4.jpg)
Collect, Store and Integrate
Integrated data analysis
Biological Understanding
} Target IDTarget SelectionTarget ValidationBiomarkers
Public Collaborations Internal
Engineering better molecules, enabling better projects
Data Sources
Multiple OmicsData types
Tissue P
henomics
Flow C
ytometry
Screening
Proteom
ics
Transcriptomics
Exom
e
Whole G
enome
Clinical
Phenotype
Data Warehouse
![Page 5: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark](https://reader035.vdocuments.us/reader035/viewer/2022070514/587e4f6b1a28abeb1a8b5bed/html5/thumbnails/5.jpg)
Integrative Informatics Architecture
5
Portal and query layer
Data Marts
Integration Engine
Metadata and business
rules
Structured data
Semi-structured
data
Un-structured
data
Visualisation Layer
Queries to build data marts
Structured data
Semi-structured
data
Un-structured
data
External Internal
Rich viz components – display connected info in multiple formats
Organized by data type or by process. Data standards guide assembly
Data Extraction for further analysis in specialized tools
Reusable data connectors
Drill into underlying detailed data
Persistent and temporary marts
Able to integrate internal and external data
QC and ETL?
![Page 6: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark](https://reader035.vdocuments.us/reader035/viewer/2022070514/587e4f6b1a28abeb1a8b5bed/html5/thumbnails/6.jpg)
6
Genomics Big Data Considerations1: Genomic Data
![Page 7: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark](https://reader035.vdocuments.us/reader035/viewer/2022070514/587e4f6b1a28abeb1a8b5bed/html5/thumbnails/7.jpg)
7
Genomics Big Data Considerations2: Security and Access
![Page 8: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark](https://reader035.vdocuments.us/reader035/viewer/2022070514/587e4f6b1a28abeb1a8b5bed/html5/thumbnails/8.jpg)
8
Genomics Big Data Considerations3: Consent
![Page 9: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark](https://reader035.vdocuments.us/reader035/viewer/2022070514/587e4f6b1a28abeb1a8b5bed/html5/thumbnails/9.jpg)
9
Genomics Big Data Considerations4: Geography
![Page 10: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark](https://reader035.vdocuments.us/reader035/viewer/2022070514/587e4f6b1a28abeb1a8b5bed/html5/thumbnails/10.jpg)
10
Genomics Big Data Considerations5: Phenotypic data
![Page 11: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark](https://reader035.vdocuments.us/reader035/viewer/2022070514/587e4f6b1a28abeb1a8b5bed/html5/thumbnails/11.jpg)
11
Genomics Big Data Considerations6: Integration with other data
Tissue P
henomics
Flow
Cytom
etry
Screening
Proteom
ics
Transcriptomi
cs
![Page 12: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark](https://reader035.vdocuments.us/reader035/viewer/2022070514/587e4f6b1a28abeb1a8b5bed/html5/thumbnails/12.jpg)
12
Genomics Big Data Considerations7: Global Collaboration
Tissue P
henomics
Flow
Cytom
etry
Screening
Proteom
ics
Transcriptomi
cs
![Page 13: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark](https://reader035.vdocuments.us/reader035/viewer/2022070514/587e4f6b1a28abeb1a8b5bed/html5/thumbnails/13.jpg)
13
Genomics Big Data Considerations8: More integration
Tissue P
henomics
Flow
Cytom
etry
Screening
Proteom
ics
Transcriptomi
cs
![Page 14: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark](https://reader035.vdocuments.us/reader035/viewer/2022070514/587e4f6b1a28abeb1a8b5bed/html5/thumbnails/14.jpg)
14
Its complicated!And a work in progress…
…for all of us!