r / bioconductor for 'omics analysis · r / bioconductor for ’omics analysis martin morgan...

39
R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Buffalo, NY, USA [email protected] 1 December 2016 R / Bioconductor for ’Omics Analysis 1 / 26

Upload: others

Post on 06-Nov-2019

10 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

R / Bioconductor for ’Omics Analysis

Martin Morgan

Roswell Park Cancer InstituteBuffalo, NY, USA

[email protected]

1 December 2016

R / Bioconductor for ’Omics Analysis 1 / 26

Page 2: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Introduction

https://bioconductor.orghttps://support.bioconductor.org

Analysis and comprehension ofhigh-throughput genomic data.

Started 2002

1295 packages – developed by‘us’ and user-contributed.

Well-used and respected.

43k unique IP downloads /month.

17,000 PubMedCentralcitations.

R / Bioconductor for ’Omics Analysis 2 / 26

Page 3: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

1 About

2 ’Omics workflows

3 Lessons learned

4 Challenges

5 Opportunities

R / Bioconductor for ’Omics Analysis About 3 / 26

Page 4: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Scope

Based on the R programming language.

Intrinsically statistical nature of data.

Flexible analysis options for new or customized types of analysis.

‘Old-school’ scripts for reproducibility; modern graphical interfaces foreasy use.

Domains of application.

Sequencing: differential expression, ChIP-seq, variants, gene setenrichment, . . .

Microarrays: methylation, expression, copy number, . . .

Flow cytometry, proteomics, . . .

R / Bioconductor for ’Omics Analysis About 4 / 26

Page 5: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

R : base packages

x <- rnorm(100)

y <- x + rnorm(100, sd=.5)

df <- data.frame(X=x, Y=y)

fit <- lm(Y ~ X, df)

anova(fit)

## Analysis of Variance Table

##

## Response: Y

## Df Sum Sq Mean Sq F value Pr(>F)

## X 1 101.459 101.459 444.98 < 2.2e-16 ***

## Residuals 98 22.345 0.228

## ---

## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

R / Bioconductor for ’Omics Analysis About 5 / 26

Page 6: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

R : contributed packages

library(ggplot2)

ggplot(df, aes(x=x, y=y)) +

geom_point() +

stat_smooth(method="lm")

●●

●●

●●

● ●

●●

−2

0

2

−3 −2 −1 0 1 2 3

x

y

R / Bioconductor for ’Omics Analysis About 6 / 26

Page 7: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Bioconductor

Learn & use

biocViews1

Landing pages2

I DescriptionI InstallationI Documentation

Vignettes3

Workflows4, F1000channel

1https://bioconductor.org/packages/release2e.g., https://bioconductor.org/packages/edgeR3e.g., https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/

inst/doc/DESeq2.pdf4http://bioconductor.org/help/workflows

R / Bioconductor for ’Omics Analysis About 7 / 26

Page 8: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Bioconductor

Learn & use

biocViews1

Landing pages2

I DescriptionI InstallationI Documentation

Vignettes3

Workflows4, F1000channel

1https://bioconductor.org/packages/release2e.g., https://bioconductor.org/packages/edgeR3e.g., https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/

inst/doc/DESeq2.pdf4http://bioconductor.org/help/workflows

R / Bioconductor for ’Omics Analysis About 7 / 26

Page 9: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Bioconductor

Learn & use

biocViews1

Landing pages2

I DescriptionI InstallationI Documentation

Vignettes3

Workflows4, F1000channel

1https://bioconductor.org/packages/release2e.g., https://bioconductor.org/packages/edgeR3e.g., https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/

inst/doc/DESeq2.pdf4http://bioconductor.org/help/workflows

R / Bioconductor for ’Omics Analysis About 7 / 26

Page 10: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Bioconductor

Learn & use

biocViews1

Landing pages2

I DescriptionI InstallationI Documentation

Vignettes3

Workflows4, F1000channel

1https://bioconductor.org/packages/release2e.g., https://bioconductor.org/packages/edgeR3e.g., https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/

inst/doc/DESeq2.pdf4http://bioconductor.org/help/workflows

R / Bioconductor for ’Omics Analysis About 7 / 26

Page 11: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Bioconductor

Learn & use

biocViews1

Landing pages2

I DescriptionI InstallationI Documentation

Vignettes3

Workflows4, F1000channel

1https://bioconductor.org/packages/release2e.g., https://bioconductor.org/packages/edgeR3e.g., https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/

inst/doc/DESeq2.pdf4http://bioconductor.org/help/workflows

R / Bioconductor for ’Omics Analysis About 7 / 26

Page 12: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Bioconductor

Learn & use

biocViews1

Landing pages2

I DescriptionI InstallationI Documentation

Vignettes3

Workflows4, F1000channel

1https://bioconductor.org/packages/release2e.g., https://bioconductor.org/packages/edgeR3e.g., https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/

inst/doc/DESeq2.pdf4http://bioconductor.org/help/workflows

R / Bioconductor for ’Omics Analysis About 7 / 26

Page 13: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Bioconductor

Learn & use

biocViews1

Landing pages2

I DescriptionI InstallationI Documentation

Vignettes3

Workflows4, F1000channel

1https://bioconductor.org/packages/release2e.g., https://bioconductor.org/packages/edgeR3e.g., https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/

inst/doc/DESeq2.pdf4http://bioconductor.org/help/workflows

R / Bioconductor for ’Omics Analysis About 7 / 26

Page 14: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Bioconductor

Input: description of experimental design and summary of read countsoverlapping regions of interest.

assay <- read.table("assay.tab") # Plain text files

pdata <- read.table("pdata.tab")

library(DESeq2)

dds <- DESeqDataSetFromMatrix(assay, pdata, ~ cell + dex)

result(DESeq(dds))

Output: top table of differentially expressed genes, log fold change,adjusted P-value, etc.

R / Bioconductor for ’Omics Analysis About 8 / 26

Page 15: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

1 About

2 ’Omics workflows

3 Lessons learned

4 Challenges

5 Opportunities

R / Bioconductor for ’Omics Analysis ’Omics workflows 9 / 26

Page 16: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

A typical work flow: RNA-seq

1 Experimental design

2 Wet-lab

3 Sequencing; QC – FASTQ

4 Alignment – BAM

5 Data reduction – count tables

6 Statistical analysis

7 Comprehesionhttp://bio.lundberg.gu.se/

courses/vt13/rnaseq.html

R / Bioconductor for ’Omics Analysis ’Omics workflows 10 / 26

Page 17: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

A typical work flow: RNA-seq

1 Experimental design

2 Wet-lab

3 Sequencing; QC – FASTQ

4 Alignment – BAM

5 Data reduction – count tables

6 Statistical analysis

7 Comprehesionhttp://bio.lundberg.gu.se/

courses/vt13/rnaseq.html

R / Bioconductor for ’Omics Analysis ’Omics workflows 10 / 26

Page 18: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

A typical work flow: RNA-seq

1 Experimental design

2 Wet-lab

3 Sequencing; QC – FASTQ

4 Psuedo-alignment – counttables

5 Statistical analysis

6 Comprehesion

kallisto5, salmon6, . . .

Very fast

Very memory efficient

Good enough for manyapplications

Bioconductor

tximport

limma voom()

5https://pachterlab.github.io/kallisto/6http://salmon.readthedocs.io/

R / Bioconductor for ’Omics Analysis ’Omics workflows 10 / 26

Page 19: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

A typical work flow: RNA-seq

1 Experimental design

2 Wet-lab

3 Sequencing; QC – FASTQ

4 Psuedo-alignment – count tables

5 Statistical analysis

6 Comprehesion

DESeq2 , edgeR

Gene set / pathway analysis

Annotation & visualization

R / Bioconductor for ’Omics Analysis ’Omics workflows 10 / 26

Page 20: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

1 About

2 ’Omics workflows

3 Lessons learned

4 Challenges

5 Opportunities

R / Bioconductor for ’Omics Analysis Lessons learned 11 / 26

Page 21: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Differential expression

limma, edgeR, DESeq2

library(DESeq2)

dds <- DESeqDataSetFromMatrix(assay, pdata, ~ cell + dex)

result(DESeq(dds))

Batch effects (e.g., surrogate variable analysis)

Library size differences (robust normalization)

Appropriate statistical model (negative binomial)

Moderated, data-driven parameter estimates (shared design; smallsample size)

Multiple testing (independent hypothesis weighting)

R / Bioconductor for ’Omics Analysis Lessons learned 12 / 26

Page 22: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Interoperability & reproducibility: classes

GenomicRanges

Genomic coordinates torepresent data (e.g., alignedreads) and annotations (e.g.,genes, binding sites).

findOverlaps() and friends.

SummarizedExperiment

Coordinate ‘assay’ data withrow (feature) and column(sample) information.

> gr = exons(TxDb.Hsapiens.UCSC.hg19.knownGene); grGRanges with 289969 ranges and 1 metadata column: seqnames ranges strand | exon_id <Rle> <IRanges> <Rle> | <integer> [1] chr1 [11874, 12227] + | 1 [2] chr1 [12595, 12721] + | 2 [3] chr1 [12613, 12721] + | 3 ... ... ... ... ... ... [289967] chrY [59358329, 59359508] - | 277748 [289968] chrY [59360007, 59360115] - | 277749 [289969] chrY [59360501, 59360854] - | 277750 --- seqinfo: 93 sequences (1 circular) from hg19 genome

DataFrame mcols(gr) gr$exon_id

GRanges length(gr); gr[1:5] seqnames(gr) start(gr) end(gr) width(gr) strand(gr)

Seqinfo seqlevels(gr) seqlengths(gr) genome(gr)

R / Bioconductor for ’Omics Analysis Lessons learned 13 / 26

Page 23: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Interoperability & reproducibility: classes

GenomicRanges

Genomic coordinates torepresent data (e.g., alignedreads) and annotations (e.g.,genes, binding sites).

findOverlaps() and friends.

SummarizedExperiment

Coordinate ‘assay’ data withrow (feature) and column(sample) information.

colData(se)se[, se$dex == "trt"]

rowRanges(se)rowData(se)subsetByOverlaps(se, roi)

assays(se)assay(se, n = 2)assay(subsetByOverlaps(se, roi))assay(se[, se$dex == "trt"])

metadata(se)metadata(se)$modelFormula

Samples (Columns)

Sam

ples

Feat

ures

(R

ows)

R / Bioconductor for ’Omics Analysis Lessons learned 13 / 26

Page 24: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Classic, tidy, rich: RNA-seq count data

Classic

Sample x (phenotype + expression) Feature data.frame

Tidy

’Melt’ expression values to two long columns, replicated phenotypecolumns. End result: long data frame.

Rich, e.g., SummarizedExperiment

Phenotype and expression data manipulated in a coordinated fashionbut stored separately.

R / Bioconductor for ’Omics Analysis Lessons learned 14 / 26

Page 25: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Classic, tidy, rich: RNA-seq count data

df0 <- as.data.frame(list(mean=colMeans(classic[, -(1:22)])))

df1 <- tidy %>% group_by(probeset) %>%

summarize(mean=mean(exprs))

df2 <- as.data.frame(list(mean=rowMeans(assay(rich))))

ggplot(df1, aes(mean)) + geom_density()

R / Bioconductor for ’Omics Analysis Lessons learned 15 / 26

Page 26: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Classic, tidy, rich: RNA-seq count data

Vocabulary

Classic: extensive

Tidy: restricted endomorphisms

Rich: extensive, meaningful

Constraints (e.g., probes & samples)

Tidy: implicit

Classic, Rich: explicit

Flexibility

Classic, tidy: general-purpose

Rich: specialized

Programming contract

Classic, tidy: limited

Rich: strict

Lessons learned / best practices

Considerable value insemantically rich structures

Endomorphism, simplevocabulary, consistent paradigmaid use

R / Bioconductor for ’Omics Analysis Lessons learned 16 / 26

Page 27: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

1 About

2 ’Omics workflows

3 Lessons learned

4 Challenges

5 Opportunities

R / Bioconductor for ’Omics Analysis Challenges 17 / 26

Page 28: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Single-cell analysis

Trapnel et al.5

Large & sparseI Outlier detectionI Zero-inflated modelsI E.g., MAST

ChallengingI E.g., developmental

trajectories

5http://bioconductor.org/packages/monocleR / Bioconductor for ’Omics Analysis Challenges 18 / 26

Page 29: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Comprehension

Gene set & pathway analysis

limma fry(); pathview ;ReactomePA

Visualization

Gviz , ComplexHeatmap, . . .

Communication

Reports; interactive apps

Statistical nuance, especiallyuncertainty, multiple testing

R / Bioconductor for ’Omics Analysis Challenges 19 / 26

Page 30: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Comprehension

Gene set & pathway analysis

limma fry(); pathview ;ReactomePA

Visualization

Gviz, ComplexHeatmap, . . .

Communication

Reports; interactive apps

Statistical nuance, especiallyuncertainty, multiple testing

R / Bioconductor for ’Omics Analysis Challenges 19 / 26

Page 31: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Comprehension

Gene set & pathway analysis

limma fry(); pathview ;ReactomePA

Visualization

Gviz , ComplexHeatmap, . . .

Communication

Reports; interactive apps

Statistical nuance, especiallyuncertainty, multiple testing

R / Bioconductor for ’Omics Analysis Challenges 19 / 26

Page 32: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Comprehension

Gene set & pathway analysis

limma fry(); pathview ;ReactomePA

Visualization

Gviz , ComplexHeatmap, . . .

Communication

Reports; interactive apps

Statistical nuance, especiallyuncertainty, multiple testing

R / Bioconductor for ’Omics Analysis Challenges 19 / 26

Page 33: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Multi-’omic integration

Gene differential expression

RNA-seq – DESeq2 , edgeR,limma voom()

Microarray – limma

Single-cell – scde

Gene regulation

ChIP-seq – csaw , DiffBind

Methylation arrays –missMethyl , minfi

Gene sets and pathways –topGO, limma, ReactomePA

Variants

SNPs – VariantAnnotation,VariantFiltering

Copy number

Structural – InteractionSet

Flow cytometry

flowCore & 41 other packages

Proteomics

mzR, xcms, and 90 otherpackages

R / Bioconductor for ’Omics Analysis Challenges 20 / 26

Page 34: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Multi-’omic integration

correlation

Freq

uenc

y

0.00 0.05 0.10 0.15

010

2030

4050

flaggedsamples

MultiAssayExperiment

Easily manage multiple assayson overlapping samples

ExperimentHub

Curated, summarized,large-scale experiment data(e.g., GEO RNA-Seq; HMP,TCGA) for incorporation in localanalysis

R / Bioconductor for ’Omics Analysis Challenges 21 / 26

Page 35: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Big data

Key strategies

Efficient R code

Restriction to data of interest

Chunk-wise iteration throughlarge data

GenomicFiles

Management of file collections,e.g., VCF, BAM, BED.

BiocParallel

Parallel evaluation on cores,clusters, clouds.

HDF5Array

On-disk storage.

Delayed evaluation.

Incorporates intoSummarizedExperiment.

R / Bioconductor for ’Omics Analysis Challenges 22 / 26

Page 36: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

1 About

2 ’Omics workflows

3 Lessons learned

4 Challenges

5 Opportunities

R / Bioconductor for ’Omics Analysis Opportunities 23 / 26

Page 37: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Install, learn, use, develop

Install6

R, RStudio,Bioconductor

Learn

Courses, vignettes,workflows

Use

Vignettes, manuals,support site7

Develop

6https://bioconductor.org7https://support.bioconductor.org

R / Bioconductor for ’Omics Analysis Opportunities 24 / 26

Page 38: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

From student to developer

A common transition

Naive users become proficient while developing domain expertise thatthey share with others in their lab or more broadly

Share via packages

Really easy!

Best practices

devtools create(), build(), check(), install()

Version control – github

Unit tests, e.g., using testthat

‘Continuous integration’

R / Bioconductor for ’Omics Analysis Opportunities 25 / 26

Page 39: R / Bioconductor for 'Omics Analysis · R / Bioconductor for ’Omics Analysis Martin Morgan Roswell Park Cancer Institute Bu alo, NY, USA martin.morgan@roswellpark.org 1 December

Acknowledgments

Core team (current & recent): Yubo Cheng, Valerie Obenchain, HervePages, Marcel Ramos, Lori Shepherd, Dan Tenenbaum, Greg Wargula.

Technical advisory board: Vincent Carey, Kasper Hansen, WolfgangHuber, Robert Gentleman, Rafael Irizzary, Levi Waldron, MichaelLawrence, Sean Davis, Aedin Culhane

Scientific advisory board: Simon Tavare (CRUK), Paul Flicek(EMBL/EBI), Simon Urbanek (AT&T), Vincent Carey (Brigham &Women’s), Wolfgang Huber (EBI), Rafael Irizzary (Dana Farber), RobertGentleman (23andMe)

Research reported in this presentation was supported by the NationalHuman Genome Research Institute and the National Cancer Institute ofthe National Institutes of Health under award numbers U41HG004059 andU24CA180996. The content is solely the responsibility of the authors anddoes not necessarily represent the official views of the National Institutesof Health.

R / Bioconductor for ’Omics Analysis Acknowledgments 26 / 26