genetics variations effects, propagation, and buffering · 2016-04-27 · genetics variations –...

56
Teresa Przytycka NIH / NLM / NCBI Genetics variations effects, propagation, and buffering AlgoCSB Algorithmic Methods in Computational and Systems Biology

Upload: others

Post on 10-May-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Teresa Przytycka

NIH / NLM / NCBI

Genetics variations –

effects, propagation, and

buffering

AlgoCSBAlgorithmic Methods in Computational and Systems Biology

Page 2: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Journal “Wisla” (1902)

Phenotypes

Genotypes Individuals 1 2 3

gen

om

e 1

gen

om

e 2

gen

om

e 3

genetic

variations

Page 3: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Molecular

Phenotypes

Genotypes Gene Expression of gene g

Individuals 1 2 3 Individuals 1 2 3

gen

om

e 1

gen

om

e 2

gen

om

e 3

eQTL analysis

genetic

variations

Page 4: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Molecular

Phenotypes

Genotypes Gene Expression of gene g

Individuals 1 2 3 Individuals 1 2 3

No one-to-one relation in most cases

• Pathways rather than individual genes

• Drivers versus passengers

• Role of context and stochastic variations

gen

om

e 1

gen

om

e 2

gen

om

e 3

genetic

variations

Page 5: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Lessons from

engineered genetic perturbations

in Fly

5

Page 6: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Engineered chromosomal deletions

6

Left arm of chromosome 2

Chromosomal deletions (in one chromosome only)

Page 7: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Engineered chromosomal deletions

7

Left arm of chromosome 2

Chromosomal deletions (in one chromosome only)

Expression of deficiency genes is reduced by nearly half

(but not exactly and not always)

Page 8: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

What is the impact on network neighbors?

1st order neighbors 2nd order neighbors 3rd order neighbors

deficiency gene

Page 9: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

9

Expression changes propagate through the

network up to the second order neighbors

First order neighbors

Second order neighbors

Third order neighbors

Page 10: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Expression variation of deficiency genes is

higher than other genes

10

Measurement of variability

Delta = abs(e1-e2) /ave(e1,e2)

e1,e2 – expression in experiment 1 and 2

exp

ressio

n v

aria

bili

ty

Page 11: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Expression bursts as major contributor to

single cell expression noise

11

Prediction from the model – increased copy copy number reduces

Page 12: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Single cell model does not explain

variability in expression of deficiency genes

12

Technical noise

exp

ressio

n v

aria

bili

ty

Page 13: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

13

-4 -3 -2 -1 0 1

0.0

0.5

1.0

1.5

Female

log2 Fold Change (Df/+ vs. Control)

De

lta

-3 -2 -1 0 1

0.0

0.5

1.0

1.5

Male

log2 Fold Change (Df/+ vs. Control)

De

lta

Deficiency genes with

compensated/collapsed expression are

more noisy

compensated

collapsed

½ of wild-type expression

(expected)

exp

ressio

n v

aria

bili

ty

Page 14: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

14

-4 -3 -2 -1 0 1

0.0

0.5

1.0

1.5

Female

log2 Fold Change (Df/+ vs. Control)

De

lta

-3 -2 -1 0 1

0.0

0.5

1.0

1.5

Male

log2 Fold Change (Df/+ vs. Control)

De

lta

Deficiency genes with

compensated/collapsed expression are

more noisy

compensated

collapsed

½ of wild-type expression

(expected)

exp

ressio

n v

aria

bili

ty

Page 15: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

15

-4 -3 -2 -1 0 1

0.0

0.5

1.0

1.5

Female

log2 Fold Change (Df/+ vs. Control)

De

lta

-3 -2 -1 0 1

0.0

0.5

1.0

1.5

Male

log2 Fold Change (Df/+ vs. Control)

De

lta

Deficiency genes with

compensated/collapsed expression are

more noisy

compensated

collapsed

½ of wild-type expression

(expected)

exp

ressio

n v

aria

bili

ty Hypothesis:

dosage compensation

and expression noise

are related to

interaction context

Page 16: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Expression variations propagate across

regulatory network

16

TFNon-TF

Page 17: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Why this is relevant

17

Page 18: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Copy number variations and

propagation of expression

changes in cancer

18

Page 19: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Positive correlation between gene copy

number and gene expression in cancer (GBM)

19

Page 20: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Molecular Phenotypes Genotypes (gene expression)

Individuals 1 2 3 4

gen

om

e 1

gen

om

e 2

gen

om

e 3

gen

om

e 4

CNV

Using propagation of expression perturbation to

study genotype-phenotype relation

Page 21: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

The eQTL-net method

expression of

gene of interest

eQTL relation

Motivated in part by:

Tu et al Bioinformatics 2006

Suthram et al MSB 2008

Kim et al. PLoS CB 2011

Page 22: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Explaining eQTL associations by propagation

of expression changes

eQTL relation

gene of interest

genes in CNV region

Kim et al. PLoS CB 2011

Page 23: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Explaining eQTL associations by propagation

of expression changes

Kim et al. PLoS CB 2011

gene of interest

Page 24: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Identification of information propagation

pathways and driver genes

driving gene

Kim et al. PolS CB 2011

gene of interest

Page 25: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

How to interpret these pathway in the context

of throughput interaction networks?

Page 26: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Dutch Interior 1, Joan Miro’ (1893–1983) Museum of Modern Art, New York© 2012 Successió Miró / Artists Rights Society (ARS), New York / ADAGP, Paris used with ARS permission).

How to interpret this painting?

Page 27: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

REAL NETWORK

Dutch Interior 1, Joan Miro’ (1893–1983)

Museum of Modern Art, New York© 2012 Successió Miró / Artists Rights Society (ARS), New York / ADAGP, Paris

(used with ARS permission).

The Lute Player, Hendrick Maertensz Sorgh (1610-1670),

Rijksmuseum, Amsterdam(public domain)

Details are perturbed but relationships remain

Page 28: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

eQTL relation

gene of interest

Considering information propagation

pathway as a bag of genes

Page 29: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Typically we use many genes as a molecular

phenotype- repate the process for all of them

eQTL relation

eQTL relation

Page 30: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Results

30

Driving Copy Number alterations

EGFR, PTEN, RB1, GBAS, TP53, CDKN2A,…..

Reoccurring pathways

Cell cycle, EGFR, RAS, Cell proliferation, Insulin Signaling, DNA repair, ….

Splicing, SMAD (nuclear translocation), lipid storage

Hubs

Myc, E2F1, CREBBP, SP1, Jun,…

Page 31: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Utilizing Networks for Understanding

Genotype-Phenotype relations

Network

propagation

Page 32: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

What can we do about perturbations that do not

necessarily cause expression change in network

neighbors?

Page 33: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

mutated /

dysregualted

subnetworks

d networks

Finding mutated/dysregulated

subnetworks

If different perturbations have similar effects the

perturbations should be related – belong to the

same subnetwork

Main principle:

Challenge:

Cancer heterogeneity

Page 34: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Gene cover

Module Cover Approach

Find minimal cost set of

modules so that each

disease case is covered at

least k times

Optimization problem:

Cost is a determined by:

A similarity of genes within modules

(application dependent)

number of modules

Kim et al. 2013

a

Advantage: different patients can be covered by different subnetworks

Motivated in part by: Ulitsky et al 2008

Page 35: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Gene cover

Optimization problem:

Cost is a function of:

Similarity within modules

• Distance in network

• Expression similarity

(similarity of eQTL profiles)

number of modules

(parameterized penalty)

Kim et al. 2013

Application 1: Glioblastoma Analysis

Advantage: different patients can be covered by different subnetowrks

Page 36: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Modules capture patients heterogeneity/subtypes

cases

mo

du

les

Kim et al. PSB 2013

Page 37: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Application 2: Extension to mutual

exlusivity

Page 38: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Mutual exclusivity of cancer drivers

mutations in gene 1

Mutations in gene 2

patients

Thomas et al 2007

Ciriello, et al., 2012;

Vandin, et al., 2012;

Szczuret et.al , 2014, 2015

Leiserson, et al., Vadin et al. 2013,2014,2015;

Kim et al. 2015

Constantinescu et al. 2015

Proposed explanations

• any of the two drivers alone gives sufficient growth advantage

• negative genetic interactions between drivers

Interesting property

Mutual exclusive pairs are often in the same pathway

Page 39: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Application 2: Subnetworks dysregulated

across many cancer types

39

Gene cover

Cost is a function of:

Similarity measure

• Distance in network

• Mutual Exclusivity Score

number of (parameterized

penalty)

Kim et al. 2015

Page 40: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Defining Mutual exclusivity in PanCancer

setting

mutations in gene 1

Mutations in gene 2

patients

patients

gene 1

gene 2

Kim et al. IMSB 2015 – classification of ME types in context of PanCancer and properties of

different ME types

Page 41: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

41

MEMCover (Mutual Exlusivity Module Cover)

Finds subnetwokrs dysregulated across cancer types

Page 42: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

42

Kim et al. IMSB 2015

Page 43: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Kim et al. IMSB 2015

Page 44: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

MEMCover modules are enriched in cancer

drivers?

44

Compared to Module Cover without ME Compared to HotNet2

Page 45: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

mutated /

dysregualted

subnetworks

d networks

Utilizing Networks for Understanding

Genotype-Phenotype relations

Perturbation

flow

Page 46: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

mutated /

dysregualted

subnetworks

d networks

Patient/phenotypic similarity networks

phenotypic

similarity

networks

Perturbation

flow

Page 47: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Motivation – simultaneous utilization of

multiple genotypic/causal and phenotypic

data types

47

Genotypic/ causal factors

mutations,

CNV

methylation,

Sex, age, environment ….

Phenotypic properties

gene expression

response to drugs

survival time

pathology features

Idea: Construct phenotype similarity graph and

explain it connectivity using genotypic features

Page 48: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

48

Topic model: Assuming that each document is a

mixture of topics - identify topics and the words that

define them

Chang J, Blei DM: Hierarchical Relational Models for Document Networks. Ann Appl

Stat 2010, 4(1):124-150.

Document similarity network

Page 49: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Subtype I

EGFR_A 0.45

NF1_M 0.37

PTEN_A 0.21

….

Subtype II

PDGFA_A 0.51

IDH1_M 0.29

M53_M 0.17

….

Subtype III

mirR218_H 0.38

ICDK2_D 0.22

SHC1_M 0.14

….

Subtype IV

CDK2B_D 0.37

EGFR_A 0.25

….

Features:EGFR_A

NF1_M

CDKN2B_D

.

.

;

Topics – disease subtypes

Words – possible causes (mutations, CNV, miRNA)

Documents similarity– phenotypic similarity (gene expression)

Cho et al. NAR 2013/RECOMB 2012

Page 50: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Cho et al. NAR 2013/RECOMB 2012

Similar patients are to be explained by similar

subtypes mixtures

Page 51: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Co-occurrence of patients in the same

subtype (based on 1000 topic models)

Observation: No separate Neural group

Page 52: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Loss of Neural group is not surprising

52

expression similarity network for GMB

Mesenchymal

Classical

Proneural

Neural

Varhaak et al.

Classification

Page 53: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Mesenchymal

EGFR_A

NF1_M

NF1_A

PTEN_A

BRACA2

BRACA1

….

Preneural

PDGFA_A

IDH1_M

Mir 128a/b

Mir 124a….

ErBB1/2

Subtype IIIClassical

TNFRSF

….

(no clearly

dominating one)

Features:EGFR_A

NF1_M

CDKN2B_D

.

.

;

Subtypes are defined by a distribution of words

(mutations) – new patients can be easily

classified

Cho et al. NAR 2013/RECOMB 2012

Page 54: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

mutated /

dysregualted

subnetworks

d networks

Summary – these are complementing

approaches

phenotypic

similarity

networks

Perturbation

flow

eQTL-net

Module-Cover Topic Model

Page 55: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

Przytycka’s group

Phung Dao Jan Hoinka YooAh Kim Damian Wojtowicz Yijie Wang

AlgoCSBAlgorithmic Methods in Computational and Systems Biology

DongYeon Cho Sanna Madam

(alumnae) Poolesville HS

Collaborators

Brian Oliver Hangnoh Lee

Steve Russel, Cambridge

Stefan Wuchty, Univ. Miami (not pictured)

Page 56: Genetics variations effects, propagation, and buffering · 2016-04-27 · Genetics variations – effects, propagation, and buffering AlgoCSB ... Documents similarity–phenotypic

56

Perturbing a systems brings about valuable lessons