characterizing the role of mirnas within gene regulatory networks using integrative genomics...

36
Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

Upload: rudolph-lyons

Post on 26-Dec-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

1

Characterizing the role of miRNAs within gene regulatory networks using

integrative genomics techniques

Min Wenwen2012.04.20

Page 2: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

2

Background: eQTL

Page 3: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

3

Expression quantitative trait loci (eQTLs)Nat. Rev. Cardiol. doi:10.1038/nrcardio.2011.208

Background: eQTL

Page 4: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

4

Motivation previous studies: the relationship between pairs of

correlated quantitative traits such as mRNA and clinical phenotypes (Mehrabian et al, 2005; Schadt et al, 2005; Yang et al, 2009).

We applied a variation of a previously described statistical procedure (Schadt et al, 2005) to identify mRNAs that respond to changes in miRNA expression levels (miRNA targets), as well as mRNAs that perturb expression levels of miRNAs.

Page 5: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

5

Summary

Integrative genomics and genetics approaches have proven to be a useful tool in elucidating the complex relationships often found in gene regulatory networks.

Our analysis reveals that the transcript abundances of miRNAs are subject to regulatory control by many more loci than previously observed for mRNA expression.

our results: miRNAs exist as highly connected hub-nodes and function as

key sensors within the transcriptional network. miRNAs can act cooperatively or redundantly to regulate a

given pathway and miRNAs play a subtle role by dampening expression of their

target gene through the use of feedback loops.

Page 6: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

6

Idea and data

This approach leverages DNA sequence variation as a causal anchor to identify the best fitting model that describes the relationship between pairs of traits (miRNA, mRNA) that are linked to the same genetic locus

Using an F2 mouse cross, we collected both mRNA expression and genotype information from liver.

the 39 557 mRNA and 183 miRNA transcripts. From the panel of 5000 SNP markers, 2804 markers informative for

the BXD cross and evenly spaced across all chromosomes, excluding the Y chromosome, were selected for use in all analyses.

MSB2011.SI\msb201123-s5.xls (markers)

Page 7: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

7

Methods

①Linkage analysis techniques were then applied to infer regulatory relationships between DNA loci and the two classes of expression traits, that is, mRNA and miRNAs.

②characterized the miRNA–mRNA relationships using a simple correlation analysis and

③applied a variation of a previously developed statistical inference technique to infer regulatory relationships between mRNA and miRNAs.

Page 8: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

8

mRNA and miRNA eQTL mapping in the BXD mouse study

Using standard parametric linkage analysis techniques, we treated the expression levels of both mRNAs and miRNAs as quantitative traits to identify regulatory loci generally referred to as expression quantitative trait loci (eQTLs).LOD score: LOD = Z = log 10 (probability of birth sequence with a given linkage value/ probability of birth sequence with no linkage)

Page 9: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

9

In contrast, we identified 5293 eQTLs for 5107 of the 39 557 mRNA transcripts (~13%) at a LOD score threshold of >4.9 (corresponding to an FDR <5%),

Of these, 2712 (or 37%) were cis eQTLs. Thus by percentage, at the 10% FDR threshold, more

than three times as many mRNA eQTL were detected when compared with the miRNA expression traits.

mRNA and miRNA eQTL mapping in the BXD mouse study

Page 10: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

10

For each miRNA, we identified a set of mRNA expression traits that contained at least one hexamer region within the 3’ UTR.

These gene sets were then filtered to contain only genes that were significantly negatively correlated with the corresponding miRNA.

Decrease the FDR of detecting miRNA eQTLs

Page 11: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

11

3’ UTR

negatively correlated

Decrease the FDR of detecting miRNA eQTLs

Page 12: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

12

We next sought to determine if there were key loci involved in regulating many miRNAs

Page 13: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

13

Distribution of eQTLs for mRNA and miRNA

Page 14: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

14

we identified a strong eQTL hotspot on chr 13 and a weaker hotspot on chr 17.

Of the 72 eQTLs identified, 42% mapped to chr 13, suggesting the presence of a key regulator influencing the expression levels of many miRNAs.

Key loci regulating many miRNAs and mRNAs

Overall, we detected seven mRNA eQTL hotspots where each hotspot is defined to comprised 41% of the total number of eQTLs (computed using a Poisson distribution with mean 9.52).

These hotspots localize to chr 2, 4, 7, 9, 12, 13, and 17.

Page 15: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

15

In order to better compare the location of miRNA eQTL hotspots to mRNA eQTL hotspots, we recomputed the probabilities of an miRNA eQTL hotspot using 2 cM bins (1cM约为 1000kb).

eQTL hotspots for miRNAs and mRNAs on chromosome 13 are <4 cM apart.

Overlap eQTLs for miRNAs and mRNAs

Page 16: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

16

39 557 mRNAs and 183 miRNAs we identified 465 646 miRNA–mRNA trait pairs that

were significantly correlated at an FDR 0.1%(P-value <3.98e-4)

A number of miRNAs(hub-nodes) were very broadly connected to tens of thousands of mRNAs.

Each miRNA , ~2545 mRNA transcripts. Each miRNAs ,at least one mRNA transcript.

Correlation analysis between miRNA and mRNA expression levels in mice

Page 17: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

17

miRNA signature set: compute the seed enrichment levels for each set

Page 18: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

The distribution of seed enrichment

18

Distribution of seed enrichment using the full miRNA–mRNA correlation results.

Distribution of seed enrichment using only positive correlations between miRNA– mRNAs.

Distribution of seed enrichment using only negative correlations between miRNA–mRNAs.

Feedback loops

Page 19: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

19

Enrichment analysis using (GO,KEGG)

Page 20: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

20

Page 21: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

21

We opted to annotate the sets of miRNA signature sets using only genes that contained at least one 6mer seed region in the 3’UTR region of the

gene.

Page 22: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

22

First, we identified all miRNA and mRNA trait pairs linked to a common genomic region at an LOD score threshold of 3.4

Next, we identified 44 370 miRNA–mRNA trait pairs with closely linked eQTLs(<15 cM).

Causal inference: (a) causal, where an eQTL for miRNA expression leads to changes in

mRNA expression (miRNA targets); (b) reactive, where eQTL for mRNA levels leads to changes in

miRNA expression (miRNA regulators); and (c) independent, eQTL independently drive miRNA and mRNA

levels (independent).

Causal associations between miRNAs and mRNAs

Page 23: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

23

Inference Method (Schadt et al, 2005;)

Page 24: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

24

BXD mice: F2 offspring from C57BL/6J (B6) and DBA/2J (DBA).

• C57BL/6J: ob mutation in the C57BL/6J mouse background (B6-ob/ob) causes obesity, but only mild and transient diabetes (Coleman and Hummel, 1973).

• DBA/2J: mice show a low susceptibility to developing atherosclerotic aortic lesions

Gene expression• Liver extracted at 16 months of age• 23,574 gene expression measured using Agilent arrays

Genetic loci• 139 autosomal genetic loci (microsatellite markers, 13 cM)

Disease Omental fat pad mass (OFPM) traits (>4)

Data

Page 25: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

25

Model

Page 26: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

26

– Causal Model (M1)

– Reactive Model (M2)

– Independent Model (M3)

L mRNA Disease

L mRNADisease

L

Disease

mRNA

Models for causality

Page 27: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

• Causal Model– Joint Probability

– Likelihood

2|

2

( )1( | ) exp{ }

2R L

RR

rL r L

( , , ) ( ) ( | ) ( | )p L R D p L p R L p D R

3

11

( | 1) ( ) ( | ) ( | )N

j i j i iji

L M p L L r L L d r

L: Genotype R: mRNA level D: Disease

L mRNA Disease

( | , )= ( | )p DR L p DR

2|

2||

( )1( | ) exp{ }

2D R

D RD R

dL d r

M1 Likelihood

27

Page 28: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

28

• Reactive Model – Joint probability

– Likelihood

( , , ) ( ) ( | ) ( | )P L R D P L P D L P R D

3

11

( | 2) ( ) ( | ) ( | )N

j i j i iji

L M p L L d L L r d

2|

2||

( )1( | ) exp{ }

2R D

R DR D

rL r D

L mRNADisease L: Genotype R: mRNA level D: Disease

2

2

1 ( )( | ) exp{ }

2D

DD

dL d L

( |D, )= ( |D)p R L p R

M2 Likelihood

Page 29: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

29

• Independent Model – Joint Probability

– Likelihood

( , , ) ( ) ( | ) ( | , )P L R D P L P R L P D R L

3

11

( | 3) ( ) ( | ) ( | , )N

j i j i i jji

L M p L L r L L d r L

2

|

22

( )1( | ) exp{ }

2

R L

RR

rL r L

L : Genotype R: mRNA level D: Disease

L

Disease

mRNA

2|

2||

( )1( | , ) exp{ }

2D RL

D RD R

dL d R L

M3 Likelihood

Page 30: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

30

Likelihood-based Causality Model Selection (LCMS)

– Calculating the Likelihood based on the data. – The model best supported by the data : smallest

AIC (Akaike Information Criterion)

ˆAIC=-2ln ( ) 2L p

Model Selection

Page 31: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

31

Simulation: simple regression models

Page 32: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

32

i iT L

The model with an AIC significantly smaller than the AIC’s of the competing models was noted.

1

2,L TR

1 2

2,T TR

2

2,L TR

L T1

Simulation study

Page 33: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

33

Application on real data

(A) predicted regulators; (B) predicted targets;(C) log ratio of the number of predicted regulators over the number ofpredicted targets.

Page 34: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

34

The error in T2(mRNA)Is larger than in T1(miRNA).

microarray data ,qPCRwhile the number of predicted causal regulators of miRNA is likely to be an under estimation of the actual number.

Simulations

Page 35: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

35

ConclusioneQTL(miRNAs,mRNAs)correlation analysis hub-nodes cooperatively feedback loops Positive correlations between miRNA–mRNAs Loci->mRNA->miRNA

Page 36: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen 2012.04.20 1

36

Thank you ☺