location analysis of transcription factor binding
DESCRIPTION
Location Analysis of Transcription Factor Binding. Tommy Computational Biology Seminar Nov. 2005. Background. Immuno Precipitation ChIP - Chromatin Immuno Precipitation Microarray evolution (from promoter arrays to tiling arrays) ChIP-chip (ChIP followed by microarray hybridization). - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/1.jpg)
Location Analysis of Transcription Factor Binding
TommyComputational Biology Seminar
Nov. 2005
![Page 2: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/2.jpg)
2
![Page 3: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/3.jpg)
3
Background• Immuno Precipitation• ChIP - Chromatin Immuno Precipitation• Microarray evolution
(from promoter arrays to tiling arrays)
• ChIP-chip (ChIP followed by microarray hybridization)
![Page 4: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/4.jpg)
4
Things to do with ChIP chip…
General method for identification of– Target genes of transcription factors– Transcribed genes (Pol II)
– Transcribed miRNAs (Pol II)
– Chromatin states (ABs for modified histones)
– etc. – (any protein (mod AB) that binds DNA)
![Page 5: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/5.jpg)
5
Outline• Kim, Ren et al. Nature (2005)
A high-resolution map of active promoters in the human genome.
• Boyer, Young et al. Cell (2005)Core transcriptional regulatory circuitry in human embryonic stem cells.
• Odom, Young et al. Science (2004)Control of pancreas and liver gene expression by HNF transcription factors.
![Page 6: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/6.jpg)
6
General Transcription Factors (GTFs)
TFIIA2-3 subunits
TFIIE
2 subunits
TFIIB
1 subunit
TFIID
15 subunits
TFIIF 2 subunits
Pol II
12 subunits
TFIIH
9 subunits
![Page 7: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/7.jpg)
7
TSS
Formation of Pre-Initiation Complex
1. Localization at the promoter2. DNA melting, initiation and elongation
TATA BRE
IIA TBP IIBTAFs
Pol. IIIIF
IIE IIH
Core promoter
![Page 8: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/8.jpg)
8
Kim, Barrera, Ren et al. Nature (2005)A high-resolution map of active promoters
in the human genome• Accurate mapping of active promoters in
human fibroblast cells (IMR90)– Active genes– Identify transcription start sites
• DNA microarray of Human genomeNimbleGen 50bp probe every 100bp
• ABs for Pol II preinitiation complex (PIC)• Computational aspects
deconvolution of semi-continuous signal
![Page 9: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/9.jpg)
9
Beware, spoiler!
The Titanic drowns and Leo DiCaprio dies
Kim et al. Map of active promoters
![Page 10: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/10.jpg)
10
Kim, Barrera, Ren et al. Nature (2005)A high-resolution map of active promoters
in the human genome
• Found 12,150 bound regions (promoters)– 10,576 belong to 6,763 known genes– 1,196 un-annotated transcriptional units
• Many genes with multiple promoters• Clusters of active promoters• Four classes of promoters• Many novel genes (RNA genes?)
![Page 11: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/11.jpg)
11
Technicalities• Follows similar work on ENCODE
regionsKim et al, Gen. Res. (2005); ENCODE project, Science (2004)
• Chip design: series of DNA microarrays covering 14.5 million (!) 50bp probes, covering all the human genome*
• IP design: Monoclonal AB to TAF1 (TAFII250) of TFIID
Kim et al. Map of active promoters
* Except for genomic repeats
![Page 12: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/12.jpg)
12
Method• Compare IP to control DNA• Identify stretches of 4 bound probes• Re-check using a new array• Computational detection of 12,150
peaks (Mpeak)• Compare to known genes
(DBTSS, RefSeq, GenBank, EnsEMBL)
• 87% matched 5’ ends of known mRNAs (up to 2.5Kb)
Kim et al. Map of active promoters
![Page 13: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/13.jpg)
13
Kim et al. Map of active promoters
![Page 14: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/14.jpg)
14
Validation of results
• Anti-RNAP AB re-found 97% of bound promoters
• Standard ChIP found 27/28 of randomly selected bound promoters
• Bound promoters are enrichment for known TSS elements
• 97% of promoters had chromatin state of active genes – H3Ac, H3K4Me
Kim et al. Map of active promoters
![Page 15: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/15.jpg)
15
Un-annotated promoters
• 1,597 promoters are ≥ 2.5Kb from 5’ of known genes
• 607 of them match EST• 632 of them are also bound by RNAP and in
the “right” chromatin state– Measure mRNA expression of 567 promoters
(50bp probes at 28Kb around each gene) – 35 new transcription units. Rest unstable?– One located 250bp ups to predicted miRNA
Kim et al. Map of active promoters
possible genes
![Page 16: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/16.jpg)
16
Kim et al. Map of active promoters
![Page 17: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/17.jpg)
17
Un-annotated promoters
• 1,239 putative promoters correspond to novel transcription units.– Evolutionary conserved– Enriched with core promoter motifs
• 1,196 outside current gene annotation(13% of promoters)
Kim et al. Map of active promoters
![Page 18: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/18.jpg)
18
Clusters of active genes
• 256 clusters of ≥4 active genes(1,668 EnsEMBL genes)
• 1609 genes had multiple promoters– Most have the same gene product– Some have different 1st exon– Some undergo different splicing
• All at a single cell type!
Kim et al. Map of active promoters
1
10
100
1000
10000
2 3 4 5
![Page 19: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/19.jpg)
19
Transcription machinaryvs.
Gene Expression• 14,437 genes• IMR90 human fibroblast cells• Compare PIC occupancy to expression
Kim et al. Map of active promoters
![Page 20: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/20.jpg)
20
• Classes I and IV are consistent (75% of genes)
• Class II - PIC is bound, no expression– PIC is assembled but not sufficient for TXN
• Contain immediate response genes (stress)
– mRNA transcribes but degraded (miRNA targets?)
• Class III - Expressed with no bound PIC– Test 10 random genes with ChIP (TFIID, RNAP)
– Nearly 60% were weakly bound
![Page 21: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/21.jpg)
21
Kim, Barrera, Ren et al. Nature (2005)A high-resolution map of active promoters
in the human genome
• Found 12,150 bound regions (promoters)• Many genes with multiple promoters• 1,239 novel genes (RNA genes?)• Clusters of active promoters (chromatin)• Four classes of promoters
![Page 22: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/22.jpg)
22
Kim, Barrera, Ren et al. Nature (2005)A high-resolution map of active promoters
in the human genome
• So what have we learned?
![Page 23: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/23.jpg)
23
Odom, Young et al. Science (2004)Control of pancreas and liver gene
expression by HNF transcription factors
• Diabetes is bad.• Uncover the transcriptional regulatory
network that control insulin secretion.
• Human liver and pancreatic islets• Use ChIP for Pol II and 3 TFs• Measure expression of genes
![Page 24: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/24.jpg)
24
Background
• Transcriptional regulation in the liver– HNF1α (homeodomain)– HNF4α (nuclear receptor)– HNF6 (onecut)
• Same with the pancreatic islets?– All three are require for normal function– Mutations maturity-onset diabetes of the
young (MODY3, MODY1)• Understand normal to explain abnormal
Odom et al. HNF regulation in pancreas and liver
![Page 25: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/25.jpg)
25
MODY• maturity-onset diabetes of the young• Genetic disorder of the insulin-secreting
pancreatic β cells• Onset of diabetes mellitus before 25• Autosomal dominant pattern of inheritance
• Not to confuse with type 2 (late-onset) diabetes– early-onset insulin resistance– functional defects in insulin secretion
Odom et al. HNF regulation in pancreas and liver
![Page 26: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/26.jpg)
26
Pancreas β cell
![Page 27: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/27.jpg)
27
Hepatocyte
![Page 28: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/28.jpg)
28
Method
• Identify targets of three TFs in two tissues• Identify transcribed genes (using Pol II)• Promoter array (13K genes)• -700bp to +200bp relatively to TSS
Odom et al. HNF regulation in pancreas and liver
![Page 29: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/29.jpg)
29
Hepatocyte targets of HNF1α
• 222 genes that represent a substantial section of hepatocyte biochemistry– gluconeogenesis and associated pathways– carbohydrate synthesis and storage– Lipid metabolism
(synthesis of cholesterol and apolipoproteins)
– Detoxification(synthesis of cytochrome P450 monooxygenases)
– Serum proteins(synthesis of albumin and coagulation factors).
Odom et al. HNF regulation in pancreas and liver
![Page 30: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/30.jpg)
30
Pancreas targets of HNF1α
• 106 genes, 30% of which bound in liver• Fewer chaperons and enzymes• Receptors and signal transduction genes vary• Many known targets are missing…
– Stringent criteria– Short promoters
Odom et al. HNF regulation in pancreas and liver
![Page 31: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/31.jpg)
31
Targets• HNF6 binds 227 (1.3%) and 189 (1.45%),
incl. important cell-cycle regulators• HNF4α 1575 (12%) and 1423 (11%)
Odom et al. HNF regulation in pancreas and liver
– Two different ABs– Western blots– Standard ChIP (50)
– Other tissues (17)
– Preimmune ABs bind not– 80% (73%) also bound by PolII.
![Page 32: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/32.jpg)
32
The transcriptome• “It is difficult to determine the transcriptome of these
tissues accurately by profiling transcript levels with DNA microarrays.”
• What is the appropriate reference RNA?• 2,984 (23%) are bound by Pol II in hepatocytes• 2,426 (19%) in islets, 81% of which by both• 80% (73%) of HNF4α are bound by Pol II• Three HNFs cover many of transcribed genes
Odom et al. HNF regulation in pancreas and liver
![Page 33: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/33.jpg)
33
Regulatory network
• Some differences between regulation in the two tissues
Odom et al. HNF regulation in pancreas and liver
![Page 34: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/34.jpg)
34
Regulatory network motifs
![Page 35: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/35.jpg)
35
Multi-component loop• Capacity for feedback control and produce
bistable systems that can switch between two alternate states [Milo et al, 2002]
• The multi-component loop of HNF1α and HNF4α is responsible for stabilization of the terminal phenotype in pancreatic beta cells [Ferrer 2002]
Odom et al. HNF regulation in pancreas and liver
![Page 36: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/36.jpg)
36
Feed-forward loop
• A feedforward loop acts as a switch, sensitive to sustained inputs (rather than transient)
• HNF6 serves as a master regulator for feed-forward motifs in hepatocytes and pancreatic islets
• Involves >80 genes in each tissue
Odom et al. HNF regulation in pancreas and liver
![Page 37: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/37.jpg)
37
Regular Chain motifs
• Regulator chain motifs represent the simplest circuit logic for ordering transcriptional events in a temporal sequence
Odom et al. HNF regulation in pancreas and liver
![Page 38: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/38.jpg)
38
Summary
• HNF4α binds almost half of active genes in the liver and pancreas islets
• Crucial for development and function of these tissues
• Might explain why mutations can increase type II diabetes
Odom et al. HNF regulation in pancreas and liver
![Page 39: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/39.jpg)
39
Boyer, Young et al. Cell (2005)Core transcriptional regulatory circuitry
in human embryonic stem cells• Embryonic stem cells are important
– Can be propagated in undifferentiated state – Can differentiate into >200 unique cell types– Great promise for regenerative medicine
• Reveal transcriptional regulatory circuitry controlling pluripotency and self-renewal.
• Early development and cell identity is controlled by several homeodomain TFs
![Page 40: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/40.jpg)
40
Background• Early development and cell identity is
controlled by several homeodomain TFs• OCT4, SOX2, NANOG have central roles in
maintaining the pluripotency of stem cells• KO of each results with differentiation• Over-expression of OCT4 ~ NANOG KO• Why? Identify targets of each and see…
Boyer et al. Regulation in embryonic stem cells
![Page 41: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/41.jpg)
41
Method• Human H9 embryonic stem cells• Agilent promoter arrays
– 60-mer probes– Spaced at ~300bp– Covering -8Kb to +2Kb relatively to TSS
• Including 98% of TRANSFAC binding sites (Wow!!)– 17,917 genes
• Replicate set of ChIP assay
Boyer et al. Regulation in embryonic stem cells
![Page 42: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/42.jpg)
42
OCT4
Analysis of peaks found:• 623 genes (3%)• 5 miRNAs (3%)
Many known targets:• Mouse ES cells• Expressed in ES
Improved protocol• Better than Odom et al• <1% FPR, 20% FNR
Boyer et al. Regulation in embryonic stem cells
![Page 43: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/43.jpg)
43
SOX2 NANOG1271 genes (7%) 1687 genes (9%)
Boyer et al. Regulation in embryonic stem cells
![Page 44: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/44.jpg)
44
Binding in proximity
• Co-binding suggests that OCT4, SOX2 & NANOG function together
Boyer et al. Regulation in embryonic stem cells
![Page 45: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/45.jpg)
45
Function of TFs
• Checked expression these genes in ES cells (published data)
• 1,303/2,260 genes are active, 957 inactive• Of the 353 tri-bound genes, half active• Active include TFs (OCT4, SOX2, NANOG, STAT3, ZIC3),
components of TGF-β and Wnt pathways• Inactive genes include developmental TFs
(important for differentiation)
• Many other homeodomain TFs
Boyer et al. Regulation in embryonic stem cells
![Page 46: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/46.jpg)
46
Putative regulatory circuitryBoyer et al. Regulation in embryonic stem cells
![Page 47: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/47.jpg)
47
Boyer et al. Regulation in embryonic stem cells
![Page 48: Location Analysis of Transcription Factor Binding](https://reader035.vdocuments.us/reader035/viewer/2022081505/56815c0b550346895dc9ebba/html5/thumbnails/48.jpg)
48
Boyer, Young et al. Cell (2005)Core transcriptional regulatory circuitry
in human embryonic stem cells
• So what have we learned?