bi-directional promoters
Post on 22-Jan-2017
244 Views
Preview:
TRANSCRIPT
BI-DIRECTIONALPROMOTERS
By:Sanju SinhaDate: 21-05-2014
Promoters is a region of DNA that initiates the transcription of a particular gene.
What is Promoter?
Promoter is a Important element for gene regulation.
TSS
Assumptions
• Promoters are Usually conceptualized as upstream of the sequences they promote.
Facts
• Scientist do not really know in which direction promoters usually transcribe or if they only transcribe in one direction or not.
Present-Research
• Their Directions possibilities and parts of promoter which plays role in deciding direction.
Promoter is a Important element for gene regulation.
TSS
On Basis of Directions they can transcribe, Promoters can be classified into two sub-classes-
1.Unidirectional 2.Bi-Directional*
Definition-Bidirectional promoters are short (<1 kbp), intergenic regions of DNA between the 5‘ ends of the genes in a bidirectional gene pair.
Head-To-Head Fashion Alignment
1000 BP1200 BP1500 BP
So Lets Increase This Window
10000 BP12000 BP15000 BPVS
0 200 400 600 800 1000 1200 1400 16000
1000
2000
3000
4000
5000
6000
7000
f(x) = 1251.38499631088 ln(x) − 3205.50840193062
Series1 Logarithmic (Series1)
1000 BP1200 BP1500 BP
So Lets Increase This Window
10000 BP12000 BP15000 BPVS
0 5000 10000 15000 20000 250000
2000
4000
6000
8000
10000
12000
14000
16000
18000
Chart Title
what is the promoter length Distribution then?
Promoter Length Histogram (window =1500)
100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 More0
100
200
300
400
500
600
700
800
Frequency
Bin
Freq
uenc
y
0 5000 10000 15000 20000 250000
2000400060008000
1000012000140001600018000
Chart Title
0 200 400 600 800 1000 1200 1400 16000
1000
2000
3000
4000
5000
6000
7000
f(x) = 1251.38499631088 ln(x) − 3205.50840193062
Series1Logarithmic (Series1)
100300
500700
9001100
13001500
0100200300400500600700800
Frequency
Bin
Freq
uenc
y
On Basis of these Results:
Mean Promoter length is : 335 Median of Promoter length is: 189
Conclusion we can draw:
• Clusters near the gene starting position in range of 189.
• The probability of occurrence of another gene at a distance from one gene first increases exponentially till 335 and then decreases and then saturates tending to constant*.*Not sure as second differentiation is still positive and can even change its concavity.
Visions and Logics to verify data:
•Making a artificial gene distribution like system.•A Cyber-Refgene.txt file.•Using the same tunnel and get the
distribution.• Comparing the Distribution.
1. All Further Data is beingtaken from a review paper.
2. All the sources and platforms are mentioned on last slide.
PART - 2
Is it possible to identify consistent pattern that distinguish Bidirectional and Unidirectional ???
What to Look for …..
GC content
INR
TATA Box
BRE
DPE
CCAAT
Location of different elements of PROMOTER(lacking TATA)
GC content
INR
TATA Box
BRE
DPE
CCAAT
Statistical Results of GC content:Average GC content percentage
Bidirectional: 66%
Unidirectional: 53%
• Bidirectional: 66%• Unidirectional: 53%GC content
INR
TATA Box
BRE
DPE
CCAAT
INR-Initiator element• Functionally similar to TATA box.• Accurate transcription initiation ,
INR btw -3 to +5 is necessary.• Increases the strength of TATA
containing promoters.
Bidirectional: 25.3%
Unidirectional:30.8%
• Bidirectional: 66%• Unidirectional: 53%GC content• Bidirectional: 25.3%• Unidirectional: 30.8%INR
TATA Box
BRE
DPE
CCAAT
TATA box:
Bidirectional: Most of them Lacks.
Unidirectional: Comparatively high.
• Located at -30 both in Unidirec and Bi-direc Promoters
• Bidirectional: 66%• Unidirectional: 53%GC content• Bidirectional: 25.3%• Unidirectional: 30.8%INR• Bidirectional: Lacks Mostly(No
data)• Unidirectional: Have more
frequently
TATA Box
BRE
DPE
CCAAT
BRE(B-recognition element)• Located directly in front of TATA• TFIIB recognizes it and binds.
Bidirectional: 16.5%
Unidirectional: 11%
• Bidirectional: 66%• Unidirectional: 53%GC content• Bidirectional: 25.3%• Unidirectional: 30.8%INR• Bidirectional: Lacks mostly(No
Data)• Unidirectional: Have more
frequently
TATA Box • Bidirectional: 16.5%• Unidirectional: 11.1%BRE
DPE
CCAAT
DPE(Downstream Promoter Element)• Located at +30 position• Binds to common transcription
factor(TFIID) in absence of TATA
Bidirectional: 46.6%
Unidirectional: 50.6%
• Bidirectional: 66%• Unidirectional: 53%GC content• Bidirectional: 25.3%• Unidirectional: 30.8%INR• Bidirectional: Lacks mostly(No
Data)• Unidirectional: Have more
frequently
TATA Box • Bidirectional: 16.5%• Unidirectional: 11%BRE• Bidirectional: 46.6%• Unidirectional: 50.6%DPE
CCAAT
CCAAT box:• Located at 75-80 BP before TSS• signals the binding site for the
RNA transcription factor
Bidirectional: 12.9%
Unidirectional: 6.9%
• Bidirectional: 66%• Unidirectional: 53%GC content• Bidirectional: 25.3%• Unidirectional: 30.8%INR• Bidirectional: Lacks mostly(No
Data)• Unidirectional: Have more
frequently
TATA Box • Bidirectional: 16.5%• Unidirectional: 11.1%BRE• Bidirectional: 46.6%• Unidirectional: 50.6%DPE• Bidirectional: 12.9%• Unidirectional: 6.9%CCAAT
CpG islands:The CpG sites or CG sites are regions of DNA where a cytosine nucleotide occurs next to a guanine
Source:Trinklein, N. D., Aldred, S. H., Hartman, S. J., Schroeder, D. I., Otillar, R. P., and Myers, R. M. (2004) Genome Res.,14, 6266.
• 77% B-DP located in CpG islands compared with 38% of U-DP.
• 90% B-DP located in CpG islands compared with 45% of U-DP. Source: Yang, M. Q., and Elnitski, L. L. (2008) BMC Genom., 9 (Suppl. 2), S3.
Bi-Directional promoters enrich with following specific Binding sites of TF.
• GABPA• MYC• E2F1• E2F4• Nrf1• YY1• NFY• SP1
PART- 3Follow the tunnel
So the first thing is- GC CONTENT
Lets check the GC content-
Wait..Wait..Wait..
Where’s the length of Unidirectional Promoters??
No, We Don’t . But we have some Important values which can help us.
1. Mean length of Bidirectional Promoter.
2. Median Length of Bidirectional Promoters.
3. We Know in Paper they take 1000BP
Comparison of GC content between
Unidirectional (Mean Length)Bidirectional VS Unidirectional (Median Length)
Unidirectional (1000bp Length)
GC content Distribution- Unidirectional_MEAN
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 More0
500
1000
1500
2000
2500
3000
3500
4000
Histogram
Frequency
Bin
Freq
uenc
y
AVERAGE: 54.6%
GC content Distribution- Unidirectional_MEDIAN
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 More0
500
1000
1500
2000
2500
3000
3500
Histogram
Frequency
Bin
Freq
uenc
y
AVERAGE: 56.9%
GC content Distribution- Unidirectional_1000 BP
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 More0
500
1000
1500
2000
2500
3000
3500
4000
4500
5000
Histogram
Frequency
Bin
Freq
uenc
y
AVERAGE: 49.7%
Comparison of GC content between
Unidirectional (Mean Length)Bidirectional VS Unidirectional (Median Length)
Unidirectional (1000bp Length)
GC content Distribution- BIDIRECTIONAL
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 More0
100
200
300
400
500
600
700
800
900
1000
Histogram
Frequency
Bin
Freq
uenc
y
AVERAGE: 64%
Statistical Conclusion:
Unidirectional: 53.7%
Bidirectional: 64%
Average GC content :
NOTE**DATA SOURCE and Platforms1. All the Data mentioned in Slides 17-36 are taken from Review:Bidirectional Promoters in the Transcription of Mammalian Genomes.A. S. Orekhova and P. M. Rubtsov*Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, ul. Vavilova 32, 119991 Moscow, Russia; fax: (499) 1351405; Email: rubtsov@eimb.ru
2. All other data in these presentation belong to Sanju Sinha and he have all rights on those. Any copying without mentioning the relevance source shall be considered as plagiarism.
3.twoBitToFa on linux platform is being used to done the calculations.
4. All coding is being done via Python language.
THANKS
top related