a genome-wide perspective on translation of proteins

39
A genome-wide perspective on translation of proteins Dec 2012 Regulatory Genomics Lecturer: Prof. Yitzhak Pilpel

Upload: halil

Post on 23-Feb-2016

33 views

Category:

Documents


0 download

DESCRIPTION

A genome-wide perspective on translation of proteins. Dec 2012 Regulatory Genomics Lecturer: Prof. Yitzhak Pilpel. Teaching assistant: Idan Frumkin. [email protected] Submit Sunday at midnight. The Central Dogma of Molecular Biology Expressing the genome. RNA. Inactive DNA. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: A genome-wide perspective on translation of proteins

A genome-wide perspective on translation of proteins

Dec 2012Regulatory GenomicsLecturer: Prof. Yitzhak Pilpel

Page 2: A genome-wide perspective on translation of proteins

Teaching assistant: Idan Frumkin

[email protected]

Submit Sunday at midnight

Page 3: A genome-wide perspective on translation of proteins

The Central Dogma of Molecular BiologyExpressing the genome

DNA mRNA Protein

f f

Inactive DNA

RNA

Page 4: A genome-wide perspective on translation of proteins

http://esg-www.mit.edu:8001/esgbio/pge/lac.html

In the presence of Lactose

The Lac Operon (Jacob and Monod)

4

Page 5: A genome-wide perspective on translation of proteins

Catabolism (breakdown of molecules, e.g.

lactose)

Anabolism (synthesis of

molecules, e.g. amino acids)

Gene is ON when substrate is present

Gene is OFF when substrate is absent

Gene is ON when substrate is absent

Gene is OFF when substrate is present

The basic logic of metabolic control

Page 6: A genome-wide perspective on translation of proteins

A combined transcription -translation control switch

At the Attenuation mechanism

Charles Yanofsky

Page 7: A genome-wide perspective on translation of proteins

The trp operon in e. coli

Page 8: A genome-wide perspective on translation of proteins

A negative control at the transcription level (similar and different from the lac operon)

Page 9: A genome-wide perspective on translation of proteins

How not to make too much triptophene?

• A fail safe mechanism complements transcription control

• At the translation level!

Page 10: A genome-wide perspective on translation of proteins

The up-stream ORF structure of the trp operon

An uORF

Mutual palindromes

1-2 are complementary

2-3 are complementary

3-4 are complementary

Page 11: A genome-wide perspective on translation of proteins

The various palindromic pairings

1-2, and 3-42-3

Transcription terminator!

Not aterminator!

Page 12: A genome-wide perspective on translation of proteins

High Trp Low Trp

The structure of the Attenuation switch

RibosomeRibosome

RNA pol

RNA pol

Page 13: A genome-wide perspective on translation of proteins

Could that be implemented in eukaryotes as well?

• No! because requires co transcription-translation

Page 14: A genome-wide perspective on translation of proteins

Where does translation take place?

Page 15: A genome-wide perspective on translation of proteins

Spatial organization of the flow of geneticinformation in bacteria (Llopis Nature 2010)

DN

A

=DNA=mRNA=Protein

Page 16: A genome-wide perspective on translation of proteins

Translation consists of initiation, elongation and termination

5’ 3’STOP

Codon

Anti-codon

Page 17: A genome-wide perspective on translation of proteins

The dynamics of translation

Page 18: A genome-wide perspective on translation of proteins

The ribosome reads nucleotide sequence and produces amino acid sequence based on the

genetic codeSome important properties of the code• The code is (almost)

universal• There are 61 amino acid

codons, and 3 STOP codons

• The code is “redundant” - many amino acids have more than one codon

• The genetic code is optimal wrt to many properties, such as error tolerance

Page 19: A genome-wide perspective on translation of proteins

The tRNAThe generic form A specific form In 3D

Page 20: A genome-wide perspective on translation of proteins

Aminoacyl tRNA synthetase:The really “smart” part

20 amino acids, 61 codons, 20 Aminoacyl tRNA synthetases

Error rate: 1/10,000-1/100,000(in-vitro; higher in-vivo)

Page 21: A genome-wide perspective on translation of proteins

The 20 canonical amino acids

Page 22: A genome-wide perspective on translation of proteins

Possible mechanisms of translational regulation

• optimality of ribosomal attachment site• mRNA secondary structure• codon usage

Page 23: A genome-wide perspective on translation of proteins

Multiple codons for the same amino acid

C1 C2 C3 C4 C5 C6Serine: UCU UCC UCA UCG AGC AGUCysteine: UGU UGCMethionine: UGG

STOP: UAA, UAG UGA

Page 24: A genome-wide perspective on translation of proteins

G T R Y E C Q A S F DC1C1C1C1C1C1C1C1C1C1C1C2C2C2C2C2C2C2C2C2C2C2C1C1C2C1C1C2C1C1C2C1C1C2C2C2C2C1C1C1C1C1C1C1C1C1C1C1C1C1C1C2C2C2C2

For a hypothetical protein of 300 amino acids with two-codon each, There are 2^300 possible nucleotide sequences

These variants will code for the same protein, and are thus considered “synonymous”.

Indeed evolution would easily exchange between themBut are they all really equivalent??

Page 25: A genome-wide perspective on translation of proteins

The codon bias in genomes

Page 26: A genome-wide perspective on translation of proteins

Two potential types of sources for codon bias

Mutation pattern(neutral)

Selection

Codon bias

Page 27: A genome-wide perspective on translation of proteins

The effect of (or on?) GC content

Nucleotide composition

Codon bias

Coding CodingInter-genic

Inter-genic composition (esp in bacteria) explain codon bias

Mutation pressure

Selection Amino acidcomposition

Page 28: A genome-wide perspective on translation of proteins

Selection of codons might affect:AccuracyThroughput

CostsFolding

RNA-structure

Page 29: A genome-wide perspective on translation of proteins

AAA CCA GAA UCG AAG … ……

A simple model for translation efficiency

8 2 5 4 1 Average: 4AA Codon AmountLys AAA 8 Asp AAC 6Lys AAG 1Asp AAUThr ACAThr ACC..Phe UUU

5’ 3’

Page 30: A genome-wide perspective on translation of proteins

The same protein can be encoded in many ways…

amino acid sequence: MPKSNFRFGE

ATG

ATGCCT

ATGCCC

ATGCCA

ATGCCG

most efficient

least efficient

intermediate efficiency

intermediate efficiency

relative concentration of tRNA in the cell

1

0

5

0

Page 31: A genome-wide perspective on translation of proteins

Scoring coding sequences for efficiency in translation

ATC CCA AAA TCG AAT

coding sequence translation efficiency score( (geometric) average of all tRNA gene copy numbers)

… ………

Efficient intermediate non-efficient

10 10 7 2 6tRNAGene copies

(dos Reis et al. Nucleic Acids Res, 2004)

Page 32: A genome-wide perspective on translation of proteins

in

jijiji tRNAsW

1

)1(

Wi/Wmax if Wi0wi = wmean else{

tAIg wikk1

g

1/g

dos Reis et al. NAR 2004

The tRNA Adaptation Index (tAI)

ATC CCA AAA TCG AAT … ……

A simple model for translation efficiency

Wobble Interaction

Page 33: A genome-wide perspective on translation of proteins

Correlation of tAI with experimentally determined protein levelsr=0.63

Predicted translation efficiency

Mea

sure

d pr

otei

n ab

unda

nce

(Ghaemmaghami et al. Nature 2003)

Page 34: A genome-wide perspective on translation of proteins

The correlation is quite high, but why not even higher?

• The limitations of the model• tRNA gene copy numbers • Model only capture elongation• Difference in mRNA levels• Protein are also degraded at different rates

gg

kk

ig wtAI /1

1

Page 35: A genome-wide perspective on translation of proteins

The effective number of codons (Nc) - a measure of overall synonymous codon usage bias

AA...

GlyGlyGlyGly

.

.

.

codon...

GGTGGCGGAGGG

.

.

.

Codon count...0

1200...

Highly biased synonymous codon usage (Nc=20)

Gene1AA...

GlyGlyGlyGly

.

.

.

codon...

GGTGGCGGAGGG

.

.

.

Codon count...3333...

No bias in synonymous codon usage (Nc≥61)

Gene2

Wright, F. (1990). "The 'effective number of codons' used in a gene." Gene 87(1): 23-9.

Page 36: A genome-wide perspective on translation of proteins

Codon usage bias is correlated with translation efficiency

r=-0.79 (p<0.001)

Mutation pattern(neutral)

Selection

Codon bias

Page 37: A genome-wide perspective on translation of proteins

But not in all species(e.g. A. gossypii)

r=-0.48 (p=0.218)

Mutation pattern(neutral)

Selection

Codon bias

Page 38: A genome-wide perspective on translation of proteins

S. cerevisiae S. bayanus C. glabrata A. gossypii D. hansenii C. albicans Y. lipolytica S. pombe

r -0.79 -0.73 -0.79 -0.48 -0.75 -0.65 -0.84 -0.66

p <0.001 <0.001 <0.001 0.218 <0.001 0.005 <0.001 <0.001

Translation selection acts in some but not all species (e.g. debate on human…)

Page 39: A genome-wide perspective on translation of proteins

Correlation does not imply causality!!

r=0.63

Predicted translation efficiency

Mea

sure

d pr

otei

n ab

unda

nce

(Ghaemmaghami et al. Nature 2003)

Evolutionary

Physiological

Z