introduction to molecular biology. g-c and a-t pairing

66
Introduction to Molecular Biology

Post on 21-Dec-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Introduction to Molecular Biology. G-C and A-T pairing

Introduction to Molecular Biology

Page 2: Introduction to Molecular Biology. G-C and A-T pairing
Page 3: Introduction to Molecular Biology. G-C and A-T pairing
Page 4: Introduction to Molecular Biology. G-C and A-T pairing

G-C and A-T pairing.

Page 5: Introduction to Molecular Biology. G-C and A-T pairing

A & G =

Purines

C & T = Pyrimidines

Page 6: Introduction to Molecular Biology. G-C and A-T pairing
Page 7: Introduction to Molecular Biology. G-C and A-T pairing

Important terms:

• Nucleotide Pair = Base pair (bp)

• 1000 base pairs = 1 kilobase pairs (kb)

• 1,000,000 base pairs = 1 megabase pairs (Mb)

• 1,000,000,000 base pairs = Gb?

Page 8: Introduction to Molecular Biology. G-C and A-T pairing

Double-stranded DNA is peeled apart to replicate DNA

Page 9: Introduction to Molecular Biology. G-C and A-T pairing

• The 2 daughter molecules are identical to each other and exact duplicates of the original (assuming error-free replication).

Page 10: Introduction to Molecular Biology. G-C and A-T pairing

• One chromosome is one long, twisted, dramatically compacted DNA molecule.

• The average length of a human chromosome is 130 million b.p.

Page 11: Introduction to Molecular Biology. G-C and A-T pairing

Genes are defined segments of DNA

Page 12: Introduction to Molecular Biology. G-C and A-T pairing

•The information content of the DNA molecule consists of the order of bases (A, C, G, and T) along the length of the molecule.

Page 13: Introduction to Molecular Biology. G-C and A-T pairing

Nucleic Acids

DNA

vs.

RNA

Page 14: Introduction to Molecular Biology. G-C and A-T pairing

• RNA is quite similar to DNA, but usually single-stranded. Both are nucleic acids

Page 15: Introduction to Molecular Biology. G-C and A-T pairing

In RNA, “U” replaces

“T “

Page 16: Introduction to Molecular Biology. G-C and A-T pairing
Page 17: Introduction to Molecular Biology. G-C and A-T pairing

Important Concepts

• DNA and RNA have polarity- each strand has a 5’ and a 3’ end. (The 2 strands of DNA are anti-parallel)

• The common convention is to list only one strand of DNA, in a 5’ to 3’ direction:

5’ AGTCGTAGTCGTAGTCGTAGTCTG3’

(3’TCAGCATCAGCATCAGCATCAGAC 5’)

Page 18: Introduction to Molecular Biology. G-C and A-T pairing

How Genes are Expressed- the Central

Dogma.

Page 19: Introduction to Molecular Biology. G-C and A-T pairing

Transcription=

RNA synthesis

Translation

=

Protein synthesis

Page 20: Introduction to Molecular Biology. G-C and A-T pairing

Eukaryotic transcription operates ‘gene by gene’.

Page 21: Introduction to Molecular Biology. G-C and A-T pairing

One strand of DNA is copied (sense strand); the antisense strand is never

transcribed.

Page 22: Introduction to Molecular Biology. G-C and A-T pairing

Transcription produces an RNA ‘copy’ of a gene (DNA)

• animation

Page 23: Introduction to Molecular Biology. G-C and A-T pairing

Important Term:

• Transcription = RNA synthesis

• Quiz question- how does sequence of mRNA compare to sequence of noncoding strand of DNA?

Page 24: Introduction to Molecular Biology. G-C and A-T pairing

The mRNA are translated in the cytoplasm

Page 25: Introduction to Molecular Biology. G-C and A-T pairing

Three consecutive bases in the mRNA form one codon

No exceptions- the genetic code is a triplet code.

Page 26: Introduction to Molecular Biology. G-C and A-T pairing

tRNA are the ‘bilingual’ molecules

Page 27: Introduction to Molecular Biology. G-C and A-T pairing
Page 28: Introduction to Molecular Biology. G-C and A-T pairing

The genetic code is the codon-amino acid conversion table

Page 29: Introduction to Molecular Biology. G-C and A-T pairing
Page 30: Introduction to Molecular Biology. G-C and A-T pairing
Page 32: Introduction to Molecular Biology. G-C and A-T pairing

The immediate product of translation is the primary protein

structure

Page 33: Introduction to Molecular Biology. G-C and A-T pairing

The primary sequence dictates the secondary and tertiary structure of the protein

Page 34: Introduction to Molecular Biology. G-C and A-T pairing

Important Term:

• Translation = Protein synthesis

Page 35: Introduction to Molecular Biology. G-C and A-T pairing

There are 2 basic types of genes:

• Protein-coding genes:

(DNA mRNA protein)

• RNA-specifying genes:

(DNA tRNA) (DNA rRNA)

(DNA small RNA)

Page 36: Introduction to Molecular Biology. G-C and A-T pairing

Genetic information, stored in DNA, is conveyed as proteins

Page 37: Introduction to Molecular Biology. G-C and A-T pairing

Protein sequences are also represented linearly.

• Each of the 20 amino acid is can be represented by a 3 letter code:

Ser Tyr Met Glu His

In bioinformatics, each of the 20 amino acid is commonly represented by a 1 letter code:

MDETSGHLKPWECVGH . . . . .•

Page 38: Introduction to Molecular Biology. G-C and A-T pairing

Genetic information, stored in DNA, is conveyed as proteins

Page 39: Introduction to Molecular Biology. G-C and A-T pairing

In sickle-cell anemia, one nucleotide change is responsible for the one amino acid change.

Page 40: Introduction to Molecular Biology. G-C and A-T pairing

Sickle-cell anemia is caused by one amino acid change.

Page 41: Introduction to Molecular Biology. G-C and A-T pairing

A single base-pair mutation is often the cause of a human

genetic disease.

Page 42: Introduction to Molecular Biology. G-C and A-T pairing

How to find a gene?*

• One way is too search for an open reading frame (ORF).

• An ORF is a sequence of codons in DNA that starts with a Start codon, ends with a Stop codon, and has no other Stop codons inside.

* = inexact science

Page 43: Introduction to Molecular Biology. G-C and A-T pairing

Each strand has 3 possible ORFs.

5'                                3’ atgcccaagctgaatagcgtagaggggttttcatcatttgagtaa

1 atg ccc aag ctg aat agc gta gag ggg ttt tca tca ttt gag taa M   P   K   L   N   S   V   E   G   F   S   S   F   E   * 

2  tgc cca agc tga ata gcg tag agg ggt ttt cat cat ttg agt  C   P   S   *   I   A   *   R   G   F   H   H   L   S   

3   gcc caa gct gaa tag cgt aga ggg gtt ttc atc att tga gta   A   Q   A   E   *   R   R   G   V   F   I   I   *   V     

Page 44: Introduction to Molecular Biology. G-C and A-T pairing

Eukaryotic Genomes

• Finding a gene is much more difficult in eukaryotic genomes than in prokaryotic genomes. WHY??

Page 45: Introduction to Molecular Biology. G-C and A-T pairing

Prokaryotic (bacterial) genomes:

• Are much smaller than eukaryotic genomes

E. coli = 4,639,221 bp, 4.6 Mb

Human = ~~ 3,300 Mb

• Contain a small amount of noncoding DNA

E. coli= ~ 11%

Human = > 95%

Page 46: Introduction to Molecular Biology. G-C and A-T pairing

Eukaryotic transcripts (mRNA) are processed and

leave the nucleus

Page 47: Introduction to Molecular Biology. G-C and A-T pairing
Page 48: Introduction to Molecular Biology. G-C and A-T pairing

Exon

=

Genetic code

Intron

=

Non-essential DNA ? ?

Page 49: Introduction to Molecular Biology. G-C and A-T pairing
Page 50: Introduction to Molecular Biology. G-C and A-T pairing

• The mechanism of splicing is not well understood.

Page 51: Introduction to Molecular Biology. G-C and A-T pairing
Page 52: Introduction to Molecular Biology. G-C and A-T pairing

Alternate Splice sites generate various proteins isoforms (HGP estimate = 35%)

Page 53: Introduction to Molecular Biology. G-C and A-T pairing

Variable mutation rate?

• Most mutations in introns and intergenic DNA are (apparently) harmless

• Consequently, intron and intergenic DNA sequences diverge much quicker than exons.

Page 54: Introduction to Molecular Biology. G-C and A-T pairing

Bacteria cells are different:

• Prokaryotic cells- No splicing (i.e. – no split genes)

• Eukaryotic cells- Intronless genes are rare (avg. # of introns in HG is 3-7, highest # is 234); dystrophin gene is > 2.4 Mb.

Page 55: Introduction to Molecular Biology. G-C and A-T pairing

How to confirm the identification of a gene?

• Possible answer- Identify the gene by identifying its promoter.

Page 56: Introduction to Molecular Biology. G-C and A-T pairing

Promoters are DNA regions that control when genes are activated.

Promoter

[ ]

Page 57: Introduction to Molecular Biology. G-C and A-T pairing

Exons encode the information that determines what product will be produced.

Promoters encode the information that determines when the protein will be produced.

Page 58: Introduction to Molecular Biology. G-C and A-T pairing

Nucleotides of a particular gene are often numbered:

Page 59: Introduction to Molecular Biology. G-C and A-T pairing

Demonstration of a consensus sequence.

• De

Page 60: Introduction to Molecular Biology. G-C and A-T pairing

How to find a gene?

• Look for a substantial ORF and associated ‘features’.

Page 61: Introduction to Molecular Biology. G-C and A-T pairing

• Two nucleic acids, that are exact complements of each other will hybridize.

• Two nucleic acids that are mostly complementary (some mismatchs) will . . .

. . . hybridize under the right conditions.

Page 62: Introduction to Molecular Biology. G-C and A-T pairing

Recombinant DNA techniques?

• Many popular tools of recDNA rely on the principle of DNA hybridization.

• In large mixes of DNA molecules, complementary sequences will pair.

Page 63: Introduction to Molecular Biology. G-C and A-T pairing

Hybridization ‘in silico’

• Algorithms have been written that will compare two nucleic acid sequences. Two similar DNA sequences (they would hybridize in solution) are said ‘to match’ when software determines that they are of significant similarity.

Page 64: Introduction to Molecular Biology. G-C and A-T pairing

Protein- Protein similarity searches?

• Many algorithms have been designed to compare strings of amino acids (single letter amino acid code) and find those of a defined degree of similarity.

Page 65: Introduction to Molecular Biology. G-C and A-T pairing

Significance of sequence similarity

• DNA similarity suggests:

• Similar function

• Similar structure

• Evolutionary relationship

Page 66: Introduction to Molecular Biology. G-C and A-T pairing

The End