biology tutorial

70
Biology Tutorial Aarti Balasubramani Anusha Bharadwaj Massa Shoura Stefan Giovan

Upload: lev

Post on 23-Feb-2016

35 views

Category:

Documents


0 download

DESCRIPTION

Biology Tutorial. Aarti Balasubramani Anusha Bharadwaj Massa Shoura Stefan Giovan. Viruses. A T4 bacteriophage injecting DNA into a cell. Influenza A virus. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Biology Tutorial

Biology Tutorial

Aarti BalasubramaniAnusha Bharadwaj

Massa ShouraStefan Giovan

Page 2: Biology Tutorial

http://stc/istc.nsf/va_WebPages/InfluenzaEngPrint

Influenza A virus

Electron micrograph of HIV. Cone-shaped cores are sectioned in various orientations. Viral genomic RNA is located in the electron-dense wide end of core.

http://pathmicro.med.sc.edu

A T4 bacteriophage injecting DNA into a cell.

Viruses

Page 3: Biology Tutorial

Life Begins with Cells

Page 4: Biology Tutorial

http://course1.winona.edu/

All cells are Prokaryotic or Eukaryotic

Page 5: Biology Tutorial

Eukaryotic Cell

Endothelial cells under the microscope. Nuclei are stained blue with DAPI, microtubules are marked green by an antibody bound to FITC and actin filaments are labeled red with phalloidin bound to TRITC. Bovine pulmonary artery endothelial cells

Page 6: Biology Tutorial

Nucleus= contains the genetic material

Cell Organelles

Mitochondrion= produces energy

Page 7: Biology Tutorial

http://microbewiki.kenyon.edu/

Endoplasmic Reticulum and Ribosomes=protein factory

Golgi complex=protein distribution

Lysosome=degradation

Page 8: Biology Tutorial

Plasma Membrane

Page 9: Biology Tutorial

DNA Replication

http://www.youtube.com/watch?v=teV62zrm2P0&feature=related

Base PairingA=TCG

Page 10: Biology Tutorial

Life Cycle of a Cell

RNA and protein synthesis

DNA Replication

Resting cells

RNA and protein synthesis

Cell division

Page 11: Biology Tutorial

The Central Dogma of Biology

Replication

Page 12: Biology Tutorial

Transcription

http://www.youtube.com/watch?v=ztPkv7wc3yU

Page 13: Biology Tutorial

Translation

http://www.youtube.com/watch?v=-zb6r1MMTkc

Page 14: Biology Tutorial

Outline• Cellular Biology

– Organelle Structure/Function– Central Dogma

• Biochemistry– Energy Storage/Utilization– Macromolecules

• Bioinformatics – Sequences and Databases– Alignments, Tree Building, Modeling

Page 15: Biology Tutorial

}

}}

Small molecules

Macromolecules

Supramolecular complexes

Cells are Composed of a Molecular Hierarchy

Page 16: Biology Tutorial

BONDS, JUST BONDS

• Covalent – nuclei share common electrons– STRONG!!

• Non-Covalent – No common electrons– WEAK!!

• Ionic• Non-Ionic

http://publications.nigms.nih.gov/chemhealth/images/ch1_bonds.gif

Page 17: Biology Tutorial

Macromolecular Structures are Stabilized by Weak Forces

ForceStrength, kJ mol-1

Effective Range, nm

Van der Waals interactions

Hydrogen bonds

Electrostatic interactions (unscreened)

Hydrophobic interactions

0.4 - 4

4 - 48

20 - 50

<40

0.2

0.3

5 - 50

?

DistanceDependence

6r

3r

1r

?

Page 18: Biology Tutorial

Hydrophobic Interactions

Structures formed by amphipathic molecules in H2O

van Holde, Johnson & Ho Principles of Physical Biochemistry Prentice Hall, Upper Saddle River,

NJ (1998)

Vibrational frequencies of O-H bond of H2O in ice, liquid H2O and CCl4

Page 19: Biology Tutorial

What Is DNA Made of?

3’

5’

Page 20: Biology Tutorial

DNA – The Double Helix

Page 21: Biology Tutorial

  Levels of Chromatin Packing

Page 22: Biology Tutorial

  The Human Genome

Page 23: Biology Tutorial

DNA to Amino Acids

Page 24: Biology Tutorial

Amino Acids – Proteins Building Blocks

Page 25: Biology Tutorial

The Making of a Polypeptide Chain

Page 26: Biology Tutorial

The Four Levels of Protein Structure

Linear arrangement of monomeric unit

Local regular structure

3-dimensional folding of molecule

Spatial arrangement of multiple subunits

Page 27: Biology Tutorial

Single Nucleotide Mutations

Page 28: Biology Tutorial

DNA Mutations

Page 29: Biology Tutorial

Experimental Techniques

Page 30: Biology Tutorial

Restriction Digestion

Page 31: Biology Tutorial

Use of Restriction Digestion to Identify Mutations

(a) Wild-type and mutant DNA sequences

Page 32: Biology Tutorial

Gel Electrophoresis

Page 33: Biology Tutorial

Gel Electrophoresis-Visualizing DNA

Page 34: Biology Tutorial

The Polymerase Chain Reaction (PCR) 

Page 35: Biology Tutorial

Cloning a human gene in a bacterial plasmid

Page 36: Biology Tutorial

Outline• Cellular Biology

– Organelle Structure/Function– Central Dogma

• Biochemistry– Energy Storage/Utilization– Macromolecules

• Bioinformatics– Sequences and Databases– Alignments, Tree Building, Modeling

Page 37: Biology Tutorial

Phenotype Tree BuildingHow Related are Organisms?

What do they eat? Where do they live? How do they divide? Move? Etc.Qualitative

http://nai.arc.nasa.gov/seminars/68_Rivera/tree.jpg

Page 38: Biology Tutorial

Genotype Tree BuildingHow Related are Organisms?

How similar is their genome? Proteome?MOLECULAR EVOLUTION

Quantitative

http://nai.arc.nasa.gov/seminars/68_Rivera/tree.jpg

Page 39: Biology Tutorial

Comparison of Genomes

• 1977- Φ-X174 genome sequenced – Only about 5.4 kbp

• 1997- E. coli K-12 genome sequenced– About 4.6x103 kbp

• 2007- Watson’s Genome sequenced! – About 3x106 kbp!

• About 0.1% difference between human genomes and 1% difference between humans and chimps!

Page 40: Biology Tutorial

Bioinformatics is…

• Highly Interdisciplinary– Proteomics and Genomics– Structural and Computational Biology– Systems Biology– Computer Science, Probabilistic Modeling

• Computational Sequence Analysis – What’s in a sequence?

STRUCTURE FUNCTIONSEQUENCE

Page 41: Biology Tutorial

Power of Prediction

• Can we …– predict structural and functional properties of

proteins given its sequence?– predict the consequences of a mutation?– design proteins or drugs with specific functions?

• Every thing we need to know is at our finger-tips, just need a better understanding of the natural world

STRUCTURE FUNCTIONSEQUENCE

Page 42: Biology Tutorial

• Structure adopted is completely determined by sequence of residues

• Compromise between comfort ( or ) and freedom ()

Protein Structure

F U TSG H TS

http://www.news.cornell.edu/stories/Aug06/protein_folding.jpg

Page 43: Biology Tutorial

Secondary Structure Prediction

• 2o structures form beneficial H-bonds (lower E)• -helices, -sheets• Dihedral angles (,)

Source: Wikipedia

Page 44: Biology Tutorial

Tertiary Structure Prediction

• Homology/Comparative Modeling– BEST– Structure of very related protein is known

• Fold Recognition/Threading– OFTEN IS ENOUGH– Similar folds available but no close relative

• Knowledge Based or A Priori Predictions– ONLY POSSIBLE FOR VERY SHORT PROTEINS– Fold prediction but without experimental quality

Page 45: Biology Tutorial

Sequence Alignments

• FASTA Text Format >header – my sequence >header – my thesis THISISMYSEQ THESISTHYSTING• Alignment

T H I S I S – M Y S E – Q – T H E S I S T H Y S T I N G

• What can we learn from this?

Page 46: Biology Tutorial

Alignments

• Pairwise– Dot Plot– Global(N-W) or Local(S-W)

• Simple Database Searches– FASTA/BLAST

• Multiple Alignments– CLUSTAL

• Advanced Strategies– PSI/PHI-BLAST, HMM’s

Dot plot of two subunits inHuman Hemoglobin

Alpha Chain

Beta

Cha

in

Page 47: Biology Tutorial

Databases

• Nucleotide Sequence Database Collaboration– DDBJ, EMBL, GenBank at

NCBI• Amino Acid Databases

– UniProt, SWISS-PROT, TrEMBL

• Structural– PDB, MMDB, MSD

• Very Many Derivations!http://www.ncbi.nlm.nih.gov/Database/

Page 48: Biology Tutorial

Scoring Matrices

• PAM Matrix : Point Accepted Mutation– PAM1 estimates substitution rate if 1% of AA had

changed. Standards: PAM30 and PAM60• BLOSUM : BLOcks of Amino Acid SUbstitution

Matrix– BLOSUM80 “blocks” together sequences with

greater then 80% similarity.

More DivergentLess DivergentPAM1BLOSUM80

PAM250BLOSUM45

Page 49: Biology Tutorial

FASTA and BLAST

• FASTA - FAST All, Rapid AA or NT Alignments• BLAST – Basic Local Alignment Search Tool• Scoring Alignments

– Raw and Bit Scores;

– Significance of Local Alignment;

– Significance of Global Alignment; x uZ

'2 SE mn

ln'ln 2S KS

Page 50: Biology Tutorial

Nucleotide Sequence Distances

• Jukes-Cantor, single parameter

• Kimura, 2 parameter

3 4ln 14 3

d p

1 1 1 1ln ln2 1 2 4 1 2

dp q q

A C

G T

A C

G T

Page 51: Biology Tutorial

Distance Based Tree Building

• Tree Building => UPGMA– Smallest distance element -> nearest neighbors

1 2 120.5t t d 1- 2 0.1- 3 0.8 0.8- 4 0.8 1 0.3- 5 0.9 0.9 0.3 0.2-

1 20.050.05

1 2

345

Page 52: Biology Tutorial

Distance Based Tree Building

• Tree Building => UPGMA– Smallest distance element -> nearest neighbors

6(1,2) - 3 0.8 - 4 0.9 0.3 -5 0.9 0.3 0.2 -

1 2

1 2

345

4 5

4 5 450.5t t d

0.10 0.106

Page 53: Biology Tutorial

Distance Based Tree Building

• Tree Building => UPGMA– Smallest distance element -> nearest neighbors

6(1,2) - 3 0.8 -

7(4,5) 0.9 0.3 -

1 2 4 5

3 370.5t d

1 2

345 6

3

0.15 7

Page 54: Biology Tutorial

Distance Based Tree Building

• Tree Building => UPGMA– Smallest distance element -> nearest neighbors

6(1,2) - -8(3,4,5) 0.85 -

1 2 4 5

6 680.5t d

1 2

345 6

3

0.425

7

89

Page 55: Biology Tutorial

Distance Based Tree Building

• UPGMA is efficient but makes non-biological assumption that rate of substitution is constant for all branches– Useful in a variety of applications such as microarray

data processing• Neighbor-Joining does not make this assumption

and is still efficient– More accurate for use in phylogenetic analyses

• Also -> Maximum Parsimony, Maximum Likelihood, Minimum Evolution, and Bayesian methods

Page 56: Biology Tutorial

Energy Calculations

• Goal: Find Unique Arrangement of Atoms which Maximizes Stability

• Experimental (usually X-ray or NMR)• Monte Carlo

– Explore states – Let T->0 and discover low energy states (Simulated Annealing)

• Molecular Dynamics– Newtonian mechanics to evolve the system

Page 57: Biology Tutorial

Molecular Mechanics

E K V

221 1

2 2i

i ii ii

pK mvm

E : Total energyK : Kinetic energyV : Potential energy

iv: Velocity of particle iip: Momentum of particle i

i

i

i

Vx

Vi i y

Vz

F V

: Force acting on particle i (gradient of potential energy)iF

, ,i bonding i nonbondiV V V Sum of covalent and

noncovalent interactions

Page 58: Biology Tutorial

Fold It!!

http://fold.it/portal/info/science

FOLD IT

Page 59: Biology Tutorial

Pairwise Alignment

• Dot Plot– Visual and Qualitative

• Needleman-Wunsch Global Alignment– Alignment over entire

sequence• Smith-Waterman Local

Alignment– Alignment over sub-

sequenceshttp://lectures.molgen.mpg.de/Pairwise/DotPlots/

Dot plot of two subunits inHuman Hemoglobin

Alpha Chain

Beta

Cha

in

Page 60: Biology Tutorial

N-W Alignment

• Produces Optimal Global Alignment – Without exhaustive pairwise comparison

• Scoring Matrix, S• Simple scoring matrix for these

sequences• Matches get a score of +1• Mismatches (blank) get a score of -2

• One could also use BLOSUM or PAM scoring matrix for example

F M D T P L N EF 1KHM 1E 1D 1P 1L 1E 1

Page 61: Biology Tutorial

N-W Alignment

• Produces Optimal Global Alignment – Without exhaustive pairwise comparison

• Alignment Matrix, FF M D T P L N E

0 -2 -4 -6 -8 -10 -12 -14 -16F -2 +1K -4H -6M -8E -10D -12P -14L -16E -18

1, 1

1,

, 1

maxi j kl

ij i j

i j

F SF F gap

F gap

Match always results in largest , else take the largest score from • mismatch,• gap in sequence 1 , or• gap in sequence 2 .

Page 62: Biology Tutorial

N-W Alignment

• Produces Optimal Global Alignment – Without exhaustive pairwise comparison

• Build Scoring Matrix, FF M D T P L N E

0 -2 -4 -6 -8 -10 -12 -14 -16F -2 +1 -1 -3 -5 -7 -9 -11 -13K -4 -1 -1H -6M -8E -10D -12P -14L -16E -18

1, 1

1,

, 1

maxi j kl

ij i j

i j

F SF F gap

F gap

Page 63: Biology Tutorial

N-W Alignment

• Produces Optimal Global Alignment – Without exhaustive pairwise comparison

• Build Scoring Matrix, FF M D T P L N E

0 -2 -4 -6 -8 -10 -12 -14 -16F -2 +1 -1 -3 -5 -7 -9 -11 -13K -4 -1 -1 -3 -5 -7 -9 -11 -13H -6 -3 -3 -3 -5 -7 -9 -11 -13M -8 -5 -2 -4 -5 -7 -9 -11 -13E -10 -7 -4 -4 -6 -7 -9 -11 -10D -12 -9 -6 -3 -5 -7 -9 -11 -12P -14 -11 -8 -5 -5 -4 -6 -8 -10L -16 -13 -10 -7 -7 -6 -3 -5 -7E -18 -15 -12 -9 -9 -8 -5 -5 -4

Overall alignment score

1, 1

1,

, 1

maxi j kl

ij i j

i j

F SF F gap

F gap

Page 64: Biology Tutorial

N-W Alignment

• Produces Optimal Global Alignment – Without exhaustive pairwise comparison

• Trace Back to Determine Optimum Alignment F M D T P L N E

0 -2 -4 -6 -8 -10 -12 -14 -16F -2 +1 -1 -3 -5 -7 -9 -11 -13K -4 -1 -1 -3 -5 -7 -9 -11 -13H -6 -3 -3 -3 -5 -7 -9 -11 -13M -8 -5 -2 -4 -5 -7 -9 -11 -13E -10 -7 -4 -4 -6 -7 -9 -11 -10D -12 -9 -6 -3 -5 -7 -9 -11 -12P -14 -11 -8 -5 -5 -4 -6 -8 -10L -16 -13 -10 -7 -7 -6 -3 -5 -7E -18 -15 -12 -9 -9 -8 -5 -5 -4

Seq1: F K HME D- P L - ESeq2: F - - M- DT P L NE

Match or MismatchGap in Sequence 1Gap in Sequence 2

Page 65: Biology Tutorial

Smith-Waterman Alignment

• Local alignment, Similar in Nature to N-W– S takes only non-negative values– Highest value in matrix corresponds to end of

alignment, need not be in corner– No penalty for gaps at ends

• Most rigorous method of aligning nucleotide or protein sequence domains

Page 66: Biology Tutorial

Database Searches

• Optimal pairwise alignment produced by S-W, but insufficient in scanning databases

• Scan for likely matches before performing more rigorous alignments– FASTA, BLAST

• Scan for words scoring higher than some threshold, extend alignment until score drops

Page 67: Biology Tutorial

Advanced Database Searches

• When BLAST falls short– Detecting homology between distantly related

proteins– Very long (>20kbp) genome sequences with highly

conserved regions and highly variable regions• PSI-BLAST (Position-Specific Iterated)

– BLAST generates Position Specific Scoring Matrix– PSSM used as query to re-search database

• Also, PHI-BLAST, HMMs…

Page 68: Biology Tutorial

Multiple Sequence Alignments

• Exact Approaches– e.g. N-W alignments– Prohibitive for many or long sequences

• Progressive Approaches– e.g. CLUSTAL

• Iterative Approaches• Consistency-Based Approaches• Structure-Based Methods

Page 69: Biology Tutorial

Distance Between Sequences

• Based on theory of molecular evolution

• Simplest method, Hamming distance, • Multiple substitutions at single site?• Poisson correction,

– Assume: Probability of observing a change is small, but constant across all sites

– Rate of mutation is constant over time– Mutations at different sites occur independently

100d p

ln 1d p

differences distances

Page 70: Biology Tutorial

James Watson, Francis Crick and Rosalind Franklin