2006 lsa lecture 1 fall 2006 - harvard universitysites.fas.harvard.edu/~lsci1a/11-21notes.pdf ·...
TRANSCRIPT
2 I. Perpetual Molecular Warfare: Viral Evolution in HIV
This section of the course has four goals. The first is to bring story of AIDS and HIV to an end by
telling why the initial application of the drugs that David described to you failed and use this failure to
introduce you to the connection between the chemical reactions that make up DNA replication and the
process of evolution. The second is to use the initial beneficial effect of the anti-HIV drugs to introduce
you to the difference between steady states and equilibrium, two different ways that the properties of
biological systems can become constant over some period of time, the third is to discuss how the central
dogma of DNA makes RNA makes protein evolved, and the fourth is to briefly explain the evidence for
evolution by natural selection and to explain its connection to AIDS and cancer, the two diseases that
this course focuses on.
Slide 1: 2I: Disease and Evolution: An Overview
Slide 2: Learning objectives
At the end of this section you should understand,
1) How mutations can arise, and how beneficial ones can alter populations.
2) Why steady states requires the continual input of energy, as distinct from equilibria.
3) How altering the kinetics of processes can greatly alter steady states.
4) Current thoughts on the evolution of the central dogma.
5) How DNA and protein sequences provide the best evidence for Darwinian evolution.
6) That evolution can be observed within individual patients in two human diseases, AIDS and cancer.
1: NATURAL SELECTION AND EVOLVING HOST-PATHOGEN INTERACTIONS IN AIDS
Resistance appears rapidly in patients treated with single anti-viral drugs
The discovery of inhibitors of HIV protease promised help for AIDS patients. But although drug
treatment did decrease the levels of virus in patients circulation and restore their immune systems, these
changes were only temporary. Within 12 months the T cell counts of half the treated patients had fallen,
and viral levels in their circulation had begun to rise. Isolating the virus from their blood and studying
its behavior in cell culture allowed scientists to ask whether the change had occurred in the virus or the
patient. The result was clear. The virus from the drug-treated patients had changed to become resistant
to the drugs; adding protease inhibitors to infected cells no longer reduced the amount of virus they
produced. Biochemical studies showed the protease was now more resistant to the inhibitors when its
activity was assayed in a test tube. Because the change has occurred in the protease gene of the virus,
the genetic material of the virus has been permanently altered and we refer to the change as a mutation.
The ability of these mutant viruses to take over the population is an example of evolution in action
Slide 3: Mutations make HIV drug-resistant
Unfortunately for patients, these mutations arise surprisingly rapidly, appearing roughly once for every
30,000 times a virus infects a cell and is copied into DNA. Because at least this many new cells are
infected each day, an individual cell carrying a resistant virus arises every day. As we will see later,
HIV is extremely careless with how it copies DNA into RNA compared to the care our cells take when
they replicate our DNA.
Slide 4: Types of mutation
Each mistake is a mutation. If the mutation is in the part of the viral genome that encodes one of the
viral proteins, it has a high chance of changing the amino acid sequence of the protein and thus altering
the protein’s three dimensional structure and chemical activities. The effects of the altered amino acid
sequence can be divided into three classes. Deleterious mutations impair or destroy the protein’s
function, neutral mutations have no effect on the protein’s function, and beneficial mutations improve at
least one aspect of the protein’s function. The most severe deleterious mutations kill the organisms that
possess them and are called lethal mutations. The distinction between deleterious, neutral, and
beneficial mutations is illustrated on this slide in a context that is directly relevant to you. Deleterious
and beneficial mutations are roughly equally common and both are far more frequent than advantageous
mutations.
If advantageous mutations are so much rarer than neutral or deleterious ones, why are they so important.
The answer is simple, the advantageous mutations confer a benefit to the viruses that have them,
whereas the neutral ones confer neither cost not benefit, and the deleterious ones impose a cost. Viruses
that have deleterious mutations do less well than the wild type virus and are quickly outcompeted and
eliminated from the viral population. Viruses with neutral mutations do as well as wild type and the
details of what happens to them lie beyond the scope of this course. The summary is that the vast
majority of neutral mutations are eliminated by chance rather than by selection, but over a very long
period a very tiny fraction of them go on to coexist with and occasionally replace the original wild type
virus.
Slide 5: Beneficial mutations take over populations
To see why advantageous mutations matter, imagine a mutant so strong that infected with the mutant
virus produces twice as many infectious virus particles as cells infected with the wild type virus in
patients treated with protease inhibitors. Thus in every round of cellular infection the mutant virus
enters twice as many cells as the wild type one. If we start from a single mutant viral genome, in a
patient with a billion infected cells, it takes about 30 rounds of infection for the mutant virus to
extinguish the wild type one, since the fraction of mutant virus goes from one in billion to two in a
billion in the first round of infection from two in a billion to four in a billion in the second round from
four in a billion to eight in a billion in the third and so on. Even after 20 rounds of viral replication and
cellular infection, the mutant virus accounts for only one a thousand viruses and even by the 25th round
that it accounts for only 3% of the patient’s virus. After that, it rapidly takes over. The fraction of the
viruses that are mutant has been doubling in every generation since its origin, but in large populations,
the fraction of mutant virus only becomes noticeably large after many doublings.
Mutations affect the structure and activity of proteins
Slide 6: HIV protease mutants
This slide shows how HIV’s protease can mutate to become resistant to inhibitors. The slide shows two
versions of the sequence of the HIV protease, depicted using a one letter code in which 20 different
letters of the alphabet represent the 20 different amino acids. The first is the wild type form of the
enzyme, the second a mutant form isolated from an individual who had become resistant to inhibitor
therapy. The mutant contains two amino acid changes relative to the wild type and together they have
two effects: they help the virus by reducing the sensitivity of the enzyme to the inhibitor, but they hurt
by reducing its catalytic activity. But because these mutations interfere more with drug binding than
they do with protease activity, they help HIV circumvent therapy.
Slide 7: HIV protease mutants reduce inhibitor binding
This slide shows the structure of the protease, which is a dimer made up of two identical proteins. The
course of the polypeptide chain is shown as the ribbon of white lines, the catalytic aspartates whose
function David explained to you are shown in green (and there are two such residues because the protein
is a dimer), the inhibitor, saquinavir, is shown in purple, and the two amino acids which have been
mutated are shown in yellow and blue. Note that one of them touches the inhibitor, but the other is far
away showing how perturbations of the protein that are far from the site of substrate binding and
catalysis can have important effects on the protein’s enzymatic activity.
Mutations due to mistakes in DNA synthesis and defenses against them
We now consider the details of how the mutations appear. As Rob told you, when HIV infects cells, its
RNA genome is copied into DNA. This slide reminds you about the basics of this process and reverse
transcriptase, the enzyme that does the copying by starting at one end of the RNA strand and copying
the information all the way down to the other to create the first DNA strand, and then using this new
strand as the template for synthesizing a second to produce a double stranded DNA molecule that is
ultimately integrated into the host cell’s genome. Reverse transcriptase makes occasional mistakes and
puts the wrong base into the DNA (shown as a star in the figure). These mistakes are mutations and will
appear in every RNA molecule that is copied from the viral DNA that contains the initial mistake and
when these RNA molecules are incorporated into viruses they will transmit this mutation to the cell they
infect, and so on.
Slide 8: Reverse transcriptase copies HIV RNA into DNA
Why is HIV’s reverse transcriptase so sloppy compared to the DNA polymerases that replicate our own
DNA? To answer this question we must go back to David’s lectures on the structure of DNA and the
enzymes that replicate it. When reverse transcriptase copies the viral RNA into DNA it is like DNA
polymerase taking one of the two strands of a DNA duplex and using it as the template to produce the
new strand of the daughter double helix. Reverse transcriptase and DNA polymerase both use an
existing nucleic acid strand as a template to synthesize a new DNA strand, but the template is RNA for
reverse transcriptase and DNA is the template for DNA polymerase. In each case, an A in the template
strand tells the polymerase to put a T in the new strand it is synthesizing, a template G tells polymerase
to insert a C, and so on.
Slide 9: A rare isomer of guanine pairs with thymine instead of cytosine
Mutations arise, because at a low frequency the wrong two bases can pair with each other. One reason
that this can happen is that all of the bases can exist in more than one form, an example of what is
referred to as a constitutional isomer, an alternative molecule that contains the same number and same
type of atoms but with the atoms connected to each other in a different way. The rare forms account for
0.1 to 1 % of each base, which gives them the potential to cause many mutations in each newly
polymerized strand of DNA. As an example, this slide shows the normal and the rare form of G. Each
has the same chemical formula, but the position of the atoms differs, with affected atoms being indicated
in red, and you can think of the change as the movement of a hydrogen atom from a nitrogen to an
oxygen atom. The rare form of G fails to base pair with its normal partner C and base pairs with T
instead, with the disastrous consequence that a T gets inserted into the newly synthesized DNA strand
instead of a C. The DNA molecule now contains a G:T base pair and if nothing else happens, when it
replicates it will produce two daughter molecules, one with the correct G:C base pair and the other with
a mutant A:T base pair.
Slide 10: Abnormal base pairing causes mutation
During DNA replication there are two defenses against putting in the wrong base as the DNA
polymerase makes the new strand. The first is called proof-reading, and it means that after inserting
each new base, the DNA polymerase goes back and checks whether it has put the right base in. If it
hasn’t, it cuts the errant base out and tries again. The second is called mismatch repair and David
referred to it when he discussed DNA replication. The basic idea is simple: an abnormal base pair like
the G:T one we discussed above leaves a tell-tale bulge in the DNA helix and the proteins of the
mismatch repair system detect these bulges and cut out the mistakenly incorporated base allowing DNA
polymerase to try again.
Slide 11: Defenses against mutation
These three lines of defense against error illustrate an important biological principle. Many biological
processes work so supremely well that it seems hard to believe that they could have been put together by
gradual improvement. But if we study them carefully we can see the traces of the progressive
refinement that comes from taking an initially error-prone way of doing things and then adding several
layers of checking machinery that catch and then rectify mistakes. In the case of DNA replication, we
will see that all the layers of checking and correction are crucial to preventing cancer in humans, which
contain 10 trillion cells and go through 10,000 trillion rounds of DNA replication in their 70 year life
span.
For HIV, the tragedy is that none of the methods for finding and fixing errors are used; reverse
transcriptase does not proof-read each newly incorporated base as it copies RNA into DNA, and the
mismatch repair system doesn’t work on the DNA strand that is being copied from RNA. The result is
that during reverse transcription of the viral RNA into DNA, the chance of making a mutation at any
given base is roughly 1 in 30,000, whereas for the replication of cellular DNA it is somewhere between
one in a billion and one in 10 billion.
It is HIV’s high rate of mutation that makes AIDS such a difficult disease to cure. It allows the virus to
mutate so that if can infect different cells in the body, explaining how the virus can start by infecting T
cells and then later infect macrophages or vice versa, as Rob discussed. It allows the virus to mutate so
that it becomes resistant to what were originally extremely potent inhibitors of its reverse transcriptase
and protease. And finally, the high rate of mutation has made an effective vaccine against HIV elusive,
because we need to find a part of the virus so crucial to its function that any mutation that allows the
virus to escape from the vaccine-induced immune response will also kill the virus.
It is HIV’s high rate of mutation that makes AIDS such a difficult disease to cure. It allows the virus to
mutate so that if can infect different cells in the body, explaining how the virus can start by infecting T
cells and then later infect macrophages or vice versa, as Rob discussed. It allows the virus to mutate so
that it becomes resistant to what were originally extremely potent inhibitors of its reverse transcriptase
and protease. And finally, it has made an effective vaccine against HIV elusive, because we need to find
a part of the virus so crucial to its function that any mutation that allows the virus to escape from the
vaccine-induced immune response will also kill the virus.
Fighting on two fronts: How combination therapies hold AIDS in check
Mutations in the genome of HIV play two important roles in AIDS. They can alter the cell preference of
the virus and they can interfere with therapy. How can we overcome the rapid mutation of HIV to drug
resistance so that we can cure patients? The answer is familiar to students of military history: attack on
two fronts. For HIV, the two fronts are the reverse transcriptase that copies the viral genome into DNA
as it enters new cells and the virally encoded protease that clips the original polypeptides that are
translated from the virus’s mRNA into the mature proteins. In both cases, scientists have developed
drugs that bind tightly to the active site of the enzymes, preventing them from binding their natural
substrates and Rob and David have described these drugs and their mechanisms of action to you.
Combination therapies attack both targets, the reverse transcriptase and the protease, at once. Patients
are treated with a mixture of two drugs, one inhibiting the protease and the other reverse transcriptase.
Mutations that confer resistance to either drug don’t help, because the other drug still blocks viral
replication. In order to escape the virus has to become resistant to both drugs at the same time.
Slide 12: Combination therapy makes mutating to resistance harder
We can make a rough estimate of how likely such an event is. Assume that we need mutation at one
position in the viral genome to give resistance to protease inhibitors and at another position to give
resistance to reverse transcriptase inhibitors. Each mutation arises at a frequency of 3 x 10–5, so the
frequency of getting both mutations at once is 3 x 10-5 x 3 x 10-5 ≈ 10-9 per round of viral replication.
This means that resistance will arise once in every trillion copyings of the viral RNA into DNA.
Is this rare enough and can combination therapy cure patients to the point where they can stop taking the
drugs? The answers are almost but not quite and no, and we will now discuss both of them in more
detail. We begin with the first and start by using an analogy to games of chance. If you are throwing
one dice, and need to throw a six to stop the game, your chance of doing so on the first throw is 1/6.
Using the laws of probability you can calculate the chance that you will have thrown a six at least once
in any given number of throws, thus constructing the graph that is shown on the slide. It reveals that by
the time you’ve thrown the dice 30 times you are almost sure to have rolled a six. Now imagine that you
have two dice, that you throw them together, and that both must come up six in order to stop the game.
The chance of this happening in the first throw is simply the chance of the first dice showing six time the
chance that the second dice shows six, that is 1/6 x 1/6 = 1/36. Now it take much longer on average to
end the game, and even after 100 throws there is some reasonable chance that you have failed to throw
two sixes.
Slide 13: Mutating once versus mutating twice: a dice analogy
We can make an exactly similar calculation for the generation of singly and doubly mutant viruses, and
focus on combination therapy where we need two mutations to happen at the same time and place. For a
single infected cell, this is the chance of mutating to resistance to protease inhibitors times the chance of
mutating to resistance to reverse transcriptase inhibitors. We have already calculated this chance as
being 9 x 10-10 or roughly one in a billion. For a patient, we need to know what the chance of this event
happening anywhere in their body is. To know the chance that a doubly resistant virus appears on a
given day, we multiply the probability that the infection of one cell leads to resistant virus by the
number of new cells that are being infected each day. This is like our dice analogy in the sense that we
are requiring two events to happen at once, and differs only because we are now doing the equivalent of
shaking the dice many time each day.
Slide 14: HIV can become doubly drug resistant
The graph shows the fraction of patients who will have generated doubly resistant viruses against the
time of treatment in months. One of the curves is for patients where 1,000,000 new cells are infected
every day, despite the combination therapy, and the other is for patients where 100,000 new cells are
infected each day. Both numbers are miniscule compared to the roughly a billion cells which are being
infected each day in an untreated patient, but the two curves make it clear that the difference between
100,000 and 1,000,000 cells being infected every day can be the difference between life and death.
Many factors determine this difference, some known and some unknown. Three are prominent:
different patients do a better or worse job of managing to stick to a demanding and long-term schedule
for when they take their medications, in some patients the drugs cause toxic side effects limiting the
doses that they can tolerate, and some patients metabolize and eliminate the drugs faster than others.
The last two relate to two important issues with medical drugs, toxicity, the need to do as much as
possible to make sure that the drugs do not have dangerous and unintended effects as well as treating the
patient’s illness, and pharmacokinetics, which David explained to you dealt with the metabolism of
drugs after they have been given to patients. In both cases, scientists and doctors have come to realize
that there are substantial differences between individuals in both drug toxicity and pharmacokinetics,
and that some of these differences are due to genetic differences between individuals in the human
population, a topic that you will hear more about if you take LS1b.
2 STEADY STATES PREVAIL IN BIOLOGY
There are two reasons why combination therapy can’t eliminate the virus from HIV-infected patients.
First, and most importantly, the drugs don’t kill infected cells and any new cells they divide to form.
Because the viral genome has been copied into DNA and integrated into the chromosomes of these cells,
they can only lose their potential to secrete virus when they die, either because the virus becomes
activated and kills the cell or the cell reaches the end of its natural life. Since some of the cells are
memory cells of the immune system, they can persist for years in a quiescent state in which they do not
grow, proliferate, or express the viral genome they contain. The second drawback is that although the
drugs enormously reduce the rate at which new cells are infected, they do not eliminate new infection,
so that the virally infected cells that are dying are partially offset by new cells that are being infected.
Slide 15: Steady states prevail in biology
These two features mean that you cannot quit combination therapy. If you do, the combination of the
low level of virus in your blood, and the ability of already infected cells to make new virus leads to a
dramatic rise in the frequency at which new T cells get infected and patients can progress to full-blown
AIDS. What this tells us is that combination therapy has created a state of détente in HIV-positive
patients; the level of virus in the blood and the number of infected cells has fallen dramatically but even
at these low levels, a new cell is infected for every infected cell that dies, meaning that the virus can
never be eliminated.
Slide 16: Equilibrium revisited
This stand off is an example of something that we see again and again in biology, a balance, or steady
state being set between two opposing forces. These stand offs are often referred to as equilibria, but this
is wrong. In its strict sense, and as David explained, equilibrium refers to a situation in which no net
chemical change is occurring because the backwards and forwards rate of a process balance and no
energy is being consumed or released. This slide shows a hypothetical reaction between A and B to
make C and D. The rate constants for the forward and backwards reactions are both 1, meaning that the
equilibrium constant is 1. The graph shows four different quantities over time:
1) The fall in the concentration of A.
2) The rise in the concentration of C
3) The rate at which C is being formed
4) The free energy change for the reaction
All of these quantities change quickly at first, then fall more slowly, and eventually become constant. In
particular at the end of the reaction the concentrations of A and C are equal, the rate at which new C is
being formed has become zero, and the free energy change has fallen to zero. These last two are crucial
points and you should remember them from David’s lectures. At equilibrium, the rates of these the
forwards and backwards reactions must be equal, there can be no net expenditure or consumption of
energy, and ∆G under these conditions is zero.
In biology, a steady state implies three things:
1) That the rates of two opposing processes are exactly equal to each other (as they are at an
equilibrium).
2) The backwards and forwards reactions take different chemical paths and neither is close to
equilibrium.
3) Cells must harvest energy from their environment to keep these reactions from approaching
equilibrium.
Slide 17: Rates of infection and death balance
The progress of AIDS and the response to combination therapy provide an interesting example of a
steady state. When a patient is infected, an acute phase where virus levels are very high is followed by a
long latent phase during which the level of T cells and virus in the blood is substantially lower than it is
during the acute phase. We now describe how this state is reached by considering the balance between
the rate at which cells are born and the rate at which they are infected, and thus become destined to die.
In the acute phase there are many virus particles and many cells to infect, so many T cells get infected.
The crucial point is that the rate at which cells are infected is much higher than the rate at which new T
cells are born. Since more cells are being infected than are being born, the number of uninfected T cells
falls. As it does so, it gets harder to find new cells to infect and since there are less cells to infect, the
rate at which cells are getting infected goes down. But as long as the rate of infection is greater than that
rate at which T cells are being born, the population keeps falling, albeit more slowly. You can see that
as this process continues, the number of uninfected T cells must eventually reach a point at which the
rate of infection precisely matches the rate at which new T cells are born. The number of T cells is now
at steady state, since on average precisely one new T cell is born for every one which is infected and
then dies.
This is not an equilibrium in two obvious senses: the birth and death of cells are clearly different
processes, and energy and raw materials are being used to produce the new, uninfected cells that allow
the steady state to be maintained. Unfortunately for the patients, this concentration of T cells is so low
that their immune system is severely compromised and they are easy prey to a wide variety of infectious
diseases.
Although the latent phase is long, most patients who are not treated will eventually develop full blown
AIDS and succumb to opportunistic infections. Combination therapy prevents this by taking the patients
immune system to a different steady state. By interfering with infection by inhibiting reverse
transcriptase, and with the production of new virus by inhibiting the HIV protease, combination therapy
independently reduces the level of circulating virus and the probability that an individual virus will
succeed in infecting a T cell. This combination dramatically reduces the rate at which cells get infected,
and as a result cells are now being born much faster than they are being infected. This imbalance means
that the number of T cells starts to rise. As it does so, the rate of infection rises, but as long as it is still
lower than the rate at which cells are born, the number of circulating T cells continues to rise, albeit
more slowly. Again, this rise continues until the rate of infection exactly matches the rate of cell birth
and the patient’s immune system has reached a new steady state. Because the virus is still present,
however much its level has been reduced, patients cannot quit combination therapy. If they do, the virus
particles succeed more often in infecting cells, and the number of virus particles each infected cell
releases goes up. As a result the level of circulating virus rapidly rises and the patients will return to
their previous steady state in which virus levels are high and T cell numbers are low.
Thus both before and after combination therapy, the patient is in steady state, but the two steady states
are radically different. This difference reflects the fact that the rate at which T cells are born is
essentially constant, but the rate of infection is far higher in untreated patients. In these individuals, the
rate of infection can only equal the rate of birth when the level of T cells has fallen so low that they are
hard to find and infect. But once the rate of infection has been reduced by therapy, the virus now needs
a much higher level of T cells to achieve the same rate of infection. In the case of T cells in AIDS
patients, it is particularly clear that neither of the opposing process, the death of infected cells or the
creation of new ones, can be described a chemical equilibrium. Death is clearly irreversible, and
creating new cells must require the input of substantial amounts of energy and raw materials. In most
biological steady states, both of the opposing processes require the input of energy, most often as a
result of the hydrolysis of ATP.
Slide 18: Equilibrium versus steady state 1
This slide contrasts the concepts of equilibrium and steady state in more conceptually. The red and blue
circles represent the substrate and product of the forward reaction, and their sizes are proportional to the
concentrations of these compounds. Hopefully, you will now be able to instantly appreciate that we can
turn the tables and think of the blue circle as the substrate for and the red circles as the product of the
back reaction. For simplicity, I have shown an reaction whose equilibrium concentration is 1, but you
should be aware that this is not a typical value. In an equilibrium, the reaction proceeds until the rates of
the forward and back reactions are equal. When they are, as David explained to you, ∆G is zero, and the
system is at thermodynamic equilibrium.
Now contrast this to a steady state system typical of biological systems. In this case the reaction that
converts red to blue is a chemically different reaction from the one that converts blue to red. Both
reactions are coupled to ATP hydrolysis and as a result of this coupling, they have a strongly negative
free energy, meaning that they are far from equilibrium inside the cell. Now what determines when the
rates of the forward and backward balance is not thermodynamics but kinetics. This is good news for
biological systems that want to be able to use information from their environments to alter steady states
inside cells, since you will remember from David’s lectures that enzymes affect the kinetics of reactions
but cannot affect their equilibria. The bad news is that although biological steady states can be
maintained at positions far from thermodynamic equilibria, this only happens because two chemically
distinct and biologically opposing reactions have each been coupled to ATP hydrolysis so that they are
far from equilibrium, this ATP hydrolysis implies that ∆G for the entire system is strongly negative.
The consequence of this is that living systems require the continual input of energy to maintain steady
states, which explains why one of the central requirements of life is the ability of organisms to harvest
energy from their environment.
Slide 19: Equilibrium versus steady state 2
We can summarize the differences between equilibria and steady state as follows:
1) In both situations the rates of the forward and backwards processes are equal, so that, over time, there
is no net change in the part of the system we are observing.
2) In equilibria, the backwards reaction is simply the forward one running backwards, and the reaction
diagram of one can be converted into that of the other simply by reversing the roles of substrates and
products. In steady states, the forwards and backwards reactions are chemically distinct, are catalyzed
by different enzymes, and have different reaction diagrams.
3) In equilibria, enzymes can affect how fast the system reaches equilibrium but cannot affect the ratio
between products and substrates at equilibrium. In steady states, the ratio of substrates and products is
determined by the rate constants of the enzymes that catalyze the forwards and backwards reactions, so
that regulating the activity of one enzyme while the other is left constant affects the balance between
substrates and products.
4) At equilibrium there is no net loss or gain of free energy. At steady state there is a net loss of free
energy, implying that a biological system can only remain at steady state as long as it can harvest energy
from its environment or from internal stores.
3: EVOLUTION OF THE CENTRAL DOGMA (Slide 20)
This is a useful point to look back and see what we have learnt in the course so far. We have used HIV
and its behavior to illustrate what Francis Crick called the central dogma, the idea that information flows
from DNA to RNA to protein. In HIV, the DNA is the viral DNA that is integrated into the host cell’s
genome, the RNA is the mRNA that is copied from it by the cellular RNA polymerase, and the proteins
are the structural proteins of the virus, the reverse transcriptase and various small peptides that regulate
the behavior of the host cell. HIV also illustrates that the rules of the central dogma are not immutable,
since during infection, the reverse transcriptase copies the viral RNA into a DNA copy that is then
integrated into the host genome. Although this behavior is exceptional, it serves as an important
reminder that even though it is much less chemically stable than DNA, RNA can be used as the genome
of an organism.
Slide 21: The central dogma
Slide 22: Evidence for RNA as the original catalyst
Scientists now believe that the most ancient ancestors of today’s organisms used RNA not only as their
genomes but also as catalytically active molecules that were used for genome replication, energy
harvesting, and other essential functions. The best evidence for this idea is the discovery that RNA
molecules exist in nature that can catalyze chemical reactions and that these reactions are exactly the
sort of reaction that might have been needed early in life’s evolution. One example is self-splicing
RNA. Rob discussed the splicing of mRNA in eukaryotic cells and this depends on a complicated
mixture of RNA and protein molecules, but there are a few specialized RNA molecules that are capable
of splicing out their introns without any help from proteins. The slide shows one with the intron shown
in green and the two exons that are about to be joined together appearing in blue and red. Perhaps even
more tellingly, the ribosome, the machine that synthesizes proteins, uses its RNA rather than its protein
components to catalyze the reaction that joins one amino acid to the next. This is exactly what we
would expect if the original protein synthesizing machine was made of RNA, and the protein
components were only added later.
Slide 23: Life began in an RNA world
In this view of the evolution of life, RNA came first both as catalytic and information carrying
molecules. In this era, RNA was responsible for everything, including harvesting energy from the
primitive cell’s environment, making new RNA molecules for the next generation of cells, and the
metabolism of any small molecules that went on in these cells.
Slide 24: Evolving dogma: proteins come next
We believe that the next stage was the use of small proteins as catalysts for various processes and the
selection of a general mechanism for using the information encoded in RNA to instruct the synthesis of
proteins. Today’s descendant of that general mechanism is protein synthesis, which as Rob and Dan
have discussed, uses a combination of transfer RNA and the ribosome to convert a linear string of bases
encoded in the messenger RNA into a linear string of amino acids, which then folds up to form a unique
three dimensional structure. Once a general scheme for protein synthesis had been invented, many of
the catalytic functions of RNA molecules were taken over by proteins whose structure and chemistry is
inherently more flexible, with only the most central functions, such as protein synthesis, retaining an
RNA component. Today the vast majority of enzymes and structural components of the cell are
proteins.