reanalysis and revision of the cambridge reference sequence for human mitochondrial dna
TRANSCRIPT
-
7/30/2019 Reanalysis and Revision of the Cambridge Reference Sequence for Human Mitochondrial DNA.
1/1
nature genetics volume 23 october 1999 147
correspondence
The Human Genome Project, from oneperspective, began in 1981 with the
publication1 of the complete sequence ofhuman mitochondrial DNA (mtDNA).The Cambridge reference sequence (CRS),as it is now designated, continues to beindispensable for studies of human evolu-tion, population genetics and mitochon-drial diseases. It has been recognized forsome time, however, that the CRS differsat several sites from the mtDNA sequencesobtained by other investigators2,3. Thesediscrepancies may reflect either true errorsin the original sequencing analysis or rarepolymorphisms in the CRS mtDNA. A
further complication is that the originalmtDNA sequence was principally derivedfrom a single individual of Europeandescent, although it also contained somesequences from both HeLa and bovinemtDNA (ref. 1). To resolve these uncer-tainties, we have completely resequencedthe original placental mtDNA sample.
The results of the resequencing confirmthat there are both errors and rare poly-morphisms in the CRS (Table 1). Thereare 11 nucleotide positions at which theoriginal sequence contains the incorrectnucleotide (all sequences here refer tothat of the CRS mtDNA L-strand), one ofwhich involves the CC doublet at posi-
tions 3,106 and 3,107 in the originalsequence, which is actually a single cyto-sine residue. The other errors are mistakesin the identification of single base pairsand typically involve the incorrect assign-ment of a guanine residue. Errors at nt14,272 and 14,365 result from the use ofthe bovine mtDNA sequence at ambigu-ous sites1. There are an additional sevennucleotide positions at which the originalCRS is correct and which represent rare(or even private) polymorphic alleles. Sixof these polymorphisms are single basepair substitutions, although one involvesthe simple repeat of cytosine residues at
nt 311315. The CRS has five residues inthis repeat, whereas most other humanmtDNAs have six.
To determine which errors in the CRSresulted from the use of HeLa mtDNAsequence, we sequenced the HeLa mito-chondrial genome in its entirety (data notshown). It is of African origin and diver-gent from the published CRS at a numberof sites. The only error that we were able toexplain using the HeLa sequence was thatat nt 14,766 (T versus C, respectively).
The revised CRS mtDNA belongs toEuropean haplogroup H on the basis of thecytosine at nt 14,766 (instead of thethymine in the original CRS) and the cyto-
sine at nt 7,028 (instead of a thymine46).The assignment of the revised CRS to hap-logroup H is confirmed by the absence ofany of the predicted restriction sitechanges that characterize the other Euro-pean mtDNA haplogroups4,6.
If we include both the technical errors(8) and the errors (3) introduced by the
assumption of the bovine and HeLamtDNA sequences, the overall error fre-quency for the original CRS analysis was0.07%, emphasizing the remarkable tourde force by Sanger and colleagues. Never-theless, because of its widespread use, werecommend that the CRS for humanmtDNA should be revised as follows: (i)the ten simple substitution errors shouldbe corrected; (ii) the rare polymorphicalleles should be retained (that is, therevised CRS (RCRS) should be a true ref-erence sequence and not a consensussequence); and (iii) the original nucleo-tide numbering should be retained. The
last suggestion represents a compromisebetween accuracy (correcting the num-bering to account for the single C residueat nt 3,106 and 3,107) and consistencywith the previous literature. We believethat renumbering all of the previouslyidentified sequence changes beyond nt3,106 would create an unacceptable levelof confusion.
Acknowledgements
We thank A. Coulson and F. Sanger for theirgenerous donation of the original placentalmtDNA sample. This work was supported by the
Medical Research Council (R.M.A. and D.M.T.),Wellcome Trust (D.M.T., P.F.C., R.N.L. andN.H.) and the John Sealy Memorial EndowmentFund (N.H.).
Richard M. Andrews1, Iwona Kubacka2,
Patrick F. Chinnery3, Robert N. Lightowlers3,
Douglass M. Turnbull3 & Neil Howell21Departments of Ophthalmology and
Neurology, The Medical School, University of
Newcastle upon Tyne, NE2 4HH, UK.2Biology
Division 0656, Department of Radiation
Oncology, The University of Texas Medical
Branch, Galveston, Texas 77555-0656, USA.3Department of Neurology, The Medical School,
University of Newcastle upon Tyne, NE2 4HH,
UK. Correspondence should be addressed toD.M.T. (e-mail: [email protected]).
1. Anderson, S. et al.Nature 290, 457465 (1981).2. Brown, M.D. et al. Genetics 130, 163173 (1992).3. Howell, N., McCullough, D.A., Kubacka, I.,
Halvorson, S. & Mackey, D. Am. J. Hum. Genet. 50,13331337 (1992).
4. Torroni, A. et al. Genetics 144, 18351850 (1996).5. Lamminen, T. et al. Eur. J. Hum. Genet. 5, 271279
(1997).6. Torroni, A. et al. Am. J. Hum. Genet. 62, 11371152
(1998).7. Howell, N. & McCullough, D.A.Am. J. Hum. Genet.
47, 629634 (1990).8 . Howell, N. et al. Genetics 140, 285302 (1995).9. Biggin, M.D., Gibson, J.J. & Hong, G.F. Proc. Natl
Acad. Sci. USA 80, 39633965 (1983).
Reanalysis and revision of theCambridge reference sequence forhuman mitochondrial DNA
Table 1 Reanalysis of the Cambridge reference sequence
Nucleotide Previous Reanalysedposition sequence sequence Remarks
263 A A rare polymorphism311315 CCCCC CCCCC rare polymorphism
750 A A rare polymorphism1,438 A A rare polymorphism
3,1063,107 CC C error3,423 G T error4,769 A A rare polymorphism4,985 G A error8,860 A A rare polymorphism9,559 G C error
11,335 T C error13,702 G C error14,199 G T error14,272 G C error (bovine)14,365 G C error (bovine)14,368 G C error14,766 T C error (HeLa)15,326 A A rare polymorphism
We determined the complete mtDNA sequence using a series of 28 overlapping PCR-amplified fragmentsspanning the entire length of the mitochondrial genome. Both strands of each PCR fragment weresequenced with BigDye terminator cycle sequencing chemistry on an Applied Biosystems model 377 auto-mated nucleotide sequencer. In addition, several regions of the CRS mtDNA were sequenced indepen-dently using a different approach7,8. In brief, small PCR-amplified mtDNA fragments of 300 bp werecloned into M13 sequencing vectors and the mtDNA sequence determined using a dideoxy chain termina-tion method9. The two approaches yielded identical results.
1999 Nature America Inc. http://genetics.nature.com
1999N
atureAmericaInc.http://genetics.nature.com