reanalysis and revision of the cambridge reference sequence for human mitochondrial dna

Upload: abidaiqbalansari

Post on 14-Apr-2018

214 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/30/2019 Reanalysis and Revision of the Cambridge Reference Sequence for Human Mitochondrial DNA.

    1/1

    nature genetics volume 23 october 1999 147

    correspondence

    The Human Genome Project, from oneperspective, began in 1981 with the

    publication1 of the complete sequence ofhuman mitochondrial DNA (mtDNA).The Cambridge reference sequence (CRS),as it is now designated, continues to beindispensable for studies of human evolu-tion, population genetics and mitochon-drial diseases. It has been recognized forsome time, however, that the CRS differsat several sites from the mtDNA sequencesobtained by other investigators2,3. Thesediscrepancies may reflect either true errorsin the original sequencing analysis or rarepolymorphisms in the CRS mtDNA. A

    further complication is that the originalmtDNA sequence was principally derivedfrom a single individual of Europeandescent, although it also contained somesequences from both HeLa and bovinemtDNA (ref. 1). To resolve these uncer-tainties, we have completely resequencedthe original placental mtDNA sample.

    The results of the resequencing confirmthat there are both errors and rare poly-morphisms in the CRS (Table 1). Thereare 11 nucleotide positions at which theoriginal sequence contains the incorrectnucleotide (all sequences here refer tothat of the CRS mtDNA L-strand), one ofwhich involves the CC doublet at posi-

    tions 3,106 and 3,107 in the originalsequence, which is actually a single cyto-sine residue. The other errors are mistakesin the identification of single base pairsand typically involve the incorrect assign-ment of a guanine residue. Errors at nt14,272 and 14,365 result from the use ofthe bovine mtDNA sequence at ambigu-ous sites1. There are an additional sevennucleotide positions at which the originalCRS is correct and which represent rare(or even private) polymorphic alleles. Sixof these polymorphisms are single basepair substitutions, although one involvesthe simple repeat of cytosine residues at

    nt 311315. The CRS has five residues inthis repeat, whereas most other humanmtDNAs have six.

    To determine which errors in the CRSresulted from the use of HeLa mtDNAsequence, we sequenced the HeLa mito-chondrial genome in its entirety (data notshown). It is of African origin and diver-gent from the published CRS at a numberof sites. The only error that we were able toexplain using the HeLa sequence was thatat nt 14,766 (T versus C, respectively).

    The revised CRS mtDNA belongs toEuropean haplogroup H on the basis of thecytosine at nt 14,766 (instead of thethymine in the original CRS) and the cyto-

    sine at nt 7,028 (instead of a thymine46).The assignment of the revised CRS to hap-logroup H is confirmed by the absence ofany of the predicted restriction sitechanges that characterize the other Euro-pean mtDNA haplogroups4,6.

    If we include both the technical errors(8) and the errors (3) introduced by the

    assumption of the bovine and HeLamtDNA sequences, the overall error fre-quency for the original CRS analysis was0.07%, emphasizing the remarkable tourde force by Sanger and colleagues. Never-theless, because of its widespread use, werecommend that the CRS for humanmtDNA should be revised as follows: (i)the ten simple substitution errors shouldbe corrected; (ii) the rare polymorphicalleles should be retained (that is, therevised CRS (RCRS) should be a true ref-erence sequence and not a consensussequence); and (iii) the original nucleo-tide numbering should be retained. The

    last suggestion represents a compromisebetween accuracy (correcting the num-bering to account for the single C residueat nt 3,106 and 3,107) and consistencywith the previous literature. We believethat renumbering all of the previouslyidentified sequence changes beyond nt3,106 would create an unacceptable levelof confusion.

    Acknowledgements

    We thank A. Coulson and F. Sanger for theirgenerous donation of the original placentalmtDNA sample. This work was supported by the

    Medical Research Council (R.M.A. and D.M.T.),Wellcome Trust (D.M.T., P.F.C., R.N.L. andN.H.) and the John Sealy Memorial EndowmentFund (N.H.).

    Richard M. Andrews1, Iwona Kubacka2,

    Patrick F. Chinnery3, Robert N. Lightowlers3,

    Douglass M. Turnbull3 & Neil Howell21Departments of Ophthalmology and

    Neurology, The Medical School, University of

    Newcastle upon Tyne, NE2 4HH, UK.2Biology

    Division 0656, Department of Radiation

    Oncology, The University of Texas Medical

    Branch, Galveston, Texas 77555-0656, USA.3Department of Neurology, The Medical School,

    University of Newcastle upon Tyne, NE2 4HH,

    UK. Correspondence should be addressed toD.M.T. (e-mail: [email protected]).

    1. Anderson, S. et al.Nature 290, 457465 (1981).2. Brown, M.D. et al. Genetics 130, 163173 (1992).3. Howell, N., McCullough, D.A., Kubacka, I.,

    Halvorson, S. & Mackey, D. Am. J. Hum. Genet. 50,13331337 (1992).

    4. Torroni, A. et al. Genetics 144, 18351850 (1996).5. Lamminen, T. et al. Eur. J. Hum. Genet. 5, 271279

    (1997).6. Torroni, A. et al. Am. J. Hum. Genet. 62, 11371152

    (1998).7. Howell, N. & McCullough, D.A.Am. J. Hum. Genet.

    47, 629634 (1990).8 . Howell, N. et al. Genetics 140, 285302 (1995).9. Biggin, M.D., Gibson, J.J. & Hong, G.F. Proc. Natl

    Acad. Sci. USA 80, 39633965 (1983).

    Reanalysis and revision of theCambridge reference sequence forhuman mitochondrial DNA

    Table 1 Reanalysis of the Cambridge reference sequence

    Nucleotide Previous Reanalysedposition sequence sequence Remarks

    263 A A rare polymorphism311315 CCCCC CCCCC rare polymorphism

    750 A A rare polymorphism1,438 A A rare polymorphism

    3,1063,107 CC C error3,423 G T error4,769 A A rare polymorphism4,985 G A error8,860 A A rare polymorphism9,559 G C error

    11,335 T C error13,702 G C error14,199 G T error14,272 G C error (bovine)14,365 G C error (bovine)14,368 G C error14,766 T C error (HeLa)15,326 A A rare polymorphism

    We determined the complete mtDNA sequence using a series of 28 overlapping PCR-amplified fragmentsspanning the entire length of the mitochondrial genome. Both strands of each PCR fragment weresequenced with BigDye terminator cycle sequencing chemistry on an Applied Biosystems model 377 auto-mated nucleotide sequencer. In addition, several regions of the CRS mtDNA were sequenced indepen-dently using a different approach7,8. In brief, small PCR-amplified mtDNA fragments of 300 bp werecloned into M13 sequencing vectors and the mtDNA sequence determined using a dideoxy chain termina-tion method9. The two approaches yielded identical results.

    1999 Nature America Inc. http://genetics.nature.com

    1999N

    atureAmericaInc.http://genetics.nature.com