evolution of duplicated reggie genes in zebrafish and goldfish · 2020-01-28 · specific...

11
Evolution of Duplicated reggie Genes in Zebrafish and Goldfish Edward Ma ´ laga-Trillo, Ute Laessing, Dirk M. Lang, Axel Meyer, Claudia A.O. Stuermer Department of Biology, University of Konstanz, 78457 Konstanz, Germany Received: 24 January 2001 / Accepted: 27 July 2001 Abstract. Invertebrates, tetrapod vertebrates, and fish might be expected to differ in their number of gene cop- ies, possibly due the occurrence of genome duplication events during animal evolution. Reggie (flotillin) genes code for membrane-associated proteins involved in growth signaling in developing and regenerating axons. Until now, there appeared to be only two reggie genes in fruitflies, mammals, and fish. The aim of this research was to search for additional copies of reggie genes in fishes, since a genome duplication might have increased the gene copy number in this group. We report the pres- ence of up to four distinct reggie genes (two reggie-1 and two reggie-2 genes) in the genomes of zebrafish and goldfish. Phylogenetic analyses show that the zebrafish and goldfish sequence pairs are orthologous, and that the additional copies could have arisen through a genome duplication in a common ancestor of bony fish. The pres- ence of novel reggie mRNAs in fish embryos indicates that the newly discovered gene copies are transcribed and possibly expressed in the developing and regenerat- ing nervous system. The intron/exon boundaries of the new fish genes characterized here correspond with those of human genes, both in location and phase. An evolu- tionary scenario for the evolution of reggie intron-exon structure, where loss of introns appears to be a distinctive trait in invertebrate reggie genes, is presented. Key words: reggie flotillin — Gene duplication — Microdomains — Axon growth — Axon regeneration — Carassius auratus Danio rerio Introduction Reggie-1 and reggie-2 (homologous to flotillin-2 and flo- tillin-1, see below) are two closely related 48 kD intra- cellular and membrane-associated proteins whose encod- ing genes have been identified so far in goldfish (Schulte et al. 1997), rats (Lang et al. 1998), mice (Cho et al. 1995; Bickel et al. 1997), humans (Schroeder et al. 1994), and fruitflies (Galbiati et al. 1998). Reggies are co-expressed in many cell types and define specific plasma membrane microdomains (Lang et al. 1998) In contrast to what has been suggested by others (Bickel et al. 1997; Volonte et al. 1999), they do not occur in ca- veolae—small plasma membrane microdomains charac- terized by the expression of caveolin—but are expressed clearly outside of these structures (Lang et al. 1998). Moreover, in neurons and T-lymphocytes—which do not possess caveolae—reggie proteins occur in their own specific microdomains (Lang et al. 1998). In goldfish and rats, reggie-1 and -2 are upregulated during axon growth and regeneration of retinal ganglion cells (RGC) (Schulte et al. 1997; Lang et al. 1998). Although the precise nature of their role in the central nervous system is not fully understood, two lines of evidence strongly suggest that reggie-1 and -2 participate in molecular sig- naling across the plasmamembrane: 1) In cultured neu- rons, reggie-1 and -2 occur all along axons and are as- sociated with GPI-linked CAMs and fyn tyrosine kinase (Stuermer et al. 2001); 2) In activated T-lymphocytes, reggie-1 and -2 occur in micropatches along the plasma membrane and are recruited into the T-cell receptor sig- naling complex (along with Thy-1 and GM-1) (Lang et al. 1998; Stuermer et al. 2001). It has long been suggested that two rounds of genome duplication facilitated the appearance of novel mor- phologies early in vertebrate evolution by increasing the Correspondence to: E. Ma ´laga-Trillo; email: edward.malaga@uni- konstanz.de J Mol Evol (2002) 54:235–245 DOI: 10.1007/s00239-001-0005-1 © Springer-Verlag New York Inc. 2002

Upload: others

Post on 26-Jul-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Evolution of Duplicated reggie Genes in Zebrafish and Goldfish · 2020-01-28 · specific microdomains (Lang et al. 1998). In goldfish and rats, reggie-1 and -2 are upregulated during

Evolution of Duplicated reggieGenes in Zebrafish and Goldfish

Edward Ma laga-Trillo, Ute Laessing, Dirk M. Lang, Axel Meyer, Claudia A.O. Stuermer

Department of Biology, University of Konstanz, 78457 Konstanz, Germany

Received: 24 January 2001 / Accepted: 27 July 2001

Abstract. Invertebrates, tetrapod vertebrates, and fishmight be expected to differ in their number of gene cop-ies, possibly due the occurrence of genome duplicationevents during animal evolution.Reggie(flotillin ) genescode for membrane-associated proteins involved ingrowth signaling in developing and regenerating axons.Until now, there appeared to be only tworeggiegenes infruitflies, mammals, and fish. The aim of this researchwas to search for additional copies ofreggie genes infishes, since a genome duplication might have increasedthe gene copy number in this group. We report the pres-ence of up to four distinctreggiegenes (tworeggie-1andtwo reggie-2 genes) in the genomes of zebrafish andgoldfish. Phylogenetic analyses show that the zebrafishand goldfish sequence pairs are orthologous, and that theadditional copies could have arisen through a genomeduplication in a common ancestor of bony fish. The pres-ence of novelreggiemRNAs in fish embryos indicatesthat the newly discovered gene copies are transcribedand possibly expressed in the developing and regenerat-ing nervous system. The intron/exon boundaries of thenew fish genes characterized here correspond with thoseof human genes, both in location and phase. An evolu-tionary scenario for the evolution ofreggie intron-exonstructure, where loss of introns appears to be a distinctivetrait in invertebratereggiegenes, is presented.

Key words: reggie— flotillin — Gene duplication —Microdomains — Axon growth — Axon regeneration —Carassius auratus— Danio rerio

Introduction

Reggie-1 and reggie-2 (homologous to flotillin-2 and flo-tillin-1, see below) are two closely related 48 kD intra-cellular and membrane-associated proteins whose encod-ing genes have been identified so far in goldfish (Schulteet al. 1997), rats (Lang et al. 1998), mice (Cho et al.1995; Bickel et al. 1997), humans (Schroeder et al.1994), and fruitflies (Galbiati et al. 1998). Reggies areco-expressed in many cell types and define specificplasma membrane microdomains (Lang et al. 1998) Incontrast to what has been suggested by others (Bickel etal. 1997; Volonte et al. 1999), they do not occur in ca-veolae—small plasma membrane microdomains charac-terized by the expression of caveolin—but are expressedclearly outside of these structures (Lang et al. 1998).Moreover, in neurons and T-lymphocytes—which do notpossess caveolae—reggie proteins occur in their ownspecific microdomains (Lang et al. 1998). In goldfishand rats, reggie-1 and -2 are upregulated during axongrowth and regeneration of retinal ganglion cells (RGC)(Schulte et al. 1997; Lang et al. 1998). Although theprecise nature of their role in the central nervous systemis not fully understood, two lines of evidence stronglysuggest that reggie-1 and -2 participate in molecular sig-naling across the plasmamembrane: 1) In cultured neu-rons, reggie-1 and -2 occur all along axons and are as-sociated with GPI-linked CAMs and fyn tyrosine kinase(Stuermer et al. 2001); 2) In activated T-lymphocytes,reggie-1 and -2 occur in micropatches along the plasmamembrane and are recruited into the T-cell receptor sig-naling complex (along with Thy-1 and GM-1) (Langet al. 1998; Stuermer et al. 2001).

It has long been suggested that two rounds of genomeduplication facilitated the appearance of novel mor-phologies early in vertebrate evolution by increasing the

Correspondence to:E. Malaga-Trillo; email: [email protected]

J Mol Evol (2002) 54:235–245DOI: 10.1007/s00239-001-0005-1

© Springer-Verlag New York Inc. 2002

Page 2: Evolution of Duplicated reggie Genes in Zebrafish and Goldfish · 2020-01-28 · specific microdomains (Lang et al. 1998). In goldfish and rats, reggie-1 and -2 are upregulated during

levels of genetic complexity (Ohno 1970; Ohno 1999).This hypothesis would explain why vertebrates can haveup to four copies of genes that are present as singlecopies in invertebrates. The recent discovery in the ze-brafish of extra copies of several gene families uncov-ered an additional genome duplication that probably tookplace in the common ancestor of all teleost fish (Amoreset al. 1998; Wittbrodt et al. 1998; Meyer and Schartl1999; Meyer and Ma´laga-Trillo 1999). The generation ofnew gene copies through genome duplications raisesquestions about how much functional redundancy can betolerated in genomes and whether duplicated copies arerandomly deleted, inactivated, or selected to perform anovel function (Malaga-Trillo and Meyer 2001). To an-swer these questions, it is necessary to identify as manyfamilies of duplicated genes as possible, and to examinethe evolution of their functions.

So far only one copy of eachreggie-(flotillin ) genehas been characterized in fruitflies, mammals, and gold-fish, and yet, if the additional fish genome-duplicationhypothesis is correct, then for any given gene family, fishare expected to have up to twice as many copies asmammals and up to eight times as many copies as in-vertebrates, although average gene copy ratios betweenthese groups cannot always be expected to remain wholenumbers because not all duplicated genes are maintainedfor a long evolutionary time. Some duplicated copiesmay be lost as pseudogenes or rendered unrecognizableas “junk DNA.” It is therefore possible that: 1) Only onefunctionalreggie-1or -2 gene was retained in vertebrategenomes, or that, 2) Additionalreggie copies exist (orexisted) in vertebrates but remain undiscovered. Thus,reggiegenes offer a case study for the evolution of ge-netic redundancy and the functional fate of duplicatedgene copies. Given their importance in the developmentof the vertebrate nervous system (Schulte et al. 1997;Lang et al. 1998), it is of interest to know how duplica-tion of these genes affected their function. Until now, theonly fish reggie genes known are those of the goldfishCarassius auratus(Lang et al. 1998), a model systemroutinely employed for the study of retinotectal develop-ment and regeneration of retinal axons (Gaze andSharma 1970). Despite their obvious biomedical interest,reggiegenes have not yet been characterized in the ze-brafish, Danio rerio, the system of choice for the mo-lecular characterization of genes important for diverseaspects of vertebrate development (Haffter et al. 1996).In order to evaluate new evidence for the fish genomeduplication hypothesis, and as a first step towards under-standing the functional evolution of redundant geneproducts in the nervous system, we set out to characterizethe reggiegenes of zebrafish and goldfish.

Materials and Methods

Nucleic acid isolation.Zebrafish and goldfish total genomic DNAswere prepared from fresh muscle tissue of homozygous individuals

using ATL Lysis Buffer (Qiagen, Hilden). We followed standard meth-ods (Sambrook et al. 1989) for the purification steps.

Synthesis of zebrafish 1st strand cDNA for RT-PCR.Total RNAwas isolated from two and three day old zebrafish embryos by theRNAzol B method (WAK Chemie, Bad Soden). First strand cDNA wasgenerated using Oligo-dT primers and the Superscript Reverse Tran-scriptase (Gibco BRL, Eggenstein) or the Ready-to-go T-primed First-strand Synthesis Kit (Amersham Pharmacia), as outlined by the manu-facturer.

Preparation of goldfish cDNA libraries.Two weeks after opticnerve transection, mRNA from goldfish retinae was prepared with theQuick Prep mRNA Purification Kit (Amersham Pharmacia), and re-verse transcribed using Superscript RT (Gibco BRL). After secondstrand synthesis and ligation toEcoRI adaptors (Amersham Pharma-cia), the cDNAs were cloned into Lambda ZapII (Stratagene, Heidel-berg). The titer of the cDNA library was amplified from 2.4 × 106

PFU/ml to 4.5 × 1010 PFU/ml.

Polymerase chain reaction (PCR). Reggie-1and-2 genes fragmentswere amplified with several pairs of degenerate primers located atvarious points along thereggie-1and -2 coding sequence (Table 1).Oneml of template was added to 19ml PCR reaction mix containing0.5 units of Red Taqt Polymerase (Sigma-Aldrich, Taufkirchen), 1×Red Taq Buffer, 800mM dNTPs, and 0.5mM primers. PCR reactionswere performed in GeneAmpt PCR System 9700 thermocyclers (PEApplied Biosystems, Weiterstadt) using the following conditions: Ini-tial denaturation for 2 min at 94°C, followed by 35–40 cycles of:denaturation at 94°C for 1 min, annealing at 50–56°C for 1 min, ex-tension at 72°C for 2 min; and a final extension step at 72°C for 10 min.

Cloning of PCR products.Amplification products were directlyused for cloning, or first excised from 2% agarose gels (Gibco BRL).After purification with the Qiaquick PCR Purification Kit (Qiagen), theDNAs were ligated into the Topo II vector (Invitrogen, Karlsruhe) andthe ligation mixes used to transformE. coli. Transformations wereplated onto LB-agar plates containing the appropriate amounts of am-picillin, X-gal, and IPTG (Sambrook et al. 1989). White recombinantbacteria were grown and tested by PCR for the presence of the correctinsert.

DNA sequencing.PCR products were purified with the aid of theQIAquick PCR purification kit (Qiagen), and plasmid templates fromcloned amplification products were isolated using the QIAprep SpinMiniprep Kit (Qiagen). Sequencing reactions were set up in 10mlvolumes containing up to 7ml of DNA, 1 ml of 10 mM primer and 2ml of Big Dye Terminator Cycle Sequencing Ready Reaction Mix(DNA Sequencing Kit, PE Applied Biosystems). The amount of tem-plate used for cycle sequencing and the thermal cycling profile werechosen according to the kit’s guidelines. Unincorporated dye termina-tors were removed by EtOH/NaOAc precipitation and the purified re-actions were loaded onto 0.2 mm-thick 5% Long Ranger™ gels and ranon an ABI 377 automated DNA sequencer.

DNA sequence analysis and drawing of dendrograms.DNA se-quences were edited and analyzed using PE Applied Biosystems soft-ware (Sequence Analysis and Sequence Navigator) as well as the GCGWisconsin Package Version 9.1 (The Genetics Computer Group 1997).Final sequences have been deposited in the GenBank database underthe accession codes AF315945, AF315946, AF315947, AF315948, andAF315949. Sequence alignments were made with DNA Star’sMegAlign, and phylogenetic analysis was performed using both dis-tance and maximum parsimony algorithms with Paup 4.0b4 (Swofford2000). Previously characterizedreggie sequences were downloaded

236

Page 3: Evolution of Duplicated reggie Genes in Zebrafish and Goldfish · 2020-01-28 · specific microdomains (Lang et al. 1998). In goldfish and rats, reggie-1 and -2 are upregulated during

from Genbank. Accession numbers for theflotillin-1/reggie-2 se-quences of Drosophila, goldfish, mouse, rat, and human are AF044734(genomic: AE003810), U33556, NM_008027, U60976, and NM-005803 (genomic: NT_001520). Accession numbers for theflotillin-2/reggie-1sequences of Drosophila, goldfish, mouse, rat, and human areAF044916 (genomic: AE003497), L36867, NM_008028, AF023303,and NM_004475 (genomic: AC024267). It should be noted that severalreggie 1and 2 cDNA sequences have been reported in the rat. Thisdoes not imply that the rat has morereggiegenes. Based on comparisonwith human genomic sequences (not shown), the additional rat se-quences are rather the result of the isolation of heterogeneous nuclearRNAs and alternative splicing forms. For instance, in the case ofreggie1, a splice variant sequence starts at exon 4 and another one at exon 5.

For distance analysis, genetic distances were estimated with theKimura two-parameter method (Kimura 1980). Dendrograms were pre-pared using the neighbor-joining (NJ) tree constructing method (Saitouand Nei 1987) and reliability of the tree topologies was evaluatedthrough 1000 bootstrap replications (Felsenstein 1985).

Results and Discussion

Homology ofreggieand flotillin Genes

Thereggiegenes of goldfish and rat (Schulte et al. 1997;Lang et al. 1998) were reported independently of the

flotillin genes of fruitflies, humans, and mice (Schroederet al. 1994; Cho et al. 1995; Galbiati et al. 1998). Nev-ertheless, their highly similar DNA sequences and ex-pression patterns strongly suggest that these genes areclosely related and members of the same gene family(above references and unpublished Medline records). Weperformed phylogenetic analysis in order to clarify theirevolutionary relationships. The neighbor-joining tree inFig. 1 defines two major sequence homology groups, onecontaining reggie-1 and flotillin-2 sequences, and theother one includingreggie-2 and flotillin-1 sequences.The same topology was obtained with high bootstrapsupport when the maximum parsimony method was em-ployed (not shown). Therefore, the current nomenclatureis in need of revision: the names “reggie” and “flotillin ”designate homologous genes, but numbering of the loci(1 or 2) is misleading. Here, we will use the names“ reggie-1” and “reggie-2” to refer to reggie-1/flotillin-2and reggie-2/flotillin-1 groups, respectively The pres-ence of pairs of homologousreggie genes in mammalsand arthropods indicates thatreggie-1and -2 arose be-fore the evolutionary split between protostomes and deu-

Fig. 1. Neighbor-joining tree ofpreviously characterizedreggie(Reg) andflotillin (Flo) and DNAsequences. An alignment offull-length coding sequences wasused as a basis for phylogeneticreconstruction. The distance matrixwas generated using the Kimuratwo-parameter method. Thehorizontal length of each branch isproportional to the estimated geneticdistance (number of substitutions)between taxa. Bootstrap values(1000 replications) are indicated oneach branch.

237

Page 4: Evolution of Duplicated reggie Genes in Zebrafish and Goldfish · 2020-01-28 · specific microdomains (Lang et al. 1998). In goldfish and rats, reggie-1 and -2 are upregulated during

terostomes. On average, the intraspecific aminoacid se-quence similarity betweenreggie-1 and -2 genes isapproximately 52.5%, while the similarity between ver-tebrate and invertebratereggiesof the -1 or -2 groups isapproximately 62.8%. The length of thereggie codingregion is 429 aminoacids (aa) and appears to be con-served during evolution, although minor length variationcan be observed at the 38 end of somereggie-2genes. Inall pairwise comparisons, substitutions are randomly dis-tributed along the gene and no mutational hotspots orhighly conserved domains are apparent.

Isolation of NovelreggieSequences From Fish

The goldfish and zebrafish sequences reported here wereobtained by PCR amplification of genomic and cDNAtemplates using several combinations of degenerateprimers. An alignment of all previously availablereg-gie-1and-2 DNA sequences (not shown) was used as thebasis for primer design. In order to increase the speci-ficity of our amplifications towards goldfish/zebrafishsequences, two of the primer pairs (GF-Reg1F/R andGF-Reg2F/R) were constructed specifically for goldfishreggie-1and2 sequences, and used in combination withthe degenerate ones. At the time we started with theseexperiments, no information about the intron-exon orga-nization ofreggiegenes was available. This presented apotential problem for the amplification of genomic tar-gets. Therefore, 20 forward and reverse primers at 13

Table 1. PCR Primers used in this study

Primer ID Sequence

1. Reg-19-43F ACTKKYGGMCCMAAYGARGCVMTSG2. Reg-193-215F GGDGTVSCYWTYWCYGTSACHGG3. Reg-193-215R CCDGTSACRGWRAWRGSBACHCC4. Reg-271-288F SAGMWGTTYCTGGGRAAG5. Reg-271-288R CTTYCCCAGRAACWKCTS6. Reg-360-384F GACKGTGGAGSAGATYTAYMAGGAC7. Reg-271-288R GTCCTKRTARATCTSCTCCACMGTC8. Reg-568-590F GRGATGCNGGGATYMGGGARGC9. Reg-568-590R GCYTCCCKRATCCCNGCATCYC

10. Reg-720-743F SMTGGCCTATCAGCTMCAGGYRGC11. Reg-720-743R GCYRCCTGKAGCTGATAGGCCAKS12. Reg-862-885F GTSMRSMRDCCWGCBGADGCMGAG13. Reg-862-885R CTCKGCHTCVGCWGGHYKSYKSAC14. Reg-1015-1039F GAGGCYGAGCRRATGRSSMWGAAGG15. Reg-1015-1039R CCTTCWKSSYCATYYGCTCRGCCTC16. Reg-1098-1119R GAYCTYYYCWGCMAYCTKGGGC17. GF-Reg1F GARGTIGCIGCICCIGAYGT18. GF-Reg1R GCYTCIGCYTCICCDATYTT19. GF-Reg2F AAYGARGCIATGGTIGTITC20. GF-Reg2R TCYTCDATYTGYTGYTTIGTYTT21. Dare-Reg1aXF ACAGGGATCAGTTTGCCAAG22. Dare-Reg1aXR CTCTCTCTGCTTCAGCCACA23. Dare-Re2aXF CGATGGTGGTGTCAGGTAAG24. Dare-Reg2aXR GGACCCCATGACGTGTGTA25. Dare-Reg2bXF GCTCCTCCTCTCATGATTGC26. Dare-Reg2bXR GCCTTCTTGGTGTTGACCTC

Numbering corresponds to that of Figure 2. Primer orientation is speci-fied by “F” or “R” at the end of the primer’s name.

Fig. 2. PCR strategy for the amplification ofreggie genes in ze-brafish and goldfish. Roman numbers indicate the arbitrary amplifica-tion targets defined by the position of our degenerate primers. Primersare represented by arrows and numbered according to Table 1. Primer

orientation is indicated by the arrows. The final length of the amplifi-cation products obtained is shown for each of the novel genes identifiedin this study.

238

Page 5: Evolution of Duplicated reggie Genes in Zebrafish and Goldfish · 2020-01-28 · specific microdomains (Lang et al. 1998). In goldfish and rats, reggie-1 and -2 are upregulated during

different locations in the least variable segments of thereggie-1and -2 coding region were designed (Table 1).The positions of the PCR primers divide most of thereggie-1and -2 consensus sequence into eight amplifi-cation targets (Fig. 2). All primer combinations encom-passing one, two, or up to three of these segments weretested. The efficiency of the amplifications varied largelydepending on the primer pair and the type of templateused. The strongest and cleanest amplification productsof expected sizes were cloned and sequenced. Primercombinations 271-288F/360-384R, 360-384F/568-590R,and GF-Reg1F/568-590R amplifiedreggie-1 products;primer combinations GF-Reg2F/193-215R, 360-384F/GF-Reg2R, 720-742F/862-885R, and GF-Reg2F/GF-Reg2R amplifiedreggie-2 products. Alignment of theamplified sequences identified four and three differentreggiegenes in goldfish and zebrafish, respectively. Twoof the goldfish sequences had been previously reported(Schulte et al. 1997), but the other two represent novelreggie-1and-2 genes. The three zebrafish sequences arenovel, one of them homologous toreggie-1and the othertwo homologous toreggie-2 (Fig. 4). These new se-quences represent distinct loci and not alleles becausethey were isolated from single homozygous fish. Wedesignated the four fish genes,reggie-1a, -1b, -2a,and-2b,based on their sequence similarity to each other andto reggie-1 genes from other species (Table 2). We

named the newly foundreggie-1from goldfishreggie-1band renamed the previously cloned goldfishreggie-1(Schulte et al. 1997) asreggie-1a.Failure to amplifyreggie-1bfrom the zebrafish could be due to an inde-pendent gene loss in the zebrafish genome or to technicaldifficulties such as inefficient primer annealing duringPCR amplification.

New internal, zebrafish-specific primers were de-signed (Table 1, Fig. 2) and used on zebrafish cDNA incombination with some of the already mentioned degen-erate primers, in order to extend the amount of codingsequence for the transcribed zebrafish genes. As a result,a total of 1188, 681, and 810 bp of coding sequence wereobtained forreggie-1a, reggie-2a,andreggie-2b,respec-tively (Fig. 2). In addition, thereggie-1and-2 introns ofzebrafish and goldfish contribute 243 and 299 bp ofDNA sequence, respectively.

The new zebrafish and goldfish sequences werealigned and compared to otherreggiegenes, in order toestablish their evolutionary history. Percent aminoacidsimilarity between all known representatives of thereg-gie-1and-2 groups are given in Table 2. For both groupsof sequences, it can be observed that “a” or “ b” genes aremore similar between goldfish and zebrafish than to eachother within either one species. For instance,reggie-2agenes of zebrafish and goldfish are 96.6% similar, whilereggie-2aand-2b in zebrafish or goldfish are only 88.3

Fig. 3. Two phylogenetic scenariosfor the tree topology of duplicatedgenes X in hypothetical species 1 and2, depending on the relative timing oftheir duplication.A: A shared geneduplication before speciation results intrans-specific groupings.B:Independent gene duplications (afterspeciation) result in grouping of genesaccording to species.

239

Page 6: Evolution of Duplicated reggie Genes in Zebrafish and Goldfish · 2020-01-28 · specific microdomains (Lang et al. 1998). In goldfish and rats, reggie-1 and -2 are upregulated during

and 83.5% similar. Thus, the origin of “a” and “b” genesappears to predate the divergence of zebrafish and gold-fish.

The Duplication of FishreggieGenes inTeleost Genomes

The presence of additionalreggiegenes in zebrafish andgoldfish, relative to tetrapods, could be explained by agene/genome duplication in the common ancestor of

these fish. Alternatively, the novel genes could havearisen through independent gene/genome duplicationevents in each of the two species. An ancestral duplica-tion is more parsimonious than two independent dupli-cation events in descending lineages. Phylogeneticanalysis of allreggie genes was undertaken to decidebetween these two scenarios, based on the resulting treetopologies. If—as expected—the novelreggie copiesarose in a teleost ancestor, homologousreggiesequencesshould cluster trans-specifically (Fig. 3A); if they arose

Fig. 4. Distance (Kimura’stwo-parameter) and maximum parsimonyconsensus tree ofreggie-1(A) andreggie-2(B) DNA sequences. Fruitfly,mouse, and human sequences have beenrenamed from their original “flotillin ”designations to the “reggie” names used inthis study (see Results). Trees wereconstructed using the neighbor-joiningalgorithm. The horizontal lengths of eachbranch are proportional to the estimatedgenetic distance (number of substitutions)between taxa. Bootstrap values (1000replications) obtained by distance andparsimony methods are indicated on eachbranch, in that order and separated by a“/”.

240

Page 7: Evolution of Duplicated reggie Genes in Zebrafish and Goldfish · 2020-01-28 · specific microdomains (Lang et al. 1998). In goldfish and rats, reggie-1 and -2 are upregulated during

independently in zebrafish and goldfish, then their se-quences would group by species (Fig. 3B). Thus, the treetopology is indicative of the timing of the duplication.Distance and Maximum Parsimony methods using fruit-fly sequences as outgroup, agree on the same tree to-pologies (Fig. 4A and B). Bothreggie-1and-2 trees aresupported with high bootstrap values and clearly dividefish from mammalian sequences. Within the fish groups,sequences do not cluster according to species, but forminstead clusters of “a” or “ b” semiorthologous genes(Holland 1999) across species. Therefore, the existenceof four reggie genes in zebrafish and goldfish is likelythe result of a duplication event in a common ancestor ofmodern fish, after its separation from the lineage leadingto tetrapods. Based only on our data, we cannot decidewhether this event was a gene or an entire genome du-plication. However, a whole-genome duplication in thecommon ancestor of all teleosts is strongly supported bydata onHoxand other gene families (Amores et al. 1998;Postlethwait et al. 1998; Naruse et al. 2000; Ma´laga-Trillo and Meyer 2001; Ma´laga-Trillo et al. unpublisheddata). Although our data are obviously not sufficient toconfirm the fish genome duplication hypothesis, they doadd experimental evidence in its favor.

In the absence of mapping information, our results donot formally rule out the possibility thatreggie genesmay have duplicated as part of a partial duplication in-volving only Hox chromosomes (as opposed to a wholegenome duplication), However, this can be demonstratedon the basis of genomic data from other organisms show-

ing conserved mapping ofreggieandHox genes to dif-ferent syntenic groups. In humans,reggie-2(Flotillin-1 )maps to 6p2l.3, within the Major HistocompatibilityComplex (Mhc) (MHC Sequencing Consortium 1999).The humanMhc is one of four extense paralogous re-gions that map to 6p21.3, 19p13.1, 9q33-q34, and 1q21-q25, independently ofHox clusters (chromosomes 2, 7,12, and 17). The affiliation ofReggie-2within theMhc isan ancient synteny conserved also in the zebrafish (Mi-chalova et al. 2000). The ancestral linkage ofreggie-2genes to theMhcchromosomes, suggests that duplicationevents in vertebrate history involved more than justHoxchromosomes, and allows us to extend this assumption tothe fish specific duplications. HumanReggie-1(Flotillin-2) does map to aHox chromosome (17q12), but verydistant from theHox Bcluster (17q21). This associationis not representative of an ancestral genetic linkage be-tweenReggieand Hox clusters, because in the fruitfly,neitherReggie-1nor -2 (mapping to chromosomes 1 and2, respectively) are linked to the singleHox complex (inchromosome 3). Thus, the presence ofReggie-1in hu-man chromosome 17 is rather likely to be the conse-quence of an independent translocation event.

The Division Teleostei (bony fish) is made up of 38Orders. One of these Orders—the Cypriniformes—includes the Cyprinidae, the largest family of freshwaterfishes to which zebrafish and goldfish also belong (Nel-son 1994). As these two genera represent only a fractionof the genetic and morphological diversity observed inmodern teleosts, it will be necessary to identify ortholo-

Table 2. (A) Percent similarity between knownReggie-1aminoacid sequences

Drme-1 Mumu-1 Rano-1 Hosa-1 Dare-1a Caau-1a Caau-1b

Drme-1 — 59.7 61.3 61.3 63.6 62.2 64Mumu-1 — — 83 80 68 65 74Rano-1 — — — 89 73 81 73Hosa-1 — — — — 74 81 73Dare-1a — — — — — 85 78Caau-1a — — — — — — 88Caau-1b — — — — — — —

Species are designated by the first two letters of the genus and species name: Drme4 Drosophila melanogaster, Rano4 Rattus norvegicus, Hosa4 Homo sapiens, Dare4 Danio rerio, Caau4 Carassius auratus.

(B) Percent similarity between knownReggie-2aminoacid sequences

Drme-2 Mumu-2 Rano-2 Hosa-2 Dare-2a Dare-2b Caau-2a Caau-2b

Drme-2 — 63.3 63.2 63.4 65.7 63.6 62 64.2Mumu-2 — — 99.5 97.9 88.7 78.3 79.2 79.5Rano-2 — — — 97.7 88.7 78.3 79.4 79.5Hosa-2 — — — — 89.1 79 79.4 80.2Dare-2a — — — — — 83 96.1 84.3Dare-2b — — — — — — 82.2 93.4Caau-2a — — — — — — — 81.9Caau-2b — — — — — — — —

Species are designated by the first two letters of the genus and species name: Drme4 Drosophila melanogaster, Mumu4 Mus musculus; Rano4 Rattus norvegicus, Hosa4 Homo sapiens, Dare4 Danio rerio, Caau4 Carassius auratus.

241

Page 8: Evolution of Duplicated reggie Genes in Zebrafish and Goldfish · 2020-01-28 · specific microdomains (Lang et al. 1998). In goldfish and rats, reggie-1 and -2 are upregulated during

Fig

.5.

Hyp

othe

tical

scen

ario

for

the

evol

utio

nof

the

intr

on-e

xon

orga

niza

tion

ofre

gg

iege

nes,

base

don

the

curr

ent

conf

igur

atio

nof

hum

anan

dfr

uitfl

yge

nes.

The

hum

anan

dfr

uitfl

yge

nest

ruct

ures

wer

eob

tain

edby

alig

ning

the

indi

vidu

alreg

gie

-1an

d-2

part

ials

truc

ture

s.E

xons

are

repr

esen

ted

bybl

uebo

xes;

red

boxe

s4in

tron

loss

essh

ared

byhu

man

and

frui

tfly

gene

s;bl

ack

boxe

s4

conc

erte

din

tron

loss

esin

frui

tfly

gene

s;or

ange

boxe

s4fr

uitfl

yre

gg

ie-1

-spe

cific

intr

onlo

sses

;gre

enbo

xes

4fr

uitfl

yre

gg

ie-

2-sp

ecifi

cin

tron

loss

es.T

hefis

hin

tron

sid

entif

ied

inth

epr

esen

tstu

dyar

ein

dica

ted

byve

rtic

alar

row

son

the

corr

espo

ndin

glo

catio

nsin

the

hum

ange

nes.

Exo

n1

isno

n-co

nser

ved;

itis

only

show

nfo

rill

ustr

ativ

epu

rpos

es.

Exo

n2

isde

lete

din

frui

tflyreg

gie

-1(b

oxm

arke

dw

itha

cros

s).

242

Page 9: Evolution of Duplicated reggie Genes in Zebrafish and Goldfish · 2020-01-28 · specific microdomains (Lang et al. 1998). In goldfish and rats, reggie-1 and -2 are upregulated during

Fig

.6.

Am

inoa

cid

alig

nmen

tofh

uman

and

frui

tflyre

gg

ie-1

and

-2co

ding

sequ

ence

s,in

clud

ing

intr

onlo

catio

nsan

dph

ases

.In

thos

eca

ses

whe

rein

tron

phas

eis

“1”

or“2

”,th

enu

cleo

tide

sequ

ence

ofth

ein

terr

upte

dco

dons

are

show

nfla

nkin

gth

ein

tron

.Das

hes

repr

esen

tind

els.

The

star

tcod

onof

reg

gie

gene

sis

loca

ted

inex

on2.

Sin

ceex

on1

isno

ncod

ing

and

high

lyva

riabl

e,th

eex

on1/

intr

on1

and

intr

on1/

exon

2bo

unda

ries

wer

ele

ftou

tof

our

anal

ysis

.E

xon

2is

dele

ted

infr

uitfl

yre

gg

ie-1

(see

Fig

.5)

.

243

Page 10: Evolution of Duplicated reggie Genes in Zebrafish and Goldfish · 2020-01-28 · specific microdomains (Lang et al. 1998). In goldfish and rats, reggie-1 and -2 are upregulated during

gous reggie genes in other teleost families to confirmthat the duplication of these genes occurred in a bonyfish ancestor.

Evolution of the Intron-Exon Organization ofreggieGenes

All goldfish and zebrafish sequences reported here wereamplified from cDNA. In addition, goldfishreggie-1band-2b,as well as zebrafishreggie-2acould be partiallyamplified from genomic DNA, uncovering the existenceof at least five introns in the upstream half of these genes:A 98 and a 145 bp intron inreggie-1;and an 84, a 92,and a 123 bp inreggie-2(genomic sequences from thedownstream half of fishreggiegenes could not be am-plified due to technical limitations). The degree of con-servation in length, and location and phase between thesefish introns and those of other organisms could not beimmediately evaluated because the intron-exon struc-tures of otherreggie genes were not previously de-scribed. Nevertheless, recent data from genome sequenc-ing projects (records unpublished but available at http://www.ncbi.nlm.nih.gov/Genomes/) provide extensiveinformation on the intron-exon structure of human andfly reggie genes. We used this information to find outwhether the partial gene structure of the fish genes ob-tained here corresponds with that of human and fruitflygenes, as well as to learn more about the mechanismsdriving their intron evolution and gene structure.

We compared the cDNA and genomic sequences ofhuman and fruitflyreggie-1and-2 genes (see Materialsand Methods for accession numbers) and inferred thesize and location of introns relative to the reading frame.In contrast to their high degree of DNA sequence con-servation, human and fruitflyreggiegenes differ greatlyin their intron-exon organization, fruitfly genes havingfewer introns than their human counterparts (Fig. 5).Alignment of fruitfly and human coding regions showsgeneral conservation of intron-exon boundaries and in-tron phases (Fig. 6). Except for two minor sliding events(at intron 4 and possibly at intron 10), the locations of allfruitfly introns correspond to those of the human ones.The use of human and fruitfly sequences for the com-parison allowed us to identify shared (ancestral) intronpositions and phases for nine out of the 11 introns ana-lyzed. The difference in the number of introns betweenthe two organisms is therefore likely to be the result ofloss of introns in the lineage leading to fruitflies ratherthan the gain of introns in the lineage leading to humans.The latter scenario is non-parsimonious because it wouldrequire the repeated and independent de novo formationof humanreggie-1 and -2 introns at exactly the samepositions in eight out of 11 cases.

The fish introns reported here are generally shorterthan their human fruitfly counterparts; they correspondto introns 5 and 6 inreggie-1,and to introns 2, 3 and 8

in reggie-2(Fig. 5, indicated by vertical arrows). In allcases, intron locations and phases are conserved betweenhuman and fish introns (Fig. 6). The presence of introns6 and 8 in fishreggie-1and -2, respectively, is sharedwith human but not with fruitfly genes, supporting thenotion that these introns are ancient and that they werelost in the lineage leading to invertebrates.

Using introns as cladistic characters, we attempted toreconstruct the evolutionary patterns of the intron loss inreggiegenes, based on the intron-exon structure of hu-man and fruitfly genes. We assume that the origin ofreggie-1 and -2 predate the split between protostomesand deuterostomes (see phylogeny in Fig. 1), and thatindependent loss of introns is more parsimonious thantheir independent gain.Reggie-1and-2 must have orig-inated in a common ancestor of protostomes and deu-terostomes, from the duplication of aproto-reggie-1/2gene with 13 exons; a subsequent loss of intron 10 oc-curred in thereggie-2gene of this hypothetical ancestor,as this loss is shared by human and fruitflyreggie-2genes. After the split between protostomes and deutero-stomes, the lineage leading to the fruitfly underwent theconcerted loss of introns 7, 8, and 9 in bothreggie-1and-2 genes, as well as the loss of introns 3, 6, and 12 inreggie-1and 4, 5, and 9 inreggie-2(Fig. 5). In addition,fruitfly reggie-1appears to have lost exon 2 (Fig. 6).

Much remains to be learned about the functional fateof the novel genes identified in this study. Our surveyuncovered the presence of duplicatedreggie transcriptsin two- to three-day-old zebrafish embryos. We are cur-rently examining their developmental expression pat-terns in zebrafish, and assessing their role in axongrowth/regeneration and T-cell activation. A particulararea of interest remains the question whether all fourreggiegenes of fish are expressed simultaneously in onemicrodomain, or rather in a combinatorial manner, assubsets that define distinct microdomains with differentsignaling properties, resulting in the development ofnovel regulatory interactions absent in mammals. Thepossible effects of these changes on axon growth/regeneration could hold much awaited answers about thedeveloping nervous system of vertebrates and its inabil-ity to regenerate neurons at the adult state.

Acknowledgments. This work was supported by grants from the Min-isterium fur Wissenschaft und Kunst of Baden-Wu¨rttemberg to C.S.and A.M. and from the DFG to A.M., and is the result of an ongoingcollaboration between the Stuermer and the Meyer laboratories.

References

Amores A, Force A, Yan YL, Joly L, Amemiya C, Fritz A, Ho RK,Langeland J, Prince V, Wang YL, Westerfield M, Ekker M,Postlethwait JH (1998) Zebrafishhox clusters and vertebrate ge-nome evolution. Science 282:1711–1714

Bickel PE, Scherer PE, Schnitzer JE, Oh P, Lisanti MP, Lodish HF(1997) Flotillin and epidermal surface antigen define a new family

244

Page 11: Evolution of Duplicated reggie Genes in Zebrafish and Goldfish · 2020-01-28 · specific microdomains (Lang et al. 1998). In goldfish and rats, reggie-1 and -2 are upregulated during

of caveolae-associated integral membrane proteins. J Biol Chem272:13793–13802

Cho YJ, Chema D, Moskow JJ, Cho M, Schroeder WT, Overbeek P,Buchberg AM, Duvic M (1995) Epidermal surface antigen(MS17S1) is highly conserved between mouse and human. Genom-ics 27:251–258

Felsenstein J (1985) Confidence limits on phylogenies: an approachusing the bootstrap. Evolution 39:783–791

Galbiati F, Volonte D, Goltz JS, Steele Z, Sen J, Jurcsak J, Stein D,Stevens L, Lisanti MP (1998) Identification, sequence and devel-opmental expression of invertebrate flotillins fromDrosophila me-lanogaster.Gene 210:229–237

Gaze RM, Sharma SC (1970) Axial differences in the reinnervation ofthe goldfish optic tectum by regenerating optic nerve fibres. ExpBrain Res 10:171–181

The Genetics Computer Group I (1997) Wisconsin Package 9.1. Madi-son, WI

Haffter P, Granato M, Brand M, Mullins MC, Hammerschmidt M,Kane DA, Odenthal J, van Eeden FJ, Jiang YJ, Heisenberg CP,Kelsh RN, Furutani-Seiki M, Vogelsang E, Beuchle D, Schach U,Fabian C, Nusslein-Volhard C (1996) The identification of geneswith unique and essential functions in the development of the ze-brafish,Danio rerio. Development 123:1–36

Holland PW (1999) The effect of gene duplication on homology. No-vartis Found Symp 222:226–236

Kimura M (1980) A simple method for estimating evolutionary rates ofbase substitutions through comparative studies of nucleotide se-quences. J Mol Evol 16:111–120

Lang DM, Lommel S, Jung M, Ankerhold R, Petrausch B, Laessing U,Wiechers MF, Plattner H, Stuermer CA (1998) Identification ofreggie-1 and reggie-2 as plasmamembrane-associated proteinswhich cocluster with activated GPI-anchored cell adhesion mol-ecules in non-caveolar micropatches in neurons. J Neurobiol 37:502–523

Malaga-Trillo E, Meyer A (2001) Genome duplications and acceleratedevolution of Hox genes and cluster architecture in teleost fishes.Amer Zool, in press

Meyer A, Malaga-Trillo E (1999) Vertebrate genomics: more fishytales aboutHox genes. Curr Biol 9:R210–

Meyer A, Schartl M (1999) Gene and genome duplications in verte-brates: the one-to-four (-to-eight in fish) rule and the evolution ofnovel gene functions. Curr Opin Cell Biol 11:699–704

Michalova V, Murray BW, Sultmann H, Klein J (2000) A contig map

of the Mhc class I genomic region in the zebrafish reveals ancientsynteny. J Immunol 164:5296–5305

Naruse K, Fukamachi S, Mitani H, Kondo M, Matsuoka T, Kondo S,Hanamura N, Morita Y, Hasegawa K, Nishigaki R, Shimada A,Wada H, Kusakabe T, Suzuki N, Kinoshita M, Kanamori A, TeradoT, Kimura H, Nonaka M, Shima A (2000) A detailed linkage mapof medaka,Oryzias latipes.Comparative genomics and genomeevolution. Genetics 154:1773–1784

Nelson JS (1994) Fishes of the world, 3rd ed. J. Wiley, New YorkOhno S (1970) Evolution by gene duplication. Springer-Verlag, Berlin,

New YorkOhno S (1999) The one-to-four rule and paralogues of sex-determining

genes. Cell Mol Life Sci 55:824–830Postlethwait JH, Yan YL, Gates MA, Horne S, Amores A, Brownlie A,

Donovan A, Egan ES, Force A, Gong Z, Goutel C, Fritz A, KelshR, Knapik E, Liao E, Paw B, Ransom D, Singer A, Thomson M,Abduljabbar TS, Yelick P, Beier D, Joly JS, Larhammar D, Rosa F,et al. (1998) Vertebrate genome evolution and the zebrafish genemap. Nat Genet 18:345–349

Saitou N, Nei M (1987) The neighbor-joining method: a new methodfor reconstructing phylogenetic trees. Mol Biol Evol 4:406–425

Sambrook J, Maniatis T, Fritsch EF (1989) Molecular cloning: a labo-ratory manual, 2nd ed. Cold Spring Harbor Laboratory Press, NewYork

Schroeder WT, Stewart-Galetka S, Mandavilli S, Parry DA, GoldsmithL, Duvic M (1994) Cloning and characterization of a novel epider-mal cell surface antigen (ESA). J Biol Chem 269:19983–19991

Schulte T, Paschke KA, Laessing U, Lottspeich F, Stuermer CA (1997)Reggie-1 and reggie-2, two cell surface proteins expressed by reti-nal ganglion cells during axon regeneration. Development 124:577–587

Stuermer CAO, Lang DM, Kirsch F, Deininger S, Wiechers MF,Plattner H (2001) GPI-anchored proteins and fyn kinase assemblein non-caveolar plasmamembrane microdomains defined by reg-gie-1 and reggie-2. Mol Biol Cell (in press)

Swofford DL (2000) PAUP*. Phylogenetic analysis using parsimony(*and other methods). Sinauer Associates, Sunderland, MA

Volonte D, Galbiati F, Li S, Nishiyama K, Okamoto T, Lisanti MP(1999) Flotillins/cavatellins are differentially expressed in cells andtissues and form a hetero-oligomeric complex with caveolins invivo. Characterization and epitope-mapping of a novel flotillin-1monoclonal antibody probe. J Biol Chem 274:12702–12709

Wittbrodt J, Meyer A, Schartl M (1998) More genes in fish? Bioessays20:511–515

245