two cl and c8: structure and in - pnas · hho.cl.95andhho.c8.5111andpartofthecorresponding genomic...

5
Proc. Natl. Acad. Sci. USA Vol. 84, pp. 4914-4918, July 1987 Developmental Biology Two human homeobox genes, cl and c8: Structure analysis and expression in embryonic development ANTONIO SIMEONE*, FULVIO MAVILIOt, DARIO ACAMPORA*, ADELE GIAMPAOLOt, ANTONIO FAIELLA*, VINCENZO ZAPPAVIGNAt, MAURIZIo D'EsPosITO*, MARIA PANNESE*, GIOVANNI Russot, EDOARDO BONCINELLI*, AND CESARE PESCHLEt§ *International Institute of Genetics and Biophysics, Via Marconi 10, 80125 Naples, Italy; tDepartment of Hematology, Istituto Superiore di Saniti, Viale Regina Elena 299, 00161 Rome, Italy; and tDivision of Obstetrics and Gynecology, Ospedale di Avellino, 83100 Avellino, Italy Communicated by Frank H. Ruddle, April 14, 1987 (received for review February 18, 1987) ABSTRACT Two human cDNA clones (HHO.cl.95 and HHO.c8.5111) containing a homeobox region have been charac- terized, and the respective genomic regions have been partially analyzed. Expression of the corresponding genes, termed cl and c8, was evaluated in different organs and body parts during human embryonic/fetal development. HHO.cl.95 apparently encodes a 217-amino acid protein containing a class I homeodomain that shares 60 out of 61 amino acid residues with the Antennapedia homeodomain of Drosophila melanogaster. HHO.c8.5111 encodes a 153-amino acid protein containing a homeodomain identical to that of the frog ACI gene. Clones HHO.cl and HHO.c8 detect by blot-hybridization one and two specific polyadenylylated transcripts, respectively. These are differentially expressed in spinal cord, backbone rudiments, limb buds (or limbs), heart, and skin of human embros and early fetuses in the 5- to 9-week postfertilization period, thus suggesting that the cl and c8 genes play a key role in a variety of developmental processes. Together, the results of the embryonic/fetal expression of cl and c8 and those of two previously analyzed genes (cdO and c13) indicate a coherent pattern of expression of these genes in early human ontogeny. In the fruitfly Drosophila melanogaster, studies on develop- mental mutants have allowed the identification of a number of genes controlling morphogenesis, which are classified in three general groups: (i) maternal-effect genes, which specify the structure and spatial coordinates of the egg, (it) segmen- tation genes, which determine the number and polarity of body segments, and (iii) homeotic genes, which specify the identity of each segment (reviewed in ref. 1). Most homeotic genes identified so far are clustered in the Antennapedia (Antp-C) and the bithorax (bx-C) complex (2, 3). Cloning of some of these genes has led to the identification of a 183-base-pair (bp) sequence, termed the homeobox, which is contained in most segmentation and homeotic genes and encodes a highly conserved protein domain (4, 5). Homeo- box-containing genes have also been cloned from sea urchin (6), frog (7-9), mouse (10-20), and human (21, 22); however, direct evidence for their regulatory role in development is still unavailable (cf. ref. 23). We have reported the isolation of human cDNA clones, termed HHO.cl, HHO.c8, HHO.c1O, and HHO.c13, con- taining homeobox sequences that appear to represent tran- scripts from four different genes (22). HHO.c1O (24) and HHO.c13 (25) contain a conserved class I homeobox and detect by blot hybridization specific transcripts during human embryogenesis. We report here the characterization of clones HHO.cl.95 and HHO.c8.5111 and part of the corresponding genomic regions. HHO.cl and HHO.c8 detect by blot hy- bridization one and two specific polyadenylylated tran- scripts, respectively, which are differentially expressed in a variety of organs or body parts at 5 to 9 weeks of embryon- ic/fetal development. METHODS Human Embryos and Fetuses. Human embryos and early fetuses were obtained virtually intact from legal curettage abortions and were maintained in sterile culture medium at +40C until processing (26). The age of embryos (5-8 weeks) was carefully established by morphologic staging according to multiple criteria (dating error was ± 2 days) (27, 28). Early fetuses (9 weeks) were staged on the basis of standard age/crown-rump length plots. Different organs or body parts were dissected under an inverted microscope and were stored in liquid nitrogen until analyzed. cDNA Clones. Recombinant clones (8 x 105) of a cDNA library prepared from poly(A)+ RNA of SV40-transformed human fibroblasts (29) were originally screened with a 600-bp BamHI-Pvu II fragment of clone 903 (4, 30) containing the entire Antp homeobox region or a 1050-bp Pvu II fragment containing the homeobox and the 3' region of theftz gene (4), both labeled to a high specific activity by nick-translation (31). Clones (2.5 x 105) of the same cDNA library were subsequently screened with the insert of clone HHO.cl or HHO.c8 under stringent conditions. Recombinant positive clones were isolated, and the longest inserts (HHO.cl.95, HHO.c8.5111) were characterized by restriction endonucle- ase mapping and were sequenced (32). RNA and DNA Analysis. Total RNA was extracted from fresh or frozen samples by the guanidinium isothiocyanate technique and was selected for poly(A)+ RNA by one passage on an oligo(dT)-ceilulose column (33, 34). Ten micrograms of total RNA or 2-3 pug of poly(A)+ RNA was electrophoresed on a 1.0% agarose/formaldehyde gel, blotted on nitrocellu- lose or nylon membranes, and hybridized to 107 cpm of DNA probe labeled by nick-translation to a specific activity of 3 X 108 to 8 x 108 dpm per Ag (31). The blots were washed under stringent conditions (cf. ref. 25). Probes were then removed by boiling in a 0.1% NaDodSO4 buffered solution, and the filters were rehybridized to a chicken .8-actin probe for normalization (see refs. 24 and 25). Southern analysis was performed according to standard techniques (31). RESULTS AND DISCUSSION The nucleotide sequence of HHO.cl.95 is shown in Fig. lA. Conceptual translation of its longest open reading frame, starting from the ATG codon at position 100, produces a peptide sequence of 217-amino acid residues that contains a class I homeodomain. The carboxyl-terminus of this protein lies 20-amino acid residues downstream from the homeodo- main and ends in a row of six glutamic acid residues. A 609-bp 3'-untranslated region contains a polyadenylylation signal §To whom reprint requests should be addressed. 4914 The publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. §1734 solely to indicate this fact. Downloaded by guest on November 8, 2020

Upload: others

Post on 11-Aug-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Two cl and c8: Structure and in - PNAS · HHO.cl.95andHHO.c8.5111andpartofthecorresponding genomic regions. HHO.cl and HHO.c8 detect by blot hy-bridization one and two specific polyadenylylated

Proc. Natl. Acad. Sci. USAVol. 84, pp. 4914-4918, July 1987Developmental Biology

Two human homeobox genes, cl and c8: Structure analysis andexpression in embryonic developmentANTONIO SIMEONE*, FULVIO MAVILIOt, DARIO ACAMPORA*, ADELE GIAMPAOLOt, ANTONIO FAIELLA*,VINCENZO ZAPPAVIGNAt, MAURIZIo D'EsPosITO*, MARIA PANNESE*, GIOVANNI Russot,EDOARDO BONCINELLI*, AND CESARE PESCHLEt§*International Institute of Genetics and Biophysics, Via Marconi 10, 80125 Naples, Italy; tDepartment of Hematology, Istituto Superiore di Saniti, VialeRegina Elena 299, 00161 Rome, Italy; and tDivision of Obstetrics and Gynecology, Ospedale di Avellino, 83100 Avellino, Italy

Communicated by Frank H. Ruddle, April 14, 1987 (received for review February 18, 1987)

ABSTRACT Two human cDNA clones (HHO.cl.95 andHHO.c8.5111) containing a homeobox region have been charac-terized, and the respective genomic regions have been partiallyanalyzed. Expression of the corresponding genes, termed cl andc8, was evaluated in different organs and body parts duringhuman embryonic/fetal development. HHO.cl.95 apparentlyencodes a 217-amino acid protein containing a class Ihomeodomain that shares 60 out of 61 amino acid residues withthe Antennapedia homeodomain of Drosophila melanogaster.HHO.c8.5111 encodes a 153-amino acid protein containing ahomeodomain identical to that of the frog ACI gene. ClonesHHO.cl and HHO.c8 detect by blot-hybridization one and twospecific polyadenylylated transcripts, respectively. These aredifferentially expressed in spinal cord, backbone rudiments, limbbuds (or limbs), heart, and skin of human embros and earlyfetuses in the 5- to 9-week postfertilization period, thus suggestingthat the cl and c8 genes play a key role in a variety ofdevelopmental processes. Together, the results of theembryonic/fetal expression of cl and c8 and those of twopreviously analyzed genes (cdO and c13) indicate a coherentpattern of expression of these genes in early human ontogeny.

In the fruitfly Drosophila melanogaster, studies on develop-mental mutants have allowed the identification of a numberof genes controlling morphogenesis, which are classified inthree general groups: (i) maternal-effect genes, which specifythe structure and spatial coordinates of the egg, (it) segmen-tation genes, which determine the number and polarity ofbody segments, and (iii) homeotic genes, which specify theidentity of each segment (reviewed in ref. 1). Most homeoticgenes identified so far are clustered in the Antennapedia(Antp-C) and the bithorax (bx-C) complex (2, 3). Cloning ofsome of these genes has led to the identification of a183-base-pair (bp) sequence, termed the homeobox, which iscontained in most segmentation and homeotic genes andencodes a highly conserved protein domain (4, 5). Homeo-box-containing genes have also been cloned from sea urchin(6), frog (7-9), mouse (10-20), and human (21, 22); however,direct evidence for their regulatory role in development is stillunavailable (cf. ref. 23).We have reported the isolation of human cDNA clones,

termed HHO.cl, HHO.c8, HHO.c1O, and HHO.c13, con-taining homeobox sequences that appear to represent tran-scripts from four different genes (22). HHO.c1O (24) andHHO.c13 (25) contain a conserved class I homeobox anddetect by blot hybridization specific transcripts during humanembryogenesis. We report here the characterization ofclonesHHO.cl.95 and HHO.c8.5111 and part of the correspondinggenomic regions. HHO.cl and HHO.c8 detect by blot hy-bridization one and two specific polyadenylylated tran-

scripts, respectively, which are differentially expressed in avariety of organs or body parts at 5 to 9 weeks of embryon-ic/fetal development.

METHODSHuman Embryos and Fetuses. Human embryos and early

fetuses were obtained virtually intact from legal curettageabortions and were maintained in sterile culture medium at+40C until processing (26). The age of embryos (5-8 weeks)was carefully established by morphologic staging accordingto multiple criteria (dating error was ± 2 days) (27, 28). Earlyfetuses (9 weeks) were staged on the basis of standardage/crown-rump length plots. Different organs or body partswere dissected under an inverted microscope and were storedin liquid nitrogen until analyzed.cDNA Clones. Recombinant clones (8 x 105) of a cDNA

library prepared from poly(A)+ RNA of SV40-transformedhuman fibroblasts (29) were originally screened with a 600-bpBamHI-Pvu II fragment of clone 903 (4, 30) containing theentire Antp homeobox region or a 1050-bp Pvu II fragmentcontaining the homeobox and the 3' region of theftz gene (4),both labeled to a high specific activity by nick-translation(31). Clones (2.5 x 105) of the same cDNA library weresubsequently screened with the insert of clone HHO.cl orHHO.c8 under stringent conditions. Recombinant positiveclones were isolated, and the longest inserts (HHO.cl.95,HHO.c8.5111) were characterized by restriction endonucle-ase mapping and were sequenced (32).RNA and DNA Analysis. Total RNA was extracted from

fresh or frozen samples by the guanidinium isothiocyanatetechnique and was selected for poly(A)+ RNA by one passageon an oligo(dT)-ceilulose column (33, 34). Ten micrograms oftotal RNA or 2-3 pug of poly(A)+ RNA was electrophoresedon a 1.0% agarose/formaldehyde gel, blotted on nitrocellu-lose or nylon membranes, and hybridized to 107 cpm ofDNAprobe labeled by nick-translation to a specific activity of 3 X108 to 8 x 108 dpm per Ag (31). The blots were washed understringent conditions (cf. ref. 25). Probes were then removedby boiling in a 0.1% NaDodSO4 buffered solution, and thefilters were rehybridized to a chicken .8-actin probe fornormalization (see refs. 24 and 25). Southern analysis wasperformed according to standard techniques (31).

RESULTS AND DISCUSSIONThe nucleotide sequence of HHO.cl.95 is shown in Fig. lA.Conceptual translation of its longest open reading frame,starting from the ATG codon at position 100, produces apeptide sequence of 217-amino acid residues that contains aclass I homeodomain. The carboxyl-terminus of this proteinlies 20-amino acid residues downstream from the homeodo-main and ends in a row of six glutamic acid residues. A 609-bp3'-untranslated region contains a polyadenylylation signal

§To whom reprint requests should be addressed.

4914

The publication costs of this article were defrayed in part by page chargepayment. This article must therefore be hereby marked "advertisement"in accordance with 18 U.S.C. §1734 solely to indicate this fact.

Dow

nloa

ded

by g

uest

on

Nov

embe

r 8,

202

0

Page 2: Two cl and c8: Structure and in - PNAS · HHO.cl.95andHHO.c8.5111andpartofthecorresponding genomic regions. HHO.cl and HHO.c8 detect by blot hy-bridization one and two specific polyadenylylated

Developmental Biology: Simeone et al.

(AATAAA) 20 nucleotides upstream from the poly(A) tail. Aputative glycosylation site (Asn-Xaa-Thr) is present at the60th amino acid residue of the homeodomain (Fig. LA), as isfound in the Antp homeodomain (23). The original cloneHHO.cl is identical to HHO.c1.95 at positions 450-1359.

A CGTTTTCGTCTAAATCTGC.ACTCTAATTCTGTAATATATCAAGCAATCTCCTAAA 55

Met Ser Ser Leu Tyr Tyr Ala Aan AlaACCGACACTMAACGTCCCTGCCTACAAATCATCCGCCCAAATTATG ACT TCA TTC TAT TAT (CG AAT GCT 126

A10 20

Leu Phe Ser Lys Tyr Pro Ala Ser Ser Ser Val Phe Ala Thr Cly Ala Phe Pro Glu GInTTA TTT TCT MA TAT CCA CCC TCA ACT TCG CTT TTC GCT ACC CGA GCC TTC CCA GAA CAA 186

30 40Thr Ser Cys Ala Phe Ala Ser Asn Pro Gln Arg Pro Gly Tyr Gly Ala Gly Ser Gly AlaACT TCT TGT GCC TTT GCT TCC MC CCC CAG CCC CCG GCC TAT CGA GCG CGT TCG GGC GCT 246

50 60Ser Phe Ala Ala Ser cet Gln Gly Leo Tyr Pro Gly Gly Gly Gly Met Ala Cly Gln SerTCC TTC GCC GCC TCG ATG CAG GGC TTC TAC CCC CGC GCC CGG GGC ATG GCG GCC CAG ACC

A

70 80Ala Ala Gly Val Tyr Ala Ala Gly Tyr Gly Leu Glu Pro Ser Ser Phe Asn Met His CysGCG GCC GGC GTC TAC GCC CCC CCC TAT GGG CTC GAG CCC ACT TCC TIC MC ATG CAC TCC

90 100Ala Pro Phe clu Gln Asn Leu Ser Sly Val Cys Pro Sly Acp Ser Ala Lys Ala Ala SlyGCG CCC TmT GAG CAG MC CTC TCC GGG GTG TGT CCC GCC SAC TCC GCC MG GCG GCG GCC

110 120Ala Lys Glu Sln Arg Asp Ser Asp Leoi Ala Ala Clu Ser Acn Phe Arg Ile Tyr Pro TrpGCC MS GAG CAG AGG GAC TCG SAC TTC GCG CCC GAG ACT MC TTC CGC ATC TAC CCC TCG

130 7Met Arg Ser Ser Gly Thr Asp SrgLys Arg Gly Arg Gln Thr Tyr Thr Arg Tyr Gln ThrATG CGA ASC TCA GGA ACT GAC GC MA CGA GGCC CC CAG ACC TAC ACC CGC TAC CAG ACC

306

366

426

486

546A

150 160Leu Clu Leu clu Lys clu Phe ills Tyr Asn Arg Tyr Leu Thr Arg Arg Ar8 Arg lle SiuCTG GAG CTG GAG MG GM TTT CAC TAC MT CGC TAC CTG ACC CCG CGC CGC CGC ATC SAG 606

170 180Ile Ala His Ala Leu Cys Leu Thr Sic Arg GIn Ile Lys Ile Trp Phe Gin Asn Arg ArgAIC GCCGCAC GCG CTC TGC CTC ACG GM AGA CAG ATC MC ATT TCC TTT CAG MC CCG CGC 666

Met Lys Trp Lys Lys Glu Acn Ly6 Thr Ala Gly Pro Gly Thr Thr Gly Cln Asp Arg AlaATC MG TGG MA MG GAG MC MSG ACC GCG GCC CCG GGG ACC ACC GGC CM GAC AGG GCT 726

A A

210Glu Ala Clu Gl Sblu Glu Glu Glu TrmGM GCA GAG GAG GSM GAS GAC TCAGGGATGGAGMAGGGCAGAGGMGAGACATGAGMAGGCAGACG 797

876

AAAAACAAMAAAACCCTATTTMXTMAGMACGAGTTTAAAAACATTTTTTMGGACGGACAACGAGAAATTTTGSTT 955

TTTCMCACTCAAAAAATAGTACCTATAGGMAATCTGTCACTTTCGGTTTTTTTGTACMTATCAAAACGACATTATC 1034

TACCTCTTCTGTAGCTTTCTGGMTTTACCTCCCCTTTTCTATGTTCCTATTGTMGGTCTTTCTAAAATCTTGCAGTT 1113

TTSTMGCCCTCTTTMTGCtCTCTTTGTCGACTCTGGCTCTGGACTMCCCTGTGGTTCCCTGCCCTCCTGTGCCTCC 1192

Proc. Natl. Acad. Sci. USA 84 (1987) 4915

HHO.c1.95 contains a second open reading frame, whichmight ehcode a 156-amino acid protein, starting from themethionine codon at position 116 through a TGA codon atposition 584 (halfway through the homeobox). Further anal-

A TTAMCCCCCAGCACGATGCACGACCAGGGCTTGACCTTGCCAGCTGCACATGCCTCCGCCCGTCCTCTCTCCCA 75

GCCTCTTCCTCCTGCCCCCTTCTTCCAMCTCTCCTTAATTTTCTTTCCCTTTTCCATCATTATMTTATTTTTATTTTT 156

GMTTTATATAMACTATATCTGTCGTCGTCTCCASCTCAGACAGCTCCCCAGCCGCACAGMTCACSCAACGACCACM

-7-TCMTTCCACCCCCTATCATCCACTCAGGCATTTCTCCACCTATGCACCGCCCGTCCCCCAaAACCC(CATCTACTCCAC

A

TCCCTTTTATTCCCCACACCACMTGTCCTGTTCAGTTCCACCCGCGCCCCCTATCACTATGCATCTAATTCCTTTTACA10

Met Leu Ser Asn Cys Arg Gln Acn Thr Leu Cly His Asn Thr Gln Thr SerCAGCACGAAGACATG CTC TCA MC TCC ACA CAA AAC ACC TTA SCA CAT MC ACA CAC ACC TCA

233

391

470

533

20 30Ile Ala Gln Asp Phe Ser Ser Glu GSn Gly Arg Thr Ala Pro Gln Asp Gln Lys Ala SerATC GCT CAG GAT TTT ACT TCT GAG CAG GGC AGG ACT GCG CCC CAG GAC CAG AM GCC AGT 593

40 S0 7Ile Gln Ile Tyr Pro Trp Met Gln Arg Met Asn Ser His Ser Gly Val Gly Tyr Gly AlaATC CAG ATT TAC CCC TCC ATG CAC CGA ATG MT TCG CAC AGT GGG CTC GGC TAC GSA GCC 653

60 70Asp Fr Arg Arg Gly Arg Cln Ile Tyr Ser Arg Tyr Gln Thr Leu Glu Leu Glu Lys GluCAC CGC ASS CCC CGC CCC CAC ATC TAC TCC CCG TAC CAG ACC CTG GM CTG GAG MG GM

'A80 90

Phe His Phe Asn Arg Tyr Leu Thr Arg Arg Arg Arg Ile Glu Ile Ala Asn Ala Leu CysTTT CAC TTC MT CCC TAC CTA ACC CGC CGC CCC CCC ATC GAG ATC CCC MC GCG CTT TGC

100 110Leu Thr Clu Arg Sln Ile Lys Ile Trp Phe Gln Asn Arg Arg Met Lys Trp Lys Lys GluCTG ACC GAG CGA CAC ATC MAA ATC TCG TTC CAG MC CGC CGC ATG MG TGG MA AMA GM

120 130Ser Asn|Leu Thr Ser Thr Leu Ser Gly Gly Gly Gly Sly Ala Thr Ala Asp Ser Met GlyTCT MT CTC ACA TCC ACT CTC TCG GGC GCC CCC GSA GCC CCC ACC CCC GAC AGC ATG CCC

A140 150

Gly Lys Glu Clu Lys Arg Glu Glu Thr GSi Glu Clu Lys Gln Lys Glu TncGGA AM GAG GM MG CGG GM GAG ACA GAA GAG GAG MG CAG AA GAG TGACCAGGACTGTCC

CTGCCACCCCTCTCTCCCTTTCTCCCTCGCTCCCCACCAACTCTCCCCTMTCACACACTCTGTATTTATCACTGGCAC

MTTCATGTGTTTTGATTCCCTAAMCMAATTACGCACTCAAACCTGCACCTCMAGTCACCTCTGGACCCCCTCCCT

CCCACTCCTTCGACGCCCCCCACCCCCGCCCCGCCTGCACACACCCGCCCCCTGCCCGCTCGGGCCTCTSCTCMACGCC

TCACGGCCCATGGCACCCCGCAGCCCCGGAGCGCASCCCCCGCCCTTGGCCCCACACCMCCCCMGGGCCTCCCCGCAG

713

773

833

893

956

1035

1114

1193

1272

1351

1430

GCCTTCCCAGCAGCGGCACCMAGCGCCTTAGGGAGCCCCAMMCCTACCACTCGCGTGTTCCCCMGCCCCTTCCTGC 1271A

TGCTGCTTGCTTCCCCTCCCCCAGCCCCATGCTCCCTTTACATTCTGTGTGTATCTAAAGCATGGAMMTAAAACCCA 1350

ATTAAAJMAT

B X HG 95.1

X HG 125.3

BgBgEEBgH3Bg H3 E H3 Bg

1 kbSTT S AT A

AAA

FIG. 1. (A) Nucleotide sequence of the human cDNA cloneHHO.cl.95 and the conceptual translation of its longest open read-ing frame. The region encoding the homeodomain is boxed. Thesecond conserved homeodomain and the polyadenylylation signal(AATAAA) preceding the poly(A) tail are underlined. -*, Methionine(ATG) codon representing the putative translation start site; *,putative glycosylation site; Trm, translation termination codon; y7,position of the intron; A, Hae III restriction sites used for subcloningHHO.cl.95. (B) Restriction map of the HHO.cl.95 transcription uniton two genomic clones (X95.1 and X125.3) as determined by Southernblot analysis. Open boxes indicate exons, and closed boxes indicatehomeobox sequences. E, EcoRI; Bg, Bgl II; A, Alu I; H3, HindIII;T, Taq I; S, Sma I.

CCCTGCCTAGCCCCTCTGCCCGACCMATGCCCASCCMGCAAATTCTATTTAAAGATCCTAGGGCTCATTATGGCAT 1509

TTTACAAACTGTGACCGTTTCTGTGTGMGATTTTTAGCTGTATTTGTGGTCTCTGTATTTATATTTATGTTTAGCACC 1588

GTCAGTGTTCCTATCCMTTTCAAAAAACCMMAMACAGGGGAAATTACAAAAGAGAGMAAAAAAGTCMTCACGT 1667

TTSTTTAGCCACTACCACAAAATAAATMAATAAAT

B

A HG 65

EEE BE Bg Bg Bg BgEBgE EEBg E

5 ---- _I ---- 3i1 kb

A Gy¶UA A 'AA

100 bp

FIG. 2. (A) Nucleotide sequence of the human cDNA cloneHHO.c8.5111 and the translation of its longest open reading frame.Symbols are as in Fig. LA. (B) Restriction map of the genomic regioncorresponding to HHO.c8.5111, as determined by Southern blotanalysis on two phage clones (X8 and X65). Open and closed boxesindicate exons and homeobox sequences, respectively. E, EcoRI;Bg, Bgl II; A, Alu I.

A HG 8

Dow

nloa

ded

by g

uest

on

Nov

embe

r 8,

202

0

Page 3: Two cl and c8: Structure and in - PNAS · HHO.cl.95andHHO.c8.5111andpartofthecorresponding genomic regions. HHO.cl and HHO.c8 detect by blot hy-bridization one and two specific polyadenylylated

4916 Developmental Biology: Simeone et al.

ysis is clearly necessary to assess the significance of thisobservation.HHO.cl.95 was used as a hybridization probe to screen a

human genomic library in X Charon 28 by standard proce-dures. Two overlapping genomic clones, X95.1 and X125.3,which contain a region of -24 kilobases (kb) correspondingto HHO.cl.95, were characterized by restriction endonucle-ase mapping (Fig. iB) and were partially sequenced (notshown). The HHO.cl.95 coding region is interrupted by anintron of -2400 bp located upstream from the homeobox(Fig. 1 A and B). Two additional homeobox sequences appearto be present in X95.1 upstream from the cl gene (Fig. 1B).Chromosome mapping by standard procedures (somatic cell

hybrids and in situ hybridization) indicates that the cl gene islocalized in the long arm of chromosome 17 (K. Hubner,personal communication), possibly within the gene clusterpreviously identified (13, 14) as containing Hul (termed c1O inrefs. 24), HO2, and HuS (18). Preliminary restriction data oncosmid clones locate the cl gene in a position corresponding toHox2.3 within the Hox2 murine cluster (17).The nucleotide sequence of the HHO.c8.5111 insert is shown

in Fig. 2A. Translation of the longest open reading frame

AEmbryos Tissues (7wk)

*30.b o(

I I IZ02 (0 A%

starting from the ATG codon at position 483 produces a peptideof 153-amino acid residues containing a class I homeodomainfollowed by 34 amino acids before the termination codon. The761-bp 3'-untranslated region contains a polyadenylylationsignal 15 nucleotides upstream from the poly(A) tail, which isfollowed by two additional and partially overlapping AATAAAsequences. The original clone HHO.c8 is identical toHHO.c8.5111 from position 594 to the poly(A) tail.The c8 homeodomain is identical to that found in the cDNA

clone AC1 from Xenopus laevis (7). The homology betweenthe predicted peptide sequences derived from HHO.c8.5111and AC1 is further extended outside the homeodomain (22).In particular, a potential glycosylation site is present at the61st amino acid residue in both homeodomains (Fig. 2A; cf.ref. 7).

Fig. 2B shows the genomic region corresponding toHHO.c8.5111 obtained from two overlapping phage clones(X8 and X65). Restriction analysis and partial sequencing (notshown) indicate that the HHO.c8.5111 coding region isinterrupted by an intron of -800 bp located upstream fromthe homeobox (Fig. 2A and B). An intron of 11 kb is presentfurther upstream in the 5'-untranslated region (Fig. 2 A and

*7t

*c

:A.

1.6- qw p

..~

- 28 S

;t.

-18S

1.6- io

MJ P--Actin

Spinal cord

6 7 8

-28S

-18 S1.6- O

Backbone Limbs

7 8 6 7 8

-28S

-18 S1.6- la 1.6- *W

Skin

7 8 9

-28S-_28S

-18S 16

-.mmXj

FIG. 3. (A) (Left) Blot-hybridizationanalysis of the expression of HHO.cl inpoly(A)+ RNA from whole human embry-os at 5, 6, and 7 weeks after fertilization.The relative mobilities of 28S and 18SrRNA are shown in size markers. Sizes arein kb. (Right) Expression of HHO.cl inpoly(A)+ (2-3 ,ug per lane) and total skin(10 tmg) RNA obtained from pools of organsor body parts dissected from human em-bryos at 7 weeks except for hearts, whichwere pooled from 7- to 8-week embryos,and skin, which was obtained at 9 weeks.Control, poly(A)+ RNA from an embryodeprived of limbs and spinal column. (B)Expression of HHO.cl transcripts inpoly(A)+ RNA (2 ,ug per lane) from spinalcord, backbone, and limbs and also in totalRNA (10 ,ug per lane) from skin in humanembryos from 6 to 9 weeks after fertiliza-tion, as analyzed by blot hybridization.Sizes are in kb. rRNA (28S and 18S) areshown as size markers.

28S-

18S-

B

Act in- A1ML.&fi-&.,dffl_

Proc. Natl. Acad Sci. USA 84 (1987)

_____

Dow

nloa

ded

by g

uest

on

Nov

embe

r 8,

202

0

Page 4: Two cl and c8: Structure and in - PNAS · HHO.cl.95andHHO.c8.5111andpartofthecorresponding genomic regions. HHO.cl and HHO.c8 detect by blot hy-bridization one and two specific polyadenylylated

Proc. Natl. Acad. Sci. USA 84 (1987) 4917

B). The c8 gene has been mapped on the long arm of humanchromosome 12 (K. Hubner, personal communication), in aposition unrelated to previously identified homeobox geneclusters on chromosomes 2 (ref. 25, and K. Hubner, personalcommunication), 7 (35), and 17 (ref. 14 and this paper).The putative proteins of HHO.cl.95 and HHO.c8.5111

contain a conserved pentapeptide Ile-Tyr-Pro-Trp-Met thatstarts at amino acid position -11 and -19, respectively, fromthe homeodomain (25).We have utilized HHO.cl and HHO.c8 as cDNA probes to

analyze the expression of the cl and c8 genes in humanembryonic development.

Blot-hybridization analysis of poly(A)+ RNA obtainedfrom whole embryos at 5, 6, and 7 weeks indicates that cl isexpressed as a single major transcript of 1.6 kb at a roughlyconstant level throughout this period (Fig. 3A). No signal wasdetected by blot hybridization of the corresponding poly(A)-RNAs (not shown).We have then analyzed the expression of cl in different

organs and body parts of 7-week embryos. Samples fromdifferent specimens were pooled when necessary. Heartsfrom 7- and 8-week embryos were pooled. The centralnervous system was obtained free of contaminating tissues;brain was dissected from spinal cord and medulla oblongataat the level of the pontine flexure. Skin was analyzed as total

AEmbryos

*zt *t *4CO

Tissues (7wk)

4.. e0

'b~~~~c~~~~ ~~ cip

A

4*<b

ell,~

RNA. As shown in Fig. 3A, the expression of cl transcriptswas tissue-specific in 7-week embryos: 1.6-kb messages wereabundant in spinal cord and detectable in backbone rudi-ments and limbs. Virtually no hybridization signal wasobserved in brain, heart, and liver, whereas a barely detect-able band was still present in a control embryo deprived oflimbs and spinal column. A low signal at 1.6 kb was alsodetectable in total RNA from skin, together with a broad bandof cross-hybridization to 28S rRNA.The expression of cl in positive tissues was then analyzed

at 6 to 9 weeks by blot-hybridization analysis of poly(A)+(spinal cord, backbone, and limbs) or total (skin) RNA (Fig.3B). The abundance of 1.6-kb transcripts shows a 3- to 5-foldincrease in spinal cord and backbone rudiments at 6 to 8 and7 to 8 weeks, respectively; a nearly constant low level wasobserved in limbs and skin at 6 to 8 and 7 to 9 weeks.The expression of c8 gene was analyzed on the specimens

described above. HHO.c8 detected two transcripts of 2.2 and1.8 kb by blot hybridization of poly(A)+ RNA from wholeembryos at 5, 6, and 7 weeks at a roughly constant level (Fig.4A Left). No signal was detected by blot hybridization ofcorresponding poly(A)- RNAs (not shown). The 2.2- and1.8-kb transcripts are abundantly accumulated in spinal cordwith a 1:2 ratio. The 1.8-kb RNA is predominantly expressedat a high level in limbs and backbone rudiments and at a lower

4;,

28S- *-28S-

18S- . 1.8--22 2.2-

1ota -1.8 1.8- a1 -18S18S-

Wfo t-Actin

BSpinal cord

6 7 8 9

Backbone

7 8

Limbs

6 7 8

28S- -28 S

18S 1 2.2-

- 1.8 -*. s 22- %, -18S1.8-

Actin- ____w as me_

FIG. 4. (A) (Left) Blot-hybridiza-tion analysis of the expression ofHHO.c8 in poly(A)+ RNA (2-3 ,ug perlane) from whole human embryos at

Skin 5, 6, and 7 weeks after fertilization.Skin Sizes are in kb. rRNA (28S and 18S)

are shown as size markers. (Right)7 B 9 Expression of HHO.c8 in poly(A)+

(2-3 ,g) or total skin (10 ,ug) RNAfrom pools of organs or body partsdissected from human embryos at 7

_28S weeks except for hearts, which werepooled from 7- to 8-week embryos,and skin, which was obtained at 8weeks. Control, poly(A)+ RNA from

22_ an embryo deprived of head, limbs,8- _ 18S heart, liver, and spinal column. (B)

Expression of HHO.c8 as analyzedby blot hybridization of poly(A)+RNA (2 ,ug per lane) from spinal cord,backbone rudiments, and limbs andalso from total RNA (10 ,g per lane)from skin obtained from human em-bryos from 6 to 9 weeks after concep-

*Se tion. Sizes are in kb. rRNA (28S and18S) are shown as size markers.

Developmental Biology: Simeone et al.

_ -28 S

Dow

nloa

ded

by g

uest

on

Nov

embe

r 8,

202

0

Page 5: Two cl and c8: Structure and in - PNAS · HHO.cl.95andHHO.c8.5111andpartofthecorresponding genomic regions. HHO.cl and HHO.c8 detect by blot hy-bridization one and two specific polyadenylylated

4918 Developmental Biology: Simeone et al.

level in heart (Fig. 4A Right). No signal was observed in brainand liver. Both bands are clearly detected by blot hybridiza-tion of skin total RNA (together with a prominent band ofcross-hybridization to 28S rRNA) at levels comparable tothose observed in spinal cord or limbs.A 1.8-kb band was still detected in poly(A)+ RNA from a

control embryo upon removal of head, limbs, spinal column,liver, and heart. Although at least part of this signal may beaccounted for by the presence of skin, it cannot be ruled outthat c8 transcripts are expressed in other organs or tissues.

Analysis of c8 expression at different stages of develop-ment indicated that the abundance of both 1.8- and 2.2-kbtranscripts, as well as their relative ratio, is nearly constantfrom 6 to 9 weeks in spinal cord, 6 to 8 weeks in limbs, 7 to8 weeks in backbone rudiments, and 7 to 9 weeks in skin (Fig.4B).

In order to exclude artifacts due to cross-hybridization totranscripts derived from other homeobox genes, we havehybridized blots of spinal cord poly(A)+ RNA to differentsubclones of both inserts. In particular, we utilized twononcontiguous Hae III fragments containing the homeoboxand the 3'-translated and untranslated regions of HHO.cl.95,as well as two contiguous Hae III fragments containing thehomeobox and the 3'-translated and untranslated regions ofHHO.c8.5111 (see Figs. lA and 2A). In both cases, hybrid-ization to all subclones detected exactly the same bands (notshown). Furthermore, these data indicate that both the 2.2-and 1.8-kb transcripts detected by HHO.c8 have essentiallythe same structure in the region spanning the homeobox andthe proximal 3'-untranslated sequences, thus suggesting thatthey might differ in their 5' portions.We conclude that cl and c8 are differentially expressed and

developmentally regulated in human embryogenesis. More-over, their expression patterns differ from those of otherhuman (24, 25) and mouse (12, 15-18, 20) homeobox genesreported so far.The pattern of expression observed for cl and c8, as well

as cdO and c13 (24, 25), allows the formulation of a series ofgeneral concepts: (i) The four genes are differentially ex-pressed according to tissue- and/or stage-specific patterns.Most of them encode multiple transcripts. (ii) All are abun-dantly transcribed in embryonic spinal cord, which repre-sents a typical metameric structure. In particular, strongheterogeneous 2.1- and 2.5-kb bands are detected in spinalcord by HHO.clO (24) and HHO.c13 (25), respectively.Clone HHO.cl detects 1.6-kb transcripts of the same sizeclass, which are also present in other embryonic tissues inlower abundance. Clone HHO.c8 detects two bands of 2.2and 1.8 kb with a specific intensity ratio different from thatobserved in other embryonic tissues. (iii) All genes, with thepossible exception of cdO, are transcribed in backbonerudiments. (iv) cl, c8, and cdO do not detect any transcriptsin embryonic brain throughout the analyzed period. c13 isexpressed as a single 4.2-kb band in mesencephalon andmetencephalon (not in prosencephalon) at 6 to 7 weeks butnot at 8 weeks or later. (v) Although the expression of cl andcdO is, respectively, largely and exclusively restricted tospinal cord (and apparently backbone), c8 and c13 are alsoexpressed in a large spectrum of embryonic structures (limbbuds or limbs, heart, skin, etc.), which follow dramaticallydifferent patterns of morphogenesis. (vi) Homeobox tran-scripts have not been detected so far in a variety of organs orbody parts (e.g., liver).

In conclusion, the concept emerges that in humans genescontaining homeodomains may play a wide spectrum ofcontrol activities in early development of different organs orbody parts. According to a unified model, the homeodomainmight exert a general function (possibly DNA-binding),which is necessarily shared by all homeobox gene products;

more specific functions may be exerted by the nonconservedparts of homeobox gene proteins, which diverged in variousevolutionary lineages.We thank Dr. W. J. Gehring for providing the Drosophila probes,

Dr. H. K. Okayama for the cDNA library, and Dr. G. Cossu forhelpful discussions. The expert assistance of Mrs. P. Alessandri intyping the manuscript is gratefully acknowledged. This work wassupported by grants from Consiglio Nazionale delle Richerche,Rome, Progetti Finalizzati "Ingegneria Genetica e Basi Molecolaridelle Malattie Ereditarie" to E.B. and C.P. (Grant No. 85.01425.51),"Oncologia" to E.B., F.M. (85.02244.44), and C.P. (85.02560.44),"Medicina Preventiva" to C.P. (85.00707.56), and the Italian Asso-ciation for Cancer Research (AIRC) to E.B.1. Gehring, W. (1985) Cold Spring Harbor Symp. Quant. Biol. 50, 243-251.2. Lewis, E. B. (1978) Nature (London) 276, 565-570.3. Kaufman, T. C., Lewis, R. & Wakimoto, B. (1980) Genetics 94, 115-133.4. McGinnis, W., Levine, M. L., Hafen, E., Euroiwa, A. & Gehring, W. J.

(1984) Nature (London) 308, 428-433.5. Scott, M. P. & Weiner, A. J. (1984) Proc. Natl. Acad. Sci. USA 78,

1095-1099.6. Dolecki, G. J., Wannakrairoj, S., Lurn, R., Wang, G., Riley, H. D.,

Carlos, R., Wang, A. & Humphreys, T. (1986) EMBO J. 5, 925-930.7. Carrasco, A. E., McGinnis, W., Gehring, W. J. & De Robertis, E. M.

(1984) Cell 37, 409-414.8. Mueller, M. M., Carrasco, A. E. & De Robertis, E. M. (1984) Cell 39,

157-162.9. Harvey, R. P., Tabin, C. J. & Melton, D. A. (1986) EMBO J. 5,

1237-1244.10. McGinnis, W., Hart, C. P., Gehring, W. J. & Ruddle, F. H. (1984) Cell

38, 675-680.11. Colberg-Poley, A. M., Voss, S. D., Chowdhury, K. & Gruss, P. (1985)

Nature (London) 314, 713-718.12. Colberg-Poley, A. M., Voss, S. D., Chowdhury, K., Stewart, C. L.,

Wagner, E. F. & Gruss, P. (1985) Cell 43, 39-45.13. Rabin, M., Hart, C. P., Ferguson-Smith, A., McGinnis, W., Levine, M.

& Ruddle, F. H. (1985) Nature (London) 314, 175-178.14. Joyner, A. L., Lebo, R. V., Kan, Y. W., Tjian, R., Box, D. R. &

Martin, G. R. (1985) Nature (London) 314, 173-175.15. Joyner, A. L., Kornberg, I., Coleman, K. G., Cox, D. R. & Martin,

G. R. (1985) Cell 43, 29-37.16. Jackson, I. J., Schofield, P. & Hogan, B. (1985) Nature (London) 317,

745-748.17. Hart, C. P., Awgulewitsch, A., Fainsod, A., McGinnis, W. & Ruddle,

F. H. (1985) Cell 43, 9-18.18. Hauser, C. A., Joyner, A. L., Klein, R. D., Learned, T. K., Martin,

G. R. & Tjian, R. (1985) Cell 43, 19-28.19. Awgulewitsch, A., Utset, M. F., Hart, C. P., McGinnis, W. & Ruddle,

F. H. (1986) Nature (London) 320, 328-335.20. Duboule, D., Baron, A., Mihl, P. & Galliot, B. (1986) EMBO J. 5,

1973-1980.21. Levine, M., Rubin, G. N. & Tjian, R. (1984) Cell 38, 667-673.22. Boncinelli, E., Simeone, A., La Volpe, A., Faiella, A., Fidanza, V.,

Acampora, D. & Scotto, L. (1985) Cold Spring Harbor Symp. Quant.Biol. 50, 301-306.

23. McGinnis, W. (1985) Cold Spring Harbor Symp. Quant. Biol. 50,263-270.

24. Simeone, A., Mavilio, F., Bottero, L., Giampaolo, A., Russo, G.,Faiella, A., Boncinelli, E. & Peschle, C. (1986) Nature (London) 320,763-765.

25. Mavilio, F., Simeone, A., Giampaolo, A., Faiella, A., Zappavigna, V.,Acampora, D., Poiana, G., Russo, G., Peschle, C. & Boncinelli, E.(1986) Nature (London) 324, 664-668.

26. Migliaccio, G., Migliaccio, A. R., Petti, S., Mavilio, F., Russo, G.,Lazzaro, D., Testa, U., Marinucci, M. & Peschle, C. (1986) J. Clin.Invest. 78, 51-60.

27. Moore, K. L. (1982) The Developing Human (Saunders, Philadelphia).28. Peschle, C., Mavilio, F., Care, A., Migliaccio, G., Migliaccio, A. R.,

Salvo, G., Samoggia, P., Petti, S., Guerriero, R., Marinucci, M.,Lazzaro, D., Russo, G. & Mastroberardino, G. (1985) Nature (London)313, 235-238.

29. Okayama, H. & Berg, P. (1983) Mol. Cell. Biol. 3, 280-289.30. Garber, R. L., Kuroiwa, A. & Gehring, W. J. (1983) EMBO J. 2,

2027-2036.31. Maniatis, T., Fritsch, E. F. & Sambrook, J. (1982) Molecular Cloning: A

Laboratory Manual (Cold Spring Harbor Laboratory, Cold SpringHarbor, NY).

32. Maxam, A. M. & Gilbert, W. (1977) Proc. Natl. Acad. Sci. USA 74,560-564.

33. Aviv, H. & Leder, P. (1972) Proc. Natl. Acad. Sci. USA 69, 1408-1412.34. Chirgwin, J. M., Przybyla, A. E., MacDonald, R. J. & Rutter, W. J.

(1979) Biochemistry 18, 5294-5300.35. Bucan, M., Yang-Feng, T., Colberg-Poley, A. M., Wolgemuth, D. K.,

Guenet, I. L., Francke, V. & Lehrach, H. (1986) EMBO J. 5, 2899-2905.

Proc. Natl. Acad. Sci. USA 84 (1987)

Dow

nloa

ded

by g

uest

on

Nov

embe

r 8,

202

0