cloning histocompatibility complex dc-3f3eddc13 chain(18) (termeddcaain this paper)show15, 22, and...

5
Proc. Natl. Acad. Sci. USA Vol. 81, pp. 5199-5203, August 1984 Immunology Cloning and sequence analysis of the human major histocompatibility complex gene DC-3f3 (class II antigen/Southern blot analysis) JEREMY M. BOSS AND JACK L. STROMINGER Department of Biochemistry and Molecular Biology, Harvard University, Cambridge, MA 02138 Contributed by Jack L. Strominger, April 11, 1984 ABSTRACT The DC antigen is one of the class II major histocompatibility antigens involved in the regulation of the immune response. This molecule is a heterodimer composed of an a and a 13 chain. Southern blot analysis of several homozy- gous cell lines shows that there are two DC.3 genes. The DC-3,3 gene, corresponding to a polymorphic restriction fragment, was cloned and sequenced and found to exist in five exons spanning 8 kilobase pairs of DNA. These exons correspond to the functional domains of the DCB protein. Comparison of the 131 domains of known DCf3 chains shows that the polymor- phism is clustered in four regions. A similar comparison of the mouse Af3 sequences shows only two prominent diversity re- gions. The DCB chain sequences are eight amino acids shorter than the Af8 chain sequences due to the elimination of a small exon by an aberrant splice acceptor. The class II antigens are encoded on chromosome 6 in the HLA-D region of the major histocompatibility complex (MHC) in man and on chromosome 17 in the Ia region of the mouse. These antigens are normally expressed on the sur- faces of B cells, some human T cells, and macrophages (re- viewed in refs. 1 and 2). They are glycosylated heterodimers consisting of an a (heavy) and a (3 (light) chain of approxi- mately 33,000 and 28,000 daltons, respectively (3). The sec- ond extracellular domain of each of these chains (a2 and (32) shows significant homology to the a3 domain of class I MHC antigens, to 132-microglobulin, and to the constant region of immunoglobulin molecules (1, 3). Although the details of the function of these antigens have yet to be elucidated, they play an integral role in regulation of the immune response and in communication between lymphocytes. The number of genes encoding class II antigens in man is unclear; however, three regions encoding the DR, DC, and SB a and 3 chain genes have been studied. The mouse ho- mologues for the DR and DC antigens are I-E and I-A, re- spectively. cDNAs (reviewed in refs. 1 and 2) and some ge- nomic clones (4-8, 33) corresponding to a and 3 chain genes for each of these families have been isolated and sequenced. The compilation of the DNA sequences of these genes has contributed significantly to the understanding of the primary structure of the proteins they encode and has suggested pos- sible mechanisms for the genesis of the polymorphism seen between alleles of these genes. To determine the structure of the class II 8 genes and to compare DC allotypes, we have cloned and sequenced a DC- 313 gene. The primary structure of this gene and the mapping of the transcription initiation site are discussed here. In addi- tion, we compare the sequences of this gene to those of other DC(3 genes and the homologous I-A,3 genes. MATERIALS AND METHODS Cloning and Sequencing. A partial Mbo I genomic library was constructed in the EMBL3B vector (9) from the lympho- blastoid cell line WT49 (DR3,3), a homozygous line derived from an offspring of a consanguineous marriage. The library was amplified and then screened with the protein-encoding sequences (Pst I/EcoRI fragment) of a DC(3 cDNA clone pIIB1 (formerly pDR(31) (10). Hybridization and washing of the filters were carried out as previously described (9). Sub- clones and deletions (11) were constructed from X-42 DNA by using the pUC9, pUC12 (12), or M13 mp8 (13) cloning vectors to facilitate sequencing. Restriction fragments were sequenced according to the procedures described by Maxam and Gilbert (14) and phage M13 clones essentially according to Sanger et al. (15). RNA Purification and Primer Extensions. Total RNA was prepared by using the guanidinium thiocyanate/cesium chlo- ride procedure (9), and poly(A)+ RNA was isolated by oligo- (dT)-cellulose chromatography (16). Primer extension reac- tions were carried out essentially as previously described, using a 5'-end-labeled synthetic oligonucleotide as a primer (17). This oligonucleotide, d(G-G-A-T-C-C-G-C-A-A-A-G- C-C-T-T-T-T-T-C-C), encodes the reverse complement of seven amino acids (-23 to -30) in the signal sequence. RESULTS AND DISCUSSION Cloning, Identification, and Sequencing. A genomic library from the homozygous lymphoblastoid cell line WT49 (DR3,3) was screened with a DCf3 cDNA clone p11,81 (for- merly pDRf1) (10). Several clones were strongly positive. One of the clones, X-42, was chosen for further analysis and a detailed restriction map was prepared (Fig. 1). The entire gene was sequenced (Fig. 2) in both directions or through overlapping clones (Fig. 1). The comparison of the sequence of X-42 (Figs. 2 and 3A) to the NH2-terminal protein sequence of a DC-1,B chain (18) (Fig. 3F) shows exact identity, indicating that X-42 is a DCB clone. Since WT49 is homozygous at the DR locus (DR3,3) and DR3 is in linkage disequilibrium with DC-3, it is likely that X-42 contains a DC-3,B gene. However, at least two DC- like (3chain genes may occur in the human genome (see be- low), and thus it is not possible to state unequivocally that the present gene encodes the serologically defined DC-3(3 specificity. The appropriate expression of this gene and as- say of its serological reactivity will ultimately define its specificity. Comparison of the amino acid sequence to that of the cDNA clone pIIB1 [isolated from Raji cells (DR3,6), Fig. 3B] (10) shows only one amino acid change, indicating that these two clones are likely to give rise to proteins with Abbreviations: MHC, major histocompatibility complex; kb, kilo- base pair(s); bp, base pair(s), TM, transmembrane. 5199 The publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked "adv'ertisement" in accordance with 18 U.S.C. §1734 solely to indicate this fact. Downloaded by guest on February 28, 2021

Upload: others

Post on 09-Oct-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Cloning histocompatibility complex DC-3f3edDC13 chain(18) (termedDCAAin this paper)show15, 22, and 24 amino acid differences, respectively (Fig. 3), which are clustered in four regions

Proc. Natl. Acad. Sci. USAVol. 81, pp. 5199-5203, August 1984Immunology

Cloning and sequence analysis of the human majorhistocompatibility complex gene DC-3f3

(class II antigen/Southern blot analysis)

JEREMY M. BOSS AND JACK L. STROMINGERDepartment of Biochemistry and Molecular Biology, Harvard University, Cambridge, MA 02138

Contributed by Jack L. Strominger, April 11, 1984

ABSTRACT The DC antigen is one of the class II majorhistocompatibility antigens involved in the regulation of theimmune response. This molecule is a heterodimer composed ofan a and a 13 chain. Southern blot analysis of several homozy-gous cell lines shows that there are two DC.3 genes. The DC-3,3gene, corresponding to a polymorphic restriction fragment,was cloned and sequenced and found to exist in five exonsspanning 8 kilobase pairs of DNA. These exons correspond tothe functional domains of the DCB protein. Comparison of the131 domains of known DCf3 chains shows that the polymor-phism is clustered in four regions. A similar comparison of themouse Af3 sequences shows only two prominent diversity re-gions. The DCB chain sequences are eight amino acids shorterthan the Af8 chain sequences due to the elimination of a smallexon by an aberrant splice acceptor.

The class II antigens are encoded on chromosome 6 in theHLA-D region of the major histocompatibility complex(MHC) in man and on chromosome 17 in the Ia region of themouse. These antigens are normally expressed on the sur-faces of B cells, some human T cells, and macrophages (re-viewed in refs. 1 and 2). They are glycosylated heterodimersconsisting of an a (heavy) and a (3 (light) chain of approxi-mately 33,000 and 28,000 daltons, respectively (3). The sec-ond extracellular domain of each of these chains (a2 and (32)shows significant homology to the a3 domain of class I MHCantigens, to 132-microglobulin, and to the constant region ofimmunoglobulin molecules (1, 3). Although the details of thefunction of these antigens have yet to be elucidated, theyplay an integral role in regulation of the immune responseand in communication between lymphocytes.The number of genes encoding class II antigens in man is

unclear; however, three regions encoding the DR, DC, andSB a and 3 chain genes have been studied. The mouse ho-mologues for the DR and DC antigens are I-E and I-A, re-spectively. cDNAs (reviewed in refs. 1 and 2) and some ge-nomic clones (4-8, 33) corresponding to a and 3chain genesfor each of these families have been isolated and sequenced.The compilation of the DNA sequences of these genes hascontributed significantly to the understanding of the primarystructure of the proteins they encode and has suggested pos-sible mechanisms for the genesis of the polymorphism seenbetween alleles of these genes.To determine the structure of the class II 8 genes and to

compare DC allotypes, we have cloned and sequenced a DC-313 gene. The primary structure of this gene and the mappingof the transcription initiation site are discussed here. In addi-tion, we compare the sequences of this gene to those of otherDC(3 genes and the homologous I-A,3 genes.

MATERIALS AND METHODS

Cloning and Sequencing. A partial Mbo I genomic librarywas constructed in the EMBL3B vector (9) from the lympho-blastoid cell line WT49 (DR3,3), a homozygous line derivedfrom an offspring of a consanguineous marriage. The librarywas amplified and then screened with the protein-encodingsequences (Pst I/EcoRI fragment) of a DC(3 cDNA clonepIIB1 (formerly pDR(31) (10). Hybridization and washing ofthe filters were carried out as previously described (9). Sub-clones and deletions (11) were constructed from X-42 DNAby using the pUC9, pUC12 (12), or M13 mp8 (13) cloningvectors to facilitate sequencing. Restriction fragments weresequenced according to the procedures described by Maxamand Gilbert (14) and phage M13 clones essentially accordingto Sanger et al. (15).RNA Purification and Primer Extensions. Total RNA was

prepared by using the guanidinium thiocyanate/cesium chlo-ride procedure (9), and poly(A)+ RNA was isolated by oligo-(dT)-cellulose chromatography (16). Primer extension reac-tions were carried out essentially as previously described,using a 5'-end-labeled synthetic oligonucleotide as a primer(17). This oligonucleotide, d(G-G-A-T-C-C-G-C-A-A-A-G-C-C-T-T-T-T-T-C-C), encodes the reverse complement ofseven amino acids (-23 to -30) in the signal sequence.

RESULTS AND DISCUSSION

Cloning, Identification, and Sequencing. A genomic libraryfrom the homozygous lymphoblastoid cell line WT49(DR3,3) was screened with a DCf3 cDNA clone p11,81 (for-merly pDRf1) (10). Several clones were strongly positive.One of the clones, X-42, was chosen for further analysis anda detailed restriction map was prepared (Fig. 1). The entiregene was sequenced (Fig. 2) in both directions or throughoverlapping clones (Fig. 1).The comparison of the sequence of X-42 (Figs. 2 and 3A) to

the NH2-terminal protein sequence of a DC-1,B chain (18)(Fig. 3F) shows exact identity, indicating that X-42 is a DCBclone. Since WT49 is homozygous at the DR locus (DR3,3)and DR3 is in linkage disequilibrium with DC-3, it is likelythat X-42 contains a DC-3,B gene. However, at least two DC-like (3chain genes may occur in the human genome (see be-low), and thus it is not possible to state unequivocally thatthe present gene encodes the serologically defined DC-3(3specificity. The appropriate expression of this gene and as-say of its serological reactivity will ultimately define itsspecificity. Comparison of the amino acid sequence to thatof the cDNA clone pIIB1 [isolated from Raji cells (DR3,6),Fig. 3B] (10) shows only one amino acid change, indicatingthat these two clones are likely to give rise to proteins with

Abbreviations: MHC, major histocompatibility complex; kb, kilo-base pair(s); bp, base pair(s), TM, transmembrane.

5199

The publication costs of this article were defrayed in part by page chargepayment. This article must therefore be hereby marked "adv'ertisement"in accordance with 18 U.S.C. §1734 solely to indicate this fact.

Dow

nloa

ded

by g

uest

on

Feb

ruar

y 28

, 202

1

Page 2: Cloning histocompatibility complex DC-3f3edDC13 chain(18) (termedDCAAin this paper)show15, 22, and 24 amino acid differences, respectively (Fig. 3), which are clustered in four regions

5200 Immunology: Boss and Strominger

E -0 oE~s

t

0

- EE

w IL; 1 2 0 E co m OL ILL _1_ .1

1kb

4 5 7 8

\ o

x m L XL s~e~a <|-ILE" aI _11

**-A

*.

the same serological reactions-i.e., p111,1 most likely wasderived from the DR3-containing haplotype of Raji cells.

Southern Blot Analysis of the DC/3 Chain Gene in WT49DNA. Southern blots using DNA from homozygous individ-uals and a variety of restriction enzymes have shown two tofour prominent bands when probed with a DC13 cDNA, indi-cating the occurrence of several related genes (21, 22). Oneof the prominent bands was nonpolymorphic, while at leastone other was polymorphic among the different cell lines. Asimilar analysis of the DC,/3 genes in the WT49 (DR3,3) cellline was carried out by Southern blotting (Fig. 4).DNAs from three homozygous cell lines, WT49 (DR3,3),

Priess (DR4,4) and LB (DR6,6) were digested with EcoRI.The Pst I/EcoRI fragment of p11131 hybridizes to a promi-nent 13-kb polymorphic EcoRI fragment in the WT49genome, which corresponds to the DCP3 gene analyzed here.This is the only EcoRI fragment in this gene that will hybrid-ize to the probe. The 18-kb band in Priess also correspondsto a single DC,8 gene (found in cosmid T28L; K. Okada, per-sonal communication) and probably to the gene cloned andsequenced by Larhammer et al. (8). The corresponding genein LB DNA appears to have at least one additional EcoRIsite within the corresponding DC/3 gene, producing a promi-nent 2.4-kb band and two less prominent bands at 11 and 10kb (or, alternatively, LB cells contain more DC/3 genes thanWT49 or Priess). These fragments are obviously those of a

polymorphic gene. A prominent 6.5-kb band is nonpolymor-phic with this enzyme within these three cell lines. In addi-tion, several less strongly hybridizing nonpolymorphicbands are evident, some of which are washed away underhigher stringency conditions. These bands may result fromcross-hybridization to more distantly related DCP3-likegenes, to pseudogenes, or to some other class II MHCgenes, since the probe used contains the entire 132 domainthat is conserved among class II genes. From these resultswe conclude that there are two major DC13-like genes in theWT49 cell line, and the gene sequenced here is derived froma polymorphic locus.Exon and Intron Organization of the DC-3p8 Gene. The DC-

313 gene is divided into five exons. The first exon containsthe 5' leader sequence of the mRNA, a signal sequence, andthe first four residues of the 181 domain of the mature pro-tein. The second and third exons encode the two extracellu-lar domains 131 (amino acid residues 5-94) and P2 (aminoacid residues 95-188), respectively. The fourth exon en-

codes residues 189-228, including the connecting peptide,the transmembrane domain, and the first six residues of thecytoplasmic domain. The fifth and final exon encodes fourresidues of the cytoplasmic domain and the 3' untranslatedsequences. Exon boundaries were determined by compari-son to the cDNA clone p11131. All splice sites are in concor-

dance with the consensus sequences for splice donors andacceptors (24). The arrangement of exons and introns in the

FIG. 1. Restriction map and sequenc-ing strategy. A restriction map of clone X-42 and a detailed map of the sequenced re-gion are shown. Darkened boxes representthe exons of the DC-3,8 gene. (A) Directionof sequencing using the chemical degrada-tion method (14). (B) Direction in whichthe M13 mp8 subclones were sequencedby using the dideoxy method (13, 15). kb,Kilobase pairs; bp, base pairs.

DC13 gene is similar to that in other MHC and immunoglob-ulin genes in that the respective protein encoding domainsare typically defined by their own exons.

(i) Signal sequence. The signal sequence is 32 amino acidslong and 56% of these residues are hydrophobic, consistentwith the composition of most signal sequences. Comparisonof this signal sequence to other DCPs reveals some intraspe-cies homology among allotypes but little homology whencompared to the mouse homologue, IAb13 (Fig. 3).

(ii) ,B1 domain. The 11 domain contains two cysteine resi-dues, which may form an intrachain disulfide bridge, and a

glycosylation sequence Asn-Gly-Thr. Both of these structur-al features are typical of class II light chains. It has previous-ly been demonstrated that the polymorphic regions of theclass II molecules reside in the 1 domain (25). Comparisonof the DC-3 131 residues to those of a probable DC-4 allele(cosII-102, isolated from an individual homozygous for DR4)(8), a second cDNA clone pII32 (isolated from Raji; DR3,6)(5) and a protein-derived sequence from a chemically isolat-ed DC13 chain (18) (termed DCAA in this paper) show 15, 22,and 24 amino acid differences, respectively (Fig. 3), whichare clustered in four regions (residues 28-32, 46-57, 66-77,and 84-90, boxed in Fig. 3).The A13 chains from the d,k, b and bmi2 (5, 20, 31, 32)

haplotypes are also compared to DC-313 in Fig. 3. These clus-tered amino acid changes are much less prominent in themouse A13 sequences and there appear to be fewer ofthem-i.e., less divergence appears to have occurred in the murinesystem. The NH2-terminal sequences in the DC light chainsare nearly identical for the first 24 residues of 11, while thecorresponding residues in the mouse, notably residues 9-14,are very variable. A polymorphism also occurs in I-AP3 atresidues 62-78, also seen in DC13. Moreover, the functionalsignificance of changes in this region has been clearly dem-onstrated by the occurrence of amino acid substitutions inthe functionally altered Apbm12 mutant at residues 67, 70,and 71, resulting from a gene conversion-like event with theE13 gene as the donor (31, 32).The DCa chain genes also display an allele-specific poly-

morphism with a similarly placed cluster at residues 45-56,and what appears to be a DCa/DXa-locus specific diver-gence at residues 63-70 (26). It is interesting to note that inthe cluster at residues 84-90 (Fig. 3) there are no differencesbetween DC-313 and the DC-413 sequence, while six out ofeight differences are found when comparing the DC-31 se-

quence to the pII,32 and DC13AA sequences. Since there ismore than one DC/3 gene, this cluster (residues 84-90) maybe analogous to the locus-specific cluster seen when com-paring the DCa and the DXa genes. Alternatively, this re-

gion (residues 84-90) may be an additional polymorphic re-

gion among certain alleles.(iii) 132, transmembrane, and cytoplasmic domains. The

132, transmembrane, and cytoplasmic domains are more

kb o 1 2.- 3

a,/e3a 0 0 "sa E200bp m x V le o

Proc. NatL Acad Sci. USA 81 (1984)

Dow

nloa

ded

by g

uest

on

Feb

ruar

y 28

, 202

1

Page 3: Cloning histocompatibility complex DC-3f3edDC13 chain(18) (termedDCAAin this paper)show15, 22, and 24 amino acid differences, respectively (Fig. 3), which are clustered in four regions

Immunology: Boss and Strominger Proc. Natl. Acad. Sci. USA 81 (1984) 5201

GGATCCTTTTGTAACATTAAACAAGTCATATTAATCTTAAATTTGTATATGTGA AGATCTAGATGTAAAATGCATGAAACATGATCCACATTTTACAAAGAGAAGCCTGGGCAAAATAAATTCAGTAATTTGTTGACTCTCATAAAGCACATTAGTTGGAACTGCAACTCACCATTATTTCCTCTAAGAACTTTGACTCTTTCACCAAAACTTAAGGCTCCTCAGGGTGTGTCT AAGACAACAGCAGTAAAAATGTCT ATGACAGCAATTTTCTCTC CCCTGAAATATGATC CCCACTTAATTTGCCCTATTGAAAGATC CCAAGTATAAGAACAACTGGTTTTTAATCAATATTACAAAGATGTTTACTGTTGAATGCATTTTTCTTTGGCTTCTTAAAATC CCTTAGGCATTCAATCTTCAGCTCTTC CATAATTGAGAGGAATTTTCAC CTCAAATGTTCATCCAGTGCAATTG

AAAGACGTC CAGTGCAGGCACTGGATTCAGAAC CTTCACAAAAAAAAAATCTGC|CAGGAAGTWGTC CTTCAGCTCCAGGjGATCT TT AA G AC

** * * t t t * * . . . HM S W KTACCACGCATGGAAACATCCACAGT~TTCTGCCAGGTACATCAGATC CATCAGGTC CGAGCTGTGTTGACT ACCACTTTTCCCTTCGTCTCAATT ATG TCT TGG AAA

K A L R I P G G L R A A S V T L M L S M L S T P V A E G R DAAG GCT TTG CGG ATC CCC GGA GGC CTT CGG GCA GCA ACT GTG ACC TTG ATG CTG TCG ATG CTG AGC ACC CCA GTG GCT GAG GGC AGA GAC

................

S~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P

TCT CCC G GTAAGTGCAGGGCAGCTGCTCTCCAGAGCCGCTACTCTGGGAACAGGCTCTC CTTGGGCTGGGGTACGGGGATGGTGATCTC CATAATCTCGGACACAATCTTTTATCAACATTTC CTCTGTTTTGGGAAAGAGAGCT ATGTTGCA TTTC CATTTATCT TTTAATGATGAAGTGAGGACA ATC CAATCCCATC CTACGGACTTAAGCCGAAGAGGAGGAGAGAGGAGAGAAAAGAGGAGACA AAGTGTTCATTTACT ACCAGTGATAGGACAAAGTGAGCA TGGGGTTATTTTTGAAAGATATGAATTTCTCCA AAGACACAG CAGGATTTGCCATTTAGGCGTGTCCCAAGACTTGCCTGACT AAATATTATGATTTC CTGCATTGGGAAATGCAAGGCAGCAATGGTGTCTGTAGTCTC CGTATTTGGGGAAAAGTTGTCTGCA TTCCTGACCCAGTGGAGCGTTTGTGGAGGCAAAATCTTGGTACTGAGGGAAGCTGACTGGCACT CCACAGAAAGAGAGCCTTCAGGTTTCGGCAAATGGTGAC CTGAGTGGGATTC CAGATACCCGAGTTGATGATGGACTAAATTTAGTAGAAAGGAGGATGTAAAGAAGGGAAATAACACA TACTGTGAAACCAACTCATTTCAGACACGGAACA ATACTTTACATAAATTCTCTCTCACTC CTTCTAACA TCCTGTGTGTAGATATCATGATTTTCTTTTACA CAATTATACTTGTAGTATGGATATTCTGTTACA TAAACTGCCCGGGCTGGTGACTGCCACAGTTCA ATGGGAATCTAGTTTATCAAATTCAAAAGCTTGTGCTCTTTCGGTGAATAAATGTTTCTTTCTAGGACTCAGAGATCTAGGACTCCCTTCTTTCTAACACAGCGTGAGTGAAC CTCACAGGGCACTTGGGAGGGTAAATC CAGGCATGGGAAGGAAGGTATTTTAC CCAGGGACCAAGAGAATAGGCGTATCGGAAGAGGACAGGTTTAATTCTGAAC CTGTCTCGTCATTCCCTTGAACTGTCAGGTTTATGTGGATAACTTTATCTCTGAGGTACCCAGGAGCTCCATGGAAAATGAGATTTCA TGCGAGAACGCCCTGATC CCTCTAAGTGCAGAGGTCCATGTAAAATCAG CCCGACTGCCTCTTCACTTGGTTCACAGGCCGAGACAGGGACAGGGCTTTC CTCCCTTTC CTGCCTGTAGGAAGGCGGATTC CCGAAGACCCCGAGAGGGGCGGGCAGGGCTGGCAGAGTTCGGGAGGATC CCAGGTCTGCAG CGCGAGGCACG

E D

GGCCGGCGGGAACTTGTGGTCGCGCGGGCTGTTC CACACGTCCGGGCCGGGTCAGGGTGGCGGCTGCGGGGGCGGACGGGCTGGGCGCA CTGACTGGCCGGTGATTCCTCGCAG AG GATF V Y Q F K G M C Y F T N G T E R V R L V S R S I Y N R E E

TTC GTG TAC CAG TTT AAG GGC ATG TGC TAC TTC ACC AAC GGG ACA GAG CGC GTG CGT CTT GTG AGC AGA AGC ATC TAT AAC CGA GAA GAGI V R F D S D V G E F R A V T L L G L P A A E Y W N S Q K D

ATC GTG CGC TTC GAC AGC GAC GTG GGG GAG TTC CGG GCG GTG ACG CTG CTG GGG CTG CCT GCC GCC GAG TAC TGG AAC AGC CAG AAG GACI L E R K R A A V D R V C R H N Y Q L E L R T T L Q R R

ATC CTG GAG AGG AAA CGG GCG GCG GTG GAC AGG GTG TGC AGA CAC AAC TAC CAG TTG GAG CTC CGC ACG ACC TTG CAG CGG CGA G GTGAGC

GGCGTCGCCCCTCTGCGAGGC CCAC CCTTGGC CCCAAGTCTCTGCGCCAGGAGGGGCGAAGGGTCGTGGC CTCTGGAAC CTGAGCCCCGTTTGTTC CACCCCAGAGGACAGGAGGCAGCGGCGAGAGTGGTGGGGGCAGGGTCATCGGAGGTGCGGGGAC CTAGGCAGAGCAGGGGGACAGGCAGAGTTGGCCAGGCTGCCT AGTGTCGCCCCAGC CTACCCGTTCGTCGGCCTTGTC CTCTGCTCTGCATGTTCTTGCCTCGTGCCTTATGCATTTGCCTCCTTTTGCCTTACCTTTGCTAAGCAGCTCTCTCTGCTCAGAATGCCCGCCCTCTTCCCCTGCCCGCCCGCCCGCCCCA CTAGCACTGCCCCAC CCAGCAAGGCCCATCGTGCACAGCTCTTGCAGCAGGAAGCT TCAGGCTTAGCCTGGTGGAGTTAGGGCTGTTC CACAACTGCGCGCAGGACATTCAG CAATTACAGTTGTGAAATAAGATATTTTAACT TTTGGCTTCAAATCATTATTCATCGTAATTCTGTTTTCTTAAATGGCTCTCA TTCATGGCAGAGATCTTTGAGGTGAGGGTGTTTTAATCATTGCATGCCTAGTAC CTGACACATTGACTGGTATGTG GTGTGAGCTCAATGATCTTCTGTTAAATTAATGAATAAATGTACTCAGCTGCCCA TCCACTTAGGCTCAAGAAAAAAAAAGAGGTAAACTGAGCCTTAAAAATGGACTTTATTAATTATTTTCT ATAATTTTGCTTAATGCTTTAAAGTAAACTCTTATTGACT TGGATCTTAATAGAGTTTGAATACA AAATCTGAGGAAAAAAGTTTTTGCTAAAAATAAAAACA ACGCTTGAAAGATATTGTAAGGCAGTTCAAATTTCTTTTCTTTTCTTTTTTTTTTTTTTTTGAGACGGATTCTCACTCTGTCGCCGAGCCGGAGTGCAGTGGCGCATCTCGGCTCACTGCA AGCTCOGCCTCCCGGGTTCACGCCATTCTC CTGCCTCAGC CTCCTGAGTAGGTGGGATTACAG GCGCGTGCCAC CACGCCCGGCTAATTTTTTTGTATTTTTAGTATGAGCGGGGTTTCAC CGTGTTAGCCAGGATGGTCTGGATCTCCTGACCTCATGATC CGCCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACAGTGC CCGGCCGGCACTTTAATTTCTTAGAAAAAGCTGAACAAATGGCACA ATGCAAAGAGCAAAAGTTTTGGAATAAATAGATTGAAGCCATTAAATTATTGGATAAAAATAGTTTCGGGTTGCTTTTGGCCTAGGTTCTCCCCTC CCCCCATGACTATC CACTTCAGGAATAAACATTCTGAAAGTCAATTTTAC CCATTTGATGAGACATTTATTTCT AGACAGTTGCCTTATCAAATACCATCTATGTTACGTCATTTAATCT CACAGTTACTTGTGCATCAGAGATTAGCATCAC CACTTTATATATTGGTACA TGATAAACACTTTATTGGTCATGGATGGGGAGATGGTCACTGTAGGCTAATATTGGTACA TGATAAACACTTTAAGTAATCAGCCCATAATTGCTCAC CAAGAC CTTAAGCCT CCCAAAGTACACAACATTCTTTGTGTTCT TCACTACACATC CATAGAGTCTAAGGGACGTAAGCCTCGTTAAAGC CAGTTTTAGAC CGAGAAGCAGCAATGAGTCTATC CTGTGTGATTTC CATGTTAATGGGACAAAATGATACTTTCAAGGCATTGAAAATTCATGATTAATCAATCCCTAGTCTGAC CCCAGTGTTATCTATGCAGGTTTGCA AAACCTTTAGTTCACT AATACTCCCCTTGCCTTCTTTTGATTCA CATCCTAATGCCAGCAAATACTTATTTTTTGCTATTTCAGTTCCATTTC CATAAAATTTATTTTATCATCTTTTCTCA TAAATTTATGCCCTCTATTTTTACT CCCAATCTGTTCAAGATGAACAAATCTTATAAGGCCA CATAGCTGACTGTTATTTCTGTTGGATCTCGAGGTGG0GTAAGGAAAAGTTGAG TGTTATGTGGGOOOOATAAGATCCAGATGATGGCTCATGAGTGCCATTTAATTGGGGTGCCACCTGGTGACCTCAGCAAATCCCAGCTATATTTATGTGTTCA CATTACAGGATCA TTAACCCAGACCGAC CACTGCACAAGATCTCAGAATATTTTCTATGGGAGAACATACATAATAATGCCTGATTTCAGAAGAAGAAAGTAATTCTCAATAGCAAGGGGATGGAGTAGGGTAGACAGCTGTAATTAAACT CACTTGTGTGATAAAAAGATTAATGGGAAAAAGAAAATGAGGAGAACATATTACTAAATAAAGAAAGCATACA ATGTTACTATAGTTTCACACT AAGAGAATAAAGGAAATGCAATAAAGTGGCCGAAAGGTAAAGGATGAGATGTGTAAAGAGGTGTAGTATTTTTCATATGACGACGAACTGAGAAGATAAAGGAATCGAGTTACGGGCAAACATGATGTTTGATCAGTGTTATTTGTTTTCAAGGCCTGCCTAAATTTTTTTCA AATATTACAAACTTTTGAAATAACATTTCTTTTTGTTTTTTGCTGTCTGTTACTAGGTTGCAOTTTCATAAAGGCAGGGACCATGGTATGTA CTTTTTGGATTCTCAGTGATTGTTATATTTAGATTTGOTGTAAGGAACCTTAATCCAAGACTTGGACTCCAAGTATCTTTCCACTOCTGGTTCCAAGGAGGGACCTCCCTCCAGCAGGCATGCTGTGTGGTCTCACATCTCACTCCTATATCTTTCCCTGTCTGTTACTGCCC

V E P T V T I S P S R T E A L N H H N L L V C S V T D F YTCAG TG GAG CCC ACA GTG ACC ATC TCC CCA TCC AGG ACA GAG GCC CTC AAC CAC CAC AAC CTG CTG GTC TGC TCG GTG ACA GAT TTC TATP A Q I K V R W F R N D Q E E T A G V V S T P L I R N G D W

CCA GCC CAG ATC AAA GTC CGG TGG TTT CGG AAT GAC CAG GAG GAG ACA GCT GGC GTT GTG TCC ACC CCC CTT ATT AGG AAT GGT GAC TGGT F Q I L V M L E M T P Q R G D V Y T C H V E H P S L Q S PACC TTC CAG ATC CTG GTG ATG CTG GAA ATG ACT CCC GAG CGT GGA GAC GTC TAC ACC TGC CAC GTG GAG CAC CCC AGC CTC CAG AGC CCCI T V E W

ATC ACC GTG GAG TGG C GTAAGGGGATATTGAGTTTCTGTTACTGTGGGCCCCACAAGACA AAGGACAGAGCTCCTTCTGACCCATCCCTTCCCATCTCTTATC CCTGATGTCACTGCTGAGCTGGGAATCACAGGAGACT AGAGCAC CTCTAGTTCCATGGCGAGTGCATCAGAAGAATC CTGATCTCATCAC CTTTCCAGATGCTAGGGAAATTACTCTACATACTGTTGCT-CTGGATC CAGTCCTGATTGCTCT5AGGAACTfiATTATTAGGGCTGGTGACTGGCATCTTAGGGTCTAAGTTTATGGATGAGTTC CTGACGGAGTGGAGATCTG CTTCCCCACTCTGTCACCTACTCACTGTATC CAAGGACCTATTGGCTGGCCTTTCCCTCCCTTAGGGGTGGTCTGAATGGAGAACTAGGTTCCTTTGATGCCTTCACCTCCTGCATCTCAGACTGGACTTCAAGC

R A Q S E S A Q S K M L S GTCCTCATCAGGGAAACTATGGGGTATGGGGACAAACACTGACACTCAGGCTCTGCTTCTCAG GG GCT CAA TCT GAA TCT GCC CAG AGC AAG ATG CTG AGT GGCI G G F V L G L I F L G L G L I I H H R S Q K

ATT GGA GGC TTC GTG CTG GGG CTG ATC TTC CTC GGG CTG GGC CTT ATC ATC CAT CAC AGG AGT CAG AAA G GTGAGGAACCCCAGGGGAAAAGGGGAAGATGGCCTGTGAC CCAGACCCTCTGTTCAGAGAGGTC CTGTCTCTAGATGTAGCTCTTTC CTCCTGAC CCTGAGAGGAAGAAAGCTGAGCTGGAAGTGGAAGGAGACAGGACA AGGTTGGAGGAGGCATTGGAATCTGATTTTACTAGCTGAAGGGTAG CCCTGTCACAGAGCTGACTGATAGAGCTTATTC CAGGGCATC CTTACCATTCATCATTGTCTCACTGGCTCCTTTCCAAAAACTTCCTCCATTAAGAGGGTCAGAGC CTCGGCCTCCTTACCTTCTAGTGACAATTTC CTTCATTTTAGGGGATTTCAAATTAGGGTGCTCA AGGACTCGAAGAACATGAATGGGAAGAGAATATAACTCTAATTAAGTCACATGTGTCATTTTC CTTTGGGGTGAGAGAGTGACTGTTCATGTAATGAGACCTTTCTCTGCATAACTTC CTTTTGTAAGAC CTCAAGGGCCACCAGCAGGTAATATTTCGAGCCGGCATCCAGTGTGGGGAGGGCACAGGTGTAAGAGGGAAGAGCA TGAGCTGAGTGTACCTGACAGTAGTGGTCTCTGTTCATGGTATATTTGCTGCTATAGGATCAAGACTTAGGGGTGAAGTTTGCCAGTTTCT AGGAATCTCCAGAGGTTGTTCCCCAGAACCAAGCCTTAACTTTGGTGGTATCTTC CTGTTAAATGTGGAG CCAGAACCACGGCTTAAATGTTAGACACTAGGATGATGCCCACTTTGTGC CACATGATGGTGGCT ACTGCCTGTAGGCATTTTCCAGTGACTGAAAGAGGCTGCTAGTGGTAGGGATGAGGTATCATC CAATTTCCTAAAAAGATTGAAC CCTTCATATTCAC CAIGAAGAGTAACAGCTGTTC CCCCACCTCCCACACA TCTGCATCAAGCTGAAGTTCTGTGTCTTCATGAGCTGATTTCTC CTTTGCACAGATCTTGGGGGAGGTGATGACAATACACTCTGGACCTCAGCTTTGTCTGTCTGAAG CTGCAGGAGGCCCCTGAGGGGTGGGGAAGATGGCAGGCCCAC CAGCGTACCCTGTGCTGATCATC CCTCTTCT

G L L H *CTCTTCTTCAG GG CTC CTG CAC TGA CTCCTGAGACTATTTTAACTGGGATTGGTTATCACTTTTCTGTAACGCCTGCTTGTCCCTGCCCAGAATTCCCAGCTGTCTGTGTCA-

GCCTGTCCCCCTGAGATCAGAGTC CTACAGTGGCTGTCACGCAGCCACCAGGTCATCTCCTTTCA TCCCCACCTTGAGGCGGATGGCTGTGACCCTACTTC CTGCACTGACCCACAGCCT

CTGCCTGTGCACGGCCAGCTGCA TCTACTCAGGCCCCAAGGGGTTTCTGTTTC CTATTCTCTCCTCAGACTGCTCAAGAGAAGCACATGAAAAC CATTACCTGACTTTAGAGCTTTTTTA

CAT* ~ATGATCCTGAGTTATCTGTATTCTGAACTTC CTTAATTGAGCAGAGGCiGGAAATCA CTGCAGAATGAiGGAACATACCTAGGCCGCCATTCCG

AGGAGGGTTGTAC CTTGAAAGACA CTGAAAGAATTTGGGGTGCAAAGTCATGGTGGGCAGAGGAGGTAGAAAATCAACTCAGTTGTTGCA TCATTCATGGTTCTTTCATATTGATGTTCAGTGCAGTGGCCTGAGAATATC CCAGCCTCTCTTCTGGTTTGGGAr.TGCTATATAAGTAAACATGGTGGAATTGTTTGGGGGCAGATAG

120240360480

60029

716 5'ut+2

806 Sig+4919

1034115912791399151916391759187919992119

62238

362328

66 Bi2418

942509

262927492869298931093229334934693589370938293949406941894309442945494669478949095029514952695389123

5479153

5569 B21835659188

57695889600961292026232 TM225

6328 Cy6448656866886808692870487168728874082297520CY

76403'ut7760

7880

80008089

FIG. 2. Sequence of the DC-3f3 gene. The nucleotide sequence of DC-3:3is shown, with the translated amino acids (standard one-letter code)above their respective codons. Underlined regions represent exons and dotted underlining at nucleotides 711-738 and 6786-6808 refer to thereverse complement of the oligonucleotide primer used in the primer extension experiments and the missing exon discussed in the text,respectively. Boxed regions 534-547 and 567-574 highlight sequences that are common to the upstream regions in all class 1I genes. Boxedregions for nucleotides 593-597, 625-633, and 7765-7770 highlight the "CAT," "TATA," and "ATTAAA" sequences, respectively. The arrowspoint to the initiation of transcription sites as determined from primer extension experiments. Sig, signal sequence; ut, untranslated sequences;TM, transmembrane sequence; CY, intracytoplasmic sequence.

Dow

nloa

ded

by g

uest

on

Feb

ruar

y 28

, 202

1

Page 4: Cloning histocompatibility complex DC-3f3edDC13 chain(18) (termedDCAAin this paper)show15, 22, and 24 amino acid differences, respectively (Fig. 3), which are clustered in four regions

5202 Immunology: Boss and Strominger

A) DC-3B) pIIBIC) DC-4D) pIIB2E) DCAAF) DC-jG) I-AH) I-AdI) I-AkJ) I-At

-30 -20 -10 1 10 20 30 40 s0l l l l Iv l l

60 70 so

HSWKKALRIPGGLRASVTLHMLSHLSTPVAZGRDSPEDFVYQFKGNCYFTNGTERVr-Ri- y-iR-IVRFDS--_ "TLLGLDA h!NSQnIIiKRArVDICMNIP_-V-VT A- -a-AY- PP C --T--E-

. _- -_ ..............._,j __~~~~~~~~.. ......___7@X-@...@@vs**- @*@@**

.....M-SLLLS-U-

90

AF-G.... .. vIRR

100 110 120 130 140 150 160 170 180 190 200 210 220I X I I I I I v I I I I v

A) VgPTTISPSRTEALNHHNLLVCSVTDFYPAQInVRWFRNDQ88TAGVVSTPLIIRNGNTFQILVML>TPQRGDVYTcHVEHPSLQSPITVEWRIAQszsAQsxmLSGIOGFnLGLIFLGonLIINHHRsaK GLLHB) -------------.-----__ --_-_-__ ------_-_-_.-i _C) -------- -HI-.-.-------T---------------- -----N---------------V------ -QR- -E) - --------------------- --------.................. ............................ ....

G) EQ-N-V-L ------- -----K----I-V--S--Q------V----R--K - W-----C-V-- F-R GPRGPPPA---QH) 8Q--V-L----^---T-- -------V--S--Q-------V--R--8---KC ---- -C-V F-R--G-PROPPA-QI) EQ-N-1-L-- ----T----K-------V--S--Q-----V----H E- K ----R-----C-- ----F-R----PRGMPPPA--QJ) EQ-S-V--L- ---T---- ---.------V-S-- ------V--R--Z------- ----------- --F-R---CPGMPPPA---Q

FIG. 3. Comparison of DC,/-like chains. The DC-3,1 amino acid sequence (A) is shown for comparison with B, pII81 (10); C, DC-413 fromcoslI-102 (8); D, p11(,2 (5); E, DCAA (18); F, DC-1,8 (19); G, Abf3 (5, 20); H, Abml2I3 (31); I, Ad(3 (20); and J, AkI3 (20). Dashes indicate identities;dots represent undetermined residues; and gaps represent deletions in the amino acid sequence placed so as to maximize the homology betweenthe proteins. Arrowheads show the intron-exon boundaries for the DC-3,8 gene. Boxes circumscribe clusters of amino acid polymorphisms.

highly conserved between the allotypes than the (31 domains.The (32 domain is homologous to the a2 domain of class II

heavy chains, a3 domain of the class I MHC antigens, (2-

microglobulin, and the constant region domains of immuno-globulins (1, 2). Comparison of the DC-3(3 chain with otherDC,3 and A(3 light chains for the most part indicates that theamino acid changes in these regions are conservative wherecharged amino acids are involved (Fig. 3).The DC(3 cytoplasmic domain is also eight amino acids

shorter than the domains of A,3, DR(3, and E(p. As noted inearlier reports (5, 7, 8) this could be due to the presence of anaberrant splice site eliminating an exon in the JJC( genes.The intron between the transmembrane exon and the 3' un-translated exon was translated in all three reading frames. Asequence homologous to the exon in the A(3 genes was foundin this intron in the DC-3(3 gene (indicated by a dotted under-

.\ 1A .A lo

n-

-- _ _4.

FIG. 4. Autoradiographs of Southern blots. Genomic DNA fromthree homozygous cell lines WVT49 (DR3,3) (lanes 1), Preiss (DR4,4)(lanes 2), and LB (DRw6,6) (lanes 3) were digested with EcoRI.Each sample was split in half and electrophoresed on either side ofthe gel. Southern blotting was carried out as previously described(23). Hybridization was carried out with a nick-translated Pst I/EcoRI fragment of pII31 (15) as previously described (9). The blotcontaining duplicate samples was washed in 300 mM NaCl/30 mMsodium citrate/0.5 x Denhardt's solution/0.1% sodium dodecyl sul-fate two times for 30 min at 680C. The blot was cut in half. Blot Awas further washed in 300 mM NaCl/30 mM sodium citrate and blotB in 30 mM NaCl/3 mM sodium citrate for 4 hr with four changes oftheir respective buffers at 680C. A darker exposure of lane 2 for blotA and B is shown on the right. Positions of size markers are shownin kb on the left.

line in Fig. 2) and can be compared with the translated AP3sequence and a similar sequence in DC-483: I-At'b, G-P-R-G-P-P-P-A; DC-383, [G-P-E-G-P-P-A]; and DC-483, [G-P-E-G-P-P-P-A] (brackets indicate an untranslated sequence). Thereare two anomalies in the nucleotide sequence of DC-3JP inthis region. First, the splice acceptor usually found as Y,-N-Y-A-G/G (24) is Y,-G-T-A-A/G. An A-A has not been foundto serve as a splice acceptor site (24). Second, there are onlyseven instead of eight amino acids encoded before the splicedonor is encountered. The sequence generated for the DC-4,B gene (cosIl-102) (8) shows a similar pseudo-exon contain-ing the same aberrant splice acceptor, but this exon containsall eight amino atids. Because both alleles have the sameaberrant acceptor ahd the DC-483 sequence has eight aminoacids it seems likely that the loss of the proline codon in theDC-3p3 sequence occurred later in evolution than the loss ofthe splice acceptor.

(iv) The 3' untranslated domain. The 3' untranslated por-tion of the DC-3,3 gene consists of approximately 350 nucleo-tides (underlined in Fig. 2). This sequence is extremely ho-mologous to the 3' ends of both the DC-4(3 and the p11(81genes, differing at 8 and 2 nucleotides respectively. Since thecDNA clone pII31 does not contain a poly(A) sequence weassume that it is just short of the poly(A) sequence and esti-mate the 3' end of the mRNA to be about 20 nucleotidesfrom the polyadenylylation signal, which in this case is A-T-T-A-A-A. A-T-T-A-A-A sequences have been found in sev-eral other genes (27), although A-A-T-A-A-A is more fre-quent. An A-T-T-A-A-A sequence is also present in both theDC-4/3 and pII31 clones as well as in a DR( cDNA clone.The sequencing of an additional 300 nucleotides past thispoint reveals no other polyadenylylation signal.

Nucleotide Polymorphism. A comparison of nucleotide dif-ferences between DC-3p and DC4(3 or pIIP2 shows that in(31 77% and 71% of the differences, respectively, are in thefirst and second positions of the codons (Table 1); in (32, 60%and 44%; and in TM, 0% and 37%. Moreover, for the mostpart the differences in (1 result in nonconservative aminoacid replacements, whereas the (32 and TM domains show

Table 1. Nucleotide differences within codons

Domaincompared 131 132 TMwith DC-3/3 1 2 3 1 2 3 1 2 3

DC4f3 4 11 6 1 2 2 0 0 0pII32 15 16 9 3 1 5 1 2 5

The numbers of nucleotide differences are tabulated for each posi-tion within the codon.

H --M-::-----Q-I-T-T-Y- Y-Y -H---E--H-D -FEF-Q--EL-T EGF-TH-3-H-LH::.-- -Y--O-T---T-Y----Y-Y-

Proc. NatL Acad. Sci. USA 81 (1984)

. .

Dow

nloa

ded

by g

uest

on

Feb

ruar

y 28

, 202

1

Page 5: Cloning histocompatibility complex DC-3f3edDC13 chain(18) (termedDCAAin this paper)show15, 22, and 24 amino acid differences, respectively (Fig. 3), which are clustered in four regions

Proc. Natl. Acad. Sci. USA 81 (1984) 5203

more third-position differences, and these nucleotide differ-ences generally result in silent or conservative amino acidsubstitutions. A similar result was obtained when I-Ab wascompared to I-Ad or I-Ak. Although the sample examined issmall, this observation is consistent with that found for otherMHC genes, such as DCa (26), Aa (28), and the variableregion of immunoglobulin genes (29).Gene conversion, unequal crossing-over events, and/or

point mutations in functionally important areas followed bypopulation selection are proposed as possible mechanismsby which clustered sequence polymorphisms can be generat-ed (29). The former two are capable of causing rapid changebetween members of a gene family. A gene conversion-likeevent has been shown to be the cause of both the class Imutant, bml, and the class II mutant, bmJ2, in the mouse

(30-32). Whatever the mechanism, it is clear that the 31 do-main is subject to different mutagenic and selection pres-sures than the other domains.Mapping of the 5' End of DC-3 Chain mRNA. Primer

extension experiments were carried out to determine the po-sition of the DC-3/3 gene transcription initiation site. Reac-tions were carried out using poly(A)+ RNA and total RNAfrom WT49 cells and 721.82 cells (Fig. 5). The 721.82 cellline, used as a control, contains a double haplotype deletionof the DR and DC loci (21). The primer extension reactionsshow three bands (arrows in Figs. 2 and 5) occurring at 58,55, and 50 nucleotides 5' to the ATG codon in the WT49RNA track. No bands are seen in the 721.82 RNA track,which in this case serves as a control for artifacts. It is notuncommon to see several bands in primer extension experi-ments. These bands may represent multiple transcription ini-tiation sites, regions of secondary structure that reverse

transcriptase has trouble reading through, or transcriptionfrom another DC-like gene (Fig. 4).The Promoter. The DNA sequences 5' to the transcription

initiation site contain the standard signals for transcription.A TATA-like region, A-T-T-T-T-T-A-T-T; and a CAT box,C-C-A-A-T, are found 20 and 51 bp 5' to the start of thetranscription. In addition, the DC-3/3 gene also has addition-al sequences C-C-A-G-A-G-A-C-A-G-A-T-G and C-T-G-A-T-T-G-G located 121 and 78 bp 5' to the start of the transcrip-tion (boxed sequences in Fig. 2). These sequences, as notedpreviously (6, 33), are common to Ea, E/3, and DRa. In addi-tion these sequences are found in both the DCa and DXasequences (J. Lillie, personal communication). It may bethat these sequences are responsible for the specific regula-tion of these genes during the development of an immuneresponse.The DC-3/3 gene presented in this paper has all the fea-

tures characteristic of a functional gene. This statement can

be verified only by transfecting this gene into cells and as-

saying for its expression.

2 3 4 5

hp

-lIN)147

W 1 () FIG. 5. Mapping the 5' end of the

DC-3,3 mRNA. Autoradiograph ofprimer extension experiment reac-

-{ tions carried out on poly(A)+RNAfrom an equivalent of 20 jug of total

7-6 RNA from WT49 (lanes 1 and 3) and

-6f 721.82 cells (lane 2) or 20 Atg of total

RNA from WT49 (lane 4) and 721.82

(lane 5). The arrows point to the ma-

jor bands representing the 5' ends of

the DC-3/3 mRNA.

We thank P. A. Peterson for giving us pI1/31 and R. DeMars forgiving us the cell line 721.82. We thank Rosemarie Mengler and Jo-anne Kuo for technical assistance; J. Lillie, A. Korman, B. Guild,M. Roux-Dosseto, and C. Auffray for helpful discussions; and AliceFurumoto-Dawson for typing the manuscript. J.M.B. received sup-port from Damon Runyon-Walter Winchell and National Institutesof Health (1 F32 Al 06860) fellowship grants. This research was sup-ported by National Institutes of Health Grant AM-30241.

1. Kaufman, J. F., Auffray, C., Korman, A. J., Shackelford,D. A. & Strominger, J. L. (1984) Cell 36, 1-13.

2. Steinmetz, M. & Hood, L. (1983) Science 222, 727-733.3. Shackelford, D. A., Kaufman, J. F., Korman, A. J. & Stro-

minger, J. L. (1982) Immunol. Rev. 66, 133-187.4. Schamboeck, A., Korman, A. J., Kamb, A. & Strominger,

J. L. (1983) Nucleic Acids Res. 11, 8663-8674.5. Larhammar, D., Hammerling, U., Denaro, M., Lund, T., Fla-

vell, R., Rask, L. & Peterson, P. (1983) Cell 34, 179-188.6. Saito, H., Maiko, R. A., Clayton, L. & Tonegawa, S. (1983)

Proc. Natl. Acad. Sci. USA 80, 5520-5524.7. Hyldig-Nielsen, J. J., Schenning, L., Hammerling, U., Wid-

mark, E., Heldin, E., Lind, P., Servenius, B., Lund, T., Fla-vell, R., Lee, J. S., Trowsdale, J., Schreier, P. H., Zahlitzky,F., Larhammar, D., Peterson, P. & Rask, L. (1983) NucleicAcids Res. 11, 5055-5071.

8. Larhammar, D., Hyldig-Nielsen, J. J., Servenius, B., Ander-son, G., Rask, L. & Peterson, P. A. (1983) Proc. Natl. Acad.Sci. USA 80, 7313-7317.

9. Maniatis, T., Fritsch, E. F. & Sambrook, J. (1982) MolecularCloning: A Laboratory Manual (Cold Spring Harbor Labora-tory, Cold Spring Harbor, NY).

10. Larhammar, D., Schenning, L., Gustafsson, K., Wiman, K.,Claesson, L., Rask, L. & Peterson, P. A. (1982) Proc. Natl.Acad. Sci. USA 79, 3687-3691.

11. Frisohauf, A. M., Garoff, H. & Lehrach, H. (1980) NucleicAcids Res. 8, 5541-5549.

12. Vieira, J. & Messing, J. (1982) Gene 19, 259-268.13. Messing, J., Crea, R. & Seelong, P. H. (1981) Nucleic Acids

Res. 29, 309-321.14. Maxam, A. & Gilbert, W. (1980) Methods Enzymol. 65, 499-

560.15. Sanger, F., Coulson, A. R., Barrell, B. G., Smith, A. J. H. &

Roe, B. A. (1980) J. Mol. Biol. 143, 161-178.16. Aviv, H. & Leder, P. (1972) Proc. Natl. Acad. Sci. USA 69,

1408-1412.17. Montgomery, D. C., Boss, J. M., McAndrew, S. J., Marn, L.,

Walthall, D. A. & Zitomer, R. S. (1982) J. Biol. Chem. 253,7756-7761.

18. Gotz, H., Kratzin, H., Thinnes, F. P., Yang, C. Y., Kruse, T.,Pauley, E., Kolbel, S., Egert, G., Wernet, P. & Hilschmann,N. (1983) Hoppe-Seyler's Z. Physiol. Biochem. 364, 749-755.

19. Bono, M. R. & Strominger, J. L. (1983) Immunogenetics (NY)18, 453-459.

20. Choi, E., McIntyre, K., Germain, R. N. & Seidman, J. G.(1983) Science 221, 283-285.

21. Roux-Dosseto, M., Auffray, C., Lillie, J. W., Boss, J., Cohen,D., DeMars, R., Mawas, C., Seidman, J. G. & Strominger,J. L. (1983) Proc. Natl. Acad. Sci. USA 80, 6036-6040.

22. Bohme, J., Owerbach, D., Devaro, M., Lernmark, A., Peter-son, P. A. & Rask, L. (1983) Nature (London) 301, 82-84.

23. Southern, E. M. (1975) J. Mol. Biol. 98, 503-517.24. Mount, S. M. (1982) Nucleic Acids Res. 10, 457-472.25. Kaufman, J. F. & Strominger, J. L. (1982) Nature (London)

297, 694-696.26. Auffray, C., Lillie, J., Arnot, D., Grossberger, D., Kappes, D.

& Strominger, J. L. (1984) Nature (London), in press.27. Boss, J. M., Gillam, S., Zitomer, R. S. & Smith, M. (1982) J.

Biol. Chem. 256, 12958-12961.28. Benoist, C. O., Mathis, D. J., Kanter, M. R., Williams, V. E.

& McDevitt, H. 0. (1983) Cell 34, 169-177.29. Baltimore, D. (1981) Cell 24, 592-594.30. Weiss, E., Mellor, A., Golden, L., Fahrner, K., Simpson, E.,

Hunt, J. & Flavell, R. (1983) Nature (London) 301, 671-674.31. McIntyre, K. R. & Seidman, J. G. (1984) Nature (London)

308, 551-553.32. Widera, G. & Flavell, R. A. (1984) EMBO J. 35, 1221-1225.33. Mathis, J. D., Benoist, C. O., Williams, V. E., Kanter, M. R.

& McDevitt, H. O. (1983) Cell 32, 745-754.

Immunology: Boss and Strorninger

Dow

nloa

ded

by g

uest

on

Feb

ruar

y 28

, 202

1