the structure of the human apolipoprotein genes

5
0270-9139/87/0701-056S$02.00/0 HEPATOLOGY Copyright 0 1987 by the American Association for the Study of Liver Diseases Vol. 7. No. 1. pp. 56S-6OS, 1987 Printed in U. S. A. The Structure of the Human Apolipoprotein Genes LAWRENCE CHAN Departments of Cell Biology and Medicine, Baylor College of Medicine, Houston, Texas 77030 The plasma lipoproteins are water-soluble lipid-pro- tein complexes which transport lipids in the vascular and extravascular compartments. The lipoproteins have been classified according to their flotation densities in the ultracentrifuge. By this technique, the major classes of lipoproteins include the chylomicrons, very low den- sity lipoproteins, intermediate density lipoproteins, low density lipoproteins and high density lipoproteins (HDLs). Of the HDL, the two best-defined subclasses are HDL, and HDL3. The major function of the protein components of the lipoproteins, or apolipoproteins, is lipid transport. However, many apolipoproteins serve other additional functions which have been reviewed in detail (1-3). STRUCTURE OF APOLIPOPROTEINS AND THEIR MRNAS The cDNAs for a number of apolipoproteins have been isolated. These include those for apo A-I, A-11, A-IV, C-I, C-11, C-I11 and E (4-12). Recently, partial cDNAs for the largest of the apolipoproteins, apo B, have also been identified (13-16). In uitro translation as well as DNA sequence analyses have indicated that the apoli- poproteins are similar to other secretory proteins in that they contain a signal peptide as part of the initial nascent polypeptide chain on ribosomes. These are cleaved co- translationally. Furthermore, two of the apolipoproteins, apo A-I and apo A-11, also contain a pro-segment. Apo A-I has a 6-amino acid pro-peptide with the sequence Arg-His-Phe-Trp-Gln-Gln. The Gln-Gln right next to the proteolytic cleavage site is a rather unusual occur- rence (17). Apo A-11, in contrast, has a 5-amino acid pro- peptide with the sequence Ala-Leu-Val-Arg-Arg (18). The two basic amino acids next to the cleavage site is typical of peptides that undergo posttranslational pro- teolytic processing. The major sites of synthesis of the major apolipoproteins are the liver and the intestine. Apo E appears to be unique in that it is also synthesized in many other organs (19-24). Apo B is the largest of the apolipoproteins. There are two different forms of apo B, a hepatic form (apo B-100) which appears to be slightly more than twice as large as the intestinal form (apo B- 48) (25). Apo B-48 appears to share common sequences This study was supported by a grant (HL-16512) from the National Institutes of Health. Address reprint requests to: Lawrence Chan, M.D., Departments of Cell Biology and Medicine, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030. with the NH2-terminal portion of apo B-100. Recent studies using monoclonal antibodies indicate that apo B- 100 and apo B-48 are two distinct products of the same gene (26). The exact structural relationship between these two proteins is presently unknown. GENOMIC STRUCTURES OF HUMAN APOLIPOPROTEINS Recently, the genomic structures of a number of apo- lipoproteins have been reported including those of apo A-I (5, 6), apo E (27, 28), apo C-I11 (29), apo A-I1 (30) and apo C-II(31). It is apparent that these apolipoprotein genes have remarkably similar structures, each consist- ing of 4 exons and 3 intervening sequences (IVS). The exon/IVS structures of these genes have been examined (30). In eukaryotes, exon/IVS junctional structures are well- conserved. IVS are bounded at the 5’ end by a conserved sequence of nine nucleotides with the consensus se- quence ,SAGGTeAGT. They are bounded at the 3’ end by another sequence which is pyrimidine-rich with the consensus sequence, PyPyPyPyPyPy XCAGG (32). On examination of all of the exon/IVS junctions in all the apolipoprotein genes published to date (apo A-I, apo E, apo A-11, apo C-111 and apo C-II), they all conform to these 5’ and 3’ splice signals (data not shown). Recently, Keller and Noon (33) examined the IVS further away from the splice junction. They found an internal con- served sequence TACTAAC that is located 20 to 55 nucleotides from the 3’ IVS boundary which might serve as internal splice signals. They analyzed genes from Drosophila, sea urchin, chicken, duck, mouse, rat and man. The analysis of human genes was confined to the globin family. In this gene family, they found the internal 3‘ IVS signal to have the consensus sequence CTGAC. We have searched for potential internal 3‘ IVS splice signals in all of the apolipoprotein genes with known structure (Table 1). We found that, for the apolipopro- tein gene family, the consensus sequence CTgAN is present between 8 and 78 nucleotides from the 3’ splice junction of all the 15 IVS for the five genes examined. Recent studies indicate that in IVS processing, splicing events involve an intermediate in the form of a lariat structure (34). In this intermediate, the 5‘ end of the IVS has been released from the preceding exon, and the 5’ terminal G is joined in a branched structure to a point in the IVS near the 3’ splice site. Ruskin et al. (34) report that the branch occurs at an A and that the 56s

Upload: lawrence-chan

Post on 11-Jun-2016

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The structure of the human apolipoprotein genes

0270-9139/87/0701-056S$02.00/0 HEPATOLOGY Copyright 0 1987 by the American Association for the Study of Liver Diseases

Vol. 7. No. 1. pp. 56S-6OS, 1987 Printed in U. S. A.

The Structure of the Human Apolipoprotein Genes

LAWRENCE CHAN Departments of Cell Biology and Medicine, Baylor College of Medicine, Houston, Texas 77030

The plasma lipoproteins are water-soluble lipid-pro- tein complexes which transport lipids in the vascular and extravascular compartments. The lipoproteins have been classified according to their flotation densities in the ultracentrifuge. By this technique, the major classes of lipoproteins include the chylomicrons, very low den- sity lipoproteins, intermediate density lipoproteins, low density lipoproteins and high density lipoproteins (HDLs). Of the HDL, the two best-defined subclasses are HDL, and HDL3. The major function of the protein components of the lipoproteins, or apolipoproteins, is lipid transport. However, many apolipoproteins serve other additional functions which have been reviewed in detail (1-3).

STRUCTURE OF APOLIPOPROTEINS AND THEIR MRNAS

The cDNAs for a number of apolipoproteins have been isolated. These include those for apo A-I, A-11, A-IV, C-I, C-11, C-I11 and E (4-12). Recently, partial cDNAs for the largest of the apolipoproteins, apo B, have also been identified (13-16). In uitro translation as well as DNA sequence analyses have indicated that the apoli- poproteins are similar to other secretory proteins in that they contain a signal peptide as part of the initial nascent polypeptide chain on ribosomes. These are cleaved co- translationally. Furthermore, two of the apolipoproteins, apo A-I and apo A-11, also contain a pro-segment. Apo A-I has a 6-amino acid pro-peptide with the sequence Arg-His-Phe-Trp-Gln-Gln. The Gln-Gln right next to the proteolytic cleavage site is a rather unusual occur- rence (17). Apo A-11, in contrast, has a 5-amino acid pro- peptide with the sequence Ala-Leu-Val-Arg-Arg (18). The two basic amino acids next to the cleavage site is typical of peptides that undergo posttranslational pro- teolytic processing. The major sites of synthesis of the major apolipoproteins are the liver and the intestine. Apo E appears to be unique in that it is also synthesized in many other organs (19-24). Apo B is the largest of the apolipoproteins. There are two different forms of apo B, a hepatic form (apo B-100) which appears to be slightly more than twice as large as the intestinal form (apo B- 48) (25). Apo B-48 appears to share common sequences

This study was supported by a grant (HL-16512) from the National Institutes of Health.

Address reprint requests to: Lawrence Chan, M.D., Departments of Cell Biology and Medicine, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030.

with the NH2-terminal portion of apo B-100. Recent studies using monoclonal antibodies indicate that apo B- 100 and apo B-48 are two distinct products of the same gene (26). The exact structural relationship between these two proteins is presently unknown.

GENOMIC STRUCTURES OF HUMAN APOLIPOPROTEINS

Recently, the genomic structures of a number of apo- lipoproteins have been reported including those of apo A-I (5, 6), apo E (27, 28), apo C-I11 (29), apo A-I1 (30) and apo C-II(31). It is apparent that these apolipoprotein genes have remarkably similar structures, each consist- ing of 4 exons and 3 intervening sequences (IVS). The exon/IVS structures of these genes have been examined (30).

In eukaryotes, exon/IVS junctional structures are well- conserved. IVS are bounded at the 5’ end by a conserved sequence of nine nucleotides with the consensus se- quence ,SAGGTeAGT. They are bounded at the 3’ end by another sequence which is pyrimidine-rich with the consensus sequence, PyPyPyPyPyPy XCAGG (32). On examination of all of the exon/IVS junctions in all the apolipoprotein genes published to date (apo A-I, apo E, apo A-11, apo C-111 and apo C-II), they all conform to these 5’ and 3’ splice signals (data not shown). Recently, Keller and Noon (33) examined the IVS further away from the splice junction. They found an internal con- served sequence TACTAAC that is located 20 to 55 nucleotides from the 3’ IVS boundary which might serve as internal splice signals. They analyzed genes from Drosophila, sea urchin, chicken, duck, mouse, rat and man. The analysis of human genes was confined to the globin family. In this gene family, they found the internal 3‘ IVS signal to have the consensus sequence CTGAC. We have searched for potential internal 3‘ IVS splice signals in all of the apolipoprotein genes with known structure (Table 1). We found that, for the apolipopro- tein gene family, the consensus sequence CTgAN is present between 8 and 78 nucleotides from the 3’ splice junction of all the 15 IVS for the five genes examined. Recent studies indicate that in IVS processing, splicing events involve an intermediate in the form of a lariat structure (34). In this intermediate, the 5‘ end of the IVS has been released from the preceding exon, and the 5’ terminal G is joined in a branched structure to a point in the IVS near the 3’ splice site. Ruskin et al. (34) report that the branch occurs a t an A and that the

56s

Page 2: The structure of the human apolipoprotein genes

Vol. 7. No. 1, Suppl. 1987 STRlI(’T1JRK O F T H E H I ’ M A N APOLIPOPROTEIN GENES 57s

sequence around this A is CTGAC, CTAAT, CTGAT, CTAAC and CTCAC in the five cases examined. The consensus sequence we have identified, CTEAN, would then be recognized, and possibly utilized in uiuo for the branch structure. This suggests that this internal 3’ splice signal is operative in the apolipoprotein gene fam- ily. We note that the sequence is highly homologous to that in the globin family (see above), the only other human gene family thus examined to date.

Since the apolipoproteins all function as lipid-binding proteins, there has been considerable speculation that they might be evolutionarily related (35). Indeed, amino acid and DNA sequence comparisons have shown that apo A-I, A-11, A-IV, C-I, C-11, C-I11 and E all contain common modules of 1 1-codon repeating units. A detailed analysis of the homologies between apolipoproteins and the individual modules has been recently published by Luo et al. (36). The study strongly suggests that the 11- codon module is a basic building block for the apolipo- proteins and forms the basic structure of the putative ancester for all of them.

5’ FLANKING SEQUENCE OF APOLIPOPROTEIN GENES

Since apolipoprotein genes are probably evolutionarily related, we have compared the 5’ flanking sequences of all of the apolipoprotein genes published to date. As shown in Table 2, all of these genes exhibit a GC-rich region of high (>70%) homology within 500 nucleotides

TABLE 1. Internal 3’ splice signal sequences in apolipoprotein IVS

Signal Distance

A- I IVS-I C C A C T C A G C C A (; 22 IVS-I1 G G C C T G A T C T (; (; 24 IVS-111 C, C C C T C A A C C C T 21

IVS-I (; (; (’ T T C A G T G T C :i2 IVS-I1 C A G C T G A A A A G A 48 IVS-111 A C T C T A A T C C C C 16

IVS-I ( . C A G T C A G C C T G 3 1 IVS-I1 T A C C T G A C A C A (; 17 IVS-I11 C C T C T A A C C A T C ‘Lo

IVS-I C C C C T C A C A (; G A 17 IVS-I1 C C A C T C A G C C C T 17 IVS-111 C C C C T G A <’ T (; A T 8

IVS-I G T C C T C A C T G G C :1‘L IVS-I1 A C C C T G A C C C G A 2s IVS-111 T G G C T C A T C C C C 78

A-11

C-I1

C-111

E

upstream from the respective CAP sites. The homologous regions are found in nucleotides -234 to -213 of apo C-11, -497 to -471 of apo A-11, -196 to -179 of apo A- I, -409 to -391 of apo E and -116 to -103 of apo C-111. (The positions for apo A-I and apo C-111 are approximate, since the CAP sites have not been identified in these genes). The lengths of homologous sequences between the apo C-I1 gene and the regions identified for apo E, apo A-I1 and apo A-I are quite extensive, involving 19, 27 and 18 nucleotides, respectively. The homologous region between apo C-I1 and apo C-111 was shorter and involved 14 nucleotides (Table 2). However, since only -189 nucleotides of the 5‘ flanking region of the latter gene have been published, it is possible that more exten- sive homology might be located further upstream from the CAP site, as is the case with other apolipoprotein genes. One possible explanation for the conservation of 5’ flanking regions in the apolipoprotein genes might be that they are conserved through evolution from genes that have diverged from an ancestral primordial gene. Another possible role of such sequences is that they might be of functional significance, e.g., they might be regulatory sequences which render the respective genes responsive to changes in cellular lipids, or some lipid- protein complexes or other common intermediates in apolipoprotein gene expression. An additional possibility is that these sequences are homologous by random chance.

GENOMIC STRUCTURE OF HUMAN APOLIPOPROTEIN: RELATIONSHIP TO PROTEIN

STRUCTURE The general structural organization of all the apoli-

poprotein genes published to date is shown in Figure 1. It is evident that these genes share very similar struc- tures. There are 4 exons and 3 IVS to each gene. The IVS appear to have very similar locations: IVS-I inter- rupts the 5’ untranslated region of the gene; IVS-I1 interrupts the translated part of the gene very close to the signal peptidase cleavage site, and IVS-I11 interrupts the part of the gene encoding the mature peptide. The lengths of the first three exons are very similar, and the difference in total length of the mRNAs is accounted for mainly by differences in length of exon 4 (Figure 1). This striking similarity in the gene structures of apo A-I, apo A-11, apo C-11, apo C-I11 and apo E supports the hypoth- esis that these genes have arisen from a common ancester (35, 36), and the individual apolipoprotein genes have evolved through partial and complete gene duplications (36).

Gilbert (37) suggested that the exon/IVS structure of eukaryotic genes might be a record of their evolutionary history: these genes evolved by exploiting RNA splicing to recruit and combine small sements of coding se-

Consensus Structure 0 1 Signals with Flanking Regions quence. Thus, in many instances [e.g., collagen -(38), albumin (39), immunoglobulins (40), ovomucoid (41), rhodopsin (42), 3’-hydroxy-3-methylglutaryl coenzyme A 2 ‘ L : l - 2 1s 2 3 :1

x - 6 9 . 5 reductase (43) and pyruvate kinase (44) genes], the cod- C 7 9 x 13 G 4 3 ‘ L l I 4 - 4 T 2 1 2 1 1 . 5 ~ :{ :{ :{ ing portions of the genes are broken up into pieces

encoding distinct functional domains. We examined the Distance is the number of nucleotides between the 3’ end o f the locations Of IVs-11 and IVs-111 in the apolipoprotein . - -

signal a n d the 3‘ splice point. genes with respect to the secondary structures of the

Page 3: The structure of the human apolipoprotein genes

58s CHAN HEPATOLOGY

TABLE 2. Homologous sequences in the 5' flanking sequence of apolipoprotein genes

Ending nucleotide Reference 6' sequence Beginning

nucleotide

c-I11 -116 A C C C T G C C T C A G G C -103 41 * * * * * * * * * * *

E -409 C T C C C A T C C C A - C T T C T G T C -391 42 * * * * * * * * * * * * * * * *

c-I1 -234 C C C T C C A C A G C C C C C A A C C C A G C C T C T G T C -213 32

A-I1 -497 C C C C C A T A G C C C T C A A C C C T G T C C C T G -471 31 * * * * * * * * * * * * * * * * * * * * *

* * * * * * * * * * * * 1 . 1 .

A-I -196 C C C T G C A C - G C C C C G - - - - C A G C -179 43.44

The asterisks indicate homology to the apo C-I1 sequence only. Numbers indicate distance 5' to the CAP site. For apo A-I and apo (2-111, these are only approximate distances since the CAP sites have not been identified in these genes.

FIG. 1. Structural organization of apolipoprotein genes. Transcrip- tion is from left to right. The heavy bars represent, from left to right, exons 1, 2 ,3 and 4. The thin line represents 5' flanking region, IVS-I, 11, and I11 and 3' flanking region of the respective genes. The numbers h o e the exons indicate the length in nucleotides of the exons. The exact lengths of exon 1 in apo C-I11 and apo A-I are in doubt since the 5' end of the gene has not been determined in these cases. The numbers in parentheses above exon 2 indicate the position of the adenine in the initiation codon, ATG, from the 5' end of exon 2.

individual proteins as predicted by the method of Chou and Fasman (45) (data not shown). Without exception, IVS-I1 interrupts the gene close to the signal peptidase cleavage site, either at regions of random structure (apo C-11, apo C-111), @-turn (apo A-I, apo E) or at the junction between two conformations (i.e., transition between short, probably unstable @-sheet and a-helical region, apo A-11). Similarly, IVS-I11 interrupts the mature pep- tide region of the different genes at regions of random structure (apo C-11) or @-turn (apo A-11, apo C-111). In these instances, therefore, not only do IVS-I1 and IVS- I11 separate the peptides into distinct functional domains as predicted by Gilbert (37), but they also seem to have preference for the surface of proteins as in the case of many other proteins (46). However, by Chou-Fasman analysis, IVS-I11 appears to interrupt a-helical structures in apo A-I and apo E. To obtain a better definition of the functional domains of the various apolipoproteins, we analyzed the conformation of the mature plasma proteins by the mean helical hydrophobic moment, ( p ~ ) , of Eisenberg et al. (47). This parameter not only gives a measure of helicity, but also takes into consideration the amphiphilicity of a helix. Inspection of Figure 2 reveals

Residue Number FIG. 2. Secondary structure analysis of human apolipoproteins by

the method of Eisenberg et al. (47) and IVS locations. The helical hydrophobic moment, ( p ~ ) , was determined as previously described. Each apolipoprotein was analyzed in 11-residue fragments and the ( p ~ ) values were plotted at the midpoint of each fragment. The bar on the scale on the oertical axis represents 1 kcal. The arrows indicate the positions of the IVS. Each arrow is placed at the residue which marks the end of exon 2 (or the first domain in this figure), e.g., for apo A-11, IVS-111 is located at amino acid 39, and the arrow is placed at the midpoint of the 11-residue peptide which marks the end of the domain encompassed by residues 29-39 or residue 34.

that IVS-I11 consistently interrupts the mature peptide at, or very close to, a minimum value of ( p H ) . It should be noted that the arrows in this figure indicate the IVS position at a site which marks the end of the first domain, e.g., for apo A-11, the arrow marks the domain which ends at residue 39 of the mature peptide (i.e., at the midpoint of the 11-residue peptide which defines the conformation of residues 29-39 or residue 34). However, the domain demarcation is relatively poor for apo C-11. For the two longest apolipoproteins, apo A-I and apo E, analyzed by this method, in addition to the minimum values of ( p H ) marked by IVS-111, other minima seem to appear periodically toward the carboxyl-terminal end of the protein. These subsequent domains are not inter- rupted by IVS and appear to define internally repeated sequences in these proteins (36).

FUTURE DIRECTIONS The application of molecular biological techniques to

lipoprotein research is a relatively recent event. It was only in 1980 that the first eukaryotic apolipoprotein cDNA was cloned (48). Since then, the cDNAs for all

Page 4: The structure of the human apolipoprotein genes

Vol. 7, No. 1, Suppl. 1987 S'rRl1CTlJRE OF THE HITMAN APO1,IPOPROTEIN GENES 59s

the major human apolipoproteins have been cloned and sequenced, and the genomic structures for most of them have been published. In the future, a number of areas will be explored.

The cDNA for the most important of the apolipopro- teins, apo B, has been cloned. It is a large cDNA of over 14 kb in length. Its sequence has just been completed (49). The DNA-deduced amino acid sequence of apo B will be the basis for a number of experiments on the structural and functional organization of this unusual protein. Specifically, the low density lipoprotein receptor binding domain and the lipid-binding regions of apo B will be defined, and the relationship between the hepatic apo B (B-100) and intestinal apo B (B-48) will be clari- fied (25).

Another interesting area for exploration is the detailed evolutionary relationship between the individual apoli- poproteins. Luo et al. (36) have presented a detailed analysis of the repeating units in the various apolipopro- teins. Future experiments will further define the time of divergence of each member of the apolipoprotein multi- gene family. The possible evolutionary origin of apo B will also be defined.

Finally, restriction fragment length polymorphisms have been described for a number of apolipoproteins (50- 52). Detailed restriction fragment length polymorphisms and haplotype analysis will prove useful in our under- standing of the genetic background for the various dys- lipoproteinemias.

1.

2.

3.

4.

5.

6.

- I .

8.

9.

10.

11.

12.

l:?.

REFERENCES dackson RL, Morrisett .ID, Gotto AM J r . Lipoprotein structure and metabolism. Physiol Rev 1956; 56259-316. Goldstein JL, Brown MS. In: Stanbury JB, Wyngaarden dB, Fred- erickson DS, et al., eds. The metabolic basis of inherited disease. New York: McGraw-Hill, 1984: 672-712. Osborne JC Jr , Brewer HH Jr . The plasma lipoproteins. Adv Protein Chem 1977; 31:2.53- 337. Cheung P, Chan L. Nucleotide sequence of cloned cDNA of human apolipoprotein A-I. Nucl Acids Res 1983; 11:3703-3715. Karathanasis SK. Zannis VI, Breslow .JL. Isolation and character- ization of the human apolipoprotein A-I gene. Proc Natl Acad Sci

Shoulders CC, Kornblihtt AR, Munro RS, et al. Gene structure of human apolipoprotein A-I. Nucl Acids Res 1983; 11:2827-2837. Sharpe CR, Sidoli A, Shelley (-S. et al. Human apolipoproteins A-I, A-11, C-I1 and C-Ill. cDNA sequences and mRNA abundance. Nucl Acids Res 1984; 12:3917-:3932. Moore MN. Kao FT. Tsao YK. et al. Human apolipoprotein A-11: nucleotide sequence of a cloned cDNA. and localization of its structural gene on human chromosome 1. Biochem Biophys R e s Commun 1984; 123:1-7. Knott TJ , Priestley LM, llrdea M. et al. Isolation and characteri- zation of a cDNA encoding the precursor for human apolipoprotein A-11. Biochem Biophys Res Commun 1984; 120:734-740. Knott Td, Robertson ME, Priestley LM, et al. Characterizations of mRNAs encoding the precursor for human apolipoprotein C-I. Nucl Acids Res 1984; 12:3909-3915. McLean JW, Elshourbagy NA, (:hang D.J. et al. Human apolipo- protein E mRNA. cDNA cloning and nucleotide sequencing of a new variant. J Biol Chem 1984; 259:6498-6504. Elshourbagy NA, Walker DN. Boguski MS, et al. The nucleotide and derived amino acid sequence of human apolipoprotein A-IV mRNA and the close linkage of its gene to the genes of apolipopro- teins A-I and C-111. J Biol Chem 1986; 261:1998-2002. Wei CF, Chen SH, Yang CY. et al. Molecular cloning and expres- sion of partial cDNAs, and deduced aminoacid sequence of a carboxyl-terminal fragment of human apolipoprotein B- 100. Proc

USA 1983; 80:6147-6151.

14.

15

16

17

18

19.

20.

21.

22.

2::(

24

25

' 6

27

28

'9

30

:! 1

:!2

33

:!4

35.

36

37. 38.

Natl Acad Sci USA 1985; 82:7265-7269. Knott TJ, Rall SC Jr, Innerarity TL, et al. Human apolipoprotein B: structure of carboxyl-terminal domains, sites of gene expression, and chromosomal localization. Science 1985; 23037-43. Deeh SS, Motulsky AG, Albers dd, A partial cDNA clone for human apolipoprotein B. Proc Natl Acad Sci USA 1985; 82:4983-4986. Huang LS. Bock SC, Feinstein SI, et al. Human apolipoprotein B cDNA clone isolation and demonstration that liver apolipoprotein B mRNA is 22 kilohases in lenah. Proc Natl Acad Sci USA 1985: R2:6825-68'29. Gordon J I , Sims HF, Lentz SR, et al. Proteolytic processing of human preproapolipoprotein A-I. A proposed defect in the conver- sion of' pro A-1 to A-1 in Tangier's disease. J Biol Chem 1983; 258:4035-4044. Gordon J I , Budelier KA, Sims HE, et al. Biosynthesis of human preproapolipoprotein A-11. J Biol Chem 1983; 25814054-14059. Blue ML. Williams DL. Zucker S, et al. Apolipoprotein E synthesis in human kidney. adrenal gland. and liver. Proc Natl Acad Sci 1JSA 1983; 80283-287. Elshourbagy NA, Liao WS. Mahley RW, et al. Apolipoprotein E mRNA is abundant in the brain and adrenals, as well as in the liver. and is present in other peripheral tissues of rats and mar- mosets. Proc Natl Acad Sci USA 1984; 82:203-207. Driscoll DM. Cetz GS. Extrahepatic synthesis of apolipoprotein E. .J Lipid Res 1984; 25:1368-1379. Williams DL, Dawson PA, Newman TC, et al. Apolipoprotein E synthesis in peripheral tissues of non-human primates. J Biol Chem 1985; 260:2444-2451. Newman TC, Dawson PA, Rude1 LL, et al. Quantitation of apoli- poprotein E mRNA in the liver and peripheral tissues of non- human primates. .I Biol Chem 198.5; 260:2452-2457. Lin-Lee YC, Kao FT. Cheung P, et al. Apolipoprotein E gene mapping and expression: localization of' the structural gene to human chromosome 19, and expression of apo E mRNA in lipo- protein and non-lipoprotein producing tissues. Biochemistry 1985; 24::17.51 -3756. Kane JP . Apolipoprotein B: structural and metabolic heterogene- i ty . Ann Rev Physiol 1983; 45637-650. Young SC. Bertico Sd, Scott TM, et al. Parallel expression of the MR19 genetic polymorphoism in apoprotein B-100 and apoprotein R-48. Evidence that both apoproteins are products of the same gene. J Biol Chem 1986; 261:2995-2998. Das HK, McPherson J , Bruns GAP, et al. Isolation, characteriza- tion and mapping to chromosome 19 of the human apolipoprotein E gene. J Biol Chem 1985; 2606240-6247. Paik YK, Chang DJ, Peardon CA, et al. Nucleotide sequence and structure of the human apolipoprotein E gene. Proc Natl Acad Sci LISA 1985; R2:3445-3449. Protter AA, Levy-Wilson B. Miller d, et al. Isolation and sequence analysis of the human apolipoprotein C-Ill gene and the intergenic region between apo A-I and apo C-Ill genes. DNA 1984; 3:449-456. Tsao YK, Wei CF, Rohberson DL, et al. The isolation and char- acterization of the human apolipoprotein A-11 gene. J Biol Chern

Hreathnach R, Chambon P. Organization and expression of eucar- yotic split genes coding for proteins. Ann Rev Biochem 1981; 50349-:%33. Wei CF, Tsao YK. Rohberson DL. et al. The structure of the human apolipoprotein C-11 gene. .I Hiol Chem 1985; 260:15211- 15221. Keller ZB, Noon WA. Intron splicing: a conserved internal signal in introns of animal pre-mRNAs. Proc Natl Acad Sci USA 1984: 81:7417-7420. Ruskin B, Krainer AR, Maniatis T. et al. Excision of an intact intron as a novel lariat structure during pre-mRNA splicing in citro. Cell 1984; 38317-331. Barker WC, Dayhoff MO. Evolution of lipoproteins deduced from protein sequence data. Comp Biochem Physiol 1977; 57b:309-315. Luo CC, Li WH, Moore MN, et al. The structure and evolution of the apolipoprotein multigene family. J Molec Biol 1986, 187325- 340. Gilbert W. Why genes in pieces? Nature 1978; 271501. Tate V, Finer M, Boedtker H, et al. Procollagen genes: further sequence studies and interspecies comparisons. Cold Spring Harbor Symp Quant Biol 1982; 47:1039-1049.

19%: 260:15222-15231.

Page 5: The structure of the human apolipoprotein genes

60s CHAN HEPATOLOGY

39. Alexander F, Young PR, Tilghman SM. Evolution of the albumin: alpha-fetoprotein ancestral gene from the amplification of a 27 nucleotide sequence. J Molec Biol1984,173159-176.

40. Tonegawa S. Somatic generation of antibody diversity. Nature

41. Stein JP, Catterall J , Kristo P, et al. Ovomucoid intervening sequences specify functional domains and generate protein poly- morphism. Cell 1980; 215331-687.

42. Nathans J , Hogness DS. Isolation, sequence analysis, and intron- exon arrangement of the gene encoding bovine rhodopsin. Cell

43. Liscum L, Finer-Moore J, Stroud RM, et al. Domain structure of 3-hydroxy-3-methylglutaryl coenzyme A reductase, a glycoprotein of the endoplasmic reticulum. J Biol Chem 1985; 260:522-530.

44. Lonberg N, Gilbert W. Intron/exon structure of the chicken py- ruvate kinase gene. Cell 1985,4081-90.

45. Chou PY, Fasman GD. Empirical predictions of protein confor- mations. Ann Rev Biochem 1978; 47:251-276.

46. Craik CS, Sprang S, Sletterick R, et al. Intron-exon splice junctions

1983,302575-581.

1983; 34807-814.

map at protein surfaces. Nature 1982; 299180-182. 47. Eisenberg D, Weiss RM, Tergwillager TC. The helical hydrophobic

moment: a measure of the amphiphilicity of a helix. Nature 1982;

48. Chan L, Dugaiczyk A, Means AR. Molecular cloning of the gene sequences of a major apoprotein in avian very low density lipopro- teins. Biochemistry 1980; 195631-5637.

49. Yang CY, Chen SH. Cianturco SH, et al. sequence, structure, receptor-binding domains and internal repeats of human apotipo- protein B-100. Nature 1986,323738-742.

50. Scott J, Knott TJ, Priestley LM, et al. High-density lipoprotein composition is altered by a common DNA polymorphism adjacent to apoprotein A-I1 gene in man. Lancet 1985,1:771-773.

51. Priestley L, Knott T, Wallis S. et al. RFLP for the human apoli- poprotein B gene: V; Xbal. Nucl Acids Res 1985; 136793.

52. Chan L. VanTuinen P, Ledbetter DH, et al. The human apolipo- protein B-100 gene: a highly polymorphic gene that maps to the short arm of chromosome 2. Biochem Biophys Res Commun 1985,

299371-374.

i33:248-255.