the journal of biological chemistry 269, no. 17, 29, 12475-12481… · 2001-06-28 · 0 1994 by the...

7
0 1994 by The American Society for Biochemistry and Molecular Biology, Inc. THE JOURNAL OF BIOLOGICAL CHEMISTRY Vol. 269, No. 17, Issue of April 29, pp. 12475-12481, 1994 Printed in U.S.A. Yeast Heat Shock Transcription Factor Contains a Flexible Linker between the DNA-binding and Trimerization Domains IMPLICATIONS FOR DNA BINDING BY TRIMERIC PROTEINS* (Received forpublication, January 6, 1994, and in revised form, March 2, 1994) Karen E. Flick*, Lino Gonzalez, Jr.0, Celia J. Harrison$ and Hillary C. M. Nelson11 From the Department of Molecular and Cellular Biology, Division of Biochemistry and Molecular Biology, University of California, Berkeley, California 94720-3206 All heat shock transcription factors (HSFs) share two regions of homology, identified as the DNA binding and trimerization regions. The DNA binding region consists of two parts, an 89-amino-acid minimal DNA-binding do- main and an additional 21 amino acids which are not necessaryfor specific DNA binding of a monomeric DNA-binding domain. These 21 amino acids may act as a flexible linker between the DNA-binding and trimeriza- tion domains. Saccharomyces cerevisiae HSF has an ad- ditional 52 amino acids between the proposed flexible linker and the trimerization domain. Deletion of this unique region has no effect on the structural integrity or essential in vivo functions of HSF. To investigate the role of the 21-amino-acid proposed linker, a series of internal deletions was created in fragments containing the DNA- binding and trimerization domains. The deletions have no effect on the structural integrity of the protein as assayed by circular dichroism spectroscopy. However, alterations of the linker do affect affinity of trimeric HSF binding to its target DNA. In addition, deletion of part or a l of the proposed linker from full-lengthyeast HSF, an essential protein, disrupts growth of yeast. Eukaryotic cells respond to stress throughhighly regulated expression of a small set of proteins known as heat shock pro- teins. All promoters for heat shock proteins contain a multiple number of inverted repeats of a 5-base pair sequence, nGAAn (1, 2). Distribution of these binding sites in different species and different heat shock promoters vary from single arrays of three to eight inverted 5-base-pair units to several short arrays for a single heat shock promoter (1-4). Each array of nGAAn sequences is referred to as a heat shock element (HSE).’ Heat shock transcription factor (HSF) binds to the HSE and subse- quently regulates the cellular response to stress (5, 6). Al- though only two nGAAn sequences are necessary for binding Service Grant GM-44086. The costs of publication of this article were * This research was supported in part by United States Public Health defrayed in part by the payment of page charges. This article must therefore be hereby marked “aduertisement” in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. $Supported in part by United States Public Health Service Pre- doctoral Training Grant GM-07232 and by a National Science Founda- tion Graduate Research Fellowship. 8 Supported in part by the Patricia Roberts Harris Fellowship. 7 Supported by a National Science Foundation Graduate Research Fellowship. 11 To whom correspondence should be addressed. Dept. of Molecular and Cellular Biology, StanleyDonnerASU, 229 Stanley Hall, Univer- sity of California, Berkeley, CA 94720-3206. The abbreviations used are: HSF, heat shock transcription factor; HSE, heat shock element; A, residue deletion;EGS,ethylene glycol bis-(succinimidylsuccinate); HPLC, high performance liquid chromatog- raphy; aa, amino acid. HSF in vitro, at least three inverted nGAAn units are neces- sary for binding in vivo (1, 2, 7). The alternating orientation of the five base pair nGAAn units positions each successive bind- ing site on alternate sidesof the DNA thus facilitating binding of the trimeric protein (7, 8). HSF has been identified and cloned from many eukaryotic cells including those of human, mouse, chicken, fruit fly, to- mato, and yeast (9-18). Although the sequences of the tran- scriptional activation regions of HSF are divergent, the DNA bindingandtrimerization regions are conserved amongall identified HSF genes. All amino acids necessary and sufficient for trimerization are found between residues 333 and 424 of Saccharomyces cerevisiae HSF (19). The DNA binding region was originally defined as a 118-amino-acid fragment (residues 167-284) in S. cereuisiae (16).A 110-amino-acid fragment con- taining the DNA binding region, residues 171-280, is capable of sequence-specific DNA binding as a monomer, although the affinity of this region is significantly lower than thatof a frag- ment consisting of both the DNA binding and trimerization regions (see “Results”). The NH,-terminal89 amino acids of the DNA binding region are necessary and sufficient for the se- quence-specific DNA binding, and these residues have been redefined as the minimal DNA-binding domain. In all HSF genes, except S. cereuisiae and tomato HSFs, the extra 21 amino acids, previously included in the DNA binding region, are located adjacent to the trimerization domain. In S. cereui- siae and tomato HSFs there are additional non-conserved se- quences between these 21 conserved amino acids and the tri- merization domain. S. cereuisiae HSF has 52 unique amino acids in this region and each of the three tomato HSFs has regions which vary in length and content (14-16). In order to permit binding of a trimeric protein to DNA, it has been proposed that flexibility is needed between the DNA-bind- ing and trimerization domains (7, 8). To investigate whether the 21 conserved amino acids or the 52 non-conserved amino acids are needed for flexibility, fragments of S. cerevisiae HSF were constructed with deletions in these regions. The resulting proteins were assayed for structural and functional perturba- tions. These results indicate that the 52 non-conserved amino acids, referred to as theSc unique region, are unnecessary for proper structure or function of HSF underphysiological condi- tions. The 21 conserved amino acids are not necessary for the structure of HSF but are important for the DNA binding activ- ity of trimeric HSF. This short conserved region may act as a flexible linker between the DNA-binding andtrimerization domains. MATERIALS AND METHODS Construction of HSF and HSE Derzuatiues”pHF103 was created by ligation of the EcoRI-KpnI of p15.04 (gift of P. Sorger), which encodes amino acids 1-259 of S. cereuisiae HSF, into pBluescript KS+ (Strat- agene). pHN124 was constructed by oligo-mediated site-directed mu- 12475

Upload: others

Post on 17-Mar-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: THE JOURNAL OF BIOLOGICAL CHEMISTRY 269, No. 17, 29, 12475-12481… · 2001-06-28 · 0 1994 by The American Society for Biochemistry and Molecular Biology, Inc. THE JOURNAL OF BIOLOGICAL

0 1994 by The American Society for Biochemistry and Molecular Biology, Inc. THE JOURNAL OF BIOLOGICAL CHEMISTRY Vol. 269, No. 17, Issue of April 29, pp. 12475-12481, 1994

Printed in U.S.A.

Yeast Heat Shock Transcription Factor Contains a Flexible Linker between the DNA-binding and Trimerization Domains IMPLICATIONS FOR DNA BINDING BY TRIMERIC PROTEINS*

(Received for publication, January 6, 1994, and in revised form, March 2, 1994)

Karen E. Flick*, Lino Gonzalez, Jr.0, Celia J. Harrison$ and Hillary C. M. Nelson11 From the Department of Molecular and Cellular Biology, Division of Biochemistry and Molecular Biology, University of California, Berkeley, California 94720-3206

All heat shock transcription factors (HSFs) share two regions of homology, identified as the DNA binding and trimerization regions. The DNA binding region consists of two parts, an 89-amino-acid minimal DNA-binding do- main and an additional 21 amino acids which are not necessary for specific DNA binding of a monomeric DNA-binding domain. These 21 amino acids may act as a flexible linker between the DNA-binding and trimeriza- tion domains. Saccharomyces cerevisiae HSF has an ad- ditional 52 amino acids between the proposed flexible linker and the trimerization domain. Deletion of this unique region has no effect on the structural integrity or essential in vivo functions of HSF. To investigate the role of the 21-amino-acid proposed linker, a series of internal deletions was created in fragments containing the DNA- binding and trimerization domains. The deletions have no effect on the structural integrity of the protein as assayed by circular dichroism spectroscopy. However, alterations of the linker do affect affinity of trimeric HSF binding to its target DNA. In addition, deletion of part or all of the proposed linker from full-length yeast HSF, an essential protein, disrupts growth of yeast.

Eukaryotic cells respond to stress through highly regulated expression of a small set of proteins known as heat shock pro- teins. All promoters for heat shock proteins contain a multiple number of inverted repeats of a 5-base pair sequence, nGAAn (1, 2). Distribution of these binding sites in different species and different heat shock promoters vary from single arrays of three to eight inverted 5-base-pair units to several short arrays for a single heat shock promoter (1-4). Each array of nGAAn sequences is referred to as a heat shock element (HSE).’ Heat shock transcription factor (HSF) binds to the HSE and subse- quently regulates the cellular response to stress (5, 6). Al- though only two nGAAn sequences are necessary for binding

Service Grant GM-44086. The costs of publication of this article were * This research was supported in part by United States Public Health

defrayed in part by the payment of page charges. This article must therefore be hereby marked “aduertisement” in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

$Supported in part by United States Public Health Service Pre- doctoral Training Grant GM-07232 and by a National Science Founda- tion Graduate Research Fellowship.

8 Supported in part by the Patricia Roberts Harris Fellowship. 7 Supported by a National Science Foundation Graduate Research

Fellowship. 11 To whom correspondence should be addressed. Dept. of Molecular

and Cellular Biology, StanleyDonner ASU, 229 Stanley Hall, Univer- sity of California, Berkeley, CA 94720-3206.

The abbreviations used are: HSF, heat shock transcription factor; HSE, heat shock element; A, residue deletion; EGS, ethylene glycol bis-(succinimidylsuccinate); HPLC, high performance liquid chromatog- raphy; aa, amino acid.

HSF in vitro, a t least three inverted nGAAn units are neces- sary for binding i n vivo (1, 2, 7). The alternating orientation of the five base pair nGAAn units positions each successive bind- ing site on alternate sides of the DNA thus facilitating binding of the trimeric protein (7, 8).

HSF has been identified and cloned from many eukaryotic cells including those of human, mouse, chicken, fruit fly, to- mato, and yeast (9-18). Although the sequences of the tran- scriptional activation regions of HSF are divergent, the DNA binding and trimerization regions are conserved among all identified HSF genes. All amino acids necessary and sufficient for trimerization are found between residues 333 and 424 of Saccharomyces cerevisiae HSF (19). The DNA binding region was originally defined as a 118-amino-acid fragment (residues 167-284) in S. cereuisiae (16). A 110-amino-acid fragment con- taining the DNA binding region, residues 171-280, is capable of sequence-specific DNA binding as a monomer, although the affinity of this region is significantly lower than that of a frag- ment consisting of both the DNA binding and trimerization regions (see “Results”). The NH,-terminal89 amino acids of the DNA binding region are necessary and sufficient for the se- quence-specific DNA binding, and these residues have been redefined as the minimal DNA-binding domain. In all HSF genes, except S. cereuisiae and tomato HSFs, the extra 21 amino acids, previously included in the DNA binding region, are located adjacent to the trimerization domain. In S. cereui- siae and tomato HSFs there are additional non-conserved se- quences between these 21 conserved amino acids and the tri- merization domain. S. cereuisiae HSF has 52 unique amino acids in this region and each of the three tomato HSFs has regions which vary in length and content (14-16).

In order to permit binding of a trimeric protein to DNA, it has been proposed that flexibility is needed between the DNA-bind- ing and trimerization domains (7, 8). To investigate whether the 21 conserved amino acids or the 52 non-conserved amino acids are needed for flexibility, fragments of S. cerevisiae HSF were constructed with deletions in these regions. The resulting proteins were assayed for structural and functional perturba- tions. These results indicate that the 52 non-conserved amino acids, referred to as the Sc unique region, are unnecessary for proper structure or function of HSF under physiological condi- tions. The 21 conserved amino acids are not necessary for the structure of HSF but are important for the DNA binding activ- ity of trimeric HSF. This short conserved region may act as a flexible linker between the DNA-binding and trimerization domains.

MATERIALS AND METHODS Construction of HSF and HSE Derzuatiues”pHF103 was created by

ligation of the EcoRI-KpnI of p15.04 ( g i f t of P. Sorger), which encodes amino acids 1-259 of S. cereuisiae HSF, into pBluescript K S + (Strat- agene). pHN124 was constructed by oligo-mediated site-directed mu-

12475

Page 2: THE JOURNAL OF BIOLOGICAL CHEMISTRY 269, No. 17, 29, 12475-12481… · 2001-06-28 · 0 1994 by The American Society for Biochemistry and Molecular Biology, Inc. THE JOURNAL OF BIOLOGICAL

12476 Flexible Linker of Yeast HSF tagenesis to create an NdeI site, which changes amino acid 170 to a methionine. The mutation was confirmed by single-stranded DNA se- quencing and restriction digestion. The NdeI-SphI fragment of pHN124 was then cloned into a derivative of the pET3-b expression vector (20) to yield pHN138. The protein, Sc-D, when expressed from pHN138 con- tains amino acids 171-259 plus three amino acids, RHA, at the carboxyl terminus of the protein.

A fragment containing the first 424 amino acids of S. cerevisiae HSF was inserted into pBluescript KS+ (Stratagene) to create pHF158. pHN126 was constructed by oligo-mediated site-directed mutagenesis to create an NdeI site, which changes amino acid 170 to a methionine. The mutation was confirmed by single-stranded DNA sequencing and restriction digestion. pHN423 was constructed by oligo-mediated site- directed mutagenesis to create an SphI site immediately aRer amino acid 280 of pHN126. The mutation was confirmed by single-stranded DNA sequencing and restriction digestion. The NdeI-SphI fragment of pHN423 was then cloned into a derivative of the pET3-b expression vector to make pHN420. The protein, Sc-D(21L), when expressed from pHN420 contains amino acids 171-280 plus five amino acids, ACLIN, at the carboxyl terminus of the protein.

Oligo-mediated site-directed mutagenesis was used to make the de- letions in the linker region of the plasmid pHN126 in order to create the following variations of this pBluescript KS+ derivative: pHN148, Aaa281-332; pHN401, Aaa270-332; pHN402, Aaa260-332; and pHN411, Aaa260-312. The deletions were confirmed by single-stranded DNA sequencing. The NdeI-SphI fragment of pHN126 was cloned into a derivative of the pET3-b expression vector (20) to yield pHN136. The same method was used to create pHN150, pHN403, pHN404, and pHN412 from pHN148, pHN401, pHN402, and pHN411, respectively. All of the proteins expressed from plasmids constructed by this cloning method have an additional four amino acids, GLMN, at the carboxyl terminus. pHN136, pHN150, pHN403, pHN404, and pHN412 express the proteins Sc-DT(73L), Sc-DT(21L), Sc-DT(lOL), Sc-DT(OL), and Sc- DT(lS*L), respectively (Fig. 1).

The BarnHI site of pRS414 (Stratagene) was removed by Klenow fill-in and religation. The resulting plasmid was digested with EcoRI and ligated with the EcoRI fragment of ph3.2 (15) in order to construct pHN1002.Oligo-mediated site-directed mutagenesis was used to create deletions in the linker region in pHN1002 to create the following plas- mids: pHN1015, Aaa281-332; pHN1016, Aaa270-332; pHN1017, Aaa260-332; and pHN1018, Aaa260-312. Sc-(21L), Sc-(lOL), Sc-(OL), and Sc-(19*L) were expressed frompHN1015, pHNlO16, pHN1017, and pHN1018, respectively.

The plasmid pBL3 was generated by cloning BamHI fragments of the plasmid pI3, which contains three nGAAn boxes (7) into pBluescript KS+ (Stratagene). The pBL3 plasmid was digested with the restriction enzymes, SpeI andXhoI, generating a restriction fragment, called TA-3, of approximately 145 base pairs.

Expression and Purification-The pET3-b-derived expression vectors were transformed into the Escherichia coli strain BL2UDE3) contain- ing a derivative of the plasmid pAcYc177 (21) that overexpresses lac repressor. This reduces the read-through level of transcription of T7 RNA polymerase (transcribed from an inducible lac promoter) so that low levels of expression of HSF do not occur before induction. Protein expression was induced as described (20, 22). Cells were stored as a frozen suspension in 50 m HEPES, pH 8.0,600 m NaCI, 2 m EDTA, 5 m CaCl,, 10% glycerol.

The purification protocols for Sc-DT(75L), Sc-DT(lOL), Sc-DT(lOL), and Sc-DT(0L) were identical. Frozen cell suspensions were thawed, supplemented with 1 pg/ml aprotinin, 2 pg/ml pepstatin, and 1 pg/ml leupeptin, incubated on ice for 5 min with 200 pg/ml lysozyme, soni- cated, and centrifuged at 39,000 x g for 15 min at 10 "C. Saturated ammonium sulfate and 50 m HEPES, pH 8.0, were added to the high speed supernatant to a final concentration of 18% saturated ammonium sulfate and 400 m NaCI. The resulting solution was loaded onto a propyl column (SynChrom, Inc.). The column was developed with a reversed ammonium sulfate gradient; the protein eluted between 220 and 360 m ammonium sulfate. The column fractions containing HSF were dialyzed for 4 h at room temperature against 50 rn Tris, pH 6.7, 400 m NaCI, 5% glycerol, 2 m EDTA with one change of buffer after 2 h. The protein solution was diluted with 50 m Tris, pH 6.7, to a final concentration of 50 m NaCl and then loaded onto a propylsulfonic acid column (Waters). The column was developed with a NaCl gradient and the protein eluted between 350 and 450 nm NaCl. The column fractions were diluted with 25 m Tris, pH 8.5, to a final concentration of 50 m NaCl and loaded onto a heparin-agarose column (Sigma). The column was developed with a NaCl gradient and the protein eluted at approxi- mately 675 m NaCI. At this point the protein is 95-99% pure as

estimated by Coomassie Blue-stained SDS-polyacrylmide gel electro- phoresis gels and by reversed-phase HPLC.

Cells containing Sc-DT(lS*L) were thawed, sonicated, and centri- fuged as described above. The high speed pellet was resuspended in 50 m Tris, pH 6.7,150 m NaCI, and 6 M deionized urea and recentrifuged at 39,000 x g for 15 min at 10 "C. The supernatant was diluted with 50 m Tris, pH 6.7, to a final concentration of 50 m NaCl and 2 M deion- ized urea and loaded onto an S-Sepharose Fast Flow column (Pharma- cia LKB Biotechnology, Inc.). The column was developed with a NaCl gradient. Sc-DT(lS*L) elutes between 350 and 450 m NaC1. The col- umn fractions containing Sc-DT( 19*L) were dialyzed overnight against 50 m Tris, pH 6.7, 1 M NaCI, 1 M deionized urea, and then for 8 h each in 50 m Tris, pH 6.7, 1 M NaCl, 0.5 M deionized urea, 50 m Tris, pH 6.7, 1 M NaCl, 0.25 M deionized urea, and 50 m Tris. pH 6.7, 1 M NaCI. After dialysis, the solution was diluted with 50 m Tris, pH 8.5, to a final concentration of 50 m NaCl, and then loaded onto and eluted from a heparin column as described above. Sc-DT(lS*L) eluted at ap- proximately 550 m NaCl. The Sc-DT(19*L) is 80-90% pure as esti- mated by Coomassie Blue-stained SDS-polyacrylamide gel electro- phoresis.

Sc-D and Sc-D(21L) were grown and induced as described above except that the cells were resuspended in a buffered solution containing 1 M NaCl rather than 600 m NaCI. Frozen cell suspensions were thawed, supplemented with 1 p g / d aprotinin, 2 pg/ml pepstatin, and 1 pg/ml leupeptin, incubated on ice for 5 min with 200 pglml lysozyme, sonicated, and centrifuged at 39,000 x g for 10 min at 10 "C. The su- pernatant was recovered and diluted with 50 m HEPES, pH 8.0, to a final concentration of 50 m NaCl and loaded onto a heparin-agarose column (Sigma). The column was washed with a solution containing 50 m Tris, pH 6.7,50 nm NaCI, 5% glycerol, 2 m EDTA. The protein was then eluted with 50 m Tris, pH 6.7, 500 m NaCl, 2 m EDTA. The column fractions containing HSF were diluted with 50 m Tris, pH 6.7, to a final concentration of 50 m NaCl and then loaded onto a propyl- sulfonic acid column (Waters). The column was developed with a NaCl gradient and the protein eluted between 350 and 450 m NaCI. The fractions containing HSF were loaded onto a reversed-phase HPLC column. The column was developed with an aqueous trifluoroacetic acid gradient, and peak fractions were lyophilized. All protein concentra- tions were determined with extinction coefficient values calculated from the tyrosine and tryptophan content of the proteins.

Circular Dichroism Spectroscopy-CD spectra were recorded on an AVIV 60 DS spectropolarimeter at 25 "C. Measurements were taken at 1-nm intervals with a 1-s time constant and 1.5 nm bandwidth, and were averaged over five scans. A path length of 2 mm was used with protein concentrations of approximately 1 mg/ml. The buffer conditions were 25 m sodium phosphate, pH 8.3, and 100 m NaCI. The mean residue ellipticity [e,,] of each protein was calculated using the follow- ing equation:

[e,.] = ODb*(MW/d*c*n) (Eq. 1)

where Oobs = observed ellipticity in millidegrees, MW is the molecular mass in daltons, d = pathlength in mm, c = concentration in mg/ml, and n = number of amino acids (23). The mean residue ellipticity allows direct comparison of the CD spectra independent of the size of the protein. The molar ellipticity [e,,] was calculated using a variant of the former equation:

[epmt] = O,,*(MW/d*C). (Eq. 2)

Cross-linking-stock solutions of ethylene glycol bis-(succinimidyl- succinate) (EGS) were prepared fresh in dimethyl sulfoxide and added to a 20 solution of the protein in 25 m sodium phosphate, pH 8.8, to give final EGS concentrations ranging from 0.5 to 200 w. The reac- tions proceeded for 15 min at room temperature and were then quenched by addition of glycine to final concentration of 40 m. The products were analyzed by SDS-polyacrylamide gel electrophoresis. Bands were visualized with Coomassie Blue.

m HEPES, pH 8.0, 100 m NaCI, 1 m EDTA, 10% glycerol, 0.05 Gel Retardation Assay-DNA-protein complexes were formed in 50

m g / d poly(d1-dC), and 0.1 m g / d bovine serum albumin for Sc- DT(73L), Sc-DT(21L), Sc-DT(lOL), Sc-DT(OL), and Sc-DT(lS*L) or in 25 m Tris, pH 6.7, 25 m NaC1, 2.5 m dithiothreitol, 5% glycerol, 1 mg/ml bovine serum albumin, and 25 pg/ml sheared calf thymus DNA for Sc-D and Sc-D(21L). The DNA fragment, TA-3, was radiolabeled with 32P and used in the concentration range of 1 to 100 PM. The samples were loaded on a 6% polyacrylamide gel and electrophoresed in Tris- borate buffer for 1.5 h at 300 V. The gels were dried and autoradio- graphed.

Page 3: THE JOURNAL OF BIOLOGICAL CHEMISTRY 269, No. 17, 29, 12475-12481… · 2001-06-28 · 0 1994 by The American Society for Biochemistry and Molecular Biology, Inc. THE JOURNAL OF BIOLOGICAL

Flexible Linker of Yeast HSF

DBD L ScU TRI ScHSF >/, , , . . ,. . . . I

sC-D r-----l 170-259

Sc-D(2 1 L) -a 170-280

170-424

170-424 A28 1-332

170-424 A270-332

170-424 A260-332

170-424 A260-312

I sc-(IOL) I I

1 SC-(19*L) I I ' A-EzmmB 1

FIG. 1. A series of internal deletions in the proposed linker and Sc unique regions have been created.

Footprinting-The fragment, TA-3, was radiolabeled at the XhoI with all four [CY-~*PINTPS. DNA-protein complexes were formed in 25 mM Tris, pH 6.7.100 mcl NaC1,5 lll~ MgCl,, 5% glycerol, 1 mg/ml bovine serum albumin, 10 pg/ml sheared calf thymus DNA, and 2 lll~ dithio- threitol. The DNA-protein complexes were incubated with 3.5 pg/ml DNase I for 2 min a t room temperature. The DNase I reaction was stopped with 20% saturated ammonium acetate, 20 pg/ml tRNA, and 2 m EDTA, and the DNA was precipitated with ethanol. The DNA pel- lets were resuspended in formamide and loaded onto an 8% denaturing polyacrylamide gel. The G/A ladder was prepared as described (24). After electrophoresis, the gel was dried and autoradiographed.

Yeast Strains-Plasmids encoding the mutant HSF proteins were transformed into a haploid strain derived From W303 (ade2-1 trpl canl-I00 leu2, 3-112 his3-11, 15 ura3) (25), which camed the HSFA2::LEUB chromosomal disruption of HSF and a wild-type gene on the URA3-containing plasmid YCp50 (15). The yeast colonies were grown on galactose plates (1.7 mg/ml yeast nitrogen base, 2% galactose, and all amino acids except leucine, uracil, and tryptophan). Expression of wild-type HSF was suppressed by growing the colonies on glucose plates (1.7 mg/ml yeast nitrogen base, 2% glucose, and all amino acids except leucine, uracil, and tryptophan).

RESULTS The Minimal DNA-binding Domain Contains Only 89Amino

Acids-HSF contains two areas of homology, the trimerization and DNA binding regions. The conserved DNA binding region contains 110 amino acids (residues 171-280 in S. cereuisiae HSF). In order to explore the boundaries of this region, 21 amino acids were removed from the carboxyl-terminal end. This newly defined minimal DNA-binding domain was com- pared with a fragment containing all of the conserved amino acids of the DNA binding region in order to confirm that the minimal DNA-binding domain contains all of the sequences necessary for specific DNA binding.

The 89-amino-acid DNA-binding domain (Sc-D) and the 110- amino-acid DNA binding region (Sc-D(21L)) have nearly iden- tical apparent DNA binding affinities. Sc-D and Sc-D(21L) were mixed with radioactively labeled DNA containing an HSE with three nGAAn boxes. The mixture was then electrophoresed through a non-denaturing polyacrylamide gel. As shown in Fig. 2, the presence of the additional 21 amino acids has no signifi- cant effect on the ability of an isolated DNA-binding domain to bind DNA. Although both Sc-D and Sc-D(21L) are capable of binding an HSE in the presence of competitor DNA, the gel mobility shift assays must be performed in very low (25 mM) NaCl concentrations and the protein-DNAcomplexes appear as smeared bands. DNase I footprinting was used to confirm that the DNA-binding domain is specifically binding the nGAAn units of the HSE. As seen in Fig. 3, the monomeric DNA- binding domain binds specifically to the three nGAAn units.

12477

"

1

sc-I> Sc-D(2 I I . ) FIG. 2. The linker region does not affect the apparent DNA

binding of an isolated minimal DNA-binding domain of yeast HSF. Sc-D and Sc-D(21L) were bound to the fragment TA-3 and elec- trophoresed through a native polyacrylamide gel as described under "Materials and Methods." From left to right Sc-D: 5 x M, 1.7 x M, 5.6 x lod M, 1.9 x 10" M, 6.7 x lo-? M, and 2.1 x lo-? M; Sc-D(21L): 5 x M, 1.7 x M, 5.6 x 10" M, 1.9 x 1O"j M, 6.7 x M, and 2.1 x 10-7 M.

Further, the cleavage patterns of Sc-D and Sc-D(21L) are in- distinguishable. Thus, the conserved 21 amino acids at the carboxyl-terminal of the DNA binding region have no effect on the ability of the isolated DNA-binding domain to interact with DNA.

Deletions in the Linker Region-In addition to the short (21- amino-acid) conserved region, S. cereuisiae HSF also contains a unique 52-amino-acid region between the minimal DNA-bind- ing domain and the trimerization domain. To study the role of the 73-amino-acid region between the DNA-binding and tri- merization domains of yeast heat shock transcription factor, a set of internal deletions in S. cereuisiae HSF were created (Fig. 1). The resulting series of proteins are named showing that they are derived from a fragment of the S. cereuisiae HSF gene containing the DNA-binding and mmerization domains sepa- rated by an n amino acid Linker (Sc-DT(nL)). The parent pro- tein, containing the DNA-binding and trimerization domains separated by the full-length linker, is called Sc-DT(73L). A se- ries of deletions, with one end at the defined NH,-terminal end of the trimerization domain, have sequentially removed the Sc unique region (Sc-DT(21L)), then part or all of the 21-amino- acid conserved sequence (Sc-DT(1OL) and Sc-DT(OL), respec- tively). An S. cereuisiae HSF derivative (Sc-DT(19*L)), with

Page 4: THE JOURNAL OF BIOLOGICAL CHEMISTRY 269, No. 17, 29, 12475-12481… · 2001-06-28 · 0 1994 by The American Society for Biochemistry and Molecular Biology, Inc. THE JOURNAL OF BIOLOGICAL

12478 Flexible Linker of Yeast HSF

HSE [

1 2 3 4 s

lrii FIG. 3. The minimal DNA-binding domain specifically binds

nGAAn units. Sc-D and Sc-D(21L) were bound to the fragment TA-3 and subjected to DNase I digestion as described under "Materials and Methods." GIA ladder; no HSF; Sc-DT(73L) 5 x lo-' M; Sc-D 2 x M; Sc-D(21L) 2 x M.

deletion of all but the 19 carboxyl-terminal amino acids of the unique sequence, was also assayed for activity and structural integrity.

Deletions in the Linker Region Do Not Disturb the Oligomer- ization Properties of the HSF Derivatives-The ability of the HSF derivatives containing various forms of the linker to form homotrimers was tested by chemical cross-linking with EGS. Other techniques, such as analytical ultracentrifugation, con- firm cross-linking as a valid assay for trimerization activity in fragments of yeast HSF (8, 19). At similar concentrations of EGS, all of the proteins with internal deletions between the DNA-binding and trimerization domains, including Sc- DT(19*L), produce a band with an apparent molecular weight three times that of the monomer on SDS-polyacrylamide gel electrophoresis (data not shown). Thus changes in the linker do not cause any detectable disruption of the function of the tri- merization domain.

Deletions in the Proposed Linker Region Do Not Alter the Secondary Structure of the HSF Derivatives-To test the struc- tural integrity of the mutants, their secondary structures were probed using CD spectroscopy. Fig. 4A shows that the proteins produce very similar CD spectra. Each HSF derivative has the minima a t 208 and 222 nm characteristic of a mostly a-helical secondary structure. This agrees with the predominantly a-hel- ical nature of the trimerization domain (19), in combination with the partial a-helical character of the DNA-binding domain (26). The highly similar CD spectra of the HSF derivatives

A l o r 5

Sc-DT(OL) - 2 0 1 . I ' I ' I -

200 220 240 260 280

wavelength (nrn)

B 750

U ~ ~ - 2 5 0 -

Y a 8 -

U

-750- ""_ "_ linker of Sc-DT(0L)

- linker of Sc-DT(21 L) linker of Sc-D(21L)

-1000 - I * I ' I '

200 220 240 260 280

wavelength (nrn)

FIG. 4. A, circular dichroism spectra of the HSF protein derivatives

pH 8.3.100 m~ NaCI. Five scans were averaged for each CD spectrum. are very similar. 1 mglml of each protein is in 25 m~ sodium phosphate,

The spectra have been corrected for the concentration and size to give the mean residue ellipticity (q,J B , difference circular dichroism spec- tra shows the apparent spectra of the linker region. The molar elliptic- ity (qp,,,J of the Sc-DT series of protein were each subtracted by the molar ellipticity of the isolated DNA-binding domain and the molar ellipticity of the isolated trimerization domain. The resulting difference spectra should approximate the spectra of the linker region alone. The difference circular dichroism spectra of Sc-DT(21L) and Sc-D(21L) show that the linker has no ordered secondary structure. The flatness of the difference circular dichroism spectrum of Sc-DT(OL) shows that deletion of the linker region does not affect the secondary structure of the DNA- binding and trimerization domains.

imply that the overall secondary structure has not been strongly altered by any of the internal deletions. Thus, it is unlikely that the deletions within the proposed linker have caused substantial misfolding of the protein.

The Proposed Linker Region Has No Ordered Secondary Structure-To look directly at the secondary structure of the linker, the CD spectra of the DNA-binding domain (S. cerevi- siae residues 171-159) and the trimerization domain (S. cer- evisiae residues 333424) were subtracted from each of the CD spectra of the HSF derivatives. As shown in Fig. 4 B , it is ap- parent that the difference CD spectrum representing the linker region of Sc-DT(21L) is characteristic of random coil with a minimum at 230 nm and maximum at 220 nm (27). As ex- pected, the remainder spectrum of the linker of Sc-DT(OL) is nearly flat, indicating that deletion of the entire linker is not disturbing the secondary structure of the DNA-binding or tri-

Page 5: THE JOURNAL OF BIOLOGICAL CHEMISTRY 269, No. 17, 29, 12475-12481… · 2001-06-28 · 0 1994 by The American Society for Biochemistry and Molecular Biology, Inc. THE JOURNAL OF BIOLOGICAL

Flexible Linker of Yeast HSF 12479

“”

HSF trimer/DNA b

Sc-DT(73L) sC-DT(21L) Sc-DT(IOI.) Sc-DT(OI.)

FIG. 5. The contents of the linker region affect the apparent DNAbinding affinities of the HSF derivatives. All HSF derivatives were bound to TA-3 and electrophoresed through a native polyacrylam- ide gel as described under “Materials and Methods.” From left to right Sc-DT(73L), 4.8 x 10“ Y, 9.6 x 10.’ Y, and 1.9 x lo” M; Sc-DT(21L), 4.8 x lo-‘ M. 9.6 x lo” M, and 1.9 x 10.‘ M; Se-nT(IOI,,). 1.8 x in-‘ M, 9.6 x lW7 M, and 1.9 x lo-.’ M; Sc-DTfOL), 1.44 x M, 2.8 x M, and 5.8 X lo-’ M.

merization domain. When the CD spectrum of Sc-D is sub- tracted from the CD spectrum of Sc-D(21L), the difference spec- tra is also characteristic of random coil. Therefore, the 21 conserved amino acids have no defined secondary structure either when free at the COOH-terminal end or when tethered to the trimerization domain.

Deletions in the Linker Region Decrease DNA Binding Afinity-Although the deletions do not appear to radically af- fect the structure of the DNA binding and trimerization region, HSF fragments with deletions in the conserved linker region are deficient in DNA binding activity as assayed by gel mobility shift. Several concentrations of each HSF derivative were mixed with a radioactively labeled DNA fragment containing three nGAAn boxes, then electrophoresed through a non-dena- turing polyacrylamide gel and autoradiographed (Fig. 5). The apparent dissociation constants were defined as the concentra- tion of protein required to bind 50% of the target DNA in an HSF trimer-DNA complex. Sc-DT(73L) and Sc-DT(21L) have apparent dissociation constants of approximately 2 x M and 5 x lo-’ M, respectively. Both Sc-DT(10L) and Sc-DT(OL), which have deletions in the conserved linker region and a complete deletion of the Sc unique region, bind DNA with approximately 10-fold lower affinity than the wild-type protein. The extent of the deletion of the proposed linker does not affect the extent of binding, as the dissociation constants of both Sc-DT(10L) and Sc-DT(0L) are both approximately 2 x 10“ M. Finally, Sc- DT(l9*L), with only a portion of the Sc unique region, binds less well than any of the other mutant proteins studied, includ- ing the mutant with no linker sequence at all (data not shown). The extent of secondary structure seen in the CD spectra (Fig. 4A), along with the ability of the HSF derivatives to trimerize, argue against the possibility that the loss of binding is due to misfolded or unfolded proteins.

All of the trimeric HSF derivatives bind DNA with higher affinity than the monomeric DNA-binding domain. The disso- ciation constants of Sc-D and Sc-D(21L) are 2 x M in 25 mM NaCl rather than 100 mM NaCl as was used in the reactions of the trimeric HSF fragments with DNA. The lower salt concen- tration allows stronger interaction between the protein and DNA and is necessary for the monomeric DNA-binding domain to bind DNA.

Deletions in the Conserved Linker Region Are Lethal in Yeast-Yeast HSF is required for viability and has constitutive

transcriptional activity (15, 16). The internal deletions de- scribed above were duplicated in the full-length gene, and the mutant HSFs were expressed in S. cerevisiae from the native HSF promoter on a single copy plasmid. The chromosomal HSF gene was disrupted by LEU2, and wild-type HSF was ex- pressed solely from a lOGALl promoter on a plasmid contain- ing a selectable marker for uracil production (15, 28). Yeasts containing both plasmids were grown on media with either galactose or glucose as the sole carbon source. On galactose media, the wild-type HSF was produced, and all cells grew. When placed on glucose media, however, only the mutant HSF will be expressed. Since HSF is essential, any mutant HSF which is unable to fully function will not be able to support growth on glucose media. Removal of the Sc unique region had no effect on the viability of the yeast cells at any of the tem- peratures examined. All proteins with partial or total removal of the conserved linker sequences were unable to support growth at any temperature assayed (see Table I). Thus any deletion in the 21-amino-acid conserved linker region results in non-functional HSF.

DISCUSSION

All known HSFs have a highly conserved DNA binding re- gion. Although the homologous region includes a t least 110 amino acids, an 89-amino-acid minimal DNA-binding domain is sufficient for specific binding to HSEs. The additional 21 amino acids have no effect on the apparent DNA binding afin- ity nor on the cleavage pattern of the DNase I footprint of the DNA-binding domain. Further, difference CD spectra indicate that the last 21 amino acids of the conserved DNA binding region have no defined secondary structure in solution. Thus, the minimal DNA-binding domain consists of residues 171 to 259 of S. cerevisiae HSF.

The recent x-ray crystal structure of the DNA-binding do- main of HSF from the milk yeast, Kluyveromyces lactis, shows that this domain is a variant of the helix-turn-helix family of DNA-binding proteins (26). The fragment which was crystal- lized is equivalent to residues 171-259 of s. cerevisiae HSF, defined in this paper as the minimal DNA-binding domain. The limits of the DNA-binding domain are also supported by NMR studies of a 130-amino-acid fragment of Drosophila melano- gaster HSF including residues amino- and carboxyl-terminal to the DNA-binding domain (29). These results demonstrate no defined secondary structure either amino-terminal of amino acid 48, which is equivalent to amino acid 174 of S. cerevisiae HSF, or carboxyl-terminal of amino acid 132 of D. melanogaster HSF which is equivalent to amino acid 259 of S. cerevisiae HSF. Finally, limited proteolysis of Sc-D(21L) with trypsin yields two major proteolytic fragments, amino acids 171-242 and amino acids 244-264, as determined by electrospray-ionization mass spectrometry.’ The first proteolytic product represents the ami- no-terminal portion of the protein up to a cleavage site in an extended loop, as seen in the crystal structure (26). The second fragment is created by cleavage in the extended loop and in the proposed linker. The remaining residues in the proposed linker were not recovered, supporting the conclusion that the 21 con- served amino acids are highly flexible and probably solvent exposed.

Unlike most HSFs, S. cerevisiae HSF contains 52 non-con- served amino acids between the proposed linker and the trim- erization domain. Deletion of the Sc unique region has no effect on the structure or function of HSF. Analysis by CD spectros- copy shows that deletion of this region does not substantially change the overall secondary structure of a fragment including

K. E. Flick and D. King, unpublished results.

Page 6: THE JOURNAL OF BIOLOGICAL CHEMISTRY 269, No. 17, 29, 12475-12481… · 2001-06-28 · 0 1994 by The American Society for Biochemistry and Molecular Biology, Inc. THE JOURNAL OF BIOLOGICAL

12480 Flexible Linker of Yeast HSF the DNA binding and trimerization regions. Sc-DT(21L) binds DNA with an apparent affinity similar to the wild-type frag- ment Sc-DT(73L) and can trimerize as well as the non-deleted fragment. Finally, deletion of the Sc unique region from full- length HSF does not alter the viability of yeast in which the mutant HSF has replaced the wild-type HSF. These results imply that the 52-amino-acid Sc unique region does not provide the flexibility that is required for a trimeric protein to bind linear DNA. Although the Sc unique region does not affect the binding of HSF to DNA under physiological conditions, it may play another, as yet unknown, role in transcriptional activation as a response to heat shock.

HSF binds to DNA as a trimer (8) and biochemical studies show that the trimerization domain is a triple-stranded a-hel- ical coiled-coil (19). Such a trimeric coiled-coil would force the DNA-binding domains to be placed at 120" relative to one an- other. DNA, on the other hand, has approximately 180" sym- metry between inverted binding sites. Therefore, some accom- modation must be made by either the protein or the DNA in order to contact nGAAn units with each of the three DNA- binding domains within a trimer. Circular permutations assays show that the DNA is not significantly bent when bound by HSF, implying that the protein must make most of the adjust- ments (22, 30). Fig. 6 illustrates the juxtaposition of a trimeric structure onto a DNA molecule consisting of a series of inverted

nGAAn sequences. The model depicts how a trimeric molecule can not fully occupy three consecutive nGAAn repeats. A flex- ible linker would allow the adjustment of an 3-fold symmetric HSF to one that can bind all three sites in the major groove, where HSF has been shown to bind (7, 30).

Many observations presented in this paper lead to the con- clusion that the 21 conserved amino acids at the carboxyl- terminal end of the DNA-binding domain shares characteris- tics found in other defined linkers and must be the necessary flexible linker. First, linkers have been shown to have little defined secondary structure, allowing great flexibility in the movement of the connected domains. For example, circular di- chroism and nuclear magnetic resonance spectroscopy on syn- thetic peptides of a linker from the dihydrolipoyl acetyltrans- ferase component of the E. coli pyruvate dehydrogenase multienzyme complex shows that the linkers have little sec- ondary structure (31). The CD difference spectra indicate that the 21 conserved amino acids of HSF's DNA binding region have no defined secondary structure when tethered by the tri- merization domain or when free at the carboxyl-terminal end. Thus, the 21 conserved amino acids likely provide the flexibility necessary for a trimeric molecule to bind linear DNA.

Second, linkers provide more than simple flexibility. Al- though linkers have little or no ordered secondary structure,

TABLE I Viability of HSF derivatives in vivo

37 "C 30 "C 25 "C 18 "C Glucose Galactose Glucose Galactose Glucose Galactose Glucose Galactose

- Vector + - ND ND ND ND + Sc HSF + + + + + + + + SC421L) + + + + + + + + SC-(lOL) SC-(OL) Sc-(19*L)

a ND, not determined.

- + + +

- + + +

- + + +

- + + +

- - - - - - - -

FIG. 6. Two different views (at 90" relative to one another) of a model of HSF bound to DNA illustrate the sym- metry problems of a trimer protein. Each ellipse represents a single DNA- binding domain with the arrows indicat- ing the orientation of the binding site in the major groove. The three solid bold lines in each view depict the extension of 3-fold rotational symmetry from the base of the trimerization domains.

Page 7: THE JOURNAL OF BIOLOGICAL CHEMISTRY 269, No. 17, 29, 12475-12481… · 2001-06-28 · 0 1994 by The American Society for Biochemistry and Molecular Biology, Inc. THE JOURNAL OF BIOLOGICAL

Flexible Linker of Yeast HSF 12481

particular sequences may be necessary for proper function or activity. For example, mutations in the "hinge region" between the DNA-binding and dimerization domains of LexA negatively affect the ability of the protein to form stable complexes with DNA (32). For HSF, the DNA-binding and trimerization do- mains must be separated by the conserved amino acid linker for proper function. Although Sc-DT(21L) and Sc-DT(lS*L) have linkers of similar length, the apparent DNA binding affinity of the fragment with the substituted linker (Sc-DT(lS*L)) is sig- nificantly lower. The conserved linker region may have the ability to adopt a particular conformation or set of conforma- tions which other sequences, such as those found in the Sc unique region, cannot.

Finally, linkers have also been implicated in regulating the orientation of the linked domains. Deletion of the linker in endoglucanase A changes the activity of both domains as well as the protease sensitivity of the mutant protein. Structural data support the hypothesis that these changes are due to altered orientation of the two domains caused by deletion of the linker (33). Further, linkers have been implicated in regulating the affinity of DNA-binding proteins for sites on the DNA. For example, deletions and additions in the linker regions of type IC restriction enzymes result in altered DNA sequence speci- ficity (34). The linker of members of the C , zinc cluster family of yeast transcriptional activators directs binding to DNA tar- get sites with different spacings between the inverted binding sites (35). Deletions in the linker region of HSF affect the DNA binding afinity only when the trimerization domain constrains the rotational symmetry of the protein. Thus it is possible that the linker region of HSF assists the high affinity binding to DNA by allowing the proper orientation of the DNA-binding domains that are necessary for the juxtaposition of the trimeric protein with linear DNA.

Acknowledgments-We thank the following people for constlvction of plasmids: Pia Abola (pHN126, pHN124, pHN136, and pHN138); Eva Grotkopp (pHN148 and pHN420); Becky Drees (pBL3); and Susan Hubl (pHN1002). We also thank Susan Hubl for help with the yeast experi- ments, Prof. Susan Marqusee for help with circular dichroism, and Dr. David King for help with the electrospray-ionization mass spectrometry.

REFERENCES

2. Amin, J., Ananthan, J., and Voellmy, R. (1988) Mol. Cell. Biol. 8, 37614769 1. Xiao, H., and Lis, J. T. (1988) Science 239, 1139-1142

3. Bienz, M., and Pelham, H. R. (1987) Adu. Genet. 24,31-72 4. Boorstein, W. R., and Craig, E. A. (1990) J. Biol. Chem. 265,18912-18921 5. Morimoto, R. I., Sarge, K. D., and Abravaya, K. (1992) J. Biol. Chem. 267,

6. Lis, J., and Wu, C. (1993) Cell 74, 1-4 7. Perisic, O., Xiao, H., and Lis, J. T. (1989) Cell 59, 7974306 8. Sorger, P. K, and Nelson, H. C. M. (1989) Cell 59,807413 9. Rabindran, S . K., Giorgi, G., Clos, J., and Wu, C. (1991) P m . Natl. Acad. Sci.

10. Schuetz, T. J., Gallo, G. J., Sheldon, L., Tempst, P., and Kingston, R. E. (1991)

11. Sarge, K. D., Zimarino, V., Holm, K., Wu, C., and Morimoto, R. I. (1991) Genes

12. Nakai, A., and Morimoto, R. I. (1993) Mol. Cell Biol. 13,1983-1997 13. Clos, J., Westwood, J. T., Becker, P. B., Wilson, S . , Lambert, K., and Wu, C.

14. Scharf, K D., Rose, S . , Zott, W., Schoffl, F., Nover, L., and Schoff, F. (1990)

15. Sorger, P. K., and Pelham, H. R. (1988) Cell 64,855-864 16. Wiederrecht, G., Seto, D., and Parker, C. S . (1988) Cell 64,841-853 17. Jakobsen, B. K., and Pelham, H. R. (1991) EMBO J. 10,369-375 18. Gallo, G. J., Prentice, H., and Kingston, R. E. (1993) Mol. Cell. Biol. 13,

19. Peteranderl, R., and Nelson, H. C. M. (1992) Biochemistry 31, 12272-12276 20. Studier, F. W., Rosenberg, A. H., Dunn, J. J., and Dubendorff, J. W. (1989)

21. Stark, M. J. (1987) Gene (Amst.) 51,255-267 22. Rye, H. S. , Drees, B. L., Nelson, H. C., and Glazer, A. N. (1993) J. Biol. Chem.

268, 25229-25238 23. Cantor, C. R., and Schimmel, P. R. (1980) Biophysical Chemistry, pp. 409480,

W. H. Freeman and Company, San Francisco, CA 24. Maxam, A. M., and Gilbert, W. (1980) Methods EnzymoZ. 66,499-560 25. Shore, D., and Nasmyth, K. (1987) Cell 51,721-732 26. Hamson, C. J., Bohm, A. A., and Nelson, H. C. M. (1994) Science 263,224-227 27. Saxena, V. P., and Wetlaufer, D. B. (1971) Proc. Natl. A c d . Sci. U. S. A. 6 8 ,

28. Sorger, P. K. (1990) Cell 62, 7934305 29. Vuister, G. W., Kim, S.J., Wu, C., and Bax, A. (1994) Biochemistry 33, 1&16 30. Gross, D. S. , English, K. E., Collins, K. W., and Lee, S . W. (1990) J. Mol. Biol.

31. Radford, S . E., Laue, E. D., Perham, R. N., Martin, S . R., andAppella, E. (1989)

32. Oertel-Buchheit, P., Schmidt-Dorr, T., Granger-Schnarr, M., and Schnarr, M.

33. Shen, H., Schmuck, M., Pilz, I., Gilkes, N. R., Kilburn, D. G., Miller, R., Jr., and

34. Gubler, M., and Bickle, T. A. (1991) EMBO J. 10,951-957 35. Reece, R. J., and Ptashne, M. (1993) Science 261,909-911

21987-21990

U. S. A. 88,6906-6910

Proc. Nutl. Acad. Sci. U. S. A. 88,69114915

& Deu. 5, 1902-1911

(1990) Cell 63, 1085-1097

EMBO J. 9,4495-4501

749-761

Methcds Enzymol. 186, 60-89

969-972

216,611431

J. Biol. Chem. 264, 767-775

(1993) J. Mol. Biol. 229, 1-7

Warren, R. A. (1991) J. Biol. Chem. 266, 11335-11340