quantitativeanalysisof invivo …yks2 by the following manipulations. yks2 was transformed with...

11
Quantitative Analysis of in Vivo Initiator Selection by Yeast RNA Polymerase II Supports a Scanning Model * Received for publication, February 28, 2006, and in revised form, March 29, 2006 Published, JBC Papers in Press, March 29, 2006, DOI 10.1074/jbc.M601937200 Jason N. Kuehner and David A. Brow 1 From the Department of Biomolecular Chemistry, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin 53706 Initiation of transcription by RNA polymerase II (RNAP II) on Saccharomyces cerevisiae messenger RNA (mRNA) genes typically occurs at multiple sites 40 –120 bp downstream of the TATA box. The mechanism that accommodates this extended and variable pro- moter architecture is unknown, but one model suggests that RNAP II forms an open promoter complex near the TATA box and then scans the template DNA strand for start sites. Unlike most protein- coding genes, small nuclear RNA gene transcription starts predom- inantly at a single position. We identify a highly efficient initiator element as the primary start site determinant for the yeast U4 small nuclear RNA gene, SNR14. Consistent with the scanning model, transcription of an SNR14 allele with tandemly duplicated start sites initiates primarily from the upstream site, yet the downstream site is recognized with equivalent efficiency by the diminished popula- tion of RNAP II molecules that encounter it. A quantitative in vivo assay revealed that SNR14 initiator efficiency is nearly perfect (90%), which explains the precision of U4 RNA 5 end formation. Initiator efficiency was reduced by cis-acting mutations at 8, 7, 1, and 1 and trans-acting substitutions in the TFIIB B-finger. These results expand our understanding of RNAP II initiation pref- erences and provide new support for the scanning model. Eukaryotes rely on RNA polymerase II (RNAP II) 2 to synthesize all messenger RNAs (mRNAs) and most of the small nuclear RNAs (snR- NAs) and small nucleolar RNAs (snoRNAs) encoded within their nuclear genomes. Efficient and accurate transcription initiation is vital to ensure the proper expression and function of these RNAs. The recruitment of RNAP II to gene promoters is mediated through the assembly of a pre-initiation complex (PIC). RNAP II accessory proteins provide promoter specificity and the structural core for assembly of the PIC. These accessory proteins include the general transcription factors TFIID, TFIIB, TFIIF, TFIIE, and TFIIH (1–3). Some transcription fac- tors engage in sequence-specific contacts with core promoter elements (4); one of the most fundamental interactions for PIC assembly is between the TATA-binding protein subunit of TFIID and the TATA box (5– 6). In a stepwise model for PIC assembly, TATA-binding pro- tein binding is followed by the addition of TFIIB, RNAP II-TFIIF, TFIIE, and TFIIH (7). In metazoans, the assembly of a PIC at the TATA box results in start site selection 25–30 bp downstream (4). The architecture of the PIC is such that the transcription start site is placed precisely within the active center of RNAP II (8 –9). In the yeast Saccharomyces cerevisiae, RNAP II initiation typically occurs at multiple sites at variable distances from the TATA box, with most start sites ranging from 40 to 120 bp downstream of the TATA box (10). The initiation mechanism that accommodates this extended and variable promoter architecture is unknown, but it does not appear to be dependent on assembling the yeast PIC in a man- ner different from that of metazoans. Yeast promoter melting has been shown to begin at the same position as in metazoans, 20 bp down- stream of the TATA box (11). In addition, the 30-bp distance between the TATA box and RNAP II active center has been confirmed through structural analysis of yeast PICs (8 –9). A scanning model for start site selection has been proposed for yeast (11). In this scanning model a PIC assembles at a TATA box, the DNA is melted, and RNAP II translocates downstream searching the template strand for acceptable start sites. The initial sequence comparisons and mutational analysis of a rela- tively small set of yeast mRNA genes helped define three related yeast start site consensus sequences, RRYRR, TCRA, and YA(A/T)R in the non-template strand, where the initiation site is underlined, Y is pyrim- idine, and R is purine (12–14). Recently, an alignment of sequences flanking 4637 yeast transcription start sites has provided a more re- fined consensus sequence: A(A rich ) 5 NYA(A/T)NN(A rich ) 6 (15). The DNA sequences encompassing yeast transcription start sites are sometimes termed initiator elements. In metazoans the initiator is defined as a core promoter element distinct from the TATA box that nucleates PIC assembly and is sufficient for accurate transcription (16). Although there is evidence to suggest that some yeast initiators may function in this way (17–18), most appear to play a more limited role in transcriptional control and influence accuracy but not overall efficiency (2). It seems likely that the recognition and efficient utilization of yeast start sites involves a sequence-specific interaction between the yeast initiator element and either RNAP II, an accessory protein, or both. RNAP II and TFIIB have been shown to dictate the distance from TATA boxes to start sites in yeast (19). TFIIB substitutions that confer down- stream shifts in yeast start site selection map to the “B-finger” domain, which encompasses residues 55– 88 of the N-terminal region (9, 20 –22). The promoter sequence immediately upstream of yeast start sites can influence the severity with which TFIIB B-finger substitutions alter start site selection (23). A yeast RNAP II-TFIIB crystal structure model shows the TFIIB B-finger inserted through the RNA exit pore into the polymerase active site, suggesting that start site selection may be mediated by a direct interaction between the B-finger and promoter DNA (9). Here we report the characterization of cis- and trans-acting determi- nants of start site selection at the yeast U4 snRNA gene, SNR14. In contrast to the heterogeneous transcription start site selection exhibited * This work was supported by National Institutes of Health, Public Health Service Grant GM44665 (to D. A. B.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked “adver- tisement” in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. The work is dedicated to the memory of Kathryn A. Kuehner (1939 –2006). 1 To whom correspondence should be addressed: Dept. of Biomolecular Chemistry, University of Wisconsin School of Medicine and Public Health, 1300 University Ave., Madison, WI 53706-1532. Tel.: 608-262-1475; Fax: 608-262-5253; E-mail: [email protected]. 2 The abbreviations used are: RNAP II, RNA polymerase II; snRNA, small nuclear RNA; snoRNA, small nucleolar RNA; PIC, pre-initiation complex; TF, transcription factor; RT, reverse transcription. THE JOURNAL OF BIOLOGICAL CHEMISTRY VOL. 281, NO. 20, pp. 14119 –14128, May 19, 2006 © 2006 by The American Society for Biochemistry and Molecular Biology, Inc. Printed in the U.S.A. MAY 19, 2006 • VOLUME 281 • NUMBER 20 JOURNAL OF BIOLOGICAL CHEMISTRY 14119 by guest on October 24, 2020 http://www.jbc.org/ Downloaded from

Upload: others

Post on 05-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: QuantitativeAnalysisof inVivo …YKS2 by the following manipulations. YKS2 was transformed with pRS313-SNR14 and pRS317-SUA7 by the lithium acetate procedure (25). His Lys clones were

Quantitative Analysis of in Vivo Initiator Selection by YeastRNA Polymerase II Supports a Scanning Model*

Received for publication, February 28, 2006, and in revised form, March 29, 2006 Published, JBC Papers in Press, March 29, 2006, DOI 10.1074/jbc.M601937200

Jason N. Kuehner and David A. Brow1

From the Department of Biomolecular Chemistry, University of Wisconsin School of Medicine and Public Health,Madison, Wisconsin 53706

Initiation of transcription by RNA polymerase II (RNAP II) onSaccharomyces cerevisiae messenger RNA (mRNA) genes typicallyoccurs at multiple sites 40–120 bp downstream of the TATA box.Themechanism that accommodates this extended and variable pro-moter architecture is unknown, but one model suggests that RNAPII forms an open promoter complex near the TATA box and thenscans the template DNA strand for start sites. Unlike most protein-coding genes, small nuclear RNA gene transcription starts predom-inantly at a single position. We identify a highly efficient initiatorelement as the primary start site determinant for the yeast U4 smallnuclear RNA gene, SNR14. Consistent with the scanning model,transcriptionof anSNR14 allelewith tandemlyduplicated start sitesinitiates primarily from the upstream site, yet the downstream siteis recognized with equivalent efficiency by the diminished popula-tion of RNAP II molecules that encounter it. A quantitative in vivoassay revealed that SNR14 initiator efficiency is nearly perfect(�90%), which explains the precision of U4 RNA 5� end formation.Initiator efficiency was reduced by cis-acting mutations at �8, �7,�1, and �1 and trans-acting substitutions in the TFIIB B-finger.These results expand our understanding of RNAP II initiation pref-erences and provide new support for the scanning model.

Eukaryotes rely on RNA polymerase II (RNAP II)2 to synthesize allmessenger RNAs (mRNAs) and most of the small nuclear RNAs (snR-NAs) and small nucleolar RNAs (snoRNAs) encoded within theirnuclear genomes. Efficient and accurate transcription initiation is vitalto ensure the proper expression and function of these RNAs. Therecruitment of RNAP II to gene promoters is mediated through theassembly of a pre-initiation complex (PIC). RNAP II accessory proteinsprovide promoter specificity and the structural core for assembly of thePIC. These accessory proteins include the general transcription factorsTFIID, TFIIB, TFIIF, TFIIE, and TFIIH (1–3). Some transcription fac-tors engage in sequence-specific contacts with core promoter elements(4); one of the most fundamental interactions for PIC assembly isbetween the TATA-binding protein subunit of TFIID and the TATAbox (5–6). In a stepwise model for PIC assembly, TATA-binding pro-tein binding is followed by the addition of TFIIB, RNAP II-TFIIF, TFIIE,and TFIIH (7).

In metazoans, the assembly of a PIC at the TATA box results in startsite selection 25–30 bp downstream (4). The architecture of the PIC issuch that the transcription start site is placed precisely within the activecenter of RNAP II (8–9). In the yeast Saccharomyces cerevisiae, RNAP IIinitiation typically occurs at multiple sites at variable distances from theTATA box, withmost start sites ranging from 40 to 120 bp downstreamof the TATA box (10). The initiation mechanism that accommodatesthis extended and variable promoter architecture is unknown, but itdoes not appear to be dependent on assembling the yeast PIC in a man-ner different from that of metazoans. Yeast promoter melting has beenshown to begin at the same position as in metazoans, �20 bp down-stream of the TATAbox (11). In addition, the�30-bp distance betweenthe TATA box and RNAP II active center has been confirmed throughstructural analysis of yeast PICs (8–9). A scanning model for start siteselection has been proposed for yeast (11). In this scanningmodel a PICassembles at a TATAbox, the DNA ismelted, and RNAP II translocatesdownstream searching the template strand for acceptable start sites.The initial sequence comparisons and mutational analysis of a rela-

tively small set of yeast mRNA genes helped define three related yeaststart site consensus sequences, RRYRR, TCRA, and YA(A/T)R in thenon-template strand, where the initiation site is underlined, Y is pyrim-idine, andR is purine (12–14). Recently, an alignmentof sequences flanking4637 yeast transcription start sites has provided a more re-fined consensus sequence: A(Arich)5NYA(A/T)NN(Arich)6 (15). The DNAsequences encompassing yeast transcription start sites are sometimestermed initiator elements. In metazoans the initiator is defined as acore promoter element distinct from the TATA box that nucleatesPIC assembly and is sufficient for accurate transcription (16).Although there is evidence to suggest that some yeast initiators mayfunction in this way (17–18), most appear to play a more limited rolein transcriptional control and influence accuracy but not overallefficiency (2).It seems likely that the recognition and efficient utilization of yeast

start sites involves a sequence-specific interaction between the yeastinitiator element and either RNAP II, an accessory protein, or both.RNAP II andTFIIB have been shown to dictate the distance fromTATAboxes to start sites in yeast (19). TFIIB substitutions that confer down-stream shifts in yeast start site selection map to the “B-finger” domain,which encompasses residues 55–88 of the N-terminal region (9,20–22). The promoter sequence immediately upstream of yeast startsites can influence the severity with which TFIIB B-finger substitutionsalter start site selection (23). A yeast RNAP II-TFIIB crystal structuremodel shows the TFIIB B-finger inserted through the RNA exit poreinto the polymerase active site, suggesting that start site selection maybe mediated by a direct interaction between the B-finger and promoterDNA (9).Here we report the characterization of cis- and trans-acting determi-

nants of start site selection at the yeast U4 snRNA gene, SNR14. Incontrast to the heterogeneous transcription start site selection exhibited

* This work was supported by National Institutes of Health, Public Health Service GrantGM44665 (to D. A. B.). The costs of publication of this article were defrayed in part bythe payment of page charges. This article must therefore be hereby marked “adver-tisement” in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

The work is dedicated to the memory of Kathryn A. Kuehner (1939 –2006).1 To whom correspondence should be addressed: Dept. of Biomolecular Chemistry,

University of Wisconsin School of Medicine and Public Health, 1300 UniversityAve., Madison, WI 53706-1532. Tel.: 608-262-1475; Fax: 608-262-5253; E-mail:[email protected].

2 The abbreviations used are: RNAP II, RNA polymerase II; snRNA, small nuclear RNA;snoRNA, small nucleolar RNA; PIC, pre-initiation complex; TF, transcription factor; RT,reverse transcription.

THE JOURNAL OF BIOLOGICAL CHEMISTRY VOL. 281, NO. 20, pp. 14119 –14128, May 19, 2006© 2006 by The American Society for Biochemistry and Molecular Biology, Inc. Printed in the U.S.A.

MAY 19, 2006 • VOLUME 281 • NUMBER 20 JOURNAL OF BIOLOGICAL CHEMISTRY 14119

by guest on October 24, 2020

http://ww

w.jbc.org/

Dow

nloaded from

Page 2: QuantitativeAnalysisof inVivo …YKS2 by the following manipulations. YKS2 was transformed with pRS313-SNR14 and pRS317-SUA7 by the lithium acetate procedure (25). His Lys clones were

at most mRNA genes, yeast snRNAs typically have one major start site,thus providing a model system for the study of accurate initiation. Weidentified a highly efficient initiator element within the SNR14 pro-moter, defined the positions most critical for its function in start siteselection, and quantified its efficiency relative to other initiatorsequences. Substitutions within the TFIIB B-finger genetically interactwith SNR14 initiator mutations in a sequence-dependent manner.Quantitative analysis of the utilization of tandemly duplicated initiatorelements strongly supports the proposed scanningmodel for yeast tran-scription start site selection and demonstrates that scanning by RNAP IIis processive.

EXPERIMENTAL PROCEDURES

Plasmid Construction—SNR14 (positions �224 to �701, relative to�1 transcription start site) was cloned by PCR amplification of agenomic DNA template isolated from yeast strain PJ43�2b and ligatedinto the BamHI site of pRS313 (CEN4, ARS1, HIS3). 5� end truncationconstructs were generated by the same method but using pRS313-SNR14 as the template. pRS313-SNR14-StDup was created by usingQuikChange PCR mutagenesis (Stratagene) to insert 14 bp of DNA(�13 to�1 relative to SNR14 start site) between positions�1 and�2 ofpRS313-SNR14, creating an overlapping 20-bp duplication. pRS317(LYS2)-SUA7 contains the entire SUA7 promoter and coding region(TFIIB gene) and was constructed by ligation of the ClaI/SacI fragmentof pRS314 (TRP1)-yIIBN (kindly provided by A. Ponticelli, State Uni-versity of New York at Buffalo) into pRS317 (LYS2). TFIIB expressedfrom these constructs contains an N-terminal hexahistidine tag. Allmutations within pRS313-SNR14-StDup and pRS317-SUA7 were cre-ated using the QuikChange method (Stratagene). pRS316 (URA3)-SNR14, SUA7 was constructed by ligating a PCR-amplified region ofSNR14 (�224 to � 701) into the SalI/XhoI sites of pRS316 (URA3)-yIIBN (kindly provided by A. Ponticelli, State University of New York atBuffalo). Oligonucleotide sequences are available upon request.

Yeast Strains—Genomic DNA isolated from PJ43�2b (MAT� trp1ura3 can1 leu2 his3 ade2 met2 lys2; kindly provided by P. James and E.Craig, University of Wisconsin at Madison) was used for the cloning offull-length SNR14. YKS2 (MATa trp1 his3 ura3 ade2 lys2 snr14::TRP1YCp50 (URA3)-SNR14; kindly provided by K. Shannon and C. Guthrie,University of California at San Franscisco) has been described previ-ously (24). JNK1 (MATa trp1 his3 ura3 ade2 lys2 snr14::TRP1sua7::KANMX4 pRS316 (URA3)-SNR14, SUA7) was derived fromYKS2 by the following manipulations. YKS2 was transformed withpRS313-SNR14 and pRS317-SUA7 by the lithium acetate procedure(25). His�Lys� clones were transformed with 5 �g of SUA7::KANMX4and plated on yeast extract-peptone-dextrose plus 200 mg/liter G418(Invitrogen). The SUA7::KANMX4 product was obtained by PCR usinggenomic DNA template from a diploid strain heterozygous for aKANMX4-disrupted allele of SUA7 (Invitrogen). Integrants were con-firmed by PCR using genomic DNA isolated from KanR colonies. Inte-grant strains were transformedwith pRS316 (URA3)-SNR14, SUA7 andgrown on medium lacking uracil. Loss of pRS313-SNR14 and pRS317-SUA7 was confirmed by replica plating onto �his and �lys medium,thus creating JNK1. Plasmid-borne alleles of SNR14 and SUA7 inpRS313 and pRS317, respectively, were transformed into JNK1 bystandard plasmid shuffle techniques (26). Loss of pRS316 (URA3)-SNR14, SUA7 was selected for on medium containing 0.75 mg/ml of5-fluoroorotic acid.

DNase I Chromatin Footprinting—Chromatin footprinting was per-formed as previously described (27) using the yeast strain PJ43�2b.After digestion of lysed yeast cells or purified genomicDNAwithDNase

I (Invitrogen), cleavage sites on the non-template strand of the SNR14promoter were mapped by primer extension using 32P-labeled oligoU4-14C, which is complementary to non-template strand residues�32to �51. Sequencing ladders were generated by primer extension ofgenomic DNA using 32P-labeled oligo U4-14C and a dNTP mix con-taining dideoxy-ATP or -GTP.

RNA Analysis—Total cellular RNA was isolated using the guani-dinium thiocyanate method including a 65 °C phenol extraction (28).Reverse transcription for the determination of the Sec3mRNA-processingsites was performed in a 50-�l reaction volume containing 5 �g of totalRNA from strain PJ43�2b, 50 mM Tris-HCl (pH 8.3), 8 mM MgCl2, 50mM NaCl, 11 mM dithiothreitol, 1 mM dNTPs, 40 units of RNasin (Pro-mega), 250 pmol of T16-EcoR1 oligo, and 37.5 units of avianmyeloblas-tosis virus reverse transcriptase (United States Biochemical). cDNAsynthesis proceeded at 42 °C for 1 h. Ten�l of the 50-�l RT reactionwasused as a template for PCR in a 100-�l volume containing 20 mM

(NH4)2SO4, 50 mM Tris-HCl (pH 9.0), 0.75 mM MgCl2, 50 pmol ofT16-EcoR1 oligo, 100 pmol of SEC3-RT-PCR oligo, and 1 unit ofMasterAmpTM Tfl DNA polymerase (Epicentre). Each PCR cycle con-sisted of a denaturation at 94 °C for 30 s, annealing at 42 °C for 30 s, andelongation at 72 °C for 1min. A total of 30 cycles was performedwith anadditional extension at 72 °C for 5 min. RT-PCR products were gel-purified and ligated into the BamHI/EcoRI site of pRS316. Recoveredplasmids were sequenced using an M13F oligo.RNA immunoprecipitations were done as described previously (29).

Each immunoprecipitation mixture contained 20 �l of swollen proteinA-SepharoseCL-4Bbeads (AmershamBiosciences ) that had previouslybeen incubated with 5 �g of 2,2,7-trimethylguanosine Ab-1 antibody(Calbiochem), 50 units of RNasin (Promega), and 5 �g of total cellularRNA in a 200-�l volume. The precedent for Ab-1 recognition of both7-methylguanosine and 2,2,7-trimethylguanosine caps came fromBochnig et al. (30).Primer extension analysis of 5 �g of total cellular RNA was carried

out using 32P-labeled oligonucleotide U4–14B (complementary tonucleotides 140–159 of yeast U4 RNA) or SCR1 (complementary tonucleotides 75–92 of yeast scR1 RNA) (31). Sequencing ladders weregenerated using the SequithermEXCEL IIDNA sequencing kit (Epicen-tre). The cDNA products were electrophoresed on 6% polyacrylamide,8.3 M urea gels. Gels were visualized with a Storm PhosphorImager(Amersham Biosciences), and data were quantitated with AmershamBiosciences ImageQuant software (Version 5.2).

RESULTS

Conserved Sequence Elements Upstream of the Yeast U4 snRNAGene,SNR14—To begin characterizing SNR14 promoter architecture, weused comparative sequence analysis to identify conserved elementsupstream of the transcription start site and downstream of the 5�-adja-cent gene, SEC3 (Fig. 1A). An alignment of sequences upstream ofSNR14 in four different species of Saccharomyces (32) helped identifyseveral conserved elements (Fig. 1B). The most strikingly conservedregions include the sequence immediately upstreamof the transcriptionstart site, a TATA box located 100 base pairs upstream of the start site,a T-stretch just upstream of the TATA box, and a region located 31–44base pairs upstream of the TATA box. The most upstream conservedregion may be an upstream activating sequence and in S. cerevisiaeexactly matches the consensus binding site of the transcriptional acti-vator Abf1 (33–34).In addition to promoting SNR14 transcription, another likely func-

tion for conserved sequences in this intergenic region is to direct cleav-age and polyadenylation of Sec3 mRNA. RT-PCR was used to identify

Initiator Selection by Yeast RNAP II

14120 JOURNAL OF BIOLOGICAL CHEMISTRY VOLUME 281 • NUMBER 20 • MAY 19, 2006

by guest on October 24, 2020

http://ww

w.jbc.org/

Dow

nloaded from

Page 3: QuantitativeAnalysisof inVivo …YKS2 by the following manipulations. YKS2 was transformed with pRS313-SNR14 and pRS317-SUA7 by the lithium acetate procedure (25). His Lys clones were

the predominant Sec3 mRNA 3�-processing sites. The most efficientlyrecovered site, which appeared in 12 of 21 clones, mapped to themiddleof the putative Abf1 binding site (Fig. 1B). Six other nearby sites wererepresented by 1–2 clones each. This result implies that the sites ofSEC3 transcription termination and SNR14 PIC assembly overlap.

DNase I chromatin footprinting was used to complement compara-tive sequence analysis in the search for potential SNR14 promoter ele-ments. This procedure probed in vivo assembled chromatin by lysingyeast cells directly into a solution of DNase I. For comparison, puri-fied genomic DNA was digested with DNase I. Cleavage sites weredetected by primer extension (27). No obvious DNase I footprint wasobserved between the SNR14 TATA box and start site (Fig. 1C) despitethe fact that the gene is single-copy and highly transcribed. This findingsuggests that there is no high occupancy protein binding site in theregion of the promoter that separates the location of PIC recruitmentfrom that of transcription initiation. Rather, the subtle changes inDNase I protection and enhancement suggest partial occupancy. Inaddition, some subtle changes in DNase I sensitivity were observed atthe putative upstream activating sequence, TATA box, and initiatorregion, consistent with partial occupancy. We could not make any con-clusions regarding the protein occupancy of the T-stretch given that itwas not efficiently cleaved by DNase I. Because a scanning RNAP IIcomplex is unlikely to provide sufficient promoter occupancy fordetectable DNase I protection, the footprinting results obtained for the

SNR14 promoter are consistent with this model for transcriptioninitiation.

The Conserved SNR14 TATA Box Is Not a Determinant of Start SitePosition in Vivo—Functional upstream SNR14 promoter elements wereroughly mapped using 5�-truncation analysis of a plasmid-borne allele(Fig. 2A). Inserts were tested in both orientations within the vector tocontrol for effects of plasmid sequences. Primer extension of U4 RNAsynthesized from its chromosomal locus showed the single major tran-scription start site designated as �1 (Fig. 2B, lane 1). Strains bear-ing SNR14 with 224 base pairs of upstream DNA on a plasmid yieldedthe same initiation pattern as the chromosomal locus (Fig. 2B, lanes 2and 9). Deletion of the putative upstream activating sequence andT-stretch had no effect on transcription efficiency or accuracy in thecontext of the “forward” plasmid-borne allele but decreased efficiency2-fold in the “reverse” orientation (Fig. 2B, lanes 3 and 10). The effi-ciency of SNR14 transcription was reduced an additional 4–11-foldupon deletion of the TATA box, but transcription start site selectionwas changed little (Fig. 2B, lanes 4 and 11). Upon further truncation toposition �74 and beyond, transcription start site selection becameincreasingly aberrant, resulting in the appearance of additional U4 RNA5� ends (Fig. 2B, lanes 5-8 and 12-14). Except for the�3 reverse allele, allSNR14 truncation mutant strains are viable. Surprisingly, even strainsbearing deletions of upstream DNA to position �3 expressed tran-scripts initiating from the�1 start site, albeit at a very low level (Fig. 2C).

FIGURE 1. Conserved sequence elements upstream of the yeast U4 snRNA gene, SNR14. A, schematic diagram of the SNR14 chromosomal locus, numbered with respect to thetranscription start site. Boxes delimit the mature U4 RNA sequence or the open reading frames of SEC3 and TMA20. Arrows indicate the direction of transcription. B, alignment ofsequences upstream of SNR14 in four species of the genus Saccharomyces (S. cerevisiae (Scer), S. mikatae (Smik), S. kudriavzevii (Skud), S. bayanus (Sbay)). Sequences were obtainedusing the Saccharomyces Genome Data base Fungal BLAST. All non-template strand sequences between �149 (in S. cerevisiae) and �1 relative to the SNR14 transcription start siteare shown, with invariant nucleotides highlighted in gray. The most conserved sequence blocks are underlined, including a putative upstream activating sequence (UAS), T-stretch,TATA box, and initiator (INR). The major Sec3 cleavage and polyadenylation site for S. cerevisiae is marked with an asterisk. The endpoints of 5�-truncation constructs described in Fig.2 are indicated. C, DNase I chromatin footprint on the SNR14 promoter. Lysed yeast cells (lysate DNA) and deproteinized genomic DNA (naked DNA) were digested with 15 and 5 �g/mlof DNase I, respectively, and cleavage sites were mapped by primer extension.

Initiator Selection by Yeast RNAP II

MAY 19, 2006 • VOLUME 281 • NUMBER 20 JOURNAL OF BIOLOGICAL CHEMISTRY 14121

by guest on October 24, 2020

http://ww

w.jbc.org/

Dow

nloaded from

Page 4: QuantitativeAnalysisof inVivo …YKS2 by the following manipulations. YKS2 was transformed with pRS313-SNR14 and pRS317-SUA7 by the lithium acetate procedure (25). His Lys clones were

The effects of promoter truncations varied somewhat depending on theorientation of SNR14 alleles within the plasmid, with the reverse orien-tation yielding more severe effects on growth and U4 RNA synthesis.Because the differences appeared to be limited to transcription effi-ciency and not transcription start site selection, they are likely due tofunctional promoter elements in the vector sequences.To confirm that the additional U4 RNA 5� ends observed upon trun-

cation of the SNR14 promoter were due to misinitiation and not degra-dation, we tested for the presence of the methylguanosine 5� cap that isadded to nascent RNAP II transcripts (35). Because the 7-methyl-guanosine cap attached co-transcriptionally to RNAP II transcripts ishypermethylated to trimethylguanosine on snRNAs, we used a mono-clonal antibody that recognizes both cap structures. RNAs correspond-ing to all major U4-specific primer extension products were efficientlyimmunoprecipitated with an anti-cap antibody (Fig. 3A), confirmingthat they are primary transcripts. The scR1 RNA is synthesized byRNAP III and was not immunoprecipitated with anti-cap antibody,

showing that immunoprecipitation is specific for capped transcripts.The major alternative start sites resulting from SNR14 promoter trun-cations were mapped to positions �35, �32, and �27, which arelocated within plasmid sequence, and �20, �32, �83, and �88 in theU4 coding region. Interestingly, all of the start sites mapped to a purineresidue, and 5 of the 8 start sites (including the wild-type start site)immediately follow the dinucleotide CC (Fig. 3B).Thus, transcription initiation of SNR14 from the normal�1 start site

is remarkably resistant to deletion of conserved upstream promoterelements.Much likewhat has been observed previously atmRNAgenes,the SNR14 TATA box primarily affects the frequency of RNAP II tran-scription and not the position of initiation (10). Alternative SNR14 ini-tiation sites minimally include a purine preceded by a pyrimidine andmost often by two cytosines. These results suggest that start site selec-tion at SNR14 is directed by an initiator-like element, potentially incombination with a downstream (intragenic) element.

APolar Effect on SNR14 Start Site Selection Supports aUnidirectionalScanning Model—A scanning model for yeast transcription start siteselection posits that after melting DNA near the TATA box, RNAP IItranslocates along downstreamDNAuntil a suitable initiation sequenceis located (11).Wedirectly tested the validity of thismodel by construct-ing an SNR14 allele with tandemly duplicated start sites, SNR14-StDup(Fig. 4A, construct 2). The SNR14-StDup construct has a 14-base pairinsertion that creates two start sites identical in sequence frompositions�13 to�7,which should be indistinguishable to factors binding directlyto these sequences, including RNAP II. Therefore, if start site selection

FIGURE 2. The conserved SNR14 TATA box is not a determinant of start site positionin vivo. A, schematic diagram of SNR14 promoter deletion mutants. Conserved sequenceelements (Fig. 1) are indicated. Deletion construct end points are indicated as base pairsremaining upstream of the �1 transcription start site. An arrow designates the annealingsite for the primer extension probe. B, primer extension of U4 and scR1 (loading control)RNAs in total cellular RNA isolated from an untransformed strain with an intact SNR14chromosomal locus (CHR) or from strains bearing a disrupted SNR14 chromosomal locustransformed with plasmid-borne SNR14 promoter deletion alleles. Orientation of theSNR14 allele within the plasmid is arbitrarily specified as forward (F) or reverse (R). Thepositions of U4 and scR1 cDNAs are indicated on the left. Arrows on the right denoteadditional U4 RNA 5� ends resulting from promoter truncations. C, summary of SNR14promoter truncation effects on growth phenotype and U4 RNA level. Orientation of theSNR14 allele within the plasmid is as specified in panel B. RNA levels were quantitatedusing only U4 transcripts initiated from the �1 start site and are relative to the full-lengthconstruct (�224) normalized to scR1 level, which was defined as 100. Growth pheno-types were determined qualitatively from serial dilution spot tests on solid syntheticcomplete medium containing 5-fluoroorotic acid.

FIGURE 3. Alternate 5� ends resulting from SNR14 promoter truncations are due toaberrant transcription initiation and share similar start site sequences. A, total RNAwas immunoprecipitated with an anti-2,2,7-trimethylguanosine (TMG) cap antibody.Primer extension of total RNA or pellet (P) and supernatant (S) fractions was done with32P-labeled U4- and scR1-specific primers. The scR1 RNA served as a negative controlsince it was synthesized by RNAP III as an uncapped transcript. The positions of alternatestart sites within upstream plasmid sequence and the U4 coding region were mapped tothe nucleotide (data not shown) and are indicated on the right. No IP, no immunopre-cipitation. B, sequence context of alternative SNR14 start sites, with non-template strandDNA aligned from �10 to �10 relative to the site of initiation.

Initiator Selection by Yeast RNAP II

14122 JOURNAL OF BIOLOGICAL CHEMISTRY VOLUME 281 • NUMBER 20 • MAY 19, 2006

by guest on October 24, 2020

http://ww

w.jbc.org/

Dow

nloaded from

Page 5: QuantitativeAnalysisof inVivo …YKS2 by the following manipulations. YKS2 was transformed with pRS313-SNR14 and pRS317-SUA7 by the lithium acetate procedure (25). His Lys clones were

is driven by a random collision of RNAP II or another initiator-bindingprotein with the DNA, we should detect roughly equal utilization of thetwo sites. If the start site is defined by its distance from factors boundupstream or by a polymerase scanning from the TATA box, we shoulddetect primarily upstream starts. In contrast, if the start site is measuredfrom factors bound to intragenic promoter elements, we should detectprimarily downstream starts (36).In fact, the upstream start site (�1u) is preferentially utilized (Fig. 4A,

lane 2), consistent with upstream recruitment and a directional scan-ning model. When the initiating non-template nucleotide of theupstream start is changed from A to T, the downstream start (�1d) ismore heavily utilized, confirming that the start sites are in competitionand that precise spacing from an upstream recruitment site is notrequired (Fig. 4A, lane 3). Changing the downstream start from A to Tessentially abolishes its usage entirely (Fig. 4A, lane 4). When both startsites are changed fromA toT, weak alternative sites are used at�8d and�7d (Fig. 4A, lane 5). Weak initiation at �8 and �7 is also observed inthe wild-type allele. Interestingly, initiation at �8u, the most upstreamobserved start site in the SNR14-StDup allele, does not increase whenthe �1u and �1d start sites are mutated. This finding is also consistent

with the directional scanning model, in which read-though of a siteshould not affect initiation at sites upstream.

Yeast Start Site Efficiency Is an Intrinsic, Quantifiable Property of anInitiator Sequence—To obtain a quantitative estimate of initiationsequence preference, we assumed that a homogeneous population ofinitiation-competent polymerases scan unidirectionally through the�8u to �7d interval of the SNR14-StDup allele in search of a goodmatch to the ideal initiator consensus. Start site efficiencies were calcu-lated by dividing the relative yield of a start site product by RNAP II fluxat that site. The relative yield of a transcript from a given start site wasdetermined by dividing its signal intensity by the total signal intensity ofproducts from all detectable start sites (�8u, �1u, �8d, �1d, �7d).RNAP II flux was defined as the relative number of polymerasesencountering a given start site andwas arbitrarily assigned a value of 100units at the �8u site. Because 2% of the U4 cDNA ends at the �8uposition, 2 units of RNAP II must have initiated at this site, and 98 unitscontinued to scan (assuming there is no loss of RNAP II except bydetectable initiation in the �8u to �7d interval). When start site effi-ciency is determined without considering flux, the �1u and �1d startsite efficiencies differ by about 10-fold (87% versus 8%). When RNAP II

FIGURE 4. A polar effect on SNR14 start siteselection supports a unidirectional scanningmodel. A, primer extension of U4 and scR1 RNAsin total cellular RNA isolated from strains contain-ing wild-type SNR14, SNR14-StDup, or mutantSNR14-StDup alleles. The SNR14-StDup allele (con-struct 2) was made by insertion of 14 bp of DNA(boxed region) between positions �1 and �2 ofwild-type SNR14 (construct 1), creating an overlap-ping 20-bp duplication (underlined). Mutationsare highlighted in gray and underlined. Arrowsshown above each sequence indicate the positionand relative abundance of in vivo 5� ends. Arrowsshown to the left of the lanes indicate SNR14-StDup transcription start sites: u, upstream site;d, downstream site. Lane T represents a sequenc-ing ladder generated from the #2 allele. B, calcu-lating start site efficiency for SNR14-StDup wildtype (panel A, lane 2). The relative yield of a tran-script from a given start site was determined bydividing its signal intensity by the total signalintensity of products from all detectable start sites(�8u, �1u, �8d, �1d, �7d). For simplicity, theefficiency of the �7d start site was assumed toequal 100%. RNAP II flux is defined as the relativenumber of polymerases encountering a givenstart site and was arbitrarily assigned an initialvalue of 100 units (U). C, calculating start site effi-ciency for SNR14-StDup A�1uT (panel A, lane 3).

Initiator Selection by Yeast RNAP II

MAY 19, 2006 • VOLUME 281 • NUMBER 20 JOURNAL OF BIOLOGICAL CHEMISTRY 14123

by guest on October 24, 2020

http://ww

w.jbc.org/

Dow

nloaded from

Page 6: QuantitativeAnalysisof inVivo …YKS2 by the following manipulations. YKS2 was transformed with pRS313-SNR14 and pRS317-SUA7 by the lithium acetate procedure (25). His Lys clones were

flux at the two positions is included in the calculation, the�1u and�1dstart site efficiencies are found to be equal at 89% (Fig. 4B). Even whenthe efficiency of the �1u site is reduced more than 10-fold by theA�1uT mutation, the efficiency of �1d and other downstream sitesremains about the same when flux is considered (Fig. 4C). These resultsindicate that initiator efficiency is an intrinsic property that is largelyindependent of start site position. We can, therefore, use this value toclassify initiator strength. For example, we can deduce that the �1 ini-tiator sequence of the U4 gene is a nearly perfect initiator, with anefficiency of 89–92%.

Mutations in SNR14-StDupDefine Preferred Sequences at�1 and�8of the Initiator—Having developed a quantitative assay for in vivo startsite selection, we next tested the sequence requirements for RNAP IIinitiation through site-directed mutagenesis of phylogenetically con-served base pairs in the SNR14 major initiator. In addition to the �1position, the nucleotide identities of positions�8,�7,�5,�4,�2, and�1 of SNR14 are conserved across the Saccharomyces genus (Fig. 1). Adouble transversion mutation at positions �8u, �7u (A3T) or �2u,�1u (C3G) of the SNR14-StDup allele significantly shifted initiationtoward downstream start sites, reducing efficiency of the �1u site byabout 5-fold (Fig. 5A, lanes 2 and 5). Upon separation of the �8u/�7udouble mutation into single point mutations, it became clear that the�8u mutation contributes more to the initiation defect than the �7umutation (Fig. 5A, lanes 3 and 4). A similar dissection of the �2u/�1udoublemutation showed that the change at position�1u accounted for

all of the downstream shift (Fig. 5A, lanes 6 and 7). Transversions at�5uor�4u on their own had little if any effect on start site selection (Fig. 5B,lanes 8 and 9).The base preference at positions �8u and �1u was explicitly tested

by creating all possible base substitutions. TheA-8uTmutation reducedusage of the �1u start site by about 2-fold, an effect that was slightlygreater than that observed for A-8uG and A-1uC (Fig. 5B, lanes 2-4).The C-1uG mutation reduced usage of the �1u start site by at least5-fold, an effect that was followed closely by C-1uA (Fig. 5B, lanes 5 and6). The C-1uT mutation had a very minor effect on start site selection(Fig. 5B, lane 7). Overall, it appears that the strength of the SNR14 startsite is dependent on a purine at �1, a pyrimidine at �1, and an adenineat �8.

There are other sequences within the RNAP II scanning windowbetween the SNR14TATAbox and�1 start site that resemble start sitesbut at which initiation does not efficiently occur. We predicted thatchanging these sites to match the bases preferred at positions �8, �7,and �1 would contribute to more efficient start site usage. Weak initi-ation occurs at �8u in a wild-type SNR14-StDup allele (Fig. 5C, lane 1).The C-16u/15uA or T-9uC mutations on their own increase usage ofthe �8u site by about 5- or 2-fold, respectively (Fig. 5C, lanes 2 and 3).Combining these mutations has an additive effect, resulting in a 10-foldincrease in �8u start site strength (Fig. 5C, lane 4). The optimization ofthe �8u start site toward a higher efficiency further demonstrates the

FIGURE 5. Mutations in SNR14-StDup define preferred sequences at �8 and �1 of the initiator. Shown is primer extension of U4 and scR1 RNAs in total cellular RNA isolated fromstrains containing wild-type SNR14-StDup or mutant SNR14-StDup alleles. Start site efficiencies were calculated as in Fig. 4 and are shown below each lane. A and B, initiator mutationsthat decrease �1u utilization. WT, wild type. C, initiator mutations that increase �1u utilization. An alignment of the �1u and �8u starts sites (with and without mutations) is shownon the right.

Initiator Selection by Yeast RNAP II

14124 JOURNAL OF BIOLOGICAL CHEMISTRY VOLUME 281 • NUMBER 20 • MAY 19, 2006

by guest on October 24, 2020

http://ww

w.jbc.org/

Dow

nloaded from

Page 7: QuantitativeAnalysisof inVivo …YKS2 by the following manipulations. YKS2 was transformed with pRS313-SNR14 and pRS317-SUA7 by the lithium acetate procedure (25). His Lys clones were

importance of the �8, �7, and �1 positions in SNR14 initiatorfunction.The efficiencies of 26 different wild-type and mutant yeast initiators

calculated from various SNR14-StDup alleles are shown in Table 1, withvalues ranging from about 89 to 4%. The wild-type SNR14 �1 site(A�1d, A�1u) is the most efficient, and divergence from this sequencereduces start site efficiency. Changes at the �8, �7, �1, and �1 posi-tions reduce start site efficiency anywhere from about 2- to 15-fold.Although the efficiency of the wild-type A-8u start was improved10-fold by changing the �8, �7, and �1 positions toward a more pre-ferred initiator sequence, it is interesting to note that the efficiency ofthis site is still around 2-fold less than the wild-type A�1 start site. The�8u and �8d start site efficiencies differ by 6-fold (4 versus 24%) evenwhen the flanking sequences are identical from �9 to �7, as is the casein the StDup-A�1uT allele (Table 1). Taken together, these results indi-

cate that there are positions other than �8, �7, �1, and �1 at whichnucleotide identity influences initiator efficiency.

Substitutions in the TFIIB B-finger Exacerbate the Effect of InitiatorMutations at �8 and �1—One possible cause of the start site selectiondefects exhibited by SNR14 initiator mutants is a disruption of directamino acid/nucleotide contact(s) made between a protein componentof the yeast PIC and the initiator. To analyze the role of TFIIB as thepotential trans-acting component of the PIC that interacts with theyeast initiator element, we generated a double knock-out strain that hasdisrupted chromosomal copies of the SNR14 and SUA7 (TFIIB) genesand carries wild-type copies of these genes on aURA3-marked plasmid.Standard plasmid shuffle protocols were used to introduce mutant alle-les of SNR14 and SUA7. The effect of TFIIB B-finger substitutions pre-viously shown to alter initiation on protein-coding genes (22–23) wastested in the context of the SNR14-StDup allele. For the most part,

TABLE 1Initiation efficiency of yeast RNAP II as a function of start site sequence

a Asterisks indicate alleles analyzed in Fig. 5C. Reverse text indicates positions divergent from the A�1 WT start site.b Efficiencies were calculated as described in Fig. 4 and are shown as an average in cases where n � 1.

Initiator Selection by Yeast RNAP II

MAY 19, 2006 • VOLUME 281 • NUMBER 20 JOURNAL OF BIOLOGICAL CHEMISTRY 14125

by guest on October 24, 2020

http://ww

w.jbc.org/

Dow

nloaded from

Page 8: QuantitativeAnalysisof inVivo …YKS2 by the following manipulations. YKS2 was transformed with pRS313-SNR14 and pRS317-SUA7 by the lithium acetate procedure (25). His Lys clones were

substitutions in residues 63, 64, 66, and 78 of TFIIB all caused a similarlymodest shift in transcription initiation from upstream to downstreamsites, reducing �1u start site efficiency by about 1.5-fold (Fig. 6A). Theeffect of theW63R substitutionwas less severe thanW63P and the otherTFIIB substitutions, consistent with what has been observed at theCYC1 and ADH1 genes (22). Overall, the effect of SUA7 mutations onSNR14 start site selection was not as dramatic as has been observed onsomemRNA genes, but this is not surprising given that the sensitivity ofgenes to TFIIB substitutions is known to be dependent on the sequenceimmediately upstream of the start site (23).The superimposition of the RNA-DNA hybrid from the RNAP II

transcribing complex structure upon a recent crystal structure of yeastRNAP II-TFIIB suggested that template-strandDNA is adjacent to con-served residues of the TFIIB B-finger domain (9). The points of closestcontact included nucleotides �6 to �8 (relative to the nucleotide addi-tion site at �1) and B-finger residues 62–66. These structurally pre-dicted contacts are consistent with results showing that the archaealTFIIB homologue cross-links to template DNA near the transcriptionstart site (37–38). We tested whether mutations in SUA7 geneticallyinteract with mutations in the initiator element at positions that arerequired for accurate start site selection. An analysis of �1u start siteefficiency in the context of the TFIIB R64A substitution revealed a dif-ference in the level of enhancement between �8 and �1 initiator

mutants, suggesting that the effect of R64A on start site selection isinfluenced by the initiator sequence (Fig. 6B). The C-1uG/R64Amutantexhibited a 2-fold reduction in �1u efficiency relative to C-1uG alone,similar to the effect of the R64A substitutionwith thewild-type�1u site(1.5-fold). In contrast, theA-8uT/R64Amutant exhibited about a 4-foldreduction in �1u efficiency relative to A-8uT alone. The differentiallevel of sensitivity of�1u variants to R64Awas not observed at the�1dsite, consistent with the fact that A-8uT and C-1uG specifically alterusage of the upstream site in SNR14-StDup. Of the remaining start sitestested,�8d was very sensitive to R64A (9-fold reduction), whereas�5dwas relatively insensitive. Overall, these data support the notion thatTFIIB B-finger substitutions affect RNAP II start site selection in aman-ner dependent on the sequence of the initiator, an element that extendsupstream to at least the �8 position.

DISCUSSION

The synthesis of non-coding RNAs, especially snRNAs and snoR-NAs, puts strong demands on the accuracy and efficiency of transcrip-tion initiation by RNAP II. The transcription start site of such RNAsusually corresponds to a unique mature 5� end, and its precise place-ment may be required for optimal RNA function. Yeast snRNAs andsnoRNAs typically have steady-state levels of hundreds of copies percell, so their genes must be actively transcribed. Non-coding RNA genepromoters are, therefore, interesting subjects for study of the optimalsequences for directing initiation by RNAP II. Herewe provide evidencethat the S. cerevisiae U4 snRNA gene, SNR14, fulfills these stringentrequirements by coupling a consensus TATA box with a nearly perfectinitiator element. Furthermore, the DNA between the TATA box andinitiator is devoid of initiator-like sequences that might divert RNAP IIfrom the proper start site as it scans downstream from the TATA box.The differential utilization of duplicated initiator elements in artificialvariants of the SNR14 promoter strongly supports the scanning modelof start site selection byRNAP II in budding yeast and demonstrates thatinitiator element efficiency is an intrinsic property dependent primarilyon the sequence at positions �8, �7, �1, and �1 relative to the startsite.

Architecture of the SNR14 Promoter—In terms of their general pro-moter architecture, yeast snRNA genes bear a strong resemblance tomRNAgenes both in the position and function of their core elements. Inagreement with their observed roles in yeast mRNA genes, the con-served TATA box and initiator elements of the SNR14 promoter pri-marily influence RNAP II transcription efficiency and accuracy, respec-tively. Amore distinctive feature of the SNR14 promoter is the presenceof a conserved T-stretch and putative Abf1 binding site. Abf1 sites havepreviously been identified upstream of T-rich stretches in the promot-ers of ribosomal protein genes (39) and snoRNA genes (40). Bothmicroarray-based readout of chromatin immunoprecipitation (ChIP-chip) and protein binding to microarrays (PBM) have identified SNR14as a target gene for Abf1 (33, 41). The similarity in upstream sequencesbetween genes encoding components of the spliceosome, ribosome, andRNA modification machinery raises the possibility of coordinate regu-lation of these activities at the level of transcription.In one orientation of a plasmid-borne SNR14 allele, deletion of the

putative Abf1 site and T-stretch led to a 2-fold reduction in U4 RNAlevel, suggesting a potential role for one or both of these elements intranscription efficiency. In the context of the ribosomal protein-codinggene RPS28A, a mutation that destroys Abf1 binding in vitro reducedtranscription by 10-fold, whereas substitutions in the T-rich elementreduced transcription by 2-fold (39). Other potential functions for Abf1within the SNR14 promoter (e.g. genome partitioning, nucleosome

FIGURE 6. Substitutions within the TFIIB B-finger exacerbate the effect of mutationsat �8 and �1. Primer extension of U4 and scR1 RNAs in total cellular RNA isolated fromstrains containing plasmid-borne alleles of wild-type (A) or mutant (B) SNR14-StDup andwild-type or mutant SUA7 (TFIIB). Start site efficiencies were calculated as in Fig. 4, exceptthat �7d and a site further downstream (indicated with asterisks) were also included inthe total signal intensity.

Initiator Selection by Yeast RNAP II

14126 JOURNAL OF BIOLOGICAL CHEMISTRY VOLUME 281 • NUMBER 20 • MAY 19, 2006

by guest on October 24, 2020

http://ww

w.jbc.org/

Dow

nloaded from

Page 9: QuantitativeAnalysisof inVivo …YKS2 by the following manipulations. YKS2 was transformed with pRS313-SNR14 and pRS317-SUA7 by the lithium acetate procedure (25). His Lys clones were

organization) would likely require amore native chromosomal environ-ment than was provided in our study (42). The potential role for Abf1 ingenome partitioning of this region is made more interesting given theobservation that the sites of SEC3 transcription termination and SNR14PIC assembly overlap.

Refinement of the Yeast RNAP II Initiator Consensus Sequence—Thepreference for particular nucleotides at positions�1 and�1 of the yeastinitiator has been widely reported in the literature. Although a YA ini-tiator consensus applies well to many documented yeast start sites, it istoo minimal to have predictive value. A point mutation analysis of anRRYRR consensus initiator from the TRP4 gene revealed that the cen-tral pyrimidine and at least one of the 3�-flanking purine nucleotideswere essential but alone insufficient to define a functional initiator ele-ment (43). Previous reports have identified regions immediately 5� ofthe original yeast initiator consensus that influence start site selection.Maicas and Friesen (44) identified a region centered at�9 of the TCM1gene and 95 other mRNA genes that they termed the “locator.” Thelocator was defined as a region where the base composition of the non-template strand sharply switched from a preponderance of thymineresidues to predominantly adenine residues. Rathjen and Mellor (45)identified a region from �10 to �4 (ACAGATC) of the major PGK1start site as a “determinator” element. Deletion of the determinatorresulted in a loss of initiation from the normal start site and increaseduse of more downstream sites. Healy and Zitomer (46) were able toshow that insertion of CAAG upstream of the CYC7 gene could directinitiation at a site at which it did not normally occur, and it is interestingto note that their insertion also introduced an adenine at the �8 posi-tion on the non-template strand. Our genetic evidence supporting apreference for adenine at positions �8 and �7 of the SNR14 initiatorexpands the older yeast initiator consensus and can account for theearlier observations described above for the TRP4, TCM1, PGK1, andCYC7 genes.The general importance of the �8, �7, �1, and �1 positions for

initiator efficiency across all RNAP II-transcribed yeast genes is sup-ported by a recent bioinformatics study that compared 4637 yeast tran-scription start sites. Sequence alignment produced the consensusA(Arich)5NYA(A/T)NN(Arich)6, where the underlined A is the initiationsite (15). This yeast initiator consensus is more expansive than thatreported for higher organisms like Drosophila TCA(G/T)T(T/C) ormammals YYAN(T/A)YY (4). Although they all share theminimal con-sensus YA, the latest evidence suggests that yeast initiator sequencepreferences extend beyond �1/�1 to include at least 8 nucleotidesupstream and downstream. Zhang andDietrich (15) could not concludewhether the A-richness of the yeast initiator consensus sequence wasimportant for transcription initiation or a consequence of some otheraspect of genome structure. Here, we present direct evidence indicatingthat the adenine at position �8 and to a lesser extent the adenine at �7contributes to the functionality of the yeast initiator as a start site deter-minant. The fact that substitution of any base besides adenine at �8significantly decreases start site efficiency suggests that the functionalimpairment is notmerely related to themelting potential of an A:T basepair. Rather, it suggests that the�8 position is recognized in a sequence-specific manner.How the yeast initiator sequence determines start site usage is as yet

unknown. It seems likely that the initiator is recognized by a proteincomponent of a scanning pre-initiation complex. Mutations that alteryeast transcription start site selection have been identified in numerousprotein components of the PIC, including RNAP II (Rpb1, Rpb2, Rpb9),TFIIB, andTFIIF (20–21, 47–52). Of these proteins, Rpb1, Rpb2, TFIIB,

and TFIIF (Tfg1, Tfg2) have also been cross-linked to DNA at or near atranscription start site (53).Faitar et al. (23) determined that mutations in yeast initiator

sequences genetically interact with substitutions in the TFIIB B-finger,making certain start sites more or less sensitive to downstream shifts intranscription start site selection. Specifically, they showed that among aset of mutations made from positions �6 to �5 of an ADH1 initiator,changes at �2, �1, and �2 significantly increased or decreased thesensitivity of the �1 transcription start site to TFIIB-V79L. Here, wepresent evidence supporting a genetic interaction between the�8 posi-tion of the SNR14 initiator with the TFIIB B-finger, expanding our viewof what constitutes an initiator element and where potential protein-DNA interactions may occur. Furthermore, the fact that the �8d startsite is highly sensitive to the TFIIB-R64A substitution in both theStDup-A-8uT and -C-1uG alleles whereas the �5d start site is essen-tially insensitive in both contexts indicates that sensitivity correlateswith sequence and not simply initiator strength.

Implications for a Scanning Model of RNAP II Start Site Selection—Ascanning model is currently the best supported explanation for howyeast start sites are selected, but experiments directly testing the basicimplications of themodel are scarce in the literature.We constructed anallele of SNR14 with tandemly duplicated start sites as a means to testthe yeast scanning model in both a qualitative and quantitative fashion.We observed that although the upstream start site had a higher relativeyield than the downstream start site, a reduced level of RNAP II flux fullyaccounted for the lower relative yield from the downstream site. RNAPII flux is an inherent property of a unidirectional scanning model foryeast transcription initiation. Thus, the fact that RNAP II flux can beused to resolve the observed differences in relative utilization of twoidentical start sites is in itself strong support for themodel. The fact thatour estimations of RNAP II flux so closely agree with shifts in utilizationof start sites 14–22 bp apart in response to mutations suggests thatRNAP II scanning is reasonably processive. A scanning mechanism ofstart site selection requires RNAP II to be processive to accommodatethe large and variable distances between yeast TATAboxes and initiatorelements. It will be interesting to test the limits of processive scanningby RNAP II. The identification of initiators with a range of efficiencies(Table 1) should aid such studies.The yeast initiator sequence consensus is readily apparent amongRNAP

II-transcribed snRNA genes (SNR14, SNR19, SNR20, SNR7) and snoRNAgenes. A Weblogo alignment (54) using a pool of 22 yeast snRNA andsnoRNA transcription start sites results in an initiator consensus very sim-ilar to that reported for mRNA genes, A(Arich)3NNYYA(A/T)N(Arich)2.Given that yeast RNAP II transcribes all mRNA genes and most snRNAand snoRNA genes, it is expected that the cis-acting sequence require-ments for their start site selection would be similar. However, the iden-tification of a strong match to the initiator consensus from such a smallsample size of snRNA/snoRNA genes suggests a basis for why yeastsnRNA/snoRNA transcription initiation ismuchmore precise than thatof mRNA genes. Simply stated, genes with a highly efficient initiatorhave fewer start sites because a smaller population of polymerases isavailable to scan further downstream.Nagawa and Fink (55) touched onthis idea when they suggested that one reason that yeast genesHIS1 andCYC1 have multiple weak start sites is because they lack a strong startsite.In addition to snRNA genes evolving highly efficient initiators, the

regions between snRNATATAboxes and coding regionsmay have alsoundergone negative selection against the initiator consensus to mini-mize the usage of non-optimal start sites. Consistent with this idea, ananalysis of all 10 potential YR start sites within the RNAP II scanning

Initiator Selection by Yeast RNAP II

MAY 19, 2006 • VOLUME 281 • NUMBER 20 JOURNAL OF BIOLOGICAL CHEMISTRY 14127

by guest on October 24, 2020

http://ww

w.jbc.org/

Dow

nloaded from

Page 10: QuantitativeAnalysisof inVivo …YKS2 by the following manipulations. YKS2 was transformed with pRS313-SNR14 and pRS317-SUA7 by the lithium acetate procedure (25). His Lys clones were

window between the SNR14 TATA box and the �1 start site revealedthat none contained the preferred adenine at the �8 or �7 positions.The driving force behind this proposed kind of snRNA promoter evo-lution would be to increase the functional capacity of the RNA geneproducts, which are essential for yeast viability. snRNA genes encodestructural RNAs that require precise 5� ends for their function. Forexample, the 5� end of yeast U4 RNA engages in base-pairing interac-tions with the yeast U6 RNA during the splicing cycle. In contrast,mRNA transcripts contain 5�-untranslated regions that typically haveno precise length requirement for proper expression. Consequently, the5� end ofmRNAs typically need not be formed in as precise of fashion asthose of snRNAs, and their gene promoterswould likely not have under-gone the same type of evolutionary selection.The study of transcription initiation on yeast non-coding RNA genes

has provided useful insight into the fundamental process by whichRNAP II initiates RNA synthesis, particularly with regard to its accu-racy. Additional genetic, biochemical, and structural studies are neces-sary to elucidate the underlying mechanism by which both initiatorDNA and PIC proteins function in the process of RNAP II transcriptionstart site selection.

Acknowledgments—We thank E. Craig, P. James, C. Guthrie, K. Shannon, andA. Ponticelli for providing plasmids and/or yeast strains and R. Landick, V.Gerlach, and current members of the Brow laboratory for discussions.

REFERENCES1. Orphanides, G., Lagrange, T., and Reinberg, D. (1996) Genes Dev. 10, 2657–26832. Hampsey, M. (1998)Microbiol. Mol. Biol. Rev. 62, 465–5033. Hahn, S. (2004) Nat. Struct. Mol. Biol. 11, 394–4034. Smale, S. T., and Kadonaga, J. T. (2003) Annu. Rev. Biochem. 72, 449–4795. Buratowski, S., Hahn, S., Sharp, P. A., and Guarente, L. (1988) Nature 334, 37–426. Nakajima, N., Horikoshi, M., and Roeder, R. G. (1988)Mol. Cell. Biol. 8, 4028–40407. Buratowski, S., Hahn, S., Guarente, L., and Sharp, P. A. (1989) Cell 56, 549–5618. Leuther, K. K., Bushnell, D. A., and Kornberg. R. D. (1996) Cell 85, 773–7799. Bushnell, D. A., Westover, K. D., Davis, R. E., and Kornberg, R. D. (2004) Science 303,

983–98810. Struhl, K. (1989) Annu. Rev. Biochem. 58, 1051–107711. Giardina, C., and Lis, J. T. (1993) Science 261, 759–76212. Hahn, S., Hoar, E. T., and Guarente, L. (1985) Proc. Natl. Acad. Sci. U. S. A. 82,

8562–856613. Healy, A. M., Helser, T. L., and Zitomer, R. S. (1987)Mol. Cell. Biol. 7, 3785–379114. Furter-Graves, E. M., and Hall, B. D. (1990)Mol. Gen. Genet. 223, 407–41615. Zhang, Z., and Dietrich, F. S. (2005) Nucleic Acids Res. 33, 2838–285116. Smale, S. T., and Baltimore, D. (1989) Cell 57, 103–11317. Sakurai, H., Ohishi, T., and Fukasawa, T. (1994)Mol. Cell. Biol. 14, 6819–682818. Ohishi-Shofuda, T., Suzuki, Y., Yano, K., Sakurai, H., and Fukasawa, T. (1999) Bio-

chem. Biophys. Res. Commun. 255, 157–16319. Li, Y., Flanagan, P. M., Tschochner, H., and Kornberg, R. D. (1994) Science 263,

805–807

20. Pinto, I., Ware, D. E., and Hampsey, M. (1992) Cell 68, 977–98821. Pinto, I., Wu, W. H., Na, J. G., and Hampsey, M. (1994) J. Biol. Chem. 269,

30569–3057322. Pardee, T. S., Bangur, C. S., and Ponticelli, A. S. (1998) J. Biol. Chem. 273,

17859–1786423. Faitar, S. L., Brodie, S. A., and Ponticelli, A. S. (2001)Mol. Cell. Biol. 21, 4427–444024. Shannon, K. W., and Guthrie, C. (1991) Genes Dev. 5, 773–78525. Gietz, R. D., Schiestl, R.H.,Willems, A. R., andWoods, R. A. (1995)Yeast 11, 355–36026. Boeke, J. D., Trueheart, J., Natsoulis, G., and Fink, G. R. (1987)Methods Enzymol. 154,

164–17527. Hull, M. W., Thomas, G., Huibregtse, J. M., and Engelke, D. R. (1991) Methods Cell

Biol. 35, 383–41528. Wise, J. A. (1991)Methods Enzymol. 194, 405–41529. Kwan, S., Gerlach, V. L., and Brow, D. A. (2000) RNA 6, 1859–186930. Bochnig, P., Reuter, R., Bringmann, P., and Luhrmann, R. (1987) Eur. J. Biochem. 168,

461–46731. Martin, M. P., Gerlach, V. L., and Brow, D. A. (2001)Mol. Cell. Biol. 21, 6429–643932. Cliften, P., Sudarsanam, P., Desikan, A., Fulton, L., Fulton, B., Majors, J., Waterston,

R., Cohen, B. A., and Johnston, M. (2003) Science 301, 71–7633. Mukherjee, S., Berger, M. F., Jona, G., Wang, X. S., Muzzey, D., Snyder, M., Young,

R. A., and Bulyk, M. L. (2004) Nat. Genet. 36, 1331–133934. Beinoraviciute-Kellner, R., Lipps, G., and Krauss, G. (2005) FEBS Lett. 579,

4535–454035. Shuman, S. (1997) Proc. Natl. Acad. Sci. U. S. A. 94, 12758–1276036. Gerlach, V. L., Whitehall, S. K., Geiduschek, E. P., and Brow, D. A. (1995) Mol. Cell.

Biol. 15, 1455–146637. Bartlett, M. S., Thomm, M., and Geiduschek, E. P. (2004) J. Biol. Chem. 279,

5894–590338. Renfrow, M. B., Naryshkin, N., Lewis, L. M., Chen, H. T., Ebright, R. H., and Scott,

R. A. (2004) J. Biol. Chem. 279, 2825–283139. Lascaris, R. F., Groot, E., Hoen, P. B., Mager, W. H., and Planta, R. J. (2000) Nucleic

Acids Res. 28, 1390–139640. Qu, L. H., Henras, A., Lu, Y. J., Zhou, H., Zhou, W. X., Zhu, Y. Q., Zhao, J., Henry, Y.,

Caizergues-Ferrer, M., and Bachellerie, J. P. (1999)Mol. Cell. Biol. 19, 1144–115841. Lee, T. I., Rinaldi, N. J., Robert, F., Odom,D.T., Bar-Joseph, Z., Gerber,G. K.,Hannett,

N. M., Harbison, C. T., Thompson, C. M., Simon, I., Zeitlinger, J., Jennings, E. G.,Murray, H. L., Gordon, D. B., Ren, B.,Wyrick, J. J., Tagne, J. B., Volkert, T. L., Fraenkel,E., Gifford, D. K., and Young, R. A. (2002) Science 298, 799–804

42. Fourel, G., Miyake, T., Defossez, P. A., Li, R., and Gilson, E. (2002) J. Biol. Chem. 277,41736–41743

43. Mosch, H. U., Graf, R., and Braus, G. H. (1992) EMBO J. 11, 4583–459044. Maicas, E., and Friesen, J. D. (1990) Nucleic Acids Res. 18, 3387–339345. Rathjen, J., and Mellor, J. (1990) Nucleic Acids Res. 18, 3219–322546. Healy, A. M., and Zitomer, R. S. (1990) Curr. Genet. 18, 105–10947. Hekmatpanah, D. S., and Young, R. A. (1991)Mol. Cell. Biol. 11, 5781–579148. Furter-Graves, E. M., Hall, B. D., and Furter, R. (1994) Nucleic Acids Res. 22,

4932–493649. Hull, M. W., McKune, K., and Woychik, N. A. (1995) Genes Dev. 9, 481–49050. Sun, Z. W., and Hampsey, M. (1995) Proc. Natl. Acad. Sci. U. S. A. 92, 3127–313151. Chen, B. S., and Hampsey, M. (2004)Mol. Cell. Biol. 24, 3983–399152. Ghazy, M. A., Brodie, S. A., Ammerman, M. L., Ziegler, L. M., and Ponticelli, A. S.

(2004)Mol. Cell. Biol. 24, 10975–1098553. Chen, B. S., Mandal, S. S., and Hampsey, M. (2004) Biochemistry 43, 12741–1274954. Crooks, G. E., Hon, G., Chandonia, J. M., and Brenner, S. E. (2004) Genome Res. 14,

1188–119055. Nagawa, F., and Fink, G. R. (1985) Proc. Natl. Acad. Sci. U. S. A. 82, 8557–8561

Initiator Selection by Yeast RNAP II

14128 JOURNAL OF BIOLOGICAL CHEMISTRY VOLUME 281 • NUMBER 20 • MAY 19, 2006

by guest on October 24, 2020

http://ww

w.jbc.org/

Dow

nloaded from

Page 11: QuantitativeAnalysisof inVivo …YKS2 by the following manipulations. YKS2 was transformed with pRS313-SNR14 and pRS317-SUA7 by the lithium acetate procedure (25). His Lys clones were

Jason N. Kuehner and David A. BrowSupports a Scanning Model

Initiator Selection by Yeast RNA Polymerase IIin VivoQuantitative Analysis of

doi: 10.1074/jbc.M601937200 originally published online March 29, 20062006, 281:14119-14128.J. Biol. Chem. 

  10.1074/jbc.M601937200Access the most updated version of this article at doi:

 Alerts:

  When a correction for this article is posted• 

When this article is cited• 

to choose from all of JBC's e-mail alertsClick here

  http://www.jbc.org/content/281/20/14119.full.html#ref-list-1

This article cites 55 references, 30 of which can be accessed free at

by guest on October 24, 2020

http://ww

w.jbc.org/

Dow

nloaded from