2014 - small proteins can no longer be ignored

Upload: jason-parsons

Post on 16-Feb-2018

215 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    1/28

    Small Proteins Can NoLonger Be Ignored

    Gisela Storz,1 Yuri I. Wolf,2

    and Kumaran S. Ramamurthi3

    1Cell Biology and Metabolism Branch, Eunice Kennedy Shriver National Institute oHealth and Human Development, National Institutes of Health, Bethesda,Maryland 20892-5430; email: [email protected]

    2National Center for Biotechnology Information, National Library of Medicine, NaInstitutes of Health, Bethesda, Maryland 20892-3830

    3Laboratory of Molecular Biology, National Cancer Institute, National Institutes of Bethesda, Maryland 20892-5132; email: [email protected]

    Annu. Rev. Biochem. 2014. 83:75377

    First published online as a Review in Advance onMarch 3, 2014

    TheAnnual Review of Biochemistryis online atbiochem.annualreviews.org

    This articles doi:10.1146/annurev-biochem-070611-102400

    Copyright c2014 by Annual Reviews.All rights reserved

    Keywords

    membrane, cell division, sporulation, transport, signal transductio

    protein

    Abstract

    Small proteins, here defined as proteins of 50 amino acids or few

    the absence of processing, have traditionally been overlooked du

    challenges in their annotation and biochemical detection. In the

    several years, however, increasing numbers of small proteins have

    identified either through the realization that mutations in interg

    regions are actually within unannotated small protein genes or thro

    the discovery that some small, regulatory RNAs encode small prot

    These insights, together with comparative sequence analysis, ind

    that tens if not hundreds of small proteins are synthesized in a gorganism. This review summarizes what has been learned abou

    functions of several of these bacterial small proteins, most of which

    at themembrane,illustratingthe astonishing range of processesin w

    these small proteins act and suggesting several general conclusions

    portant questions for future studies of these overlooked protein

    also discussed.

    753

    Click here for quick links to

    Annual Reviews content online,

    including:

    Other articles in this volume

    Top cited articles

    Top downloaded articles

    Our comprehensive search

    FurtherANNUAL

    REVIEWS

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    2/28

    Small protein:an experimentallydetected shortpolypeptide, heredefined as fewer than50 amino acids inlength

    Contents

    INTRODUCTION . . . . . . . . . . . . . . . . . . 754

    EXAMPLES OF SMALL PROTEIN

    F U N C T I O N . . . . . . . . . . . . . . . . . . . . . . 7 5 5

    Small Proteins That Affect Spore

    Formation: SpoVM and CmpA . . 755

    Small Proteins That Affect Cell

    Division: MciZ, SidA, and Blr . . . 757

    Small Protein Regulators of

    Transport: KdpF, AcrZ, and

    S g r T . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 5 9

    Small Protein Regulators of

    Membrane-Bound Enzymes:

    CydX, PmrR, and MgtR . . . . . . . . 761

    Small Protein Regulators of Protein

    Kinases and Signal Transduction:

    MgrB and Sda . . . . . . . . . . . . . . . . . . 762

    Small Protein Chaperones: MntS,

    FbpB, and FbpC . . . . . . . . . . . . . . . . 764

    FUNDAMENTAL QUESTIONS . . . . 765

    Further Identification of Small

    Proteins and the Elucidation of

    Their Functions . . . . . . . . . . . . . . . . 765

    Mechanisms of Action . . . . . . . . . . . . . . 766

    Synthesis, Subcellular Localization,

    and Degradation.. . . . . . . . . . . . . . . 767

    Evolution . . . . . . . . . . . . . . . . . . . . . . . . . . 768

    BROADER VIEW . . . . . . . . . . . . . . . . . . . 770

    Small Proteins Encoded byBacteriophages . . . . . . . . . . . . . . . . . 770

    Small Proteins Synthesized by

    Eukaryotes . . . . . . . . . . . . . . . . . . . . . 770

    Potential for Exploiting Small

    P r o t e i n s . . . . . . . . . . . . . . . . . . . . . . . . 7 7 1

    CONCLUSIONS . . . . . . . . . . . . . . . . . . . . 771

    INTRODUCTION

    Whereas the activities and structures of hun-

    dreds of thousands of proteins have been

    studied in exquisite detail, one class of pro-

    teins, small proteins, has largely been ignored.

    Small proteins are polypeptides that are en-

    coded by small open reading frames (ORFs). As

    opposed to the term peptideswhich m

    refer to (a) intrinsically unordered polype

    tides regardless of size, (b) smaller polype

    tides that arise from proteolytic processing

    a larger precursor (such as leader peptides

    or (c) polypeptides that are synthesized by

    ribosome-independent mechanism (in vitro,f

    example)small proteins are defined here those proteins that acquire their diminutive si

    directly by translation of a small ORF.

    Although defining a size limit for wh

    qualifies as small may seem arbitrary, the g

    nomic age has already, perhaps unwitting

    defined such a limit. For example, soon a

    ter the yeast genome was sequenced, an a

    bitrary minimum cutoff of 100 codons w

    applied to annotate putative ORFs (1). Th

    decision was more than simply a matter

    convenience: Had such a cutoff not been aplied, annotating every theoretical ORF b

    tween 2 and 99 codons would have resulte

    in 260,000 additional ORFs (2)! Accordingl

    a similar minimum cutoff of 100 codons h

    since been applied in the annotation of mo

    eukaryotic genome sequences. In bacteria, ow

    ing to smaller genome size, the arbitrary cuto

    is generally shorter but can nonetheless lead

    the exclusion of bona fide small proteincodin

    genes. Largely in keeping with these practice

    GenBank, the genetic sequence database of thNational Institutes of Health, presently do

    not accept submissions of individual sequenc

    with a length of less than 200 nucleotides, co

    responding to approximately 66 codons (

    Surprisingly, this exclusion even extends

    newlydiscovered,previously unannotated sm

    ORFs that demonstrably produce a prote

    whose function is experimentally elucidate

    Thus, proteins arising from small ORFs th

    are substantially smaller than 100 codons a

    typically ignored.In addition to the bioinformatic challeng

    associated with identifying and catalogi

    small proteins and small ORFs, the functio

    of small proteins are often difficult to identif

    As such, small proteins, even those that parti

    ipate in very well studied pathways, may elu

    754 Storz Wolf Ramamurthi

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    3/28

    Small open reaframe (ORF): sequence with thpotential to encsmall protein

    Sporulation:

    a developmentaprogram in somgram-positive bin which a celldifferentiates inhardy, dormant termed an endo(spore)

    discovery for many years. Mutations in genes

    encoding most small proteins characterized

    thus far typically do not result in an obvious

    phenotype on their own; thus, these genes

    are often not identified in screens for loss-of-

    function mutants. Additionally, even if such

    mutations in intergenic regions are successfully

    isolated, genes encoding small proteins run therisk of being ignored by researchers once the

    mutation is mapped if the small ORF is not

    annotated, an issue that is compounded if the

    ORF initiates with either a GTG or a TTG

    start codon. Complicating the matter further,

    classical biochemical experiments that aim to

    identify proteins that copurify with a molecule

    of interest typically employ methods that miss

    small proteins. For example, small proteins

    that may have copurified are simply run off the

    gel and are consequently not detected if thegel system employed is not optimized to detect

    proteins that are less than 5 kDa in size.

    Here we highlight the diverse functions of

    several bacterial small proteins that were dis-

    covered serendipitously by biochemical or ge-

    netic methods or whose functions were eluci-

    dated after their corresponding small ORF was

    identified by using bioinformatic approaches.

    For brevitys sake, we limit our discussion to

    polypeptides that are encoded by small ORFs

    thatcontain50orfewercodonsandareencodedas part of operons or as stand-alone genes. In an

    effort to focus on these ignored proteins, we do

    not discuss polypeptides that obtain their small

    size by proteolytic processing of larger pre-

    cursors (peptides, according to our definition),

    such as signaling molecules and leader peptides.

    We also omit the increasing number of identi-

    fied small ribosomal proteins (4), small secreted

    toxins, small proteins encoded by regulatory 5

    leader sequences and by toxin-antitoxin systems

    (reviewed in Reference 5), and small proteinsencoded by phage genomes (6) or by prophage-

    like regions of bacterial genomes (7). Despite

    these arbitrary limitations, there aremany small

    proteins left to discuss in detail here, and even

    more have been detected and still await charac-

    terization to determine their functions.

    SpoVM, CmpA

    MciZ

    CydX, PmrR, MgtR

    KdpF, AcrZ

    MgrBSda

    MntS, FbpB,FbpC

    SgrT

    SidA, Blr

    Plasma membrane

    Cytosol

    Figure 1

    Sites of small protein action. The cytosol of a composite gram-positive ogram-negative bacterium bounded by the plasma membrane (light gray) shown. The outer forespore membrane during sporulation is depicted ingray. Proteins associated with various cell functions are colored as followkinases, green; transporters, red; membrane-bound enzymes, blue; cell dseptum, yellow; forespore during sporulation, orange; soluble chaperonepurple. Small proteins are depicted as rectangles. Transmembrane smallproteins are depicted as rectangles that traverse the plasma membrane,amphipathic helical small proteins that are peripherally membrane assocare drawn as rectangles that are parallel to the plane of the membrane, asoluble small proteins are shown in the cytosol (pink).

    EXAMPLES OF SMALLPROTEIN FUNCTION

    As we begin our discussion of some of the

    best-characterized small proteins, we note that

    they participate in diverse cellular functions

    ranging from morphogenesis and cell division

    to transport, enzymatic activities, regulatorynetworks, and stress responses (Figure 1).

    Therefore, small proteins not only may provide

    insight into how biological functions may be

    carried out with very few amino acids, but also

    may be used as tools to probe how their larger

    interacting proteins participate in various

    cellular processes.

    Small Proteins That Affect SporeFormation: SpoVM and CmpA

    Endospore (hereafter simply spore) formation

    (sporulation) in gram-positive Bacillus subtilis

    is triggered when the cell senses the imminent

    onset of starvation, whereupon the rod-shaped

    bacterium elaborates a roughly spherical,

    double-membrane-bound organelle termed

    www.annualreviews.org Bacterial Small Proteins 755

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    4/28

    Spore coat:the complex,proteinaceous shellsurrounding bacterialendospores

    Spore cortex: aninner shell, made ofpeptidoglycan,surrounding bacterialendospores. Along

    with the coat, itconfers resistance tothe spore againstenvironmental insults

    the forespore that contains a copy of the

    genetic material (reviewed in Reference 8).

    The outer cell (termed the mother cell) then

    deposits a thick protein shell (the spore coat)

    onto the surface of the forespore and con-

    structs a peptidoglycan shell (the spore cortex)

    between the two membranes that encapsulates

    the forespore. Eventually, the forespore coredehydrates and becomes largely metabolically

    inactive, at which time the mother cell lyses,

    releasing the now dormant spore into the

    environment (reviewed in Reference 9).

    SpoVM. Classically, genes in which mutations

    arrest sporulation at defined morphological

    stages are named spo genes and are readily

    identified because loss-of-function mutations

    in them result in a dramatic decrease in the

    ability of sporulating cells to withstand extremeconditions (high heat and caustic chemicals,

    for example). This strong phenotype led to the

    identification of one of the first reported bac-

    terial genes encoding a small protein (10, 11).

    spoVMis a monocistronic locus that encodes a

    26-amino-acid protein that is exclusively con-

    served among endospore-forming species (12).

    Deletion of the gene results in a six-log de-

    crease in sporulation efficiency due to massive

    defects in spore coat and cortex morphogenesis

    (11). Like many small proteins identified thusfar (13), SpoVM is a membrane-associated

    protein. During sporulation, SpoVM is pro-

    duced only in the mother cell and localizes

    on the forespore surface (14). Unlike many

    other small proteins described below, however,

    SpoVM is not an integral membrane protein.

    Instead, in the presence of lipid bilayers,

    SpoVM assumes an -helical conformation

    that displays a striking amphipathicity (15),

    which drives it to orient itself parallel to the

    plane of the plasma membrane such that itshydrophobic face embeds into the lipid bilayer

    and its positively charged face is exposed to

    the mother cell cytosol (16). SpoVM therefore

    spontaneously inserts into membranes without

    the need for protein insertion machinery.

    What is the basis for the strong sporulation

    phenotype resulting from the absence of

    such a small protein? Despite its diminutiv

    size, SpoVM performs at least four distin

    functions during sporulation. First, SpoV

    is among the first coat proteins that locali

    to the surface of the developing forespo

    to mark this membrane as the site for futu

    coat assembly. Recent evidence suggests th

    SpoVM preferentially embeds in slightconvex, or positively curved, membran

    Because the surface of the forespore is the on

    convex membrane surface in the mother ce

    cytosol (all the other surfaces are concav

    this ability to discriminate between degre

    of membrane curvature drives the prop

    localization of SpoVM to its subcellul

    position (17). The introduction of amino ac

    substitutions that disrupt the ability of SpoVM

    to recognize membrane curvature results in th

    promiscuous localization of SpoVM and, asconsequence, the misassembly of the spore co

    at incorrect locations in the mother cell cytos

    (17).

    Second, upon arriving at the forespo

    surface, SpoVM recruits an ATPase term

    SpoIVA, which is the structural compone

    of the basement layer of the spore coat (18

    20). Interaction with SpoIVA requires at lea

    one amino acid residue on the charged face

    SpoVM that faces the mother cell (16). Di

    ruption of this residue abrogates the interactiobetween SpoVM and SpoIVA and results in t

    misassembly of the spore coat.

    The third function of SpoVM during spor

    lation is not well understood but stems from t

    observations that SpoVM is a competitive i

    hibitor of the membrane-bound protease Fts

    (21) and shares a limited amino acid sequen

    homology with the cIII protein of bacteri

    phage lambda (22), another small protein th

    inhibits FtsH in gram-negative Escherichia c

    (23). Although FtsH is required for entry insporulation (24), the role that SpoVM may pl

    in inhibiting FtsH later during the sporulatio

    program is not known. Nonetheless, the abili

    of SpoVM to inhibit this protease has been e

    ploitedas a tool by which to understand the ro

    of FtsH during other cellular processes such

    biofilm formation (25).

    756 Storz Wolf Ramamurthi

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    5/28

    sRNA: small,regulatory RNA

    GFP: greenfluorescent prot

    Divisome: the

    given to the baccell divisionmachinery; iscomposed ofapproximately tproteins

    FtsZ: a tubulinhomolog that isin most bacteriathat is responsibmembraneconstriction durcell division

    The fourth function of SpoVM concerns

    a sporulation phenomenon that was first de-

    scribed approximately 40 years ago, when re-

    searchers observed that mutations inB. subtilis

    that abrogate the initiation of assembly of the

    coat (the outer proteinaceous shell surrounding

    the spore) also abrogate assembly of the cortex

    (the inner peptidoglycan shell). This observa-tion suggested that the morphogenesis of these

    two structures separated by a membrane must

    somehow be linked (26). Of the approximately

    70 proteins present in the spore coat ofB. sub-

    tilis, only SpoVM and SpoIVA are required for

    this linkage (11, 18).

    CmpA. In an effort to determine how the co-

    ordinated assembly of the coat and cortex is

    achieved, investigators isolated a mutant allele

    ofspoVMthat specifically blocked cortex assem-bly but allowed normal initiation of coat assem-

    bly. A spontaneous suppressor mutation that

    corrected this sporulation defect was identified

    in an intergenic region that harbored a pre-

    viously unannotated small gene subsequently

    namedcmpA(cortex morphogenetic protein A)

    (27). The mRNA encoding this 37-amino-acid

    small protein was also identified in a global

    screen for small, regulatory RNAs (sRNAs) that

    were specifically upregulated during sporula-

    tion (28). Although cmpA is widely conservedin endospore-forming species of the Bacillales

    order, it is conspicuously absent in members

    of the Clostridiales order. This observation is

    consistent with a recent report that coat assem-

    bly and cortex assembly are likely not linked in

    Clostridium difficile(29).

    On the basis of the phenotypes of cmpA

    single mutants compared with those ofspoVM

    cmpA double mutants, CmpA is thought to

    function in a checkpoint that inhibits cortex

    assembly until coat assembly properly initi-ates (28). Accordingly, the lack of cmpA re-

    sults in cells sporulating faster (presumably due

    to unchecked cortex assembly). Overexpression

    of the gene impairs cortex assembly and, as a

    result, reduces sporulation efficiency. Consis-

    tent with its cortex-inhibiting function, CmpA-

    GFP (green fluorescent protein) was no longer

    detected by fluorescence microscopy in those

    cells that had reached a particular sporula-

    tion milestone. This observation suggests that

    the inhibitory effect of CmpA is eventually

    overcome in those cells by a posttranscrip-

    tional mechanism. Identification of the target

    of CmpA inhibition (presumably a cortex as-

    sembly factor) and factors that participate inthe regulatory pathway that ultimately leads to

    the relief of CmpA inhibition should reveal the

    mechanism underlying the linked morphogen-

    esis of the coat and cortex during sporulation.

    Small Proteins That Affect CellDivision: MciZ, SidA, and Blr

    Cytokinesis in bacteria depends on the assem-

    bly of the cell division machinery, termed the

    divisome, usually at or near mid-cell. The di-visome is composed of approximately ten core

    proteins that anchor the divisome to the mem-

    brane and mediate its constriction (reviewed in

    Reference 30). The component of the divi-

    some that actually exerts the force for this con-

    striction is a tubulin homolog termed FtsZ

    that polymerizes into a ring at the division

    site in a GTP-dependent manner (reviewed in

    Reference 31). As the central component of the

    divisome, FtsZ is a frequent target of positive

    or negative regulation either by components ofthe divisome or by other species-specific ac-

    cessory factors, among them small proteins. In

    this manner, cytokinesis may be carefully regu-

    lated both spatially and temporally (reviewed in

    Reference 32).

    MciZ. At the onset of sporulation inB. subtilis,

    before the elaboration of an asymmetrically po-

    sitioned septum, the FtsZ ring initially assem-

    bles at mid-cell, after which time it redeploysto two polar sites at the opposite ends of the

    cell (33). Only one of the polar FtsZ rings then

    constricts to form the polar septum (34). Upon

    elaboration of the polar septum, other cell di-

    vision events are generally blocked so that the

    progenitor cell differentiates into two dissimilar

    progeny cells.

    www.annualreviews.org Bacterial Small Proteins 757

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    6/28

    SOS response:bacterial response toDNA damage, in

    which genes requiredfor cell cycle arrest andDNA repair are

    induced

    In an interaction screen to identify FtsZ-

    associated proteins, Handler et al. (35) discov-

    ered a previously unannotated 40-amino-acid

    small protein that was specifically synthesized

    in the mother cell during sporulation; they

    named this protein MciZ (mother cell inhibitor

    of FtsZ). Purified MciZ not only bound di-

    rectly to FtsZ, but also abrogated polymeriza-tion of FtsZ in vitro. Consistent with this obser-

    vation, overproduction of MciZ in vegetatively

    growingB. subtilisresulted in cell filamentation

    that arose from a reduction in the number of

    FtsZ rings assembled. This cell division defect

    was suppressed by a single-amino-acid substi-

    tution in FtsZ, near its GTP-binding pocket,

    suggesting that binding of MciZ to FtsZ may

    directly occlude GTP binding, thereby inhibit-

    ing polymerization. A deletion of the mciZgene

    revealed the physiological role for this smallprotein. Whereas vegetative cells harboring a

    deletion of mciZ did not display an obvious

    phenotype, sporulating cells continued to form

    aberrant Z-rings in the mother cell even after

    the onset of polar septation. Taken together,

    the data are consistent with a model in which

    MciZ, by inhibiting FtsZ assembly, prevents

    additional cell division events after the mother

    cell commits to the sporulation pathway.

    SidA. Although FtsZ, the central componentof the divisome, is the most frequently identi-

    fiedtargetfor regulation of cell division, a newly

    discovered small protein inCaulobacter crescen-

    tusinhibits another component of the divisome,

    FtsW, and prevents membrane constriction af-

    ter the divisome has already assembled and is

    poised to constrict. Treatment ofC. crescentus

    with DNA-damaging agents results in an ar-

    rest in cell division (36), but not in cell elon-

    gation. Whole-genome microarray analysis re-

    vealed that the expression of approximately 5%of the genes in the C. crescentusgenome was

    concomitantly affected either positively or neg-

    atively (37). One of these genes, which the

    authors renamed sidA(SOS-induced inhibitor

    of cell division A), was upregulated in re-

    sponse to DNA damage and encoded a putative

    29-amino-acid hydrophobic protein containing

    a predicted transmembrane segment (37). Th

    presence of an upstream LexA binding site su

    gests that thesidAgene is part of the SOS re

    ulon ofC. crescentus.

    In a well-studied pathway in E. coli, indu

    tion of the SulA protein by DNA damage d

    lays cell division by inhibiting FtsZ polymeriz

    tion (38). C. crescentusdoes not harbor a Suhomolog, but Modell et al. (37) demonstrat

    that SidA may fulfill a similar inhibitory fun

    tion, as overproduction of SidA during no

    malgrowth resulted in cellfilamentation. How

    ever, unlike the case for SulA, overproducti

    of SidA did not prevent the localization or a

    semblyoftheFtsZringatmid-cell,eventhoug

    membrane constriction could not subsequent

    be observed.

    A genetic selection revealed that the targ

    of SidA inhibition is likely FtsW. FtsW is integral membrane protein that has been imp

    cated to be thelipid II flippase, which transpor

    peptidoglycan precursors from the cytosol in

    the periplasm (39). Bacterial two-hybrid ana

    ysis demonstrated that SidA directly interac

    with FtsW (as well as with FtsN and indirect

    with FtsI, two other late-arriving divisome pr

    teins). In addition, mutations in ftsWthat su

    pressed the SidA overexpression phenotype r

    sulted in FtsW variants that displayed a reduc

    interaction with the small protein.One hypothesis to be drawn from the dire

    interaction betweenSidA and FtsW is that Sid

    disrupts peptidoglycan synthesis at the nasce

    septum via inhibition of the lipid II flippase a

    tivity of FtsW. Surprisingly, however, as mo

    itored by fluorescein-labeled vancomycin, t

    initiation of peptidoglycan synthesis did not a

    pear to be disrupted by SidA overexpressio

    Moreover, in cells overexpressing SidA, the in

    tiation of membrane invagination at mid-ce

    also appeared to be normal, even though setum formation did not proceed past this poin

    Taken together, the data suggest that FtsW

    may be involved in a previously unappreciate

    step of cell division that occurs after its pep

    tidoglycan precursortransporting activity an

    that involves the final membrane-remodelin

    step of cytokinesis. Thus, the discovery of th

    758 Storz Wolf Ramamurthi

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    7/28

    Sensor kinase:the protein thatbacterial two-component signtransduction sysis a phosphor do

    a response regul

    Response reguthe protein (usutranscription facthat, in bacterialtwo-componenttransduction sysreceives thephosphoryl groufrom the sensor

    small protein SidA not only revealed a pre-

    viously unidentified target for the regulation

    of cell division, but also implicated that tar-

    get in a previously undescribed role in cell

    division.

    Blr. A second small protein that regulates a

    divisome component other than FtsZ is the41-amino-acid Blr small protein ofE. coli. blr

    was first identified as a stand-alone gene (40)

    that, when disrupted, caused increased sensi-

    tivity to -lactam antibiotics (-lactam resis-

    tance) (41) and, surprisingly, decreased sensi-

    tivity to cell envelope stress (42). Blr was later

    identified in a bacterial two-hybrid screen (43)

    for proteins that interacted with FtsL, an inte-

    gral membrane protein that has a very poorly

    understood function but that is nonetheless a

    core component of the division machinery (30).The two-hybrid assay also suggested that Blr

    can associate with other components of the cell

    division machinery, such as FtsI, FtsK, FtsN,

    FtsQ, and FtsW (43). Consistent with these re-

    sults, GFP-Blr was found to localize to the divi-

    sion septum ofE. coliin a divisome-dependent

    manner. Karimova et al. (43) were unable to

    identify a significant cell division defect in cells

    harboring a deletion of theblrgene. However,

    the cells became elongated when grown in a

    lowosmotic strength medium and when theblrmutation was combined with a temperature-

    sensitive variant of another cell division protein

    termed FtsQ, which also performs a very poorly

    understood function during cell division. This

    observation suggests that Blr modulates the

    function of FtsQ during specific growth con-

    ditions. Additional characterization of Blr and

    its interacting partners will undoubtedly pro-

    vide further insights into small protein action as

    well as into the functions of divisome proteins,

    similar to the manner in which SidA revealeda previously unappreciated function of FtsW.

    Future studies of MciZ, SidA, Blr, and other

    divisome regulators should also clarify whether

    small proteins predominate or whether larger

    proteins can have similar functions in modulat-

    ing cell division.

    Small Protein Regulators ofTransport: KdpF, AcrZ, and SgrT

    Bacterial cells have a multitude of transporters

    that allow for the import of critical nutrients

    and for the export of detrimental compounds.

    Several small proteins inE. coliassociate with

    small molecule transporters, with various con-

    sequences for activity.

    KdpF. Thehydrophobic 29-amino-acid KdpF

    protein was one of the first small proteins de-

    scribed to affect a transporter (44). Gael et al.

    (44) noted a small ORF initiating with a GTG

    codon in the 116-base-pair region between the

    promoter elements of theE. coli kdpABCoperon

    and the first annotated gene in the operon,

    kdpA. The kdpABC mRNA encodes a P-type

    ATPase high-affinity K+

    transporter and is in-duced in response to limiting K+ by the KdpD-

    KdpE sensor kinase and response regulator en-

    coded downstream (45).

    Synthesis of KdpF was confirmed by label-

    ing minicells with 35S methionine, by visual-

    izing the proteins in the purified Kdp com-

    plex by silver staining, and by purifying the

    small protein upon extracting membranes with

    chloroform and methanol. The purified KdpF

    protein was used to examine its effects on the

    purified KdpABC complex. Interestingly, thecomplex purified from a strain lacking kdpF

    had low activity. A significant increase in ac-

    tivity was observed upon the addition of puri-

    fied KdpF. The presence of high amounts of

    E. coli lipids also increased activity, although

    not to the same extent as did the addition of

    KdpF. Together, these results led to the con-

    clusion that KdpF helps stabilize the KdpABC

    complex. Despite the strong effects of KdpF

    addition in vitro, the lack ofkdpFdid not obvi-

    ously affectE. coligrowth in low K+

    , suggestingthat KdpF may be required only under specific

    growth conditions or that there may be redun-

    dant mechanisms for stabilizing the KdpABC

    complex (45).

    The kdpFABC genes are also present in

    Mycobacterium bovisBCG. Bacterial two-hybrid

    assays conducted with mycobacterial proteins

    www.annualreviews.org Bacterial Small Proteins 759

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    8/28

    revealed an interaction between KdpF and the

    sensor kinase KdpD as well as between KdpF

    and the MmpL7 and MmpL10 proteins, which

    are involved in lipid synthesis and transport.

    The significance of these two-hybrid signals is

    not clear, because high levels of KdpF did not

    alter the levels of thekdp mRNAs or of apolar

    lipids. However, KdpF overexpression led todecreased growth ofM. bovisBCG in murine

    and human macrophages, and cells with high

    KdpF levels had altered colony morphology,

    pointing to a physiological role for the small

    protein (46).

    AcrZ. AcrZ, which interacts with the major

    AcrB-AcrA-TolC efflux pump inE. coli, is an-

    other hydrophobic small protein that affects

    transport (47). This 49-amino-acid small pro-

    tein is encoded by a stand-alone gene near themodEF and modABC operons, which encode

    proteins involved in molybdenum uptake and

    in regulating the synthesis of molybdopterin.

    Despite the synteny with themodgenes across

    multiple organisms, strains lacking acrZhave

    not been found to have a phenotype related to

    molybdenum.

    However, acrZ expression is strongly in-

    duced by a range of antibiotics, detergents,

    and oxidizing compounds via the MarA, Rob,

    and SoxS transcription factors (47). Consis-tent with a role in protecting against deleteri-

    ous compounds, a functional, tagged version of

    the AcrZ protein (AcrZ-SPA) copurifies with

    AcrB, the inner-membrane component of the

    AcrAB-TolC efflux pump, even in the absence

    of AcrA and TolC. Bacterial two-hybrid assays

    further supported an interaction between AcrZ

    and AcrB, as did increased protease sensitivity

    of AcrB in extracts from cells lacking AcrZ. A

    dominant-negative AcrZ mutant that conferred

    sensitivity to chloramphenicol was suppressedby a mutation in the gene encoding AcrB, pos-

    sibly defining the region of interaction between

    the two proteins. The precise consequences of

    AcrZ binding to AcrB are not known, butacrZ

    mutants are sensitive to only a subset of com-

    pounds to which acrB strains are hypersen-

    sitive, both in global phenotypic screens (48)

    and in individual tests (47). These observatio

    suggest that one function of AcrZ may be

    enhance AcrAB-TolC export of certain class

    of substrates.

    SgrT. The 43-amino-acid SgrT protein is e

    coded on a transcript that was first iden

    fied as a sRNA. In fact, the sRNA denoteSgrS binds to the Hfq RNA chaperone pr

    tein and regulates several mRNAs by base pai

    ing, including the ptsG mRNA gene, whi

    encodes the EIICBGlc glucose transporter

    the phosphoenolpyruvate-dependent gluco

    phosphotransferase system(49).The SgrStra

    script is unusually long for an E. coli sRN

    and upon further examination, the sequen

    was found to also encode a small protein who

    expression was confirmed bylacZtranslation

    fusions (50). Phenotypic assays showed thSgrT inhibits glucose uptake, most likely

    inhibiting the activity of EIICBGlc. Thus, th

    SgrS RNA, whose expression is induced

    the SgrR transcription factor in response

    glucose-phosphate stress (51), provides an el

    gant defense against high levels of toxic glucos

    phosphate: The sRNA base pairs with ptsG

    block translation of EIICBGlc, and the encod

    small protein SgrT blocks transport by pree

    isting EIICBGlc.

    Further characterization of the SgrT prtein gave results consistent with a direct in

    teraction between EIICBGlc and SgrT (5

    The two proteins were cross-linked when ce

    expressing both C-terminally tagged protei

    (EIICBGlu-His5and SgrT-3HA) from plasmi

    in a deletion background were treated wi

    paraformaldehyde. These experiments also r

    vealed that SgrT has a preference for the d

    phosphorylated form of EIICBGlc, which

    prominent during glucose uptake. Addition

    evidence for an interaction between the twproteins came from a bimolecular fluorescen

    complementation assay in which both protei

    were fused to one-half of a GFP. This ass

    was used to delineate the regions of EIICBG

    required for the interaction with SgrT. Th

    results of these assays, together with cop

    rification studies of tagged EIICBGlc carryin

    760 Storz Wolf Ramamurthi

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    9/28

    LPS:lipopolysacchari

    different amino acid substitutions, suggest

    that SgrT may interact with a conserved

    KTPGRED motif present in the glucose trans-

    porter but not present in the transporters of

    other sugars. The SgrT protein does not have

    an obvious -helical transmembrane domain,

    but fluorescence microscopy with SgrT tagged

    with GFP (SgrT-GFP) indicated that SgrT islocalized to the membrane upon the interaction

    with EIICBGlc.

    Small Protein Regulators ofMembrane-Bound Enzymes:CydX, PmrR, and MgtR

    Some of the first small proteins to be iden-

    tified are those associated with photosystem

    I, photosystem II, and cytochrome complexes

    in cyanobacteria (reviewed in References 5355). Crystallographic studies have confirmed

    the interaction of these single-transmembrane-

    domain small proteins with these large com-

    plexes, although the exact role of each proteinis

    less clear. A survey of the phenotypes associated

    with the lack of the Psb small protein compo-

    nents of photosystem II proposes that most are

    involved in stabilization, assembly, or dimeriza-

    tion of the complex (reviewed in Reference 53).

    Similarly, a study of the PetG, PetL, and PetN

    proteins of Synechocystis suggested that PetGand PetN stabilize the cytochromeb6fcomplex

    (54). Thus, the functions of these small proteins

    associated with large complexes may be similar

    to the functions proposed for KdpF.

    CydX. Recent studies of the 37-amino-acid

    CydX protein, which is encoded downstream of

    the CydA and CydB subunits of the cytochrome

    bdoxidase in E. coli, found that a deletion of

    thecydXgene gave rise to the same phenotypes

    ascydAor cydBdeletions, such as slow growthunder aerobic conditions and sensitivity to the

    reducing agent-mercaptoethanol (56). Direct

    measurements of oxidase activity inE. colialso

    showed that activity is reduced in the absence

    of CydX. At 51 amino acids, the corresponding

    protein encoded adjacent to theBrucella abortus

    cydAB genes is slightly longer thanE. coliCydX,

    but again strains lacking this protein have de-

    fects consistent with a lack of cytochrome bd

    oxidase activity as well as impaired growth in

    macrophages (57).

    Copurification studies with functional,

    tagged derivatives ofE. coliCydX (CydX-SPA)

    confirmed that the small protein tightly asso-

    ciates with tagged CydA (CydA-His6) and withCydB (CydB-His6) (56). The AppX homolog

    encoded by theappABXoperon for the anaero-

    bically induced cytochromebdoxidase also as-

    sociates with CydA, albeit to a lower extent than

    CydX. The presence of a cysteine residue in

    the predicted transmembrane domain of CydX

    led to the attractive hypothesis that the cys-

    teine might coordinate a heme in the Cyd-

    ABX complex, but this hypothesis was not sup-

    ported by mutational studies.In fact, these stud-

    ies revealed that substitution of only 4 of the15 residues tested prevented full complementa-

    tion of the -mercaptoethanol sensitivity phe-

    notype associated with thecydXdeletion.

    PmrR. The hydrophobic 29-amino-acid

    PmrR protein in Salmonella was discovered

    by bioinformatics, which revealed a small

    ORF preceded by a binding site for the

    PmrA response regulator, both encoded

    downstream and on the opposite strand of

    the PmrA-regulatedpmrCABoperon (58, 59).Transcript mapping confirmed the presence

    of a PmrA-dependent transcript, and a tagged

    PmrR protein (PmrR-FLAG) that cofraction-

    ated with the inner membrane was detected

    by immunoblot analysis. Given the roles of

    the PmrB sensor kinase and PmrA response

    regulator in controlling genes that mediate

    modification of theSalmonellalipopolysaccha-

    ride (LPS) and the unexplained observation

    that a PmrA-regulated gene inhibited LpxT-

    mediated synthesis of diphosphorylated lipid A(60), Kato et al. (58) tested for an interaction

    between PmrR and LpxT. The predicted

    interaction was confirmed in both two-hybrid

    assays and copurification of tagged PmrR

    (FLAG-His6-PmrR) and tagged LpxT (LpxT-

    LacZ). Overexpression of PmrR and decreased

    expression of the small protein also had the

    www.annualreviews.org Bacterial Small Proteins 761

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    10/28

    expected effects of decreasing and increasing

    diphosphorylated lipid A levels, respectively.

    These experiments raised the broader ques-

    tion of the physiological consequences of dif-

    ferent levels of diphosphorylated lipid A, which

    increases the overall negative charge of the

    LPS (58). Assays of Fe3+ binding revealed that

    increased PmrR levels (which resulted in de-creased diphosphorylated lipid A synthesis and

    thus in an increased positive charge of the LPS)

    decreased the binding of Fe3+ to the LPS. Be-

    cause Fe3+ activates the PmrB response regula-

    tor, elevated PmrR and decreased Fe3+ binding

    also decreased activation of PmrB-PmrA target

    genes. These observations led to the proposal

    of a negative feedback loop wherein PmrB and

    PmrA are activated by high external Fe3+ and

    induce the expression of PmrR, which inhibits

    LpxT, leading to a change in LPS charge. Theresulting decrease in negative charge in turn re-

    sults in lower Fe3+ binding and finally in de-

    creased PmrB activation.

    MgtR. The PhoQ sensor kinase and PhoP re-

    sponse regulator have a critical role in patho-

    genesis in Salmonella species and control a large

    regulon in response to low Mg2+, acidic pH,

    and antimicrobial peptides. The PhoQ-PhoP

    system controls the expression of at least three

    small hydrophobic proteins: the 30-amino-acidMgtR protein (61), the 47-amino-acid MgrB

    protein (62), and the 31-amino-acid YneM

    protein (63).

    The first of these small proteins, MgtR,

    modulates the stability of the MgtC virulence

    factor in Salmonella typhimurium (61). The

    discovery of MgtR came from studies that

    followed up the observation that, although the

    levels of the mgtCB transcript are high in Mg2+-

    depleted medium, the MgtC protein is barely

    detectable (64). By mapping portions of themgtCB transcript associated with the instability,

    Alix & Blanc-Potard (61) found that a region

    downstream ofmgtBis involved and noted that

    this region may encode a 30-amino-acid pro-

    tein. Mutations that correlated MgtR coding

    potential with MgtC instability provided ev-

    idence that the small protein was synthesized.

    Alix & Blanc-Potard also detected a function

    tagged MgtR derivative (His6-MgtR).

    Western blot analysis of MgtC leve

    showed that MgtR affects the stability of Mgt

    in a manner that is dependent on FtsH, th

    membrane-bound protease involved in t

    degradation of membrane proteins. Bacteri

    two-hybrid assays supported a direct interation between MgtR and MgtC. Intriguingl

    substitution of only 2 of 11 residues in plasmi

    expressed MgtR had effects on MgtC leve

    In MgtC, mutations in the cytoplasmic loo

    between the third and fourth transmembran

    domains confer resistance to MgtR, possib

    defining the region of interaction between th

    two proteins.

    Although MgtC was long known to

    a virulence determinant, the activity of th

    protein was uncovered only recently (65Copurification experiments after cross-linkin

    revealed that tagged MgtC (MgtC-FLAG

    associated with FtsH and MgtR (as expecte

    from the results above), as well as with the F

    subunit of the F1F0 ATP synthase. Lee et a

    (65) went on to show that the MgtC interactio

    with the F0a subunit inhibits ATP synthes

    and ATP-driven proton translocation and h

    pothesized that the MgtC-mediated inhibitio

    of the F1F0ATP synthase may protect the ba

    teria against the acidification that occurs insidmacrophages.

    Another recent study using a two-hybrid a

    say showed that MgtR also posttranscriptio

    ally affects the levels of the MgtA magnesiu

    transporter and interacts with MgtA (66). Th

    interplay between the small protein MgtR an

    MgtA, MgtC, FtsH, and the F1F0ATP syntha

    is not completely understood but may const

    tute a feedback loop.

    Small Protein Regulators of ProteinKinases and Signal Transduction:

    MgrB and Sda

    The frequent localization of small protei

    at the inner membrane makes small pr

    teins candidates for regulating membran

    localized sensor kinases and for affecting sign

    762 Storz Wolf Ramamurthi

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    11/28

    transduction. Two such proteins are discussed

    here.

    MgrB. The second of the hydrophobic,

    PhoQ-PhoP-regulated small proteins, MgrB,

    negatively regulates the PhoQ sensor kinase

    in E. coli(67). Because PhoQ and PhoP were

    known to strongly regulate mgrB expression(62), Lippa & Goulian (67) monitored the ef-

    fects of deleting PhoP target genes in a strain

    background in which the yellow fluorescent

    protein (yfp) reporter was fused to the mgrB

    promoter. In this assay, the mgrB mutant

    showed a striking increase in YFP activity as

    monitored by colony color on plates in low and

    high magnesium. In addition, overexpression

    of E. coliMgrB as well as of homologs from

    SalmonellaandYersinaled to strong repression

    ofthe mgrB-yfp fusion. Thehydrophobicnatureof the proteinprompted the authors to examine

    subcellular localization. Both cell fractionation

    followed by immunoblot analysis with antibod-

    ies raised against a C-terminal peptide of MgrB

    as well as localization of a GFP-tagged deriva-

    tive of MgrB (GFP-MgrB) gave signals consis-

    tent with membrane localization.

    ThemgrBdeletion and overexpression phe-

    notypes led to the prediction that the small pro-

    tein might be interacting with the PhoQ sensor

    kinase, an expectation that was supported bytwo-hybrid analysis (67). The binding and

    repression of PhoQ resulted in a negative feed-

    back loop wherein PhoQ and PhoP activated

    expression of MgrB, which in turn repressed

    PhoQ. Because the kinetics of reporter induc-

    tion were not found to be altered, Lippa &

    Goulian (67) proposed that theMgrB-mediated

    feedback inhibition could be a mechanism by

    which to integrate additional environmental

    signals into the PhoQ-PhoP circuit. Consistent

    with this model, MgrB was found to be requiredfor the repressive effects of DsbA (68, 69), a

    periplasmic disulfide oxidase, on the induction

    of PhoQ-PhoP-regulated genes. Interestingly,

    MgrB contains three conserved cysteines: C16,

    which is predicted to be in the transmembrane

    domain, and C28 and C39, which are predicted

    to be in the periplasm. Future studies to

    determine the redox status of the cysteines un-

    der different stress conditions and to examine

    how oxidation and reduction influence MgrB

    binding to PhoQ should further elucidate the

    role of MgrB in modulating the PhoQ-PhoP

    response.

    Sda. The cytosolic 46-amino-acid Sda protein,one of the best-characterized small proteins,

    similarly inhibits the first kinase in the his-

    tidine kinase phosphorelay that regulates

    sporulation-specific genes inB. subtilis. When

    cells encounter starvation and stress, the histi-

    dine kinases KinA and KinB are activated and

    autophosphorylate; phosphates from these ki-

    nases are transferred to Spo0F, then to Spo0B,

    and finally to the transcription regulator

    Spo0A. An advantage of such a phosphorelay is

    that the drastic step to initiate sporulation canbe modulated at multiple steps in response to a

    range of environmental signals. A mutant allele

    ofdnaA blocks replication initiation and, as a

    result, blocks the entry into sporulation. The

    sdagene (suppressor ofdnaA) was identified in

    a screen for mutations that bypassed this block

    in sporulation (70). Thesdapromoter has mul-

    tiple DnaA binding sites, and expression of a

    sda-lacZfusion is induced by various mutations

    that affect replication, most likely via DnaA. In

    vitro assays with purified tagged derivatives ofsoluble KinA (KinA-His6) and Sda (Sda-His6)

    revealed that Sda inhibits the kinase activity

    of KinA. Burkholder et al. (70) also suggested

    that Sda may inhibit KinB but were unable to

    directly test this hypothesis in vitro, given the

    difficulty in purifying the integral membrane

    kinase. They proposed that Sda serves as a

    checkpoint to inhibit the KinA/KinB-Spo0F-

    Spo0B-Spo0A phosphorelay and thus sporula-

    tion if DNA cannot be replicated properly.

    The interaction between Sda and the KinAkinase has been examined in detail. The site

    of Sda binding to KinA was mapped to the

    KinA dimerization/phosphotransfer (DHp)

    domain, first by the determination of the

    Sda NMR structure and mutational studies

    (71) and later by small-angle X-ray scattering

    and neutron contrast variation studies on

    www.annualreviews.org Bacterial Small Proteins 763

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    12/28

    the Sda-KinA complex (72). Researchers first

    proposed that Sda acts only by blocking an

    interaction between the catalytic and DHp

    domains of KinA (71). However, later studies

    led to the conclusion that Sda inhibits KinA

    by inducing a conformational change via the

    DHp domain (72). Sda interaction with KinA

    may have multiple consequences. One of thelatest studies showed that Sda also interferes

    with the phosphotransfer from KinA to Spo0F

    because the Sda and Spo0F binding sites on

    KinA overlap (73). An X-ray structure of the

    Geobacillus stearothermophilus Sda protein in

    complex with the cytoplasmic portion of KinB

    comprising the catalytic and DHp domains

    supported the suggestion that Sda also inhibits

    KinB and showed that Sda again blocks the

    interaction between the catalytic and DHp

    domains and the interaction with Spo0F (74).These extensive structural studies, together

    with an X-ray structure ofB. subtilisSda (75),

    illustrate how a protein of only 46 amino acids

    and only two -helices can nevertheless signif-

    icantly impact interactions between proteins

    or interactions between domains of a single

    protein.

    Small Protein Chaperones:MntS, FbpB, and FbpC

    Several well-characterized protein, nucleic

    acid, and metal chaperone proteins are of rel-

    atively low molecular weight (reviewed in Ref-

    erences 76 and 77). Thus, some of the bacterial

    small proteins of even lower molecular weight

    may also have chaperone function.

    MntS. MntS is another example of a small

    protein discovered as a consequence of charac-

    terizing a transcript first annotated as a sRNA in

    E. coli. This small ORF, which was recognizedon the basis of sequence conservation, was re-

    ported to encode a 42-amino-acid small protein

    initiating with an ATG codon (78) but, upon

    further inspection, may instead encode a 33-

    amino-acid small protein initiating with a GTG

    codon (L.S. Waters, personal communication).

    Clues to MntS function came from the infor-

    mation about its expression. ThemntSgene

    adjacent to the gene encoding the manganes

    responsive MntR transcription repressor, an

    synthesis of a tagged MntS derivative (Mnt

    SPA) is strongly repressed by high manganes

    Consistent with MntR regulation of Mnt

    overexpression of MntS resulted in seve

    sensitivity to manganese, but not to high leveof other metals such as iron, nickel, and coppe

    This observation, together with the findin

    that another target of MntR is abnorma

    regulated in anmntSdeletion strain, led to th

    suggestion that MntS functions as a chaperon

    to deliver manganese to appropriate protei

    or locations in the cell. Experiments showin

    that the activities of manganese-depende

    superoxide dismutase and ribonucleoti

    reductase enzymes are reduced in E. coli mn

    mutants ( J. Martin, L.S. Waters & J.A. Imlapersonal communication) support an Mn

    role in increasing manganese availabili

    Nonetheless, further studies are needed

    distinguish between chaperone activity and a

    ternative functions such as blocking mangane

    export.

    FbpB and FbpC. Two other small protei

    whose expression is regulated by the presence

    metals, in this case iron, are the 48-amino-ac

    FbpB (Fur-regulated basic protein B) and th29-amino-acid FbpC proteins ofB. subtilis(79

    The transcript encoding FbpC was the fir

    identified member of the ferric uptake repre

    sor (Fur) regulon (80). The low ironinduc

    mRNA encoding FbpB (initially annotated as

    59-amino-acid protein) and the slightly larg

    54-amino-acid FbpA protein were identified

    a genome-wide screen for Fur-repressed gen

    (81). All three of these basic proteins can be d

    tected as FLAG-tagged derivatives. Given th

    repression of some operons by the regulatosRNA FsrA requires one or the other of the

    small proteins, researchers proposed that the

    basic small proteins act as chaperones to faci

    tate sRNA function (82). However, the reaso

    for the observed differential effects on targ

    operons, the binding partners for the three pr

    teins, and their exact functions are unknown

    764 Storz Wolf Ramamurthi

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    13/28

    Ribosome profia global method

    which to determmRNA sites boua ribosome at angiven time

    FUNDAMENTAL QUESTIONS

    The characterization of an increasing num-

    ber of small proteins with diverse physiological

    roles allows for reflection on general principles

    regarding the discovery and characterization of

    small proteins; their mechanisms of action, syn-

    thesis, and degradation; and their distribution

    and evolution. At the same time, these studiesraise interesting and important general ques-

    tions that can guide future studies.

    Further Identification of SmallProteins and the Elucidationof Their Functions

    The full complement of small proteins is not

    yet known for any organism. A review of how

    the small proteins described above were foundreveals that many were discovered by serendip-

    ity: some as mutations in unannotated regions

    of the chromosome and others by further in-

    spection of potential promoters and short tran-

    scripts. A limited number of computational

    screens based on comparative genomics and

    the identification of ribosome binding site se-

    quences have been designed to specifically de-

    tect bacterial small proteins (for examples, see

    References 13, 59, 83, and 84). However, a gen-

    eral drawback of computational screens is thelimited information content of a small protein

    coding gene, particularly if the gene encodes a

    hydrophobic small protein. This limitation has

    led to both the underannotation and the over-

    annotation of small protein genes in completed

    genomesequences(85). Thus,an essential com-

    ponent of small protein identification is inde-

    pendent data such as those acquired through

    direct detection or mutational analysis demon-

    strating that the small protein is synthesized.

    The newly developed technique of ribo-some profiling to identify the positions of ri-

    bosome occupancy on a transcriptome-wide

    level is leading to the identification of ribo-

    some binding signatures on small ORFs (86).

    Whether all these signals will be correlated with

    the existence of small proteins is not yet clear.

    Again, the onus will be to obtain independent

    data supporting the synthesis of small proteins.

    As more small proteins are characterized, gen-

    eral features that are uncovered may provide

    clues that lead to the productive identification

    of this family of proteins. For example, the hy-

    drophobic natureof most small proteins studied

    thus far suggests that membrane extraction

    for example, by methanol and chloroform, aswas carried out for KpdF (44) and found for the

    29-amino-acid YnhF protein of E. coli(87)

    may be a fruitful avenue to pursue. We en-

    vision that increased awareness of important

    roles for small proteins will also increase the

    scrutiny of mutations in unannotated regions

    of the chromosome as well as scrutiny of faint

    bands at the bottom of protein gels. A conse-

    quence of the identification of new small pro-

    teins is the need to revisit annotation of genome

    sequences, likely by both manual and improvedin silico methods, to fill in the missing genes.

    At this point, we are still left with the excit-

    ing question of the true extent of the small

    proteome.

    As the synthesis of new small proteins is

    confirmed, there will be the challenge of

    elucidating their functions. This challenge is

    not substantially different from the problem

    of characterizing larger proteins of unknown

    function but has the added complication that

    current biochemical assays are biased againstsmall proteins. For example, raising antibodies

    against a 30-amino-acid hydrophobic protein

    is difficult; the two reported antibodies against

    mature small proteins (-SgrT and -MgrB)

    detect only protein overproduced from plas-

    mids (67, 88). With the exception of KdpF

    and the subunits of the cytochrome bc1 and

    cytochrome oxidase complexes, all interactions

    between the small proteins described and their

    interacting protein partners were examined by

    using tagged derivatives of the small proteins.Most of these tags are larger than the small pro-

    tein. Although in many examples the tagged

    protein was confirmed to complement a spe-

    cific phenotype, this complementation may not

    be the case for all small proteins, and caution

    regarding unanticipated effects of the tags is

    warranted. The identification of interactions in

    www.annualreviews.org Bacterial Small Proteins 765

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    14/28

    Competence:the ability to uptakeexogenous DNA bybacteria

    two-hybrid assaysis a recurrenttheme, suggest-

    ing that this line of investigation may be pro-

    ductive for the characterization of other small

    proteins.

    The approach of examining the phenotypes

    of overproducing the small proteinand deleting

    the corresponding gene has also yielded use-

    ful insights into the functions of many of thesmall proteins. However, none of the small pro-

    teins have been found to be essential for viabil-

    ity, and only a subset of the null mutants gave

    strong phenotypes. Some small proteins may

    have partially redundant functions, as found

    for CydX and AppX (56). Alternatively, small

    proteins may generally be acting in regulatory

    roles, as seen for PmrR (58), MgrB (67), and

    Sda (70). These largely regulatory functions

    would be akin to the functions of regulatory

    sRNAs, which, despite having significant im-pacts on bacterial cell physiology, are also not

    essential (reviewed in Reference 89).

    Mechanisms of Action

    As increasing numbers of small proteins are

    characterized, we are learning more about their

    mechanisms of action. By definition, the small

    size of the proteins limits the number of pos-

    sible three-dimensional structures into which

    the proteins can fold and the number of ac-tivities the proteins can have. Accordingly, an

    -helical structure appears to be the common

    inferred structural motif among the transmem-

    brane and amphipathic small proteins discussed

    above. The set of small proteins studied thus far

    also suggests that small proteins are unlikely

    to possess enzymatic functions. Instead, as il-

    lustrated by the proteins discussed, this class

    of proteins appears to act in more mechanical

    ways. They can be inhibitors by directly block-

    ing a domain of a target protein, as suggestedfor MciZ (35) and MgrB (67), or by block-

    ing interactions between domains or interac-

    tions between proteins,as suggested forSda (73,

    74). Alternatively, a small protein may facili-

    tate interactionsbetween domains within or be-

    tween proteins as well as between proteins and

    othermolecules,assuggestedforMgtR(61)and

    MntS (78). An interaction with a small prote

    may also bring about a conformational chang

    as suggested for the binding of Sda to the Kin

    kinase (72). Finally, small proteins may pr

    vide a membrane anchor, as found for SpoV

    (17).

    An important direction for further studies

    small proteins will be the elucidation of thestructures, particularly in the context of th

    larger interacting proteins. These types of stu

    ies, together with mutational analyses to defin

    critical amino acids, will help to delineate ho

    small proteins interact with their respecti

    partners. In this context, surprisingly few of t

    mutations isolated thus far have dramatic effec

    on small protein activity. With only a limit

    number of amino acids, an initial assumptio

    might be that most of the residues are critic

    but the results to date have not confirmed thassumption. Of course, more systematic sca

    ning mutagenesis of small proteins may reve

    residues that aremore sensitive to substitution

    Many of the small toxic proteins synth

    sized as part of type I toxin-antitoxin system

    (reviewed in Reference 5) or serving as a d

    fense against host cells (reviewed in Referen

    90) are of similar length and hydrophobicity

    the transmembrane small proteins we descri

    above, yet the small proteins we discuss are n

    toxic or secreted. Which residues define the dferences in the modes of action is unclear. Inte

    estingly, ComI, a member of the TxpA fami

    of toxins (91) encoded on a large plasmid in u

    domesticated strains ofB. subtilis, performs th

    additional cellular function of inhibiting com

    petence (92), suggesting that the distinctio

    between the different types of small protei

    may be blurred.

    A related question is whether there are an

    lower limits to the size of a functional sma

    protein. The average length of a transmembrane helix is approximately 26 5 residue

    but transmembrane segments of as few as

    residues are possible (93). Other questio

    that remain to be addressed are whether sm

    proteins form oligomeric complexes, wheth

    one small protein can interact with multip

    larger proteins, and whether larger protei

    766 Storz Wolf Ramamurthi

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    15/28

    can be the target of a set of small proteins ex-

    pressed under different conditions. In this con-

    text, in E. coli, the slightly larger 72-amino-

    acid, hydrophobic YmgF protein interacts with

    some of the same divisome proteins as does

    Blr (94); the 65-amino-acid SafA (B1500), like

    MgrB, interacts with PhoQ (95); and the 110-

    amino-acid, single-transmembrane YajC pro-tein was reported to cocrystallize with AcrB

    (96).

    We assume that small proteins provide

    advantages not afforded by larger proteins. For

    example, small proteins may be able to assume

    functions directly after synthesis without

    the requirement for folding chaperones. A

    surprisingly large percentage of the described

    small proteins are localized at or in the mem-

    brane. Hydrophobic small proteins that anchor

    larger proteins to the membrane separate thelocalization function of the target protein from

    its activity, thereby providing another level of

    regulation. Small proteins can also fine-tune

    an activity absent the synthesis or degradation

    of an entire enzyme complex. This property

    may be particularly important for membrane

    complexes, given that the membrane may limit

    the number of ways in which the complexes

    can be regulated.

    Synthesis, Subcellular Localization,and Degradation

    The small sizes of small proteins raise interest-

    ing fundamental questions about the synthesis,

    subcellular localization, and degradation of this

    class of proteins. The first question is whether

    the small size of the ORF exerts any unusual

    demands on translation, particularly because

    many of the small proteins are hydrophobic.

    Do RNA secondary structures, alternative start

    codons, stop codons, and frameshift mutationshave the same effects on small proteins as on

    larger proteins? Are there any impediments to

    releasing small proteins that might not be much

    longer than the ribosome exit channel? The hy-

    drophobic sequences of several of thesmall pro-

    teins are not very different from the hydropho-

    bic sequences of small ORFs found in the 5

    leaders of antibiotic resistance operons. These

    5 leaderencoded peptides regulate the expres-

    sion of the downstream operons by interacting

    with the exit channel to lead to the formation of

    a stalled ribosome in the presence of antibiotics

    (reviewed in Reference 97).

    Similarly, how are small transmembrane

    proteins, which are approximately the size of asignal sequence, inserted into the membrane?

    Studies that examine the subcellular localiza-

    tion of small proteins are still in their infancy,

    but initial experiments indicate that small pro-

    teins display diversity in topology and mem-

    brane insertion pathways (98). For example,

    the tagging of single-transmembrane-domain

    small proteins in E. coliwith proteins that are

    active only in either the cytoplasm (GFP) or

    the periplasm (alkaline phosphatase) revealed

    that some small proteins have an Nin-Coutorientation and others an Nout-Cinorientation,

    whereas a few even have dual topology (98).

    Mutational analysis of fusions to one of the

    small proteins showed that positive residues

    adjacent to the transmembrane domain impact

    topology in a manner similar to that found

    for polytopic membrane proteins. However,

    a potential problem with these experiments

    is that the tags employed may impact the

    orientation of small proteins more than they do

    for larger proteins. TheE. coli study reportedan Nout-Cinorientation for CydX (98), but aB.

    abortusstudy using a different tag (-lactamase)

    reported an Nin-Coutorientation (57). Because

    several relatively small bacteriophage proteins

    require the YidC protein for membrane inser-

    tion (reviewed in Reference 99), it was assumed

    that insertion of theE. colismall proteins might

    also be dependent on YidC. However, the

    depletion of the essential YidC protein as well

    as of the essential SecE component of the Sec

    translocase again revealed diversity; some smallproteins were affected by the depletion of both

    proteins, other small proteins were affected by

    the depletion of only one or the other protein,

    and some small proteins were not affected by

    eitherdepletion (98). Finally, higher-resolution

    imaging is needed to determine whether small

    proteins are generally inserted into membranes

    www.annualreviews.org Bacterial Small Proteins 767

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    16/28

    uniformly or in punctate patterns that may

    indicate a preference for particular lipid

    microdomains or rafts. Overall, the initial

    studies reveal that much remains to be learned

    about small protein insertion into membranes.

    Given the hypothesis that small proteins

    most frequently act as regulators that modu-

    late processes or act under specific conditions,one must also assume that there are pathways

    to specifically degrade the small proteins, but

    this assumption has not been explored. In addi-

    tion, the possibility of posttranslational modifi-

    cationstosmallproteinsneedstobeconsidered.

    Evolution

    Other fundamental questions relate to the

    distribution and evolution of small proteins.

    The short length of the coding sequences;the fact that many of the small proteins are

    composed of hydrophobic amino acids such as

    leucine, isoleucine, and valine; and the uneven

    annotation of small protein genes in completed

    genome sequences provide formidable chal-

    lenges to identifying small protein orthologs.

    In one study, linkage to the adjacent SgrR

    proteincoding gene allowed for the recogni-

    tion of SgrT orthologs in other enteric bacteria

    (100), but conservation of synteny has not been

    established for many of the small protein genes.Taking into account the caveats associated

    with the identification of orthologs,Figure 2

    shows thephylogenetic distribution of thesmall

    proteins described in this study. Although a few

    of thesmall proteins, particularlythose encoded

    in operons with larger proteins, span more than

    one phylogenetic classsuch as the, , and

    proteobacteriamost small proteins are co

    served in only a limited number of organism

    As is the case for all proteins, the small pr

    tein distribution has implications for evolutio

    Did these small genes evolve independentl

    or are they fragments of genes that original

    encoded larger proteins, for example, larg

    proteins with which the small proteins interacConversely, are small protein genes buildin

    blocks for the evolution of genes encodin

    larger proteins? Again, due to the low info

    mation content of small proteins, establishin

    clear evolutionary relationships between sm

    proteins and the rest of the proteome may

    difficult. Upon examining translation, conse

    vation, and other features of small ORFs

    so-called nongenic sequences ofSaccharomy

    cerevisiae, Carvunis et al. (101) noted that le

    conserved ORFs in S. cerevisiae had highhydropathicity and a higher tendency to for

    transmembrane regions. These researche

    concluded that many of these small OR

    are protogenes that are a reservoir for t

    birth of new genes (101). Larger protei

    are less affected by thermal noise due to th

    energy of interactions between the residu

    in the polypeptide chain. As a result, sma

    proteins may be unordered on their own an

    may therefore depend on an external fact

    (another protein or a phospholipid bilayefor example) to obtain a stable conformatio

    Such a dependency may explain the skewe

    subcellular distribution of small protein

    either as components of larger complexes or

    membrane-associated proteins. Determinatio

    of whether these characteristics are gener

    evolutionary constraints will require furth

    Figure 2

    Phylogenetic distribution of small proteins. An unrooted prokaryotic phylogenetic tree based on 16S rRNA sequences is shown. At thcenter, selected major phylogenetic groupings are colored, and selected organisms are indicated. The phylogenetic distribution of smproteins discussed in this review is classified according to biological function and is highlighted in red on individual trees shown atlower magnification. Homologs were identified by a PSI-BLAST search of 2,262 completely sequenced genomes (as of February 201Supplemental Table 1(to access, follow theSupplemental Material linkfrom the Annual Reviews home page athttp://www.annualreviews.org) summarizes the presence or absence of particular small proteins. This analysis illustrates the disparate nature ofsmall protein annotation; for example, thepmrRgene is annotated in only 6 out of the 95 genomes in which homologs were found. Thalignments of annotated small proteins generated by MUSCLE are given in Supplemental Table 2and are available in FASTA formatftp://ftp.ncbi.nih.gov/pub/wolf/_suppl/small.Phylogenetic tree adapted from Nelson et al. (108).

    768 Storz Wolf RamamurthiSupplemental Materia

    http://www.annualreviews.org/http://www.annualreviews.org/ftp://ftp.ncbi.nih.gov/pub/wolf/_suppl/smallftp://ftp.ncbi.nih.gov/pub/wolf/_suppl/smallhttp://www.annualreviews.org/doi/suppl/10.1146/annurev-biochem-070611-102400http://www.annualreviews.org/doi/suppl/10.1146/annurev-biochem-070611-102400ftp://ftp.ncbi.nih.gov/pub/wolf/_suppl/smallhttp://www.annualreviews.org/http://www.annualreviews.org/
  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    17/28

    Blr

    SidA

    MciZ

    CydX

    PmrR

    Transport

    MntS

    FbpC

    Chaperones

    Membraneenzymer

    egulators

    SpoVM

    CmpA

    Sporulation

    Celldivision

    M

    grB

    MgtR

    Sda

    Signaltransduction

    SgrT

    KdpF

    AcrZ

    FbpB

    LowGC,gramp

    ositive

    H

    g

    G

    g

    a

    m

    p

    t

    v

    H

    ighGC,

    g

    ramp

    ositive

    Bacillussubtilis

    Staphylococcusaureus

    Bacillusanthracis

    Clostridiumdiffi

    cile

    Proteobacteria

    Escherich

    iacoli

    Shigellaexneri

    Salmonellatyphimurium

    Klebsiellapneumoniae

    P

    o

    e

    e

    a

    P

    roteobacteria

    Bordetellapertussis

    P

    o

    e

    e

    a

    Proteobacteria

    Caulobacter

    crescentus

    P

    o

    e

    e

    a

    Proteobacteri

    a

    S

    r

    o

    h

    e

    Spirochetes

    Yersiniapestis

    Sinorhizo

    biumm

    eliloti

    Legionellapneumophila

    Entero

    coccus

    faecalis

    Burkholderiapseudomallei

    Campylobacterjejuni

    Deinococcus

    radiodurans

    Desulfovib

    rio

    vulgaris

    P

    o

    e

    e

    a

    Proteobacteria

    Mycobacterium

    tuberculosis

    y

    o

    e

    u

    m

    tu

    c

    u

    o

    s

    Mycobacterium

    tuberculosis

    Pseudomonasaeruginosa

    Ralstoniasolanacearum

    Rhodobacters

    phaeroides

    Rhodo

    pseudomonas

    palustris

    Streptomyces

    coelicolor

    Synechococcussp.

    Cyanobacteria

    Xanthomonascitri

    SynechocystisPCC6803

    Photorhabdus

    luminescens

    Haemophilusducreyi

    Vibriocholerae

    www.annualreviews.org Bacterial Small Proteins 769

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    18/28

    structural studies and larger genomic data

    sets.

    BROADER VIEW

    This review focuses on the small proteins en-

    coded by specific genes on bacterial chromo-

    somes, but what has been learned will likely in-

    form our understanding of the many potentialsmall proteins encoded by bacteriophage and

    viruses as well as by archaeal and eukaryotic

    genomes. In addition, the small size of the gene

    products makes them attractive candidates for

    biotechnological applications.

    Small Proteins Encodedby Bacteriophages

    We suggest that bacteriophages may be an ideal

    system in which to study the presence and func-tions of small proteins, given the much smaller

    viral genome sizes and the enormous evolu-

    tionary timescale that allowed the streamlining

    of phage proteins to maximize efficiency. Re-

    cently, a ribosome profiling study of bacterio-

    phage lambda revealed that 55 potential ORFs

    of at least 5 codons in length, all previously

    unannotated, show evidence of translation (6).

    Although many of these ORFs had ribosome

    binding at levels comparable to, or higher than,

    those of well-studied lambda genes, they prob-ably remained undiscovered because they did

    not display obvious deletion phenotypes under

    routine laboratory conditions.

    One well-characterized smallprotein of bac-

    teriophage lambda is the 22-amino-acid-long

    product of the cIIIgene, which harbors some

    sequence similarity to the SpoVM small pro-

    tein ofB. subtilisdiscussed above. Like SpoVM,

    cIII is a membrane-associated, amphipathic

    -helix and is both a substrate and an in-

    hibitor of the membrane-bound protease FtsH(22). Kobiler et al. (22) demonstrated that cIII

    oligomerizes and competitively inhibits FtsH

    from degrading lambda cII, the transcription

    factor that mediates lambda lysogeny. We spec-

    ulate that many more small bacteriophage pro-

    teins that interact with andmodulatelargerbac-

    terial proteins will be found.

    Small Proteins Synthesizedby Eukaryotes

    As is the case for efforts to study small pr

    teins in bacteria, efforts directed at identif

    ing small ORFs in eukaryotes are beginnin

    to reveal potentially thousands of small pr

    teins in these more complex organisms (r

    viewed in Reference 102). Indeed, there isgrowing realization that previously discover

    regulatory, or noncoding, RNAs harbor sm

    ORFs that are translated. For example, Hana

    et al. (103) employed a bioinformatic approa

    to identify small ORFs with a high probabili

    of encoding a protein by exploiting differenc

    in codon usage between coding and nonco

    ing DNA sequences in theArabidopsis thalia

    genome. This analysis led to 7,901 candida

    small ORFs, all previously unannotated an

    located in intergenic regions that the authodeemed highly likely to produce a small pr

    tein. An array analysis led to the observatio

    that more than 2,000 of these candidates we

    highly expressed at the mRNA level during

    least one of many experimental conditions.

    separate experiments, overexpression of 49

    these genes that were highly conserved acro

    other plant species resulted in obvious, d

    cernible phenotypes inA. thaliana. However

    systematic analysis is needed to confirm that t

    observed overexpression phenotypes are specically related to the overproduction of an en

    coded small protein, rather than simply the p

    tative mRNA.

    More detailed studies of small proteins we

    recently carried out in Drosophila, finding th

    some transcripts previously annotated as no

    coding RNAs actually encode small protei

    (104106). For example, an RNA denoted p

    (or tal) encoded four small proteins rangin

    in size from 11 to 32 amino acids (104, 105

    Deletion ofpriresulted in defects in epidermdifferentiation, and a reduction inprilevels l

    to defects in leg morphogenesis. Flies lacking

    gene termedsvb, which encodes a transcriptio

    factor, showed similar defects. Althoug

    deletion ofpridid not reduce svb expressio

    the deletion prevented a normal switch in th

    770 Storz Wolf Ramamurthi

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    19/28

    localization pattern of Svb protein in the nu-

    cleus. This switch in Svb localization correlates

    with a switch in the activity of Svb from a

    transcriptional repressor to a transcriptional

    activator (107). Induced expression ofpriwas

    sufficient to elicit this switch in Svb localization.

    In addition, the expression ofpriresulted in a

    shift in the electrophoretic mobility of Svb thatcorresponded to the removal of its repressor

    domain, but not its activator domain. Taken

    together, Kondo et al. (107) concluded that

    the Pri small proteins mediate the processing

    of the Svb transcription factor to provide

    temporal regulation of Svb activity. How the

    Pri small proteins mediate Svb processing is an

    interesting question because the small proteins

    are unlikely to harbor proteolytic activity. If

    the precedents from bacterial studies apply,

    the Pri proteins may promote interactionswith factors that are responsible for the

    processing.

    Potential for Exploiting Small Proteins

    Given the low molecular weight of small pro-

    teins and their propensity to modulate the ac-

    tivities of proteins with which they interact, one

    intriguing possibility is that small proteins may

    be used as externally added agents, analogous

    to the way that small molecules are currentlyemployed, either as probes or to affect specific

    processes. SpoVM, when added in culture me-

    dia, inhibited the FtsH protease and prevented

    biofilm formation byB. subtilis(25), providing

    preliminary evidence that this approach could

    be fruitful. Thus, for example, one could ex-

    ploit the interactions of small proteins with ef-

    flux pumps and components of the cell division

    machinery as genetic tools to probe the active

    sites of these targets; such an approach could

    aid in designing antimicrobial agents that block

    either drug efflux or cell division.

    CONCLUSIONS

    We suggest that the study of small proteins

    is a field that is poised to explode. The

    characterization of a subset of small proteins

    in bacteria has shown that they are present,

    particularly in membranes. Furthermore, their

    synthesis is regulated, and they impact diverse

    processes ranging from spore formation and

    cell division to the movement of molecules

    across the membrane, enzymatic activities,and signal transduction. Thus, small proteins

    should not be overlooked in any organism.

    Most small proteins probably act mechanically

    to block protein domains or to block or

    facilitate interactions between domains or

    membranes, and there are likely hundreds

    of these proteins. In addition to questions

    regarding the identification and characteriza-

    tion of small proteins, numerous fundamental

    questions about the nature of small pro-

    tein interactions with other molecules, smallprotein synthesis and degradation, and small

    protein evolution remain to be answered not

    only in bacteria but also in viruses and eukary-

    otes. We look forward to the future exploration

    and exploitation of the ignored proteome.

    SUMMARY POINTS

    1. Small proteins in bacteria, here defined as polypeptides of 50 or fewer amino acids, have

    been ignored because they are difficult to detect by using routine biochemical methodsand because the corresponding genes historically have not been annotated in sequenced

    genomes and have been overlooked in classical genetic screens.

    2. Several small proteins discovered in bacteria are beginning to be characterized. These

    small proteins participate in morphogenesis, cell division, transport of ions and macro-

    molecules, regulation of enzyme activity, and signal transduction.

    www.annualreviews.org Bacterial Small Proteins 771

  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    20/28

    3. Most bacterial small proteins characterized thus far are membrane associated, and none

    have been reported to harbor enzymatic activity. Rather, the small proteins either mod-

    ulate the enzymatic activity of larger proteins, positively or negatively, via a direct inter-

    action or play a structural role in stabilizing large complexes or anchoring larger soluble

    proteins to the membrane.

    4. Multipronged strategies, including enhanced bioinformatics, twists on classical biochem-

    ical techniques like ribosome profiling, and other directed global screens, should lead tothe discovery of previously unappreciated smallproteins in bacterial, eukaryotic, archaeal,

    and viral genomes.

    FUTURE ISSUES

    1. What is the full complement of small proteins for a single organism?

    2. What other common features and mechanisms of action will be discerned for small

    proteins?

    3. How are small proteins folded, and how do they interact with other molecules?

    4. What is the smallest size for a functional small protein?

    5. How do small proteins evolve?

    DISCLOSURE STATEMENT

    The authors are not aware of any affiliations, memberships, funding, or financial holdings th

    might be perceived as affecting the objectivity of this review.

    ACKNOWLEDGMENTS

    We thank A. Blanc-Potard, S. Gottesman, M. Goulian, E. Groisman, M. Laub, K. Makarova, C

    Vanderpool, and current and former members of the Storz lab for helpful discussions and com

    ments on the review. This work was funded by the Intramural Research Program of the NIH; th

    Eunice Kennedy Shriver National Institute of Child Health and Human Development (G.S

    the National Center for Biotechnology Information, National Library of Medicine (Y.I.W.); an

    the National Cancer Institutes Center for Cancer Research (K.S.R.).

    LITERATURE CITED

    1. Harrison PM, Kumar A, Lang N, Snyder M, Gerstein M. 2002. A question of size: the eukaryotproteome and the problems in defining it. Nucleic Acids Res. 30:108390

    2. Basrai MA, Hieter P, Boeke JD. 1997. Small open reading frames: beautiful needles in the haystac

    Genome Res.7:76871

    3. Natl. Cent. Biotechnol. 2011. What kind of data can be submitted to GenBank? InGenBank Submissio

    Handbook, Natl. Cent. Biotechnol., Bethesda, Md. http://www.ncbi.nlm.nih.gov/books/NBK5370

    4. Yutin N, Puigbo P, Koonin EV, Wolf YI. 2012. Phylogenomics of prokaryotic ribosomal proteins.PL

    ONE7:e36972

    772 Storz Wolf Ramamurthi

    http://www.ncbi.nlm.nih.gov/books/NBK53707/http://www.ncbi.nlm.nih.gov/books/NBK53707/
  • 7/23/2019 2014 - Small Proteins Can No Longer Be Ignored

    21/28

    5. Fozo EM, Hemm MR, Storz G. 2008. Small toxic proteins and the antisense RNAs that repress them.

    Microbiol. Mol. Biol. Rev.72:57989

    6. Liu X, Jiang H, Gu Z, Roberts JW. 2013. High-resolution view of bacteriophage gene expression by

    ribosome profiling.Proc. Natl. Acad. Sci. USA 110:1192833

    7. Ansaldi M, Th eraulaz L, Mejean V. 2004. TorI, a response regulator inhibitor of phage origin in

    Escherichia coli.Proc. Natl. Acad. Sci. USA 101:942328

    8. Stragier P, Losick R. 1996. Molecular genetics of sporulation in Bacillus subtilis.Annu. Rev. Genet. 30:297

    341

    9. Setlow P. 2007. I will survive: DNA protection in bacterial spores.Trends Microbiol.15:1728010. Sandman K, Losick R, Youngman P. 1987. Genetic analysis ofBacillus subtilis spomutations generated

    by Tn917-mediated insertional mutagenesis.Genetics117:60317

    11. Levin PA, Fan N, Ricca E, Driks A, Losick R, Cutting S. 1993. An unusually small gene required for

    sporulation byBacillus subtilis.Mol. Microbiol.9:76171

    12. Galperin MY, Mekhedov SL, Puigbo P, Smirnov S, Wolf YI, Rigden DJ. 2012. Genomic determinants

    of sporulation in Bacilliand Clostridia: towards the minimal set of sporulation-specific genes.Environ.

    Microbiol.14:287090

    13. Hemm MR, Paul BJ, Schneider TD, Storz G, Rudd KE. 2008. Small membrane proteins found by

    comparative genomics and ribosome binding site models. Mol. Microbiol.70:1487501

    14. van Ooij C, Losick R. 2003. Subcellular localization of a small sporulation protein in Bacillus subtilis.

    J. Bacteriol. 185:139198

    15. Prajapati RS, Ogura T, Cutting SM. 2000. Structural and functional studies on an FtsH inhibitor fromBacillus subtilis.Biochim. Biophys. Acta1475:35359

    16. Ramamurthi KS, Clapham KR, Losick R. 2006. Peptide anchoring spore coat assembly to the outer

    forespore membrane inBacillus subtilis.Mol. Microbiol