structure and function of glycosylated tandem repeats from ... · tr sequences of the als family...

10
EUKARYOTIC CELL, Mar. 2010, p. 405–414 Vol. 9, No. 3 1535-9778/10/$12.00 doi:10.1128/EC.00235-09 Copyright © 2010, American Society for Microbiology. All Rights Reserved. Structure and Function of Glycosylated Tandem Repeats from Candida albicans Als Adhesins Aaron T. Frank, 1 ‡ Caleen B. Ramsook, 2 Henry N. Otoo, 2 Cho Tan, 2 Gregory Soybelman, 2 Jason M. Rauceo, 3 Nand K. Gaur, 4 Stephen A. Klotz, 4 and Peter N. Lipke 2 * Department of Biophysics, University of Michigan, Ann Arbor, Michigan 48109 1 ; Department of Biology, Brooklyn College of CUNY, Brooklyn, New York 11210 2 ; Department of Biology, John Jay College of Criminal Justice of CUNY, New York, New York 10019 3 ; and University of Arizona and Southern Arizona VA Health Care System, Tucson, Arizona 85724 4 Received 16 August 2009/Accepted 2 October 2009 Tandem repeat (TR) regions are common in yeast adhesins, but their structures are unknown, and their activities are poorly understood. TR regions in Candida albicans Als proteins are conserved glycosylated 36-residue sequences with cell-cell aggregation activity (J. M. Rauceo, R. De Armond, H. Otoo, P. C. Kahn, S. A. Klotz, N. K. Gaur, and P. N. Lipke, Eukaryot. Cell 5:1664–1673, 2006). Ab initio modeling with either Rosetta or LINUS generated consistent structures of three-stranded antiparallel -sheet domains, whereas randomly shuffled sequences with the same composition generated various structures with consistently higher energies. O- and N-glycosylation patterns showed that each TR domain had exposed hydrophobic surfaces surrounded by glycosylation sites. These structures are consistent with domain dimensions and stability measurements by atomic force microscopy (D. Alsteen, V. Dupres, S. A. Klotz, N. K. Gaur, P. N. Lipke, and Y. F. Dufrene, ACS Nano 3:1677–1682, 2009) and with circular dichroism determination of secondary structure and thermal stability. Functional assays showed that the hydrophobic surfaces of TR domains supported binding to polystyrene surfaces and other TR domains, leading to nonsaturable homophilic binding. The domain struc- tures are like “classic” subunit interaction surfaces and can explain previously observed patterns of promis- cuous interactions between TR domains in any Als proteins or between TR domains and surfaces of other proteins. Together, the modeling techniques and the supporting data lead to an approach that relates structure and function in many kinds of repeat domains in fungal adhesins. Yeast adhesins are a diverse set of cell adhesion proteins that mediate adhesion to host cells, environmental substrates, other fungi, and coinfecting bacteria (6, 8, 20, 21, 23, 29). The adhesins share common features, including compact N-termi- nal domains similar to Ig or lectin domains, Thr-rich mid- pieces, often in tandem repeats, and long highly glycosylated Ser/Thr-rich C-terminal regions that extend the functional do- mains out from the cell surface. No structures for the Thr-rich midpieces are known, but they can mediate aggregation of fungal cells (33, 35, 47). The prevalence and conservation of such repeats argue that they are functionally important, de- spite limited data on their structure and function. In Candida albicans, the Als adhesins are homologous pro- teins, products of 8 loci that encode numerous alleles of cell surface adhesins (16). In each mature Als protein, there are, from the N terminus, three tandem Ig-like domains, a -sheet- rich conserved 127-residue amyloid-forming T region, a vari- able number of 36-residue tandem repeats (TRs), and a highly glycosylated stalk region that extends the N-terminal domains away from the cell surface (Fig. 1) (16, 33, 41). The C termini of these and other wall-associated adhesins are covalently cross-linked into the cell wall through transglycosylation of a modified glycosylphosphatidylinositol (GPI) anchor (18, 25). This modular design, including tandem repeats, is typical of fungal adhesins (8). The Als protein Ig-like region, T region, and TR region all have protein-protein interaction activities (26, 33, 35). The Ig-like regions can interact with diverse mammalian proteins, presumably in a way analogous to antibody-antigen binding, as has been shown in the homologous protein -agglutinin from Saccharomyces cerevisiae (8, 24, 26, 35). The T regions interact through formation of amyloid-like structures both in vivo and in vitro (33, 34a, 36). An insight into the function of the tandem repeats followed from observations that Als proteins initiate and maintain cell-to-cell aggregations, either spontaneously (“autoaggregation”) or following adhesion to a bead-bound defined ligand (10, 11, 36). Aggregation is more extensive for Als proteins with more tandem repeats (26, 35). This result suggested that the tandem repeats are uniquely structured to facilitate or mediate the aggregative function. Circular dichro- ism spectroscopy of the TR region of Als5p shows a -sheet- rich structure in the soluble protein (35). In support of their direct involvement in aggregation, the repeat region of the C. albicans adhesin Als5p mediates cell- cell aggregation in the absence of the Ig-like and T domains (35). Moreover, the repeats can also potentiate binding of Als5p to fibronectin (35). Thus, the TR domains mediate cel- lular aggregation and increased binding to fibronectin. In ad- dition, TR domains and their amino acid sequences are highly conserved across several Candida species (3). These properties need to be explained by their three-dimensional structure. * Corresponding author. Mailing address: Department of Biology, Brooklyn College, 2900 Bedford Avenue, Brooklyn, NY 11210. Phone: (718) 951-5000, ext. 1949. Fax: (718) 951-4659. E-mail: plipke @brooklyn.cuny.edu. † Supplemental material for this article may be found at http://ec .asm.org/. ‡ Present address: Department of Chemistry, University of Califor- nia, Irvine, Irvine, CA 92697. Published ahead of print on 9 October 2009. 405 on May 28, 2020 by guest http://ec.asm.org/ Downloaded from

Upload: others

Post on 26-May-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

EUKARYOTIC CELL, Mar. 2010, p. 405–414 Vol. 9, No. 31535-9778/10/$12.00 doi:10.1128/EC.00235-09Copyright © 2010, American Society for Microbiology. All Rights Reserved.

Structure and Function of Glycosylated Tandem Repeats fromCandida albicans Als Adhesins�†

Aaron T. Frank,1‡ Caleen B. Ramsook,2 Henry N. Otoo,2 Cho Tan,2 Gregory Soybelman,2Jason M. Rauceo,3 Nand K. Gaur,4 Stephen A. Klotz,4 and Peter N. Lipke2*

Department of Biophysics, University of Michigan, Ann Arbor, Michigan 481091; Department of Biology, Brooklyn College of CUNY,Brooklyn, New York 112102; Department of Biology, John Jay College of Criminal Justice of CUNY, New York, New York 100193;

and University of Arizona and Southern Arizona VA Health Care System, Tucson, Arizona 857244

Received 16 August 2009/Accepted 2 October 2009

Tandem repeat (TR) regions are common in yeast adhesins, but their structures are unknown, and theiractivities are poorly understood. TR regions in Candida albicans Als proteins are conserved glycosylated36-residue sequences with cell-cell aggregation activity (J. M. Rauceo, R. De Armond, H. Otoo, P. C. Kahn, S. A.Klotz, N. K. Gaur, and P. N. Lipke, Eukaryot. Cell 5:1664–1673, 2006). Ab initio modeling with either Rosettaor LINUS generated consistent structures of three-stranded antiparallel �-sheet domains, whereas randomlyshuffled sequences with the same composition generated various structures with consistently higher energies.O- and N-glycosylation patterns showed that each TR domain had exposed hydrophobic surfaces surroundedby glycosylation sites. These structures are consistent with domain dimensions and stability measurements byatomic force microscopy (D. Alsteen, V. Dupres, S. A. Klotz, N. K. Gaur, P. N. Lipke, and Y. F. Dufrene, ACSNano 3:1677–1682, 2009) and with circular dichroism determination of secondary structure and thermalstability. Functional assays showed that the hydrophobic surfaces of TR domains supported binding topolystyrene surfaces and other TR domains, leading to nonsaturable homophilic binding. The domain struc-tures are like “classic” subunit interaction surfaces and can explain previously observed patterns of promis-cuous interactions between TR domains in any Als proteins or between TR domains and surfaces of otherproteins. Together, the modeling techniques and the supporting data lead to an approach that relates structureand function in many kinds of repeat domains in fungal adhesins.

Yeast adhesins are a diverse set of cell adhesion proteinsthat mediate adhesion to host cells, environmental substrates,other fungi, and coinfecting bacteria (6, 8, 20, 21, 23, 29). Theadhesins share common features, including compact N-termi-nal domains similar to Ig or lectin domains, Thr-rich mid-pieces, often in tandem repeats, and long highly glycosylatedSer/Thr-rich C-terminal regions that extend the functional do-mains out from the cell surface. No structures for the Thr-richmidpieces are known, but they can mediate aggregation offungal cells (33, 35, 47). The prevalence and conservation ofsuch repeats argue that they are functionally important, de-spite limited data on their structure and function.

In Candida albicans, the Als adhesins are homologous pro-teins, products of 8 loci that encode numerous alleles of cellsurface adhesins (16). In each mature Als protein, there are,from the N terminus, three tandem Ig-like domains, a �-sheet-rich conserved 127-residue amyloid-forming T region, a vari-able number of 36-residue tandem repeats (TRs), and a highlyglycosylated stalk region that extends the N-terminal domainsaway from the cell surface (Fig. 1) (16, 33, 41). The C terminiof these and other wall-associated adhesins are covalently

cross-linked into the cell wall through transglycosylation of amodified glycosylphosphatidylinositol (GPI) anchor (18, 25).This modular design, including tandem repeats, is typical offungal adhesins (8).

The Als protein Ig-like region, T region, and TR region allhave protein-protein interaction activities (26, 33, 35). TheIg-like regions can interact with diverse mammalian proteins,presumably in a way analogous to antibody-antigen binding, ashas been shown in the homologous protein �-agglutinin fromSaccharomyces cerevisiae (8, 24, 26, 35). The T regions interactthrough formation of amyloid-like structures both in vivo andin vitro (33, 34a, 36). An insight into the function of the tandemrepeats followed from observations that Als proteins initiateand maintain cell-to-cell aggregations, either spontaneously(“autoaggregation”) or following adhesion to a bead-bounddefined ligand (10, 11, 36). Aggregation is more extensive forAls proteins with more tandem repeats (26, 35). This resultsuggested that the tandem repeats are uniquely structured tofacilitate or mediate the aggregative function. Circular dichro-ism spectroscopy of the TR region of Als5p shows a �-sheet-rich structure in the soluble protein (35).

In support of their direct involvement in aggregation, therepeat region of the C. albicans adhesin Als5p mediates cell-cell aggregation in the absence of the Ig-like and T domains(35). Moreover, the repeats can also potentiate binding ofAls5p to fibronectin (35). Thus, the TR domains mediate cel-lular aggregation and increased binding to fibronectin. In ad-dition, TR domains and their amino acid sequences are highlyconserved across several Candida species (3). These propertiesneed to be explained by their three-dimensional structure.

* Corresponding author. Mailing address: Department of Biology,Brooklyn College, 2900 Bedford Avenue, Brooklyn, NY 11210. Phone:(718) 951-5000, ext. 1949. Fax: (718) 951-4659. E-mail: [email protected].

† Supplemental material for this article may be found at http://ec.asm.org/.

‡ Present address: Department of Chemistry, University of Califor-nia, Irvine, Irvine, CA 92697.

� Published ahead of print on 9 October 2009.

405

on May 28, 2020 by guest

http://ec.asm.org/

Dow

nloaded from

Because there are no homologous structures known, wemodeled by two independent ab initio methods. Rosetta as-sembles structures by combining short peptide structures ex-tracted from the protein structural database PDB (38), thencombines structures in a Monte Carlo approach, and assessesenergetics of assembled structures. Rosetta has recently beenshown to generate accurate models for protein-sized domains(40). We also predicted structures with LINUS, which gener-ates randomized structures and rapidly estimates energetics tochoose low-energy models (45). The models were supported bystructural analyses with atomic force microscopy and circulardichroism spectroscopy. Functional assays showed that the TRdomains can mediate binding activities predicted from thecalculated structures.

MATERIALS AND METHODS

Nomenclature. In this paper, we refer to each individual repeat by the ALSlocus that encodes the repeat followed by “R” and the number of the repeat ascounted from the N terminus. The repeats chosen for modeling from Als5pderive from sequence NCBI accession number O13368.1; thus, Als5p-R3 is thethird repeat in this Als5p sequence. Individual repeats from other Als proteinswere chosen as those repeats with sequences most divergent from the Als5psequences (Fig. 2). The NCBI accession numbers of the protein sequences are asfollows: Als1p, P46590; Als3p, XP712666.1; Als6p, AAD42033.1; Als7p,AAF98068.1; Als9p, AAP34370.1. The prefix “L” in the chosen Als3-LR4 des-ignates that the longer allele of Als3 was the source of the sequence.

Tertiary structure modeling. The conformational data-based protein-foldingprogram Rosetta was used to generate structural models of eleven representativeTR sequences of the Als family (Fig. 2 and Table 1). The Rosetta algorithm isbased on a fragment assembly approach. In this approach, a library of 3- and 9-residue fragments is created based on segments of proteins of known structure.This library represents the local conformational space accessible to these shortresidue fragments. To create a model, the fragments are combined using aMonte Carlo search protocol. One “move” in the modeling method consists ofsubstituting a local fragment with one from the fragment library, a process knownas “fragment insertion.” The resulting structure is evaluated using Rosetta’spotential energy function. The structure is accepted or rejected based on theMetropolis criterion (38). In our studies, about 88% of moves were energeticallyrejected.

The general protocol described below was followed for modeling each tandemrepeat. Rosetta was used to create 1,000 structures for each sequence shown inFig. 2. The resulting structures were clustered by structural similarity based onthe C� root mean square distance between the models. The lowest-energy modelfrom the largest cluster was selected, and the selected models were then energyminimized to remove bad contacts. Additionally, for each modeled sequence,two randomly permuted sequences were generated (except for Als5-R4 where 4random sequences were generated), and models of these randomized sequenceswere generated using the same protocol described above.

For comparison, modeling was also carried out using LINUS (local indepen-dently nucleated units of structure) which was developed by Srinivasan et al. (45).LINUS generates random perturbations in successive three-residue segments ofan extended peptide and then scores the energetics of the resulting structure.The LINUS scoring function includes only terms for hard sphere repulsion,contact energy, hydrogen bonds (short range and long range), and torsion en-

ergy. Nevertheless, LINUS gives reasonable results concerning the general to-pology and secondary structure of proteins (45). A major strength of LINUS isits truly ab initio approach; no information other than the sequence is used tobuild models. LINUS structures were produced and converged in reasonablecomputation times because there were only 36 residues per repeat.

Visualization and graphics used CHIMERA (13).Glycosylation of structural models. The Glycam Biomolecule Builder (http:

//glycam.ccrc.uga.edu) (19) was used to add the disaccharide �-D-mannose-(1-2)-�-D-mannose-1� to solvent-exposed Ser and Thr hydroxyls, as predicted byRosetta, in the top-scoring model of each repeat. This disaccharide represents an“average” C. albicans or S. cerevisiae O-glycoside and conforms to the expectedaverage stoichiometry of glycosylation of TRs synthesized by S. cerevisiae (12, 15,35). The same sugars were similarly added to the exposed hydroxyl groups of the

FIG. 1. Schematic diagram of the sequence of Als5p. The regionsare named above, and the number of amino acid residues in eachregion is shown below. The modeled sequences are in the TR region.

FIG. 2. Tandem repeat sequences. Alignments (top) are colorcoded by degree of conservation (the darker the color of the column,the more conservation of the residue). The residues with high �-strandpotential are colored blue, and residues with high turn potential arecolored red. The JPRED secondary structure predictions for eachsequence are shown below the alignment (7). The nomenclature isdescribed in Materials and Methods.

TABLE 1. RMSD values for C� relative to the mean structurefor foldsa

TR sequenceRMSD value for:

All C� �-Sheet Turn 1 Turn 2

Als5-R1 2.6 1.8 3.3 3.1Als5-R2 2.0 1.6 2.5 1.7Als5-R3 1.5 1.2 1.9 2.1Als5-R4 1.7 1.5 1.6 2.0Als5-R5 2.3 1.8 4.5 2.3Als5-R6 1.5 1.2 2.3 1.7Als1-R5 2.5 2.5 2.6 2.5Als3-LR4 1.7 1.7 2.4 1.5Als6-R3 1.8 1.4 3.2 2.0Als7-R2 1.6 1.4 2.4 1.8

Als5p mean 1.9 1.5 2.7 2.2

Overall mean 1.9 1.6 2.7 2.1

a The RMSD values (in Å) were calculated with the RMSD Calculator inVMD (17).

406 FRANK ET AL. EUKARYOT. CELL

on May 28, 2020 by guest

http://ec.asm.org/

Dow

nloaded from

top-scoring structure for the 72-residue, two-domain structure Als5-R1R2(Als5p repeat sequences 1 and 2). N-glycosylation sites were derivatized withGlcNAc2Man9 yeast core glycan (15, 19). The folded and glycosylated domainmodels were subjected to brief Molecular Dynamics simulations to minimizeenergy.

The root mean square differences (RMSDs) of the C� positions was estimatedafter alignment by a best-fit procedure. Als5R-1 was used as the referencestructure for the alignment. The RMSDs for the aligned structure were thencalculated relative to the average structure. For our calculations, the �-strandregions were defined as residues 3 to 11, 17 to 23, and 28 to 34. Turn 1 consistedof residues 11 to 15, and turn 2 included residues 23 to 27.

Circular dichroism. A 36-mer peptide with the sequence of Als5-R3 (HNPTVTTTEFWSESYATTETITNGPEGDTSVIVREP) was synthesized by the Ari-zona State University Peptide Facility and purified to �90% purity. Far UVspectra were obtained using AVIV and Applied Photophysics Chirascan circulardichroism (CD) spectrophotometers. The peptide was suspended at 1 mg/ml in10 mM phosphate buffer (pH 5.6), and 2,2,2-trifluoroethanol was added to 0, 10,20, 30, and 40% (vol/vol) concentrations. For each condition, three spectra wereobtained at 25°C and averaged, and the baseline was subtracted.

Secondary structural percentages were estimated from the raw data withCDSSTR, CONTINLL, and SELCON3 programs (43, 44, 46). All three pro-grams gave similar secondary structure predictions; CDSSTR results are re-ported, because they gave the most consistent results, regardless of changes inwavelength range or basis set.

Soluble truncated versions of Als5p, Als5p1–443 (Als5p protein with residues 1to 443), and Als5p1–664, were expressed and purified as previously described (33),and spectra were acquired within a week of purification. Denaturation andrenaturation CD studies were carried out by raising (for denaturation) or low-ering (for renaturation) the temperature in 10°C steps at 1°C min�1 and thenequilibrating for 10 min after each step. After each step, 10 spectra were ob-tained, averaged, and then smoothed (43).

Binding assays. Enzyme-linked immunosorbent assays (ELISAs) for Als5pbinding to substrates were carried out as previously described, using horseradishperoxidase-labeled anti-V5 at 1:500 dilution as the sole antibody (35). Theantibody reacted equivalently with both forms of Als5p (35; data not shown).Peroxidase activity was assayed with Quanta Blu fluorescent substrate (PierceChemical). Soluble forms of Als5p were purified as previously described andwere used within 2 weeks of isolation to minimize amyloid formation (33). Allassays were performed in triplicate, and standard deviations are shown as errorbars.

RESULTS

Predicted secondary structure of Als repeats. The individualtandem repeats (TRs) have highly conserved sequences andlow rates of nonsynonymous substitutions, no insertions, andalmost no deletions (Fig. 2 and data not shown) (33). Therepeats vary in number from 2 to 36 copies in the allelessequenced so far (16, 31, 49, 51). TR sequences include a highfrequency of aliphatic �-branched amino acids, with 9 to 11Thr residues and 6 Val or Ile residues in clusters within eachrepeat. All of these amino acid have high �-strand potential(blue in Fig. 2). These �-branched clusters are interspersedwith clusters of residues with high turn or helical potential(red in Fig. 2). In the multiple alignments like Fig. 2,�-branched residues Ile, Val, and Thr can substitute for oneanother (blue regions), and the residues with turn-inducing orhigh helical regions also substitute only for each other (red andwhite regions).

The sequence conservation leads to uniform consensus pre-dictions of �-sheet-rich structures (Fig. 2) (4, 7, 9). The se-quence-based predictions were highly consistent in predictingstructures composed of three �-strands in each of the Als5prepeats (7). The first and second strands sometimes varied intheir predicted length by one or two residues, but the thirdstrand predictions were invariant (Fig. 2, bottom). NeitherJPRED nor other secondary structure prediction methods

showed any helical regions (4, 7, 9). Therefore, we predict thatthe overall fold of the repeats is conserved.

Circular dichroism spectroscopy. The TR region has a�-sheet-rich secondary structure in soluble versions of Als5p(35). To establish whether the strongly predicted secondarystructure of individual repeats was reflected in solution, wecarried out far UV CD spectroscopy on a synthetic 36-merpeptide (HNPTVTTTEFWSESYATTETITNGPEGTDSVIVREP) (Als5p-R3) dissolved at 1 mg/ml in buffer. The spectrashowed a strong negative peak at 198 nm and a less negativeshoulder at �220 nm and varied little between pH 5.5 and 7.0,the approximate physiological pH range for this surface-ex-posed protein (data not shown). Curve fitting with CDSSTRgave secondary structure predictions of about 34% �-sheetcontent, slightly less �-turn and unstructured content, and verylow �-helix (Fig. 3). These fractions are consistent with anall-�-domain structure (7, 14). Increasing trifluoroethanol(TFE), which generally stabilizes peptide structures but favorshelix formation (50), shifted the negative peak to longer wave-length and made the shoulder more negative. The secondarystructure analyses showed that TFE had little effect on second-ary structure at concentrations up to 20% (vol/vol). At higherTFE concentrations, �-helix and aperiodic structures in-creased. Thus, the secondary structure of a single syntheticrepeat reflected the secondary structure predictions in its all �structure under a variety of pH levels and at TFE concentra-tions up to 20%.

Three-dimensional models of Als5p repeats. Three indepen-dent analyses implied uniform structure for tandem repeats:sequence conservation, secondary structure predictions, andCD of the synthetic peptide and repeats in situ. To test whetherthis uniformity led to uniform tertiary structures, we used in-dependent modelers to predict structure of many Als tandemrepeat domain sequences. Rosetta modeling generated highlysimilar structures for all six repeats in Als5p (Fig. 4A). In eachcluster of modeled structures, the lowest-energy structures had�-�-� antiparallel topology, with highly similar tertiary struc-ture. The RMS distance differences for the C�s in the super-imposed models were 1.9 Å for all residues and 1.6 Å for

FIG. 3. Secondary structure content for 36-mer peptide Als5p-R3.Secondary structure content was calculated from far UV CD spectrawith four different basis sets in the program CDSSTR (43). The meanvalue � standard error of the mean (error bar) with different basis setsis shown for each TFE concentration tested. Symbols: F, �-sheet; �,�-helix; f, turn; �, aperiodic structure.

VOL. 9, 2010 Candida Als TANDEM REPEATS 407

on May 28, 2020 by guest

http://ec.asm.org/

Dow

nloaded from

residues in the �-strands (Table 1). RMSD values for theresidues in the core �-sheet were considerably smaller (Fig. 4).These results thus showed uniform secondary and tertiarystructure for the repeats.

Because Rosetta is based on libraries of known peptideconformations, we were concerned that it might predict knownstructures preferentially. We therefore repeated the modelingwith LINUS, which assesses energetics for randomly generatedstructures. Interestingly, LINUS also predicted similar antipa-rallel three-strand �-sheet topology for all six repeats of Als5p.The LINUS results therefore provided an independent confir-mation of the propensity of Als repeat sequences to formcompact three-stranded antiparallel �-sheet domains and in-creased our confidence in the Rosetta models.

It was possible that the uniformity of the predictions was duepurely to the unusual amino acid composition of the repeats.We tested whether this was so by modeling randomly per-muted sequences of each repeat in Rosetta. The predictedstructures were much different from those of the authenticsequence. There were all-helical structures as well as antipar-

allel �-sheet structures. Moreover, none of the models of per-muted sequences showed the three-stranded �-�-� topology(for examples, see Fig. 4C). The energy values of the beststructures from permuted sequences were substantially worsethan those from the authentic sequences (see Fig. S1 in thesupplemental material). Therefore, the consistency of themodeled structures resulted from both the amino acid compo-sition and the conserved sequences of the repeats.

Tandem repeats in other Als proteins. The tandem repeatsin other Als proteins have more diverse sequences than thosein Als5p (Fig. 2). Therefore, we also modeled examples of therepeats, in each case choosing the repeats with sequences mostdivergent from the Als5p TR consensus. Rosetta models forthese repeats also showed uniform antiparallel �-�-� strandtopology (Fig. 4B). The �-sheets in these domains were super-imposable on those of the Als5 TR domains (Fig. 4B). In fourof the five models, RMSD variations were indistinguishablefrom those among the Als5p repeat domains (Table 1). In eachcase, random permutations of the repeat sequence gave dif-ferent structures (data not shown), a result that reinforced thesequence-specific nature of the three-strand models.

The single exception to the uniform results was that thetop-scoring models for Als9-R2 showed two sets of structures.For this sequence, too, all of the models gave three antiparallel�-strands as the other models did. However, the secondarystructure of the domain differed in that the spatial position ofthe first strand varied in different trials. In about 70% of thetrials, the topology, tertiary structure, and RMSD variation ofthe lowest-energy structure were similar to those of the lowest-energy models of the other domains (red in Fig. 4B). However,in about one-third of the models of Als9-R2, the first strandwas modeled above the plane formed by second and thirdstrands. Therefore, the structure was less strongly consistent.

Glycosylation patterns in the tandem repeats. Gel electro-phoresis, lectin blotting, and stoichiometric analyses show thatthe TR domains are heavily O glycosylated when exogenouslyexpressed in S. cerevisiae, with about 1.5 mannose units per Seror Thr hydroxyl group (35). In the models, about two-thirds ofthe Ser and Thr hydroxyls were solvent accessible, as expected(see Table S1 in the supplemental material). Therefore, wewould expect a mean glycosylation of about 2 mannose resi-dues per exposed hydroxyl group. This mean ratio of 2:1 isconsistent with known O-glycosylation structures in C. albicansand S. cerevisiae. In C. albicans, the O-linked oligosaccharidesinclude �-mannose, �1,2-linked mannobiose, and mannotriose(12, 15).

Typically, residues Thr4, Thr6, Thr7, Thr8, Ser12, Ser14,Thr18, Ser30, and sometimes Thr16, Thr22 and Thr28 wereaccessible to solvent to a degree consistent with glycosylation(Fig. 5; see Table S1 in the supplemental material). Therefore,these residues were glycosylated with the disaccharideMan�1,2Man�1. The resulting glycosylated structures weretested for stability by Molecular Dynamic simulations on atime scale of a few nanoseconds. The structures were stable, asindicated by small RMSD fluctuation of �0.03 Å2. The glyco-sylated models showed that oligosaccharides surround exposedsurface hydrophobic surface areas (Fig. 5). This pattern wasconserved in all modeled repeats of Als5p (Fig. 5C) and otherAls proteins (Fig. 5D).

Some TR domains, such as those from Als1p and Als3p,

FIG. 4. Rosetta models of TR domains. (A) Aligned models forALS5 TRs. ALS5-R1 (black), ALS5-R2 (blue), ALS5-R3 (green),ALS5-R4 (brown), ALS5-R5 (purple), and ALS5-R6 (orange) areshown. (B) Aligned models for other ALS TRs. ALS1-R5 (gray),ALS3-LR4 (pink), ALS6-R3 (cyan), ALS7-R2 (yellow) and ALS9-R2(red) are shown. (C) Lowest-energy Rosetta models for four randompermutations of the Als5-R4 amino acid sequence. The relative ener-gies and RMSD values are shown in Fig. S1 in the supplementalmaterial.

408 FRANK ET AL. EUKARYOT. CELL

on May 28, 2020 by guest

http://ec.asm.org/

Dow

nloaded from

VOL. 9, 2010 Candida Als TANDEM REPEATS 409

on May 28, 2020 by guest

http://ec.asm.org/

Dow

nloaded from

have a potential N-glycosylation site at Asn2. These sites weremodeled with a yeast core glycan of GlcNac2Man9, a minimalsized N-glycan in C. albicans (15). The result was a polarposition for these glycans on the more hydrophilic end of theprolate ellipsoidal domains (Fig. 5D).

Tandem domain models. To gain structural insights aboutthe structure of multiple repeats, Als5p repeat sequences 1 and2 (Als5-R1R2) were modeled together, as were repeats R3R4,and a set of triple sequences, R1 to R3, delaying nonlocalinteractions in the folding. For these models, we generated3,000 structures rather than our standard 1,000 in Rosetta andLINUS. Each of the repeats consistently folded as the mono-mers did, again showing the stability of the compact three-stranded domains (Fig. 6). Local folding of individual repeatdomains is consistent with sequential domain folding aftersynthesis and endoplasmic reticulum (ER) translocation and isalso consistent with force extension analyses by atomic forcemicroscopy of Als5p (2, 27, 32).

Functions of TR domains. The structural models predictedthat the hydrophobic effect should mediate the adherence ofTR domains to surfaces and to each other. A single-antibodyELISA measured binding to polystyrene for equivalent molaramounts of Als5p1–443 (without TR domains) and Als5p1–664

(including six TR domains) (35). Inclusion of TR domainsresulted in greater binding at lower protein concentrations, aresult indicative of increased affinity of Als5p for polystyrene (Fig.7A). Half-maximal binding was achieved at 5 � 10�10 M forAls5p1–664, whereas the value was 1.5 � 10�8 M for Als5p1–443,30-fold-lower affinity (Fig. 7A). The geometric mean differencein affinity was 10-fold in three independent experiments. Thus,the TR domains showed greater affinity for polystyrene thanthe Ig-like and T domains did.

The single-antibody ELISA was also used to estimate theaffinity of Als5p for a protein substrate. Excess Als5p with orwithout the TR domains bound to increasing amounts ofimmobilized fibronectin (Fig. 7B). Both Als5p1–443 andAls5p1–664 forms bound to fibronectin at similar concentra-tions, with half-maximal binding at 2.5 � 10�9 M (approxi-mately 1 �g/ml). Nevertheless, almost 10-fold-more Als5p1–664

bound than Als5p1–443. Thus, inclusion of TR domains did notaffect the affinity of the Ig-T region for fibronectin but alteredthe maximal binding.

To determine whether the increased binding resulted fromAls-Als aggregation, we titrated Als5p against limiting fi-bronectin. The results for Als5p1–443 were similar to thoseshown in Fig. 7B and showed saturation of binding, as expectedfor adhesin-ligand binding at limited discrete sites. On theother hand, titration with Als5p1–664 yielded binding propor-tional to the concentration of Als5p1–664 (Fig. 7C). This resultshowed that binding of Als5p1–664 was nonsaturable and in-

creased linearly with added protein, a result demonstratingthat each Als5p molecule that bound became a potential ligandfor more Als5p. Thus, the linear increase in binding for thisform of Als5p was characteristic of self-association of Als5p.Therefore, the TR domains mediated both increased affinityfor polystyrene and Als5p-Als5p homotypic binding.

Unfolding of TR domains. A second strong prediction of themodels is that the tandem repeats fold independently and thusshould unfold independently. This independent unfolding has,in fact, been observed by atomic force microscopy (see Discus-sion) (2). In addition, the uniformity and reproducibility of themodels implied that the unfolding of the TR domains would bea reversible equilibrium process in vivo. Furthermore, the TR

FIG. 5. Ribbon and surface representations of Rosetta best-scoring model of Als tandem repeat domains. (A) (Left) Ribbon drawing ofAls5p-R4 with side chain representation of Ser and Thr residues to be glycosylated. (Right) Ribbon drawing with added mannobiosyl groups(green). (B) Space-filling model of Als5-R4 peptide with carbohydrates shown as ball-and-stick models. Surface-exposed hydrophobic regions areshaded yellow. Glycosylated Ser and Thr residues are green, as are the mannosyl groups. In the leftmost representation, the orientation is the sameas in panel A. Each successive representation is rotated by 90o about the y axis. (C) Surface representations of Rosetta best-scoring models ofdomains R3 to R6 of Als5p. Coloring and domain orientations are the same as in panel B. (D). Surface representations of Rosetta best-scoringmodels of representative domains from other Als proteins. Domain orientations are the same as in panel C. Coloring is the same as in those figures,with the addition of blue for GlcNac and Asn residues of the Man9GlcNAc2N N-glycan in the Als1p and Als3p models.

FIG. 6. Models of two-domain structures from the sequence ofAls5p-R1R2. The structure shown had the lowest energy of all modelsand was part of the largest cluster of related structures. (A) LINUSmodel with long-range interactions repressed. The domain structuresare similar to the single-domain structures. (B) Space-filled glycosy-lated models. O-Glycosylated residues are green. Successive represen-tations are rotated by 90o around the x axis.

410 FRANK ET AL. EUKARYOT. CELL

on May 28, 2020 by guest

http://ec.asm.org/

Dow

nloaded from

domains should refold to their native structure after denatur-ation. Therefore, we thermally denatured and renaturedAls5p1–443, which lacks tandem repeats, and Als5p1–664, whichincludes them. For each protein, CD spectra were obtainedduring heating and cooling. Als5p1–664 denatured reversibly,and the unfolding was cooperative with an isosbestic point at216 or 217 nm, indicative of an unfolding equilibrium reaction(Fig. 8A) (34). The spectra showed great stability of the Ig-likeand T regions and equilibrium unfolding up to 80°C. Thedenaturation of each form was fully reversible upon recoolingto 20°C: the CD spectra were indistinguishable from those ofthe original untreated samples (Fig. 8B). The results weresimilar for Als5p1–443 (see Fig. S2 in the supplemental mate-rial). These data showed that the Als5p TR region has anintrinsic ability to refold to a stable native conformation, aresult in accord with the consistency and stability of secondarystructure predictors and the calculated structures.

DISCUSSION

We have calculated structures for the Thr-rich tandem re-peats from Candida albicans Als proteins. We believe thatthese structures are the first for Thr-rich repeats in any fungaladhesin or cell wall glycoprotein. Structural calculation seemsthe most practical approach currently, because there are nostructures in PDB on which to base homology models (theclosest match has e � 0.14) and because O glycosylation makesnuclear magnetic resonance (NMR) structure determinationimpractical and crystallization is not yet possible for X-raydiffraction. The Als repeats showed highly regular structures,and these structures were well-supported by independent phys-ical and functional data. In addition, the models explain manyof the observed and predicted activities of the repeat regions inAls proteins.

Sequence conservation led to similar structures. The align-ment shown in Fig. 2 shows high conservation of key residues,especially the aliphatic �-branched amino acids Ile, Thr, andVal that form the conserved �-strands. There are only 15nonconservative substitutions in 396 amino acid positions in 11sequences in Fig. 2, and this low frequency is reflected in thelow nonsynonymous substitution rate KA (33). At the sametime, the normal-to-high rate of synonymous substitution KS

shows that this sequence is subject to mutation at normalfrequency. Therefore, the region must be under strong purify-ing selection, and few amino acid substitutions are tolerated.The vast majority of the nonsynonymous mutations are in thesecond turn of each domain (residues 23 to 26 in Fig. 2), andthey are all substitutions between residues common to turns(37). Thus, the Als proteins and their individual repeats haveconserved structural propensities, implying that the structureitself is essential to domain function (3).

This conservation of sequence results in the remarkableconvergence in secondary and tertiary structure models. Ten ofeleven TR sequences generated the same fold as the lowest-energy structure in all trials, and the eleventh gave similarstructures in the majority of tests. The results were alsoconsistent between two independent prediction algorithms:Rosetta, based on a short sequence structural motif library, andLINUS, which finds stable structures from among randomlygenerated models. The programs also have different structure

FIG. 7. ELISAs of Als5p binding. Als5p1–664 (black and red circles)or Als5p1–443 (�). (A) Titrations of Als5p binding to polystyrene.Controls include wells without antibody and wells without antigen.(B) Als5p binding to wells coated with different concentrations offibronectin. Note the difference in scale on the fluorescence axes: theright-hand x-axis scale (red) refers to Als5p1–664 (red circles), and theleft-hand x-axis scale refers to Als5p1–443 (�). Controls in the lowerright portion of the graph include wells without antibody and wellswithout antigen. (C) Titrations of Als5p on limiting fibronectin. Notethe increase in binding of Als5p1–664 (F) at concentrations that satu-rate binding of Als5p1–443 (�).

VOL. 9, 2010 Candida Als TANDEM REPEATS 411

on May 28, 2020 by guest

http://ec.asm.org/

Dow

nloaded from

evaluation criteria (38, 45). This common structure was se-quence specific: random sequences with the same compositionas the repeats generated Rosetta models that were differentfrom each other and different from the three-strand fold thatthe authentic sequences generated (e.g., Fig. 4C; also see Fig.S1 in the supplemental material).

Features of the conserved structures. The spatial patterns ofsurface hydrophobicity and glycosylation were similar in allmodeled TR sequences (Fig. 5). The exposed hydrophobic sidechains were highly conserved: one hydrophobic patch consistedof Val5, Ile/Val21, and Ile or Val residues in positions 31 to 33;another patch was more aromatic and included conserved res-idues Phe/Tyr11, Trp12, Phe/Tyr15, and Pro36. The positionsof the glycosylated residues were also remarkably constant (seeTable S1 in the supplemental material). Therefore, the con-served sequences resulted in similar structures with conservedsurface hydrophobicity and glycosylations. Thus, the TR regionof each Als protein consists of a string of compact tandemdomains, each with hydrophobic binding surfaces with sur-rounding hydrophilic O-glycans (Fig. 5, 6, and 9A).

Supporting structural data. The models are supported byextensive physicochemical and functional data from indepen-dent approaches. Atomic force microscopy has demonstratedthat each tandem repeat folds into a discrete domain, and instudies of Als5p with different numbers of repeats, the numberof unfolding domains is equal to the number of repeats (2).Each TR domain unfolds to a length characteristic of extendedpeptide sequence, with a length increase corresponding to thatexpected for a 36-residue sequence with an original long axisdimension of 4.3 nm, the same dimension shown in the models(Fig. 5 and 6). Thus, each unfolding domain extends by alength equal to the difference between the folded three-stranded domains shown in Fig. 4 to 6 and the length of anextended peptide. We conclude that AFM confirms the num-ber and linear dimensions of the domains.

Circular dichroism spectra also supported the models. CD

spectroscopy of the intact proteins demonstrates that the TRdomains have a �-sheet-rich structure, as predicted by secondaryand tertiary modeling (35). That a synthetic sequence of a singlerepeat also forms a similar structure shows that the �-sheet pro-

FIG. 8. CD spectra of thermal unfolding (A) and refolding (B) of Als5p1–664. The sample was heated (A) or cooled (B) at 1°C/min andequilibrated for 10 min after each 10°C step of temperature change. An isosbestic point is present at 217 or 218 nm in each experiment.

FIG. 9. Cartoons of multidomain structures of TRs. Each three-strand �-sheet domain is rendered as a prolate spheroid, with hydro-phobic surfaces colored yellow and hydrophilic glycosylated regionscolored green as in Fig. 5. The circles in each domain refer to thehydrophobic (yellow) and more polar (white) ends of the ellipsoidstructures as shown in Fig. 5. (A) Drawing of three successive domains.(B) Model for interaction of three Als TR domains (colored as inpanel A and rotated 90o around the x axis relative to that shown inpanel A) with the surface of a larger protein. Hydrophobic areas(yellow), basic areas (blue), and acidic areas (red) are shown.(C) Model for binding to a hydrophobic surface such as polystyrene.(D) Model of interaction of TR domains from two Als proteins withantiparallel orientation. Similar models can be drawn with parallelorientations. Note that incomplete overlap of the two chains wouldresult in unoccupied binding surfaces able to bind TR domains of otherAls molecules or to heterologous ligands, making binding potentiallynoncompetitive.

412 FRANK ET AL. EUKARYOT. CELL

on May 28, 2020 by guest

http://ec.asm.org/

Dow

nloaded from

pensity resides in each individual repeat, as the convergent struc-ture calculations predict.

The experimentally demonstrated stability of the domainstructure also supports the models. First, the in vitro secondarystructure in the presence of TFE, which tends to induce �-helical structure, is remarkable for the synthetic peptide (Fig.3) (50). Second, the stability to TFE is also seen in spectra ofAls5p1–664 (data not shown). Finally, the stability and consis-tency of the calculated structures were also consistent with theobserved equilibrium unfolding and renaturation of the TRregion after thermal denaturation (Fig. 8) (44). Therefore, CDresults supported the �-sheet potential of each individual re-peat and also confirmed that the �-sheet-rich domain struc-tures are thermodynamically favored.

Als TR domain activities. The structures of the Als tandemrepeats showed the physical basis for their known activities.Specifically, the exposed hydrophobic surfaces of the TR do-mains should interact with other hydrophobic surfaces. Canon-ical protein-protein interaction domains consist of a core ofhydrophobic residues surrounded by an annulus of interactingpolar groups (5, 39, 40). These hydrophobic and polar regionseach form complementary interactions with the same kinds ofresidues on the protein ligand. This type of structure is car-tooned for the Als tandem repeat domains in Fig. 9.

Als tandem repeats and heterotypic binding. The TR do-main surfaces have the generally “sticky” characteristics ofprotein-protein interaction surfaces. Exposed aliphatic and ar-omatic hydrophobic side chain atoms form a central surface,and the polar interactions would be mediated through themannosyl residues (Fig. 5, 6, and 9). These structures wouldhave highly diverse interaction partners, because carbohy-drates can donate or accept H bonds or mediate charge-dipoleor dipole-dipole interactions with polar or charged amino acidsor with other glycosyl groups. Thus, each repeat domain pre-sents a hydrophobic surface of amino acid side chains sur-rounded by flexible polar mannose units that can mediatecarbohydrate-amino acid or carbohydrate-carbohydrate bond-ing. Such interactions are cartooned in Fig. 9B, which illus-trates apposed hydrophobic surfaces with polar carbohy-drates (green) interacting with polar acidic (red) and basic(blue) groups on the surface of a large protein ligand. Thismodel illustrates heretofore unexplained observations aboutthe broad specificity of Als binding to both heterologousligands, perhaps including interactions of Als3p with hydro-phobic regions of Hwp1p and ferritin (1, 8, 22, 30, 42). Asimilar model explains the affinity of the TR domains forpolystyrene (Fig. 9C).

Tandem repeats and Als-Als binding. Als proteins showhomotypic binding to other Als proteins displayed on a cellsurface (20). Our results demonstrate that at least some of thisactivity resides in the TR region domains. Cells expressing onlythe Als5p tandem repeat and stalk regions aggregate, so ag-gregation ability must be localized to one or both of thesedomains. On the other hand, deletion of the tandem repeatsfrom full-length Als5p greatly reduces aggregation (35). There-fore, the presence of the TR region of Als5p is necessary forrobust aggregation of cells expressing Als5p. The binding as-says confirmed the importance of TR-TR binding: Als5p1–664

showed first-order nonsaturable binding characteristic of self-association without competition for binding sites (Fig. 7C).

This binding must be mediated by the TR domains, becausethere is no similar first-order nonsaturability with binding ofAls5p1–443. Thus, TR domains mediated increased binding topolystyrene at low Als5p concentration (Fig. 7A and 9B) andbinding to other TR domains (Fig. 7C and 9D) to form aggre-gates of Als5p molecules.

Conclusions. The conserved sequences and structures of AlsTR domains explain their physical properties and binding ac-tivities. Each TR sequence folds compactly to give an indepen-dent �-sheet-rich domain with a conserved hydrophobic coreand consistent surface features. The domain surfaces promoteinteractions with a large variety of hydrophobic surfaces, in-cluding other TR domains.

The structural approach and functional consequences maybe more general, because other fungal adhesins also have tan-dem repeats with functional roles. These adhesins, like Alsproteins, are multidomain proteins 600 to 1,600 residues inlength and are displayed at the cell surface with their C-ter-minal regions covalently linked to wall polysaccharides. Tan-dem repeats are a common feature in the adhesins, and theyusually occur near the middle of the sequences. These repeatsare commonly conserved in length and sequence within anadhesin family, and many such repeats are predicted to be�-sheet-rich domains (8, 16, 28, 47, 49, 51). More repeats areassociated with greater aggregation in the S. cerevisiae floccu-lins, as they are in the C. albicans Als proteins (16, 47, 48).These similarities imply that the approach taken here shouldalso be applicable to tandem repeats in other fungal adhesins.

ACKNOWLEDGMENTS

We thank Janis Lee and Melissa Garcia for their consideration andcomments on the manuscript.

This work was supported by NIGMS SCORE grants S06 GM 070758and SC1 GM 083756 to Brooklyn College. Work in GLYCAM wassupported by GM 55230 and the Center for Functional Glycomicsthrough RR 05357. A.T.F. gratefully acknowledges support of an NSFGraduate Research Fellowship.

REFERENCES

1. Almeida, R. S., S. Brunke, A. Albrecht, S. Thewes, M. Laue, J. E. Edwards,S. G. Filler, and B. Hube. 2008. The hyphal-associated adhesin and invasinAls3 of Candida albicans mediates iron acquisition from host ferritin. PLoSPathog. 4:e1000217.

2. Alsteens, D., V. Dupres, S. A. Klotz, N. K. Gaur, P. N. Lipke, and Y. F.Dufrene. 2009. Unfolding individual Als5p adhesion proteins on live cells.ACS Nano 3:1677–1682.

3. Butler, G., M. D. Rasmussen, M. F. Lin, M. A. Santos, S. Sakthikumar, C. A.Munro, E. Rheinbay, M. Grabherr, A. Forche, J. L. Reedy, I. Agrafioti, M. B.Arnaud, S. Bates, A. J. Brown, S. Brunke, M. C. Costanzo, D. A. Fitzpatrick,P. W. de Groot, D. Harris, L. L. Hoyer, B. Hube, F. M. Klis, C. Kodira, N.Lennard, M. E. Logue, R. Martin, A. M. Neiman, E. Nikolaou, M. A. Quail,J. Quinn, M. C. Santos, F. F. Schmitzberger, G. Sherlock, P. Shah, K. A.Silverstein, M. S. Skrzypek, D. Soll, R. Staggs, I. Stansfield, M. P. Stumpf,P. E. Sudbery, T. Srikantha, Q. Zeng, J. Berman, M. Berriman, J. Heitman,N. A. Gow, M. C. Lorenz, B. W. Birren, M. Kellis, and C. A. Cuomo. 2009.Evolution of pathogenicity and sexual reproduction in eight Candida ge-nomes. Nature 459:657–662.

4. Chou, P. Y., and G. D. Fasman. 1978. Prediction of the secondary structureof proteins from their amino acid sequence. Adv. Enzymol. Relat. AreasMol. Biol. 47:45–148.

5. Clackson, T., and J. A. Wells. 1995. A hot spot of binding energy in ahormone-receptor interface. Science 267:383–386.

6. Coronado, J. E., S. Mneimneh, S. L. Epstein, W. G. Qiu, and P. N. Lipke.2007. Conserved processes and lineage-specific proteins in fungal cell wallevolution. Eukaryot. Cell 6:2269–2277.

7. Cuff, J. A., M. E. Clamp, A. S. Siddiqui, M. Finlay, and G. J. Barton. 1998.JPred: a consensus secondary structure prediction server. Bioinformatics14:892–893.

8. Dranginis, A. M., J. R. Rauceo, J. E. Coronado, and P. N. Lipke. 2007. A

VOL. 9, 2010 Candida Als TANDEM REPEATS 413

on May 28, 2020 by guest

http://ec.asm.org/

Dow

nloaded from

biochemical guide to yeast adhesins: glycoproteins for social and antisocialoccasions. Microbiol. Mol. Biol. Rev. 71:282–294.

9. Garnier, J., J. F. Gibrat, and B. Robson. 1996. GOR method for predictingprotein secondary structure from amino acid sequence. Methods Enzymol.266:540–553.

10. Gaur, N. K., and S. A. Klotz. 2004. Accessibility of the peptide backbone ofprotein ligands is a key specificity determinant in Candida albicans SRSadherence. Microbiology 150:277–284.

11. Gaur, N. K., S. A. Klotz, and R. L. Henderson. 1999. Overexpression of theCandida albicans ALA1 gene in Saccharomyces cerevisiae results in aggrega-tion following attachment of yeast cells to extracellular matrix proteins,adherence properties similar to those of Candida albicans. Infect. Immun.67:6040–6047.

12. Gemmill, T. R., and R. B. Trimble. 1999. Overview of N- and O-linkedoligosaccharide structures found in various yeast species. Biochim. Biophys.Acta 1426:227–237.

13. Goddard, T. D., C. C. Huang, and T. E. Ferrin. 2007. Visualizing densitymaps with UCSF Chimera. J. Struct. Biol. 157:281–287.

14. Grigorescu, A., M. H. Chen, H. Zhao, P. C. Kahn, and P. N. Lipke. 2000. ACD2-based model of yeast alpha-agglutinin elucidates solution propertiesand binding characteristics. IUBMB Life 50:105–113.

15. Hazen, K. C., D. R. Singleton, and J. Masuoka. 2007. Influence of outerregion mannosylphosphorylation on N-glycan formation by Candida albi-cans: normal acid-stable N-glycan formation requires acid-labile manno-sylphosphate addition. Glycobiology 17:1052–1060.

16. Hoyer, L. L., C. B. Green, S. H. Oh, and X. Zhao. 2008. Discovering thesecrets of the Candida albicans agglutinin-like sequence (ALS) gene family–asticky pursuit. Med. Mycol. 46:1–15.

17. Humphrey, W., A. Dalke, and K. Schulten. 1996. VMD: visual moleculardynamics. J. Mol. Graph. 14:33–38, 27–28.

18. Kapteyn, J. C., L. L. Hoyer, J. E. Hecht, W. H. Muller, A. Andel, A. J.Verkleij, M. Makarow, H. Van Den Ende, and F. M. Klis. 2000. The cell wallarchitecture of Candida albicans wild-type cells and cell wall-defective mu-tants. Mol. Microbiol. 35:601–611.

19. Kirschner, K. N., A. B. Yongye, S. M. Tschampel, J. Gonzalez-Outeirino,C. R. Daniels, B. L. Foley, and R. J. Woods. 2008. GLYCAM06: a general-izable biomolecular force field. Carbohydrates. J. Comput. Chem. 29:622–655.

20. Klotz, S. A., N. K. Gaur, R. De Armond, D. C. Sheppard, N. Khardori, E. J. J.Edwards, P. N. Lipke, and M. El-Azizi. 2007. Candida albicans Als proteinsmediate aggregation with bacteria and yeasts. Med. Mycol. 45:363–370.

21. Klotz, S. A., B. S. Chasin, B. Powell, N. K. Gaur, and P. N. Lipke. 2007.Polymicrobial bloodstream infections involving Candida species: analysis ofpatients and review of the literature. Diagn. Microbiol. Infect. Dis. 59:401–406.

22. Klotz, S. A., N. K. Gaur, D. F. Lake, V. Chan, J. Rauceo, and P. N. Lipke.2004. Degenerate peptide recognition by Candida albicans adhesins Als5pand Als1p. Infect. Immun. 72:2029–2034.

23. Kumamoto, C. A., and M. D. Vinces. 2005. Alternative Candida albicanslifestyles: growth on surfaces. Annu. Rev. Microbiol. 59:113–133.

24. Lipke, P. N., and J. Kurjan. 1992. Sexual agglutination in budding yeasts:structure, function, and regulation of adhesion glycoproteins. Microbiol.Rev. 56:180–194.

25. Lipke, P. N., and R. Ovalle. 1998. Yeast cell walls: new structures, newchallenges. J. Bacteriol. 180:3735–3740.

26. Loza, L., Y. Fu, A. S. Ibrahim, D. C. Sheppard, S. G. Filler, and J. E.Edwards, Jr. 2004. Functional analysis of the Candida albicans ALS1 geneproduct. Yeast 21:473–482.

27. Maier, T., L. Ferbitz, E. Deuerling, and N. Ban. 2005. A cradle for newproteins: trigger factor at the ribosome. Curr. Opin. Struct. Biol. 15:204–212.

28. Moreno-Ruiz, E., G. Ortu, P. W. de Groot, F. Cottier, C. Loussert, M. C.Prevost, C. de Koster, F. M. Klis, S. Goyard, and C. d’Enfert. 2009. TheGPI-modified proteins Pga59 and Pga62 of Candida albicans are required forcell wall integrity. Microbiology 155:2004–2020.

29. Nobile, C. J., and A. P. Mitchell. 2007. Microbial biofilms: e pluribus unum.Curr. Biol. 17:R349–R353.

30. Nobile, C. J., H. A. Schneider, J. E. Nett, D. C. Sheppard, S. G. Filler, D. R.Andes, and A. P. Mitchell. 2008. Complementary adhesin function in C.albicans biofilm formation. Curr. Biol. 18:1017–1024.

31. Oh, S. H., G. Cheng, J. A. Nuessen, R. Jajko, K. M. Yeater, X. Zhao, C. Pujol,D. R. Soll, and L. L. Hoyer. 2005. Functional specificity of Candida albicansAls3p proteins and clade specificity of ALS3 alleles discriminated by thenumber of copies of the tandem repeat sequence in the central domain.Microbiology 151:673–681.

32. Ostermeier, M. 2005. Engineering allosteric protein switches by domaininsertion. Protein Eng. Des. Sel. 18:359–364.

33. Otoo, H. N., K. G. Lee, W. Qiu, and P. N. Lipke. 2008. Candida albicans Alsadhesins have conserved amyloid-forming sequences. Eukaryot. Cell 7:776–782.

34. Povey, J. F., C. M. Smales, S. J. Hassard, and M. J. Howard. 2007. Com-parison of the effects of 2,2,2-trifluoroethanol on peptide and protein struc-ture and function. J. Struct. Biol. 157:329–338.

34a.Ramsook, C. B., C. Tan, M. C. Garcia, R. Fung, G. Soybelman, R. Henry, A.Litewka, S. O’Meally, H. N. Otoo, R. A. Khalaf, A. M. Dranginis, N. K. Gaur,S. A. Klotz, J. M. Rauceo, C. K. Jue, and P. N. Lipke. 2010. Yeast celladhesion molecules have functional amyloid-forming sequences. Eukaryot.Cell 9:xxx.

35. Rauceo, J. M., R. De Armond, H. Otoo, P. C. Kahn, S. A. Klotz, N. K. Gaur,and P. N. Lipke. 2006. Threonine-rich repeats increase fibronectin binding inthe Candida albicans adhesin Als5p. Eukaryot. Cell 5:1664–1673.

36. Rauceo, J. M., N. K. Gaur, K. G. Lee, J. E. Edwards, S. A. Klotz, and P. N.Lipke. 2004. Global cell surface conformational shift mediated by a Candidaalbicans adhesin. Infect. Immun. 72:4948–4955.

37. Richardson, J. S. 1981. The anatomy and taxonomy of protein structure.Adv. Protein Chem. 34:167–339.

38. Rohl, C. A., C. E. Strauss, K. M. Misura, and D. Baker. 2004. Proteinstructure prediction using Rosetta. Methods Enzymol. 383:66–93.

39. Rudd, P. M., M. R. Wormald, R. L. Stanfield, M. Huang, N. Mattsson, J. A.Speir, J. A. DiGennaro, J. S. Fetrow, R. A. Dwek, and I. A. Wilson. 1999.Roles for glycosylation of cell surface receptors involved in cellular immunerecognition. J. Mol. Biol. 293:351–366.

40. Sammond, D. W., Z. M. Eletr, C. Purbeck, R. J. Kimple, D. P. Siderovski,and B. Kuhlman. 2007. Structure-based protocol for identifying mutationsthat enhance protein-protein binding affinities. J. Mol. Biol. 371:1392–1404.

41. Sheppard, D. C., M. R. Yeaman, W. H. Welch, Q. T. Phan, Y. Fu, A. S.Ibrahim, S. G. Filler, M. Zhang, A. J. Waring, and J. E. Edwards, Jr. 2004.Functional and structural diversity in the Als protein family of Candidaalbicans. J. Biol. Chem. 279:30480–30489.

42. Soll, D. R. 2008. Candida biofilms: is adhesion sexy? Curr. Biol. 18:R717–R720.

43. Sreerama, N., and R. W. Woody. 2000. Estimation of protein secondarystructure from circular dichroism spectra: comparison of CONTIN,SELCON, and CDSSTR methods with an expanded reference set. Anal.Biochem. 287:252–260.

44. Sreerama, N., and R. W. Woody. 2004. Computation and analysis of proteincircular dichroism spectra. Methods Enzymol. 383:318–351.

45. Srinivasan, R., P. J. Fleming, and G. D. Rose. 2004. Ab initio protein foldingusing LINUS. Methods Enzymol. 383:48–66.

46. van Stokkum, I. H., H. J. Spoelder, M. Bloemendal, R. van Grondelle, andF. C. Groen. 1990. Estimation of protein secondary structure and erroranalysis from circular dichroism spectra. Anal. Biochem. 191:110–118.

47. Verstrepen, K. J., A. Jansen, F. Lewitter, and G. R. Fink. 2005. Intragenictandem repeats generate functional variability. Nat. Genet. 37:986–990.

48. Verstrepen, K. J., T. B. Reynolds, and G. R. Fink. 2004. Origins of variationin the fungal cell surface. Nat. Rev. Microbiol. 2:533–540.

49. Zhang, N., A. L. Harrex, B. R. Holland, L. E. Fenton, R. D. Cannon, and J.Schmid. 2003. Sixty alleles of the ALS7 open reading frame in Candidaalbicans: ALS7 is a hypermutable contingency locus. Genome Res. 13:2005–2017.

50. Zhao, H., M. H. Chen, Z. M. Shen, P. C. Kahn, and P. N. Lipke. 2001.Environmentally induced reversible conformational switching in the yeastcell adhesion protein alpha-agglutinin. Protein Sci. 10:1113–1123.

51. Zhao, X., S. H. Oh, R. Jajko, D. J. Diekema, M. A. Pfaller, C. Pujol, D. R.Soll, and L. L. Hoyer. 2007. Analysis of ALS5 and ALS6 allelic variability ina geographically diverse collection of Candida albicans isolates. FungalGenet. Biol. 44:1298–1309.

414 FRANK ET AL. EUKARYOT. CELL

on May 28, 2020 by guest

http://ec.asm.org/

Dow

nloaded from