accepted manuscript - université laval
Post on 23-Jun-2022
4 Views
Preview:
TRANSCRIPT
ACCEPTED MANUSCRIPT
1
Solution and electron-microscopy characterization of lactococcal phage
baseplates expressed in E. coli.
Valérie Campanaccia, *, David Veeslera, Julie Lichièrea, Stéphanie Blangya, Giuliano Sciaraa,
Sylvain Moineaub, c, Douwe van Sinderend, e, Patrick Bronf and Christian Cambillaua, *
a Architecture et Fonction des Macromolécules Biologiques (AFMB), UMR 6098 CNRS and
Universités Aix-Marseille I & II, Campus de Luminy, Case 932, 13288 Marseille Cedex 09,
France
b Groupe de Recherche en Écologie Buccale (GREB) and Félix d’Hérelle Reference Center
for Bacterial Viruses, Faculté de Médecine Dentaire, Université Laval, Québec City, Québec,
Canada, G1V 0A6
c Département de Biochimie et de Microbiologie, Faculté des Sciences et de Génie,
Université Laval, Québec City, Québec, Canada, G1V 0A6
d Department of Microbiology, National University of Ireland, Cork, Ireland
e Alimentary Pharmabiotic Centre, University College Cork, Cork, Ireland
f Centre de Biochimie Structurale, INSERM U554 / CNRS UMR 5048, 29 rue de Navacelles,
34090 Montpellier, France
*Corresponding authors.
Fax: +33 491 266 720.
E-mail: valerie.campanacci@afmb.univ-mrs.fr or cambillau@afmb.univ-mrs.fr.
ACCEPTED MANUSCRIPT
2
ABSTRACT
We report here the characterization of several large structural protein complexes
forming the baseplates (or part of them) of Siphoviridae phages infecting Lactococcus
lactis: TP901-1, Tuc2009 and p2. We revisited a “block cloning” expression strategy and
extended this approach to genomic fragments encoding proteins whose interacting
partners have not yet been clearly identified. Biophysical characterization of some of
these complexes using circular dichroism and size exclusion chromatography, coupled
with on-line light scattering and refractometry, demonstrated that the overproduced
recombinant proteins interact with each other to form large (up to 1.9 MDa) and stable
baseplate assemblies. Some of these complexes were characterized by electron
microscopy confirming their structural homogeneity as well as providing a picture of
their overall molecular shapes and symmetry. Finally, using these results, we were able
to highlight similarities and differences with the well characterized much larger
baseplate of the myophage T4.
Keywords: operon expression; Lactococcus lactis phage; multi-protein complex; multi-angle
light scattering; receptor binding protein; baseplate; electron microscopy.
Abbreviations: RBS, ribosome binding site; TMP, tape measure protein; Dit, distal tail
protein; Tal, tail-associated lysin; BppU, baseplate upper protein; BppL, baseplate lower
protein; RBP, receptor binding protein; SEC/MALS/RI, size exclusion chromatography
coupled with on-line static light scattering and refractometry; CD, circular dichroism; EM,
electron microscopy; TEV, tobacco etch virus.
ACCEPTED MANUSCRIPT
3
1. Introduction
During the course of several structural genomics programs, we designed several
methods to easily express hundreds of single proteins in Escherichia coli and screen for the
best conditions in order to obtain them in a soluble form and prone to crystallization (Alzari et
al., 2006; Campanacci et al., 2003; Graslund et al., 2008; Vincentelli et al., 2005; Vincentelli
et al., 2003; Vincentelli et al., 2004). However, the over-expression of individual subunits of a
multi-protein complex in E. coli can result in inclusion body formation, improper folding,
degraded products or cell growth inhibition due to product toxicity. In contrast, co-expression
of two or more subunits of such complexes can enhance expression yield, solubility, correct
folding and result in higher activities (Buddha et al., 2004; Dokland et al., 2002; Li et al.,
1997; Stebbins et al., 1999; Strong et al., 2006). Different strategies can be applied for co-
expression in E. coli such as the use of several vectors, a single vector with a single promoter,
a single vector with different promoters, or a combination of all of these approaches (for
review, see (Perrakis and Romier, 2008)). Recently, co-expression of four genes coding for
the Type IV secretion system made it possible to crystallize it and solve its 3D structure
(Chandran et al., 2009).
In this contribution, we describe the expression and characterization of the overall
assembly of different baseplates or baseplate components belonging to three lactococcal
phages (TP901-1, Tuc2009 and p2) infecting different strains of the low GC content gram-
positive lactic acid bacterium Lactococcus lactis (Fig. 1). Gene expression of lactococcal
phages is tightly and timely controlled during the replication cycle. In several cases, it was
shown that such phage genes are organized in at least three sequential clusters that are
temporally transcribed. The early genes are expressed just after infection and encode proteins
involved in phage replication and host control; the middle genes specify proteins involved in
ACCEPTED MANUSCRIPT
4
DNA packaging and morphogenesis; and the late genes mostly encode proteins allowing final
virion assembly and bacterial host lysis (Duplessis et al., 2005; Madsen and Hammer, 1998).
Besides, phage genomes are densely packed as more than 90% of the complete sequence
correspond to coding sequences (Hatfull, 2008). Each operon often contains clustered genes
having limited distance between each other. For example, in the L. lactis-infecting phage
TP901-1, genes that encompass orf37 through to orf41 all overlap, whereas for orf41 through
to orf49 (bppL) the distance between the stop codon of one gene and the ATG codon of the
following gene varies between 1 and 55 bp, with four non-coding intergenic regions of 12±3
bp (Brondsted et al., 2001; Johnsen et al., 1996). Similar densely packed genomes are also
observed in other lactococcal phages (Villion et al., 2009) including Tuc2009 (Seegers et al.,
2004).
Except for the production of receptor binding proteins (RBPs) (Ricagno et al., 2006;
Spinelli et al., 2006a; Spinelli et al., 2006b; Tremblay et al., 2006), our initial attempts to
express the individual components of lactococcal phages and reassemble them in huge
complexes failed in a large extent, due to lack of expression, inclusion body formation,
protein instability or the absence of interactions. We therefore implemented a partial operon-
based cloning strategy using a whole phage genomic segment that codes for two or more
phage structural proteins, followed by over-expression in E. coli. This strategy yielded
excellent expression levels of structural protein complexes from lactoccocal phages suitable
for biophysical and structural studies. This approach made it possible to obtain the baseplates
from phages TP901-1 (ORF46, ORF48 and ORF49) and p2 (ORF15, ORF16 and ORF18).
Moreover, we were able to produce chimerical baseplate protein complexes using a protein-
shuffling approach between the two related phages TP901-1 and Tuc2009. We then
characterized both individual proteins (when available) and complexes by circular dichroism
(CD) and SEC/MALS/RI (Veesler et al., 2009a; Veesler et al., 2009b). Some of these
ACCEPTED MANUSCRIPT
5
complexes were also studied by electron microscopy (EM) providing a picture of their overall
molecular shapes and symmetry. Finally, we used these results to compare lactococcal phage
baseplates organization to the well-known Myoviridae phage T4.
2. Materials and methods
2.1. Cloning procedure
All primers used were salt-free and purchased from Eurofins MWG Operon. Primers
were designed to introduce protein purification tags as well as att recombination sequences to
facilitate cloning. In silico optimized primers were designed using the ExSenSo software
(Care et al., 2008) and are listed in Table 1. DNA regions containing two or more genes (Fig.
1B) were amplified from genomic DNA using PrimeStar HS DNA polymerase (Takara Bio
Inc.) or Platinum Pfx DNA polymerase (Invitrogen) following manufacturers’ instructions.
For cloning of chimera or multiple, non-contiguous genes (to form an artificial operon, Fig.
1B), we amplified the individual genes with inclusion of an overlapping sequence before
performing a second PCR reaction to achieve joining of the initial PCR products. PCR
amplicons were cloned by Gateway recombination (Invitrogen) in pDEST™14 (Invitrogen),
pDEST™17 (Invitrogen) or pETG-20A (a kind gift from Dr Arie Geerlof, EMBL Hamburg)
vector, as previously described (Vincentelli et al., 2003). The various constructs are listed in
Table 2. All clones were confirmed by sequencing (GATC Biotech).
2.2. Protein production
Recombinant plasmids were transformed in E. coli Rosetta(DE3)pLysS (Novagen) or
T7 Express Iq pLysS (New England Biolabs) strains. Cells were grown at 37°C in Terrific
Broth or M9 minimal medium until the OD600nm reached 0.6, after which protein expression
ACCEPTED MANUSCRIPT
6
was induced with 0.5 mM IPTG overnight at 25°C or 17°C. Protein purification was
performed as previously described (Sciara et al., 2008; Siponen et al., 2009). Briefly, after
cell harvesting, lysis was performed by adding 0.25 mg/ml lysozyme, followed by a
freezing/thawing cycle and sonication. Soluble proteins were separated from inclusion bodies
and cell debris by a 30-min centrifugation step at 20,000 x g. Purification was performed on
an ÄKTA FPLC system in two steps: a Ni2+-NTA column (HisTrap Ni 5 mL, GE Healthcare)
with a step gradient of 250 mM imidazole, followed by a preparative Superdex 200 HR 26/60
gel filtration. For complexes with a Strep-tag II, an additional affinity chromatography was
performed prior to gel filtration by means of a Strep-Tactin superflow resin (IBA GmbH)
using 2.5 mM D-desthiobiotin as eluant. For the thioredoxin fusion, a cleavage by
recombinant TEV protease was performed after the Ni affinity chromatography.
2.3. Determination of complexes molecular masses by SEC/MALS/RI
Size exclusion chromatography was carried out on an Alliance 2695 HPLC system
(Waters) using either a Superose 6 HR10/30 column (GE Healthcare) or a KW405-4F column
(Shodex) run in a buffer containing 10 mM Hepes, 150 mM NaCl, and 0.02% NaN3 at pH 7.5
with a flow rate of 0.3 mL/min or 0.35 mL/min, respectively. Detection was performed using
a three-detectors static light-scattering apparatus (MiniDAWN TREOS, Wyatt Technology), a
quasi-elastic light-scattering instrument (Dynapro, Wyatt Technology) and a refractometer
(Optilab rEX, Wyatt Technology). Molecular weight calculations were performed with the
ASTRA V software (Wyatt Technology) using a dn/dc value of 0.185 mL/g, as previously
described (Veesler et al., 2009a). Proteins were injected at a final concentration of 0.02 mM.
Errors were assigned by the Astra software (Wyatt).
ACCEPTED MANUSCRIPT
7
2.4. Circular dichroism
Far-UV CD spectra were recorded with a JASCO J-810 spectropolarimeter (JASCO
Corporation) equipped with a Peltier temperature control and using 1-mm thick quartz cells.
CD spectra were averaged on three accumulations using a scanning speed of 50 nm/min.
Measurements were performed between 190 and 260 nm at 20°C in 10 mM Na/Na2 phosphate
buffer pH 7.5 with a protein concentration of 0.1 mg/mL. Mean residual ellipticity values
([Θ]) were calculated as [Θ] = 3300 M ∆A/(l•c•n), where l (path length) = 0.1 cm, n = number
of residues, M = molecular mass (Daltons), and c = protein concentration (mg/mL).
Secondary structure analysis was done with the Dichroweb server (Whitmore and Wallace,
2004) using the CDSSTR method (Sreerama and Woody, 2000) and the reference set SP175
(Lees et al., 2006).
2.5. Electron microscopy and image processing
Samples were diluted to a final concentration of 0.05 mg/mL. Three microliters of a
given protein sample was applied to glow-discharged, carbon-coated, copper grids. Following
a 2 min exposure period to allow contact, fluid excess was removed by blotting paper and 4 µl
of 1% uranyl-acetate were added to the grid and left for 1 min. The grids were then dried and
kept in a desiccator cabinet until examination. Electron micrographs were recorded under
low-dose conditions with a FEI Tecnai Sphera LaB6 200 kV microscope. Images were
collected at 50,000X and 29,000X magnification for TP901-1 complexes and p2 baseplate,
respectively; defocus range was of 0.4 to 1.0 µm. Micrographs were checked by optical
diffraction and digitized on a Nikon Coolscan 9000 ED with a step size of 10 µm. The
digitized images were coarsened by a factor of 2, resulting in a pixel size corresponding to 4
Å and 6.9 Å at the specimen level for the TP901-1 and p2 complexes, respectively.
ACCEPTED MANUSCRIPT
8
Preliminary image processing was performed using the IMAGIC V software (van Heel
et al., 1996). Briefly, single molecule images were extracted semi-automatically from raw
micrographs using Boxer (Ludtke et al., 1999). The phase-contrast-transfer function was
corrected by phase flipping. Images were auto-centered employing the CENTER-IMAGE
program by using the average of raw images as reference. Some preferential views were
selected by visual inspection and chosen as references using the MRA (multi reference
alignment) program from the IMAGIC V software. Images were then grouped into classes
and averaged using the MSA (multi-statistical alignment) procedure. Best images of each
averaged class were used as new references for a new alignment cycle. In the present study,
three iterative alignment cycles of images were performed.
ACCEPTED MANUSCRIPT
9
3. Results
3.1. Production of baseplate proteins from lactococcal phages TP901-1 and Tuc2009.
TP901-1 and Tuc2009 are two temperate phages belonging to the lactococcal
Siphoviridae P335 group (Deveau et al., 2006; Vegge et al., 2006). The distal tail structure of
TP901-1 contains an initiator complex formed by the tape measure protein (TMP, ORF45),
the distal tail protein (Dit, ORF46) and the tail-associated lysin (Tal, ORF47) as well as a
double-disk-shaped baseplate, which is made of a lower baseplate protein (BppL also called
receptor binding protein (RBP), ORF49) and an upper baseplate protein (BppU, ORF48)
(Vegge et al., 2005; Vegge et al., 2006). BppL was previously shown to be involved in the
host recognition process (Vegge et al., 2006).
We individually over-produced and purified TP901-1 BppU and BppL yielding 4 and
2 mg of pure protein per liter of culture, respectively (Table 2, rows 1 & 2). As we observed
that mixing of BppU and BppL from Tuc2009 did not result in complex formation (Sciara et
al., 2008), we generated bicistronic vectors for the co-expression of TP901-1 and Tuc2009
bppU/bppL to overcome this problem. In these constructs, an in silico optimized ribosome
binding site sequence (RBS) was introduced to promote efficient translation of bppU and
bppL transcripts (Care et al., 2008). Furthermore, constructs were designed in such a way that
BppU and BppL carried a hexahistidine tag and a Strep-tag II (Schmidt and Skerra, 2007),
respectively (constructs 3 and 11 in Figure 1B). For TP901-1, we did not observe the
expression of a protein complex (Table 2, row 3). In contrast, clear production of a
BppU/BppL complex was observed for the Tuc2009-derived proteins, enabling us to
previously propose a topological model for the baseplate assembly of this latter phage (Sciara
et al., 2008).
ACCEPTED MANUSCRIPT
10
Because operon-based expression has been successfully used in some cases (Droge et
al., 2000; Fronzes et al., 2009; Takagi et al., 2005; Wang et al., 2006; Wilbur et al., 2005), we
decided to clone a larger phage genomic fragment containing the genes coding for BppU and
BppL from TP901-1 or Tuc2009 as well as the intervening sequence between these two
genes. Moreover, to test the influence of RBS sequences, two constructs were designed where
either the endogenous RBS or an in silico optimized RBS (Care et al., 2008) controlled bppU
mRNA translation, while endogenous lactococcal phage RBS of bppL and bppA (a baseplate-
associated protein found in Tuc2009) were kept (Tables 1, 2). A hexahistidine tag was
introduced at the BppL C-terminus for purification purpose.
For the TP901-1 baseplate proteins, excellent expression levels were achieved using
an in silico optimized RBS upstream of bppU: 10 mg of the BppU/BppL complex and 20 mg
of uncomplexed excess BppL were obtained per liter of culture (Table 2, row 4). This latter
uncomplexed protein was not subject to proteolytic cleavage, in contrast to what was
previously observed (Spinelli et al., 2006a). This might result from a more favorable
BppL:protease ratio as compared to the situation where BppL was expressed alone. It is
noteworthy to point out that the TP901-1 BppU/BppL complex production dropped
considerably when the endogenous RBS of bppU was used (Table 2, row 5). Likewise, we
designed an artificial operon containing the genes encoding Dit, BppU and BppL and cloned
it in frame as a C-terminal fusion to the thioredoxin gene. After two affinity-purification steps
(including a TEV protease cleavage) and a gel filtration, we obtained 8 mg per liter of culture
of a stable complex constituted by these three proteins as well as an excess of Dit (Table 2,
row 6).
Surprisingly, for the Tuc2009 baseplate proteins, no BppU/BppL complex was
produced using either an endogenous or an in silico optimized RBS upstream of bppU,
although we obtained large amounts of BppL alone (Table 2, rows 12 & 13). We observed the
ACCEPTED MANUSCRIPT
11
BppU/BppA/BppL complex formation in low amounts (Table 2, row 14), but its co-elution
with the host chaperone protein DnaK during gel filtration (data not shown) precluded further
characterization.
We were thus able to produce and purify individual baseplate proteins from the two
P335 lactococcal phages, TP901-1 and Tuc2009. Furthermore, our operon-based approach
coupled with optimal RBS screening allowed us to obtain high quantity of several protein
complexes formed by two or three protein components.
3.2. Production of chimerical baseplate complexes from TP901-1 and Tuc2009 lactococcal
phages
Isolation of chimerical phages or phage mutants was originally used with the RBP-
encoding gene in E. coli T-even phages (Tetart et al., 1998; Tetart et al., 1996). This approach
was also successfully used for Streptococcus thermophilus phages (Duplessis and Moineau,
2001), in two Lactobacillus delbrueckii infecting phages (Ravin et al., 2002) and in several
lactococcal phages (Dupont et al., 2004; Stuer-Lauridsen et al., 2003; Vegge et al., 2006). In
the latter case, it was shown that a chimerical TP901-1 phage encoding the BppL from
Tuc2009 (BppLTuc2009) exhibited altered host range specificity (Vegge et al., 2006). We thus
decided to produce both BppUTP901-1/BppLTuc2009 and BppUTuc2009/BppLTP901-1 chimerical
complexes. Using the strategy described above for the TP901-1 BppU/BppL complex, we
cloned a DNA fragment containing the bppUTP901-1 (or bppUTuc2009) gene with an optimized
RBS upstream, and, the bppLTuc2009 (or bppLTP901-1) gene with the endogenous bppLTP901-1
RBS. As for the TP901-1 complex, high yields of purified complexes were obtained (5 and 15
mg/L for BppUTP901-1/BppLTuc2009 and BppUTuc2009/BppLTP901-1 respectively), as well as a
large excess of either BppLTuc2009 or BppLTP901-1 (Table 2, rows 15 & 16).
ACCEPTED MANUSCRIPT
12
3.3. Production of baseplate proteins from the lactococcal phage p2
The virulent phage p2 belongs to the lactococcal 936 group, which represents the
predominant phage group responsible for industrial milk fermentation failures (Chandry et al.,
1997; Crutz-Le Coq et al., 2002; Mahony et al., 2006). We previously demonstrated that
ORF18 of phage p2 is located within the baseplate and identified it as the RBP (Spinelli et al.,
2006b). Furthermore, ORF14 of phage p2 is most likely the tape measure protein (TMP)
based on its genomic position and on the fact that it is the largest structural protein (Dupont et
al., 2004). Similarly, using genomic organization as well as gene sequence similarities (albeit
very limited) with their homologous counterparts in phages TP901-1 and Tuc2009, we
hypothesized that the p2 phage baseplate is encoded by orf15 to orf18.
The DNA fragment encompassing orf15 through to orf18 was PCR-amplified using
primers designed to add a hexahistidine tag at the N-terminus of ORF15 and a Strep-tag II at
the C-terminus of ORF18. Following two different affinity chromatography steps and a gel
filtration, we obtained a high yield of the ORF15/ORF16/ORF18 complex (6 mg per liter of
culture) as well as some excess ORF15/ORF18 complex (Table 2, row 17). ORF17 was
neither detectable associated with any of these two complexes nor in the whole cell lysate.
This is in agreement with its absence in virion particles (data not shown) and suggests a non-
structural role for this protein.
To further investigate the interaction network, we also cloned orf15 and orf16, using
our operon strategy, and obtained 4 mg per liter of culture of the ORF15/ORF16 complex
(Table 2, row 18). As stable complexes of ORF15/ORF16, ORF15/ORF18 and
ORF15/ORF16/ORF18 could be obtained, we hypothesize that ORF15 plays a central
structural role in the p2 baseplate assembly, as was proposed for the Dit protein in the P335-
group of lactococcal phages (Vegge et al., 2005; Vegge et al., 2006).
ACCEPTED MANUSCRIPT
13
3.4. Molecular mass and size determination of baseplate protein complexes
SEC/MALS/RI characterization (Veesler et al., 2009a) (Table 2, Fig. 2) revealed
masses of 102,000 Da and 54,000 Da for TP901-1 BppU and BppL, respectively (Table 2,
rows 1 & 4; Fig. 2A). These data attest the trimeric nature of both proteins. Analysis of the
TP901-1 BppU/BppL complex resulted in a measured mass of 255,000 Da, suggesting a 3:9
ratio (Table 2, row 4; Fig. 2A), that is compatible with the observation of a tripod in EM (see
below). Interestingly, this mass was identical to the one obtained for the Tuc2009
BppU/BppA/BppL complex (Table 2, row 14) (Sciara et al., 2008) indicating that, in this
latter phage, 6 BppL have been replaced by 3 BppA, the exact role of this latter protein
remaining still undocumented. The measured molar mass of the Dit/BppU/BppL complex was
1,900,000 Da. Combining this latter result with the EM evidence of a 6-fold symmetry (see
below), strongly suggests the presence of 6 (BppU/BppL) tripods associated with 12 Dit,
yielding a 12:18:54 ratio of Dit/BppU/BppL. However, it should be noted that expressing Dit
alone surprisingly resulted in a monomer (27,300 Da) (Table 2, row 6). This multimerization
process upon partner binding is a very common feature in phages as illustrated by the phage
λ tail terminator protein (gpU) (Pell et al., 2009), the phage λ capsid-stabilizing protein (gpD)
(Imber et al., 1980; Yang et al., 2000), the Bacillus subtilis SPP1 phage adaptor (gp15) and
stopper (gp16) proteins (Lhuillier et al., 2009) as well as the phage P22 tail accessory factor
(gp4) (Olia et al., 2006).
Analysis of the BppUTP901-1/BppLTuc2009 and BppUTuc2009/BppLTP901-1 chimeras
provided complexes with 3:9 ratios deduced from measured masses of 264,000 Da and
284,000 Da, respectively (Table 2, rows 15 & 16). As observed for the Tuc2009 proteins
(Sciara et al., 2008), higher molecular weight complexes, corresponding to 2x(3:9) and
3x(3:9) oligomers, were also detected for these two chimeras. These results suggest that, in a
ACCEPTED MANUSCRIPT
14
chimerical context with TP901-1, either BppUTuc2009 or BppLTuc2009 proteins assemble as a
native TP901-1 BppU/BppL complex with respect to their stoichiometry.
Complex formation was also observed when mixing BppLTP901-1, obtained from this
operon-based production, with individually produced BppUTP901-1 or BppUTuc2009, (Table 2,
row 7; Fig. 2A). In contrast, no complex formation occurred when BppLTuc2009 was added to
BppUTP901-1 or BppUTuc2009 (data not shown). This is consistent with our previous
observations using separately produced Tuc2009 proteins (Sciara et al., 2008) and is probably
due to the rapid proteolytic degradation of BppL, as illustrated by crystallographic structures
(Sciara et al., 2008; Siponen et al., 2009; Spinelli et al., 2006a). This indicates a full integrity
requirement of the BppL N-terminus for protein complex formation.
Analyses of the p2 protein complexes revealed masses of approximately 1,000,000
Da, 120,000 Da and 310,000 Da for ORF15/ORF16/ORF18, ORF15/ORF18 and
ORF15/ORF16, respectively (Table 2, rows 17 & 18; Fig. 2B). As ORF18 is a trimer (Spinelli
et al., 2006b; Tremblay et al., 2006), we thus propose that the ORF15/ORF18 complex exists
in a 1:3 protein ratio, the ORF15/ORF16 complex in a 6:3 ratio, while the
ORF15/ORF16/ORF18 complex, considering its 6-fold symmetry seen by EM (see below),
might occur in ratios of 12:3:18 or 6:3:18, of respective masses 1,100,000 and 900,000 Da.
3.5. Characterization of TP901-1 and chimerical BppU/BppL complexes by circular
dichroism
In order to verify the structural integrity of the different BppU/BppL complexes, far-
UV circular dichroism spectra were recorded. All spectra were typical of structured proteins
(data not shown). Estimation of secondary structural contents based on CD spectra
deconvolution revealed a significant discrepancy in the α-helical and β-strand contents
between the complex from TP901-1 (7% α-helix and 43% β-strands) and the two chimeras
ACCEPTED MANUSCRIPT
15
(17%-18% α-helix and 37% β-strand, Table 3). These results associated with the
SEC/MALS/RI ones, suggest that BppU and BppL shuffling between P335-like phages can
lead to quite stable complexes, thereby illustrating means by which these phages evolve
(Haggard-Ljungquist et al., 1992; Labrie et al., 2008). Obtaining such shuffled complexes
also reinforces our hypothesis that the BppL N-terminus mediates anchoring to BppU because
of the observed 64% amino acid identity between the first 20 residues of the two BppL
proteins (from Tuc2009 and TP901-1 phages) (Mc Grath et al., 2006). Finally, the secondary
structural content discrepancies between chimeras and native TP901-1 complex may be
explained by the presence of BppA in the native Tuc2009 baseplate as an additional partner
during protein complex formation.
3.6. Electron microscopy of protein complexes reveals baseplate structures
The over-expressed TP901-1 Bppu/BppL and Dit/BppU/BppL as well as p2
ORF15/ORF16/ORF18 complexes, were negatively stained using 1% uranyl-acetate and
observed with a transmission electron microscope (Fig. 3). Particle images were extracted
from digitized images and processed using the IMAGIC V software. Images were centered,
aligned, and grouped into classes presenting structural similarities. We selected representative
class averages (insets in Fig. 3A) computed after 3 alignment cycles of raw images. In the
case of the TP901-1 Bppu/BppL complex, particles were adsorbed on the carbon film and
well resolved. Many of them appear to have a fork-like shape showing two or three prongs
(tripod, Fig. 3A, indicated by black arrows) terminated by a globular domain. Particles are
approximately 16.4 nm long and 13.6 nm wide. It is worth noting that most class averages
were related to these structural characteristics highlighting the structural homogeneity of
BppU/BppL particles.
ACCEPTED MANUSCRIPT
16
The production of TP901-1 Dit/BppU/BppL complex (without Tal) resulted in the
formation of particles displaying protuberances at the periphery and annular densities in the
middle, as illustrated in Fig. 3B. Image analysis revealed two main class averages presented
in insets of Fig. 3B. These two averaged views are similar but not identical, corresponding
probably to the top and bottom view of these particles. They reveal an apparent 6-fold
symmetry axis with two central rings connected via small-elongated parts to protruding
triangular domains. Outer diameters of the whole particle and of the inner ring are 45 nm and
13.4 nm, respectively. The 10 nm long protruding domains are linked to the external ring
consisting of 12 circular densities being approximately 3.4 nm wide.
In the case of the ORF15/ORF16/ORF18 protein complex from phage p2,
micrographs analysis reveals particles with a highly homogeneous circular shape. Image
processing indicated that particles had preferential views showing an apparent 6-fold
symmetry axis with two rings of density containing well resolved domains inside, as shown in
the class averages (Fig. 3C, insets). The two ring outer diameters are 23.5 nm and 9.6 nm,
respectively. Most of the identified class averages correspond to those shown in the insets 1
and 2 of Fig. 3C. This is typical of a particle interacting in a preferential way with the carbon
film of the grid. Class averages 1 and 2 of Fig. 3C possibly correspond to the up and down
view, while class average 3 is a presumed representation of the side view of the particle.
3. Discussion
We obtained different baseplate protein complexes, belonging to lactococcal phages
TP901-1, Tuc2009, and p2, thanks to an expression strategy in which internal translational
signals were conserved using operon fragments encompassing 2 to 4 different genes. Within
this context, this approach is both faster and more efficient than those involving classical
multi-cistronic vectors and was useful to determine interacting partners. We were also able to
ACCEPTED MANUSCRIPT
17
obtain chimerical complexes where the BppL of two phages belonging to the P335 group
were shuffled providing an interesting approach to study phage evolution.
Over-expression of three complexes, TP901-1 BppU/BppL and Dit/BppU/BppL as
well as p2 ORFs 15/16/18, yielded particles showing relative homogeneity as revealed by the
well-resolved EM class averages. Although these complexes interact with the carbon film of
the EM grid, limiting thus the number of observable orientations, their overall shape and
symmetry can be deduced from both SEC/MALS/RI and EM results. This approach can be
applied to phages with relatively small baseplates such as Lactococcus lactis siphophages. It
would be probably more difficult or even impossible for myophages, such as T4, which are
much larger and composed of numerous different ORFs.
EM pictures of TP901-1 Bppu/BppL particles reveal that they might be composed of
two or three elongated domains. Combining these results with our SEC/MALS/RI analysis,
led us to propose that the tripod architecture is the most probable one, in which a trimer of
BppU forms the dome-shaped region of the tripod, while BppL trimers form each foot. In the
Dit/BppU/BppL complex, Dit most probably form a dodecameric hub structure (indicated by
the SEC/MALS/RI data) on which six tripods would be accommodated, leading to the overall
6 fold symmetry revealed by the EM pictures. The formation of such a complex constituted
by 12 Dit, 18 BppU and 54 BppL (Table 2) allows us to rule out that Tal, a component of the
initiator complex, is an essential element in the initial baseplate assembly of P335 phages.
Vegge et al. showed that tal- or dit- mutant phages exhibited head structures only, lacking the
entire tail (Vegge et al., 2005). Our results clearly demonstrated that the lack of Tal did not
prevent the formation of a stable TP901-1 baseplate structure. We therefore hypothesize that
Tal, or at least its N-terminal part, exerts a role in anchoring the baseplate to the tail,
stabilizing thus the phage tail probably via interactions with the TMP (Mc Grath et al., 2006).
ACCEPTED MANUSCRIPT
18
We identified the three proteins forming the p2 baseplate, namely ORF15, ORF16 and
ORF18. This complex has an apparent mass of ~1,0 MDa, that is in between masses of
complexes having 6 or 12 ORF15, 3 ORF16 and 18 ORF18 (Table 2). If 12 ORF15 were
present, the assembly of p2 ORF15 would be comparable to that of TP901-1 Dit. The role of
ORF15 as a hub on which other proteins are plugged is consistent with the presence of the
isolated ORF16/ORF18 complex having a 1:3 stoichiometry (Table 2). Furthermore, the fact
that ORF16 is trimeric could indicate an analogous function to Tal proteins in P335 phages
(Sciara et al., 2008). The transition from 6 to 3-fold symmetry in baseplates is a feature which
has also been reported in the myophage T4 (Kanamaru et al., 2002).
It is noteworthy that an overall 6-fold symmetry is exhibited by both baseplates, from
TP901-1 and p2 phages. The same symmetry was also observed for the baseplate of the
myophage T4 despite its larger mass (accounted by ~130 proteins of 14 different types) and
size (520 nm large x 270 nm high) (Leiman et al., 2004; Rossmann et al., 2004). In this latter
case, the 6-fold to 3-fold symmetry transition is observed at the level of the puncturing
device. This should be also observed with phage p2 baseplate (3 ORFs16) and is very likely
the case with phage TP901-1 Tal (as is the case in Tuc2009).
Myoviridae phages as well as some Siphoviridae phages possess a baseplate for
adsorption to the host cell wall. This feature seems to be the hallmark of phages binding to
saccharidic receptors. It is remarkable that both phage families appear to have convergently
evolved to adopt such complex structures. This work, dedicated to lactococcal phage
baseplates characterization, makes it possible to pursue higher resolution analysis of these
large complexes, with the perspective to understand mechanisms taking place during infection
at the molecular level.
ACCEPTED MANUSCRIPT
19
Acknowledgements
We thank Dr. Arie Geerlof, who kindly provided the Gateway plasmid pETG-20A for
His-Trx fusion. This work was supported, in part, by grants from the Marseille-Nice
Génopole (to J.L.), the CNRS, the Natural Sciences and Engineering Research Council of
Canada through its strategic program (to S.M.), the Science Foundation Ireland Principal
Investigator program (ref. no. 08/IN.1/B1909 (to D.v.S.)), the ANR (ANR-07-BLAN-0095)
and by a PhD grant from the "Ministère français de l'Enseignement Supérieur et de la
Recherche" to D.V. (ref. no. 22976-2006).
ACCEPTED MANUSCRIPT
20
References
Alzari, P.M., H. Berglund, N.S. Berrow, E. Blagova, D. Busso, C. Cambillau, V.
Campanacci, E. Christodoulou, S. Eiler, M.J. Fogg, G. Folkers, A. Geerlof, D. Hart,
A. Haouz, M.D. Herman, S. Macieira, P. Nordlund, A. Perrakis, S. Quevillon-Cheruel,
F. Tarandeau, H. van Tilbeurgh, T. Unger, M.P. Luna-Vargas, M. Velarde, M.
Willmanns, R.J. Owens, 2006. Implementation of semi-automated cloning and
prokaryotic expression screening: the impact of SPINE. Acta Crystallogr D Biol
Crystallogr 62, 1103-13.
Brondsted, L., S. Ostergaard, M. Pedersen, K. Hammer, F.K. Vogensen, 2001. Analysis of the
complete DNA sequence of the temperate bacteriophage TP901-1: evolution,
structure, and genome organization of lactococcal bacteriophages. Virology 283, 93-
109.
Buddha, M.R., K.M. Keery, B.R. Crane, 2004. An unusual tryptophanyl tRNA synthetase
interacts with nitric oxide synthase in Deinococcus radiodurans. Proc Natl Acad Sci U
S A 101, 15881-15886.
Campanacci, V., M.P. Egloff, S. Longhi, F. Ferron, C. Rancurel, A. Salomoni, C.
Durousseau, F. Tocque, N. Bremond, J.C. Dobbe, E.J. Snijder, B. Canard, C.
Cambillau, 2003. Structural genomics of the SARS coronavirus: cloning, expression,
crystallization and preliminary crystallographic study of the Nsp9 protein. Acta
Crystallogr D Biol Crystallogr 59, 1628-31.
Care, S., C. Bignon, M.-C. Pelissier, E. Blanc, B. Canard, B. Coutard, 2008. The translation
of recombinant proteins in E. coli can be improved by in silico generating and
screening random libraries of a -70/+96 mRNA region with respect to the translation
initiation codon. Nucleic Acids Res 36, e6.
ACCEPTED MANUSCRIPT
21
Chandran, V., R. Fronzes, S. Duquerroy, N. Cronin, J. Navaza, G. Waksman, 2009. Structure
of the outer membrane complex of a type IV secretion system. Nature.
Chandry, P.S., S.C. Moore, J.D. Boyce, B.E. Davidson, A.J. Hillier, 1997. Analysis of the
DNA sequence, gene expression, origin of replication and modular structure of the
Lactococcus lactis lytic bacteriophage sk1. Mol Microbiol 26, 49-64.
Crutz-Le Coq, A.M., B. Cesselin, J. Commissaire, J. Anba, 2002. Sequence analysis of the
lactococcal bacteriophage bIL170: insights into structural proteins and HNH
endonucleases in dairy phages. Microbiology 148, 985-1001.
Deveau, H., S.J. Labrie, M.C. Chopin, S. Moineau, 2006. Biodiversity and classification of
lactococcal phages. Appl Environ Microbiol 72, 4338-46.
Dokland, T., S. Wang, B.H. Lindqvist, 2002. The structure of P4 procapsids produced by
coexpression of capsid and external scaffolding proteins. Virology 298, 224-231.
Droge, A., M.A. Santos, A.C. Stiege, J.C. Alonso, R. Lurz, T.A. Trautner, P. Tavares, 2000.
Shape and DNA packaging activity of bacteriophage SPP1 procapsid: protein
components and interactions during assembly. J Mol Biol 296, 117-32.
Duplessis, M., S. Moineau, 2001. Identification of a genetic determinant responsible for host
specificity in Streptococcus thermophilus bacteriophages. Mol Microbiol 41, 325-36.
Duplessis, M., W.M. Russell, D.A. Romero, S. Moineau, 2005. Global gene expression
analysis of two Streptococcus thermophilus bacteriophages using DNA microarray.
Virology 340, 192-208.
Dupont, K., F.K. Vogensen, H. Neve, J. Bresciani, J. Josephsen, 2004. Identification of the
receptor-binding protein in 936-species lactococcal bacteriophages. Appl Environ
Microbiol 70, 5818-24.
Fronzes, R., E. Schafer, L. Wang, H.R. Saibil, E.V. Orlova, G. Waksman, 2009. Structure of a
type IV secretion system core complex. Science 323, 266-8.
ACCEPTED MANUSCRIPT
22
Graslund, S., P. Nordlund, J. Weigelt, B.M. Hallberg, J. Bray, O. Gileadi, S. Knapp, U.
Oppermann, C. Arrowsmith, R. Hui, J. Ming, S. dhe-Paganon, H.W. Park, A.
Savchenko, A. Yee, A. Edwards, R. Vincentelli, C. Cambillau, R. Kim, S.H. Kim, Z.
Rao, Y. Shi, T.C. Terwilliger, C.Y. Kim, L.W. Hung, G.S. Waldo, Y. Peleg, S.
Albeck, T. Unger, O. Dym, J. Prilusky, J.L. Sussman, R.C. Stevens, S.A. Lesley, I.A.
Wilson, A. Joachimiak, F. Collart, I. Dementieva, M.I. Donnelly, W.H. Eschenfeldt,
Y. Kim, L. Stols, R. Wu, M. Zhou, S.K. Burley, J.S. Emtage, J.M. Sauder, D.
Thompson, K. Bain, J. Luz, T. Gheyi, F. Zhang, S. Atwell, S.C. Almo, J.B. Bonanno,
A. Fiser, S. Swaminathan, F.W. Studier, M.R. Chance, A. Sali, T.B. Acton, R. Xiao,
L. Zhao, L.C. Ma, J.F. Hunt, L. Tong, K. Cunningham, M. Inouye, S. Anderson, H.
Janjua, R. Shastry, C.K. Ho, D. Wang, H. Wang, M. Jiang, G.T. Montelione, D.I.
Stuart, R.J. Owens, S. Daenke, A. Schutz, U. Heinemann, S. Yokoyama, K. Bussow,
K.C. Gunsalus, 2008. Protein production and purification. Nat Methods 5, 135-46.
Haggard-Ljungquist, E., C. Halling, R. Calendar, 1992. DNA sequences of the tail fiber genes
of bacteriophage P2: evidence for horizontal transfer of tail fiber genes among
unrelated bacteriophages. J Bacteriol 174, 1462-77.
Hatfull, G.F., 2008. Bacteriophage genomics. Curr Opin Microbiol 11, 447-53.
Imber, R., A. Tsugita, M. Wurtz, T. Hohn, 1980. Outer surface protein of bacteriophage
lambda. J Mol Biol 139, 277-95.
Johnsen, M.G., K.F. Appel, P.L. Madsen, F.K. Vogensen, K. Hammer, J. Arnau, 1996. A
genomic region of lactococcal temperate bacteriophage TP901-1 encoding major
virion proteins. Virology 218, 306-15.
Kanamaru, S., P.G. Leiman, V.A. Kostyuchenko, P.R. Chipman, V.V. Mesyanzhinov, F.
Arisaka, M.G. Rossmann, 2002. Structure of the cell-puncturing device of
bacteriophage T4. Nature 415, 553-7.
ACCEPTED MANUSCRIPT
23
Labrie, S.J., J. Josephsen, H. Neve, F.K. Vogensen, S. Moineau, 2008. Morphology, genome
sequence, and structural proteome of type phage P335 from Lactococcus lactis. Appl
Environ Microbiol 74, 4636-44.
Lees, J.G., A.J. Miles, F. Wien, B.A. Wallace, 2006. A reference database for circular
dichroism spectroscopy covering fold and secondary structure space. Bioinformatics
22, 1955-62.
Leiman, P.G., P.R. Chipman, V.A. Kostyuchenko, V.V. Mesyanzhinov, M.G. Rossmann,
2004. Three-dimensional rearrangement of proteins in the tail of bacteriophage T4 on
infection of its host. Cell 118, 419-29.
Lhuillier, S., M. Gallopin, B. Gilquin, S. Brasiles, N. Lancelot, G. Letellier, M. Gilles, G.
Dethan, E.V. Orlova, J. Couprie, P. Tavares, S. Zinn-Justin, 2009. Structure of
bacteriophage SPP1 head-to-tail connection reveals mechanism for viral DNA gating.
Proc Natl Acad Sci U S A 106, 8507-12.
Li, C., J.W. Schwabe, E. Banayo, R.M. Evans, 1997. Coexpression of nuclear receptor
partners increases their solubility and biological activities. Proc Natl Acad Sci U S A
94, 2278-83.
Ludtke, S.J., P.R. Baldwin, W. Chiu, 1999. EMAN: semiautomated software for high-
resolution single-particle reconstructions. J Struct Biol 128, 82-97.
Madsen, P.L., K. Hammer, 1998. Temporal transcription of the lactococcal temperate phage
TP901-1 and DNA sequence of the early promoter region. Microbiology 144, 2203-
15.
Mahony, J., H. Deveau, S. Mc Grath, M. Ventura, C. Canchaya, S. Moineau, G.F. Fitzgerald,
D. van Sinderen, 2006. Sequence and comparative genomic analysis of lactococcal
bacteriophages jj50, 712 and P008: evolutionary insights into the 936 phage species.
FEMS Microbiol Lett 261, 253-61.
ACCEPTED MANUSCRIPT
24
Mc Grath, S., H. Neve, J.F. Seegers, R. Eijlander, C.S. Vegge, L. Brondsted, K.J. Heller, G.F.
Fitzgerald, F.K. Vogensen, D. van Sinderen, 2006. Anatomy of a lactococcal phage
tail. J Bacteriol 188, 3972-82.
Olia, A.S., J. Al-Bassam, D.A. Winn-Stapley, L. Joss, S.R. Casjens, G. Cingolani, 2006.
Binding-induced stabilization and assembly of the phage P22 tail accessory factor gp4.
J Mol Biol 363, 558-76.
Pell, L.G., A. Liu, L. Edmonds, L.W. Donaldson, P.L. Howell, A.R. Davidson, 2009. The X-
ray crystal structure of the phage lambda tail terminator protein reveals the
biologically relevant hexameric ring structure and demonstrates a conserved
mechanism of tail termination among diverse long-tailed phages. J Mol Biol 389, 938-
51.
Perrakis, A., C. Romier, 2008. Assembly of protein complexes by coexpression in prokaryotic
and eukaryotic hosts: an overview. Methods Mol Biol 426, 247-56.
Ravin, V., L. Raisanen, T. Alatossava, 2002. A conserved C-terminal region in Gp71 of the
small isometric-head phage LL-H and ORF474 of the prolate-head phage JCL1032 is
implicated in specificity of adsorption of phage to its host, Lactobacillus delbrueckii. J
Bacteriol 184, 2455-9.
Ricagno, S., V. Campanacci, S. Blangy, S. Spinelli, D. Tremblay, S. Moineau, M. Tegoni, C.
Cambillau, 2006. Crystal structure of the receptor-binding protein head domain from
Lactococcus lactis phage bIL170. J Virol 80, 9331-5.
Rossmann, M.G., V.V. Mesyanzhinov, F. Arisaka, P.G. Leiman, 2004. The bacteriophage T4
DNA injection machine. Curr Opin Struct Biol 14, 171-80.
Schmidt, T.G., A. Skerra, 2007. The Strep-tag system for one-step purification and high-
affinity detection or capturing of proteins. Nat Protoc 2, 1528-35.
ACCEPTED MANUSCRIPT
25
Sciara, G., S. Blangy, M. Siponen, S. Mc Grath, D. van Sinderen, M. Tegoni, C. Cambillau,
V. Campanacci, 2008. A topological model of the baseplate of lactococcal phage
Tuc2009. J Biol Chem 283, 2716-23.
Seegers, J.F., S. Mc Grath, M. O'Connell-Motherway, E.K. Arendt, M. van de Guchte, M.
Creaven, G.F. Fitzgerald, D. van Sinderen, 2004. Molecular and transcriptional
analysis of the temperate lactococcal bacteriophage Tuc2009. Virology 329, 40-52.
Siponen, M., S. Spinelli, S. Blangy, S. Moineau, C. Cambillau, V. Campanacci, 2009. Crystal
structure of a chimerical receptor binding protein constructed from two lactococcal
phages. J Bacteriol 191, 3220-5.
Spinelli, S., V. Campanacci, S. Blangy, S. Moineau, M. Tegoni, C. Cambillau, 2006a.
Modular structure of the receptor binding proteins of Lactococcus lactis phages. The
RBP structure of the temperate phage TP901-1. J Biol Chem 281, 14256-62.
Spinelli, S., A. Desmyter, C.T. Verrips, H.J. de Haard, S. Moineau, C. Cambillau, 2006b.
Lactococcal bacteriophage p2 receptor-binding protein structure suggests a common
ancestor gene with bacterial and mammalian viruses. Nat Struct Mol Biol 13, 85-9.
Sreerama, N., R.W. Woody, 2000. Estimation of protein secondary structure from circular
dichroism spectra: comparison of CONTIN, SELCON, and CDSSTR methods with an
expanded reference set. Anal Biochem 287, 252-60.
Stebbins, C.E., W.G. Kaelin, Jr., N.P. Pavletich, 1999. Structure of the VHL-ElonginC-
ElonginB complex: implications for VHL tumor suppressor function. Science 284,
455-61.
Strong, M., M.R. Sawaya, S. Wang, M. Phillips, D. Cascio, D. Eisenberg, 2006. Toward the
structural genomics of complexes: crystal structure of a PE/PPE protein complex from
Mycobacterium tuberculosis. Proc Natl Acad Sci U S A 103, 8060-5.
ACCEPTED MANUSCRIPT
26
Stuer-Lauridsen, B., T. Janzen, J. Schnabl, E. Johansen, 2003. Identification of the host
determinant of two prolate-headed phages infecting Lactococcus lactis. Virology 309,
10-7.
Takagi, H., Y. Kakuta, T. Okada, M. Yao, I. Tanaka, M. Kimura, 2005. Crystal structure of
archaeal toxin-antitoxin RelE-RelB complex with implications for toxin activity and
antitoxin effects. Nat Struct Mol Biol 12, 327-31.
Tetart, F., C. Desplats, H.M. Krisch, 1998. Genome plasticity in the distal tail fiber locus of
the T-even bacteriophage: recombination between conserved motifs swaps adhesin
specificity. J Mol Biol 282, 543-56.
Tetart, F., F. Repoila, C. Monod, H.M. Krisch, 1996. Bacteriophage T4 host range is
expanded by duplications of a small domain of the tail fiber adhesin. J Mol Biol 258,
726-31.
Tremblay, D.M., M. Tegoni, S. Spinelli, V. Campanacci, S. Blangy, C. Huyghe, A. Desmyter,
S. Labrie, S. Moineau, C. Cambillau, 2006. Receptor-binding protein of Lactococcus
lactis phages: identification and characterization of the saccharide receptor-binding
site. J Bacteriol 188, 2400-10.
van Heel, M., G. Harauz, E.V. Orlova, R. Schmidt, M. Schatz, 1996. A new generation of the
IMAGIC image processing system. J Struct Biol 116, 17-24.
Veesler, D., S. Blangy, M. Siponen, R. Vincentelli, C. Cambillau, G. Sciara, 2009a.
Production and biophysical characterization of the CorA transporter from
Methanosarcina mazei. Anal Biochem 388, 115-21.
Veesler, D., B. Dreier, S. Blangy, J. Lichiere, D. Tremblay, S. Moineau, S. Spinelli, M.
Tegoni, A. Pluckthun, V. Campanacci, C. Cambillau, 2009b. Crystal Structure and
Function of a DARPin Neutralizing Inhibitor of Lactococcal Phage TP901-1:
Comparison of Darpin and Camelid VHH Binding Mode. J Biol Chem 284, 30718-26.
ACCEPTED MANUSCRIPT
27
Vegge, C.S., L. Brondsted, H. Neve, S. Mc Grath, D. van Sinderen, F.K. Vogensen, 2005.
Structural characterization and assembly of the distal tail structure of the temperate
lactococcal bacteriophage TP901-1. J Bacteriol 187, 4187-97.
Vegge, C.S., F.K. Vogensen, S. Mc Grath, H. Neve, D. van Sinderen, L. Brondsted, 2006.
Identification of the lower baseplate protein as the antireceptor of the temperate
lactococcal bacteriophages TP901-1 and Tuc2009. J Bacteriol 188, 55-63.
Villion, M., M.C. Chopin, H. Deveau, S.D. Ehrlich, S. Moineau, A. Chopin, 2009. P087, a
lactococcal phage with a morphogenesis module similar to an Enterococcus faecalis
prophage. Virology 388, 49-56.
Vincentelli, R., S. Canaan, J. Offant, C. Cambillau, C. Bignon, 2005. Automated expression
and solubility screening of His-tagged proteins in 96-well format. Anal Biochem 346,
77-84.
Vincentelli, R., C. Bignon, A. Gruez, S. Canaan, G. Sulzenbacher, M. Tegoni, V.
Campanacci, C. Cambillau, 2003. Medium-scale structural genomics: strategies for
protein expression and crystallization. Acc Chem Res 36, 165-72.
Vincentelli, R., S. Canaan, V. Campanacci, C. Valencia, D. Maurin, F. Frassinetti, L.
Scappucini-Calvo, Y. Bourne, C. Cambillau, C. Bignon, 2004. High-throughput
automated refolding screening of inclusion bodies. Protein Sci 13, 2782-92.
Wang, S., J.R. Chang, T. Dokland, 2006. Assembly of bacteriophage P2 and P4 procapsids
with internal scaffolding protein. Virology 348, 133-40.
Whitmore, L., B.A. Wallace, 2004. DICHROWEB, an online server for protein secondary
structure analyses from circular dichroism spectroscopic data. Nucleic Acids Res 32,
W668-73.
ACCEPTED MANUSCRIPT
28
Wilbur, J.S., P.T. Chivers, K. Mattison, L. Potter, R.G. Brennan, M. So, 2005. Neisseria
gonorrhoeae FitA interacts with FitB to bind DNA through its ribbon-helix-helix
motif. Biochemistry 44, 12515-24.
Yang, F., P. Forrer, Z. Dauter, J.F. Conway, N. Cheng, M.E. Cerritelli, A.C. Steven, A.
Pluckthun, A. Wlodawer, 2000. Novel fold and capsid-binding properties of the
lambda-phage display platform protein gpD. Nat Struct Biol 7, 230-7.
ACCEPTED MANUSCRIPT
29
Figure legends
Fig. 1. Schematic representation of the orfs constituting the baseplate of the lactococcal
phages TP901-1, Tuc2009 and p2 and the expression constructs of their complexes. (A) The
structural genes encompassing the orfs belonging to the segment following the TMP and
finishing at the RBP. (B) Schematic view of the various constructs generated during this
study. The numbers in parenthesis correspond to the lines in Table 2. The red, green and
orange flags represent His6 tag, Strep tag and TRX fusion, respectively. Full lines and dashed
lines represent endogenous and engineered RBS, respectively.
Fig. 2. SEC/MALS/RI analysis of the different proteins and complexes. The molar mass (left
axis, solid lines) and the UV280nm absorbance (right axis, dashed lines) are plotted as a
function of the column elution volume. (A) TP901-1 BppU (pink), BppL (blue), BppU/BppL
(green) and mix of individually produced BppU with BppL from operon expressed
BppU/BppL (black). The column used was a 24 mL Superose 6 HR10/30 column (GE
Healthcare). (B) TP901-1 Dit/BppU/BppL complex (green), p2 ORF15/18 (orange) and p2
ORF15/16/18 (violet). The column used was a 5 mL KW405-4F column (Shodex).
Fig. 3. Electron microscopy analysis and image processing of negatively stained images of
TP901-1 BppU/BppL and Dit/BppU/BppL complexes as well as p2 baseplate proteins.
Images of TP901-1 BppU/BppL complex (A), TP901-1 Dit/BppU/BppL complex (B) and p2
baseplate (C) were recorded at 50,000 X (A and B) and 29,000 X (C) magnification. Scale
bar, 500 Å. Protein complexes were observed using 1% uranyl-acetate and imaged on a FEI
Tecnai Sphera LaB6 200 kV microscope. Proteins are in white. The insets show representative
class averages obtained after three iterative alignment cycles of raw images. The size of insets
ACCEPTED MANUSCRIPT
30
in A, B and C are 360 Å x 360 Å, 983 Å x 983 Å and 621 Å x 621 Å, respectively. No
symmetry was applied on images.
ACCEPTED MANUSCRIPT
31
ACCEPTED MANUSCRIPT
32
ACCEPTED MANUSCRIPT
33
ACCEPTED MANUSCRIPT
Table 1. Primers sequences and features
Name Sequence (5’ to 3’)a
TP901-1 BppU-ins-F attB1-GAAGGAGATAGAACC ATG AAA ACA GAA CAT TTT ATA ACA CTG TC
BppU-endo-F attB1-AGAAAGGGTCTATT ATG ACA GAA CAT TTT ATA ACA CTG TCC ACC
BppU-HC-R attB2-CTA gtg atg gtg atg gtg atg ATC TAT TGG ATA TGT ACC AGT C
BppU/L-ins-F cat cac cat cac cat cac TAA TAATCTAGAAATTTTGTTTAACTTTAAGAAGGAGATAGAACC ATG AAA GCT AGT ATT AAA AAA GTA TAT CGT GGC
BppU/L-ins-R TTT CAT GGTTCTATCTCCTTCTTAAAGTTAAACAAAATTTCTAGATTA TTA gtg atg gtg atg gtg atg AGC TAC AAA AAC ATA GCC TTT TG
BppL-ins-F attB1-GAAGGAGATAGAACC ATG GCT AGT ATT AAA AAA GTA TAT CGT GG
BppL-SC-R attB2-TTA TTA ttt ttc gaa ctg cgg gtg gct cca agc gct ATC TAT TGG ATA TGT ACC AGT CC
Dit-F attB1- TA gaa aac ctg tac ttc cag ggt TAC AAG TTT AGA GAT ACG AC
Dit/BppU-F CG ATT CAA TAT AGA AAG GCG GTA CTT TAA AAAAAGGAGAAAAA ATG ACA GAA CAT TTT ATA ACA CTG TCC ACC
Dit/BppU-R GGT GGA CAG TGT TAT AAA ATG TTC TGT CAT TTTTTCTCCTTTTT TTA AAG TAC CGC CTT TCT ATA TTG AAT CG
Tuc2009
BppU-ins-F attB1-GAAGGAGATAGAACC ATG AAA ACAGAACATTTTATAACACTGTCC
BppU-endo-F attB1-AGAAAGGGTCTATT ATG ACA GAA CAT TTT ATA ACA CTG TCC ACC
BppU/L-endo-F CG TGG GAA AAC GGG GGA TAA TAGAAATAGGAGAATAAA ATG GCT GAA TTA ACT AAA ATT AC
BppU/L-endo-R GT AAT TTT AGT TAA TTC AGC CAT TTTATTCTCCTATTTCTA TTA TCC CCC GTT TTC CCA CG
BppL-HC-R attB2-TTA gtg atg gtg atg gtg atg ATT CCG ATA AAG TTT TAC AAT C
Chimera
BppUTP901-1/LTuc2009-F ACA AAA GGC TAT GTT TTT GTA GCT TAA AAAAAGGAGAAAAA ATG GCT GAA TTA ACT AAA ATT ACT CG
BppUTP901-1/LTuc2009-R CG AGT AAT TTT AGT TAA TTC AGC CAT TTTTTCTCCTTTTT TTA AGC TAC AAA AAC ATA GCC TTT TGT
BppUTuc2009/LTP901-1-F TT TCG TGG GAA AAC GGG GGA TAA AAAAAGGAGAAAAA ATG GCT AGT ATT AAA AAA GTA TAT CGT GG
BppUTuc2009/LTP901-1-R CC ACG ATA TAC TTT TTT AAT ACT AGC CAT TTTTTCTCCTTTTT TTA TCC CCC GTT TTC CCA CGA AA
p2
ORF15-F attB1-TA gaa aac ctg tac ttc cag ggt GTA AGA CAG TAC AAA ATA CAT AC
ORF16-R attB2-TTA TTA
ORF18-R attB2-TTA TTA ttt ttc gaa ctg cgg gtg gct cca agc gct TTT AAT GAA GTA ACT TCC GTT ACC
ACCEPTED MANUSCRIPT
a For clarity, attB1 (5’-GGGGACAAGTTTGTACAAAAAAGCAGGCT-3’) and attB2 (5’-GGGGACCACTTTGTACAAGAAAGCTGGGT-3’)
sequences are abbreviated in the table. RBS are underlined, ATG and stop codons in bold, tag encoding sequences in lower case, lysine codon in
italic and capitals, Tobacco Etch Virus (TEV) protease encoding sequence in italic and lower case.
ACCEPTED MANUSCRIPT
Table 2. Summary of protein production and SEC/MALS analysis on baseplate proteins and complexes
Name First
gene RBS
origin
Middle
gene
RBS
origin
Last
gene
RBS
origin
Purified proteins Purification
yield
(mg/L)
Theoretical
MW (kDa)
SEC/MALS MW
(in kDa) d
Protein
ratio
Measured RH (nm)
(hydrodynamic
radius)
TP901-1 1- BppU-His-C in silico - - BppU 4 34.8 102±0.10 3 4.27±0.27 2
-
BppL-His-C in silico - - BppL 2 18.0 - -
3
-
BppU-His-C/BppL-Strep-C
in silico in silico
- BppU/BppL & BppL
- 1
34.8 / 18.5 18.5
- - -
4
-
BppU/BppL-His-C in silico endo - BppU/BppL & BppL
10 20
34.0 / 18.0 18.0
255±0.20 54±0.40
3:9 3
6.0±0.38 3.42±0.22
5
-
BppU/BppL-His-C endo endo - BppU/BppL & BppL
2 10
34.0 / 18.0 18.0
- -
- -
-
6
-
Dit/BppU/BppL-Strep-C
pETG-20A
c
endo (bppL)
endo Dit/BppU/BppL & Dit
8 3
29.0 / 34.0 / 18.5 29.0
1,906±17.0 27.3±0.20
12:18:54 1
12.25±1.06 3.06±0.19
7
-
BppU-His-C + BppL-His-C
a
- - - - - 34.0 / 18.0
231.7±0.3 – 127.9±1.20 –
54.8±1.28
3:9 - 3 (BppU) - 3 (BppL)
-
Tuc2009
8
-
BppU-His-Cb in silico - - BppU 25 37.4 109.3±0.5 3 4.76±0.17
9
-
BppA-His-Cb in silico - - BppA 3 32.9 28.1±0.49 1 2.48±0.20
1
0-
BppL-His-Cb in silico - - BppL 12 20.0 60.3±0.23 3 -
1
1-
BppU-His-C/BppL-Strep-C
b
in silico in silico
- BppU/BppL & BppU
2 15
37.4 / 20.2 37.4
370±2.41 170 ±1.72
3:3 – 2x(3:3)
-
ACCEPTED MANUSCRIPT
1
2-
BppU/BppL-His-C endo endo - BppU/BppL & BppL
- 10
36.4 / 19.7 19.7
- -
- -
1
3-
BppU/BppL-His-C in silico endo - BppU/BppL & BppL
- 25
36.5 / 19.7 19.7
- -
- -
1
4-
BppU/BppA/BppL-His-C
b
in silico endo endo BppU/BppA/BppL & BppL
<1 25
36.5 / 31.9 / 19.7 19.7
255±1.43 -
3:3:3 -
Chimera BppU/BppL
1
5-
BppUTP901-
1/BppLTuc2009-His-C in silico Endo
(bppL
TP901-1)
- BppUTP901-
1/BppLTuc2009
& BppLTuc2009
5 40
34.0 / 18.0 19.7
494.8±2.50 - 264.8±0.5
2x(3:9) - 3:9
- -
1
6-
BppUTuc2009/BppLTP901-
1-His-C in silico endo
- BppUTuc2009/BppLTP901-
1
& BppLTP901-1
15 30
36.5 / 18.0 18.0
794.0±1.40 -581±1.2 -
284.0±3.10
3x(3:9) - 2x(3:9) -
3:9
- - -
p2
1
7-
ORF15-His-N/ORF16/ ORF17/ORF18-Strep-C
pDEST17d endo endo ORF15/ORF16/ORF18
& ORF15/ORF18
6
<1
37.8 / 42.8 / 29.8
37.9 / 29.8
1,004±150.0
132.2±0.50
6:3:18 or 12:3:18
1:3
-
4.7±0.25
1
8-
ORF15-His-N/ORF16 pDEST17d endo - ORF15/ORF16
& ORF15 4
<1 37.8 / 42.8 37.8
305.8±1.40 -
6:3 -
6.78±0.42
a individually produced BppU-His-C was mixed with excess BppL-His-C obtained via BppU/BppL-His-C operon expression;
b from (Sciara et al., 2008);
c RBS from the pETG-20A vector;
d RBS from the pDEST17 vector. ‘Endo’ and ‘in silico’ refer to endogenous and ‘in silico’ designed RBS, respectively.
When using the endogenous RBS from an another gene, its origin is indicated between parentheses. d
errors estimated for each experiment with the Astra software (Wyatt inc.). Note that the real MWs can be within 10% from the measured values.
ACCEPTED MANUSCRIPT
Table 3. Secondary structure content analysis obtained by circular dichroism of BppU/BppL
complexes
Helix1 Helix2 Strand1 Strand2 Turns Other NRMSDa
TP901-1 BppU/BppL
0.01 0.06 0.29 0.14 0.11 0.37 0.052
Chimera BppUTP901-1/BppLTuc2009
0.07 0.10 0.25 0.12 0.10 0.35 0.078
Chimera BppUTuc2009/BppLTP901-1
0.08 0.10 0.25 0.12 0.10 0.36 0.026
a NRMSD (normalized root-mean-square deviation) parameter (Mao et al., 1982) is a measure
of the difference between the experimental ellipticities ( exp) and the ellipticities of the back-
calculated spectra ( cal) for the derived structure. NRMSD is defined as [( exp-
cal)2/( exp)
2]1/2
, summed over all wavelengths. NRMSD values of <0.1 mean that the back-
calculated and experimental spectra are in close agreement (Brahms and Brahms, 1980).
top related