|
|
||||||||


1 Department of Infection, Immunity and Inflammation, Maurice Shock Medical Sciences Building, University of Leicester, Leicester LE1 9HN, UK
2 Department of Biological Sciences, University of Warwick, Coventry CV4 7AL, UK
3 U IBBMC-CNRS UMR 8619, Bât. 430 – Université de Paris-Sud, F-91405 Orsay, Paris, France
Correspondence
Martha R. J. Clokie
mrjc1{at}le.ac.uk
| ABSTRACT |
|---|
|
|
|---|
These authors contributed equally to this work. | INTRODUCTION |
|---|
|
|
|---|
The phage selected for this study was S-PM2 which infects marine cyanobacteria of the genus Synechococcus. These organisms, together with their close relatives from the genus Prochlorococcus, make a substantial contribution to the primary productivity of the oceans (Field et al., 1998
). This phage was originally isolated from water collected off the coast of Plymouth, UK (Wilson et al., 1993
). It is a lytic myovirus with an icosahedral head and contractile tail and its genome has been sequenced (Mann et al., 2005
).
The archetypal myovirus T4 devotes more than 40 % of its genome to the synthesis and assembly of one of the most complex virus particles known (Miller et al., 2003a
). S-PM2 is morphologically similar to T4 and therefore it is likely that a similar number of proteins are involved in building its virion structure. The genome of S-PM2 is slightly larger than T4 and contains 236 ORFs, 69 of which exhibit similarity to other phage or bacterial genes of known function, including 38 which have similarity to T4 (Mann et al., 2005
), 15 of which have similarity to hypothetical proteins and the remaining 152 have no similarity to any known proteins.
There is a surprising degree of synteny between the S-PM2 and T4 ORFs which encode virion structural genes (Hambly et al., 2001
; Mann et al., 2005
). It is known that the T4 virion is composed of around 39 structural proteins (Miller et al., 2003a
), 15 of which have homologues in S-PM2. These ORFs are shown in Fig. 1
and appear to comprise a core set of conserved proteins that have not evolved significantly from an ancestral T4-type phage. These form the complex contractile tail necessary for the injection of phage DNA (Fig. 1
). Both phages encode the major capsid protein (gp23), three neck proteins (gp20, gp13 and gp14), four tail proteins (gp3, gp15, gp18 and gp19) and six baseplate hub proteins that are part of the baseplate–tail tube complex (gp5, gp6, gp8, gp25, gp48 and gp53) (Kostyuchenko et al., 2003
). This leaves in the order of 20 S-PM2 virion structural proteins to be identified, which was the purpose of this study. T4 was used as a control to assess the efficiency of the proteomic approach developed in this study.
|
| METHODS |
|---|
|
|
|---|
Phage preparation.
Phage were removed from the CsCl gradients and dialysed in SM buffer (l–1: 5.8 g NaCl, 2 g MgSO4 . 7H2O, 50 ml 1 M Tris/HCl, pH 7.5). Phage ghosts were prepared as described by Konopa & Taylor (1979)
. Briefly, phages were concentrated by ultracentrifugation at 100 000 g in a TL100 (Beckman Coulter) at 4 °C for 30 min. They were resuspended in 10 M LiCl and heated to 46 °C for 20 min, then diluted 10x in 50 mM Tris/HCl, pH 8, 100 mM NaCl and 5 mM MgCl2, and treated with 40 units DNase (Ambion) for 2 h at 37 °C. Samples were then reconcentrated and resuspended in protein buffer, boiled for 2 min and then spun down to remove any particulate matter.
Protein preparation and identification.
The supernatant from above was resolved on a 1-D SDS-PAGE gel which was stained with 0.1 % Coomassie brilliant blue G-250 (Bio-Rad) in 40 % methanol and 10 % acetic acid for 1 h. The gel was destained in 40 % methanol and 10 % acetic acid until the background had faded (after around 2 h) and stored in deionized water until required.
Protein digestion.
Protein gel plugs were excised and processed automatically by the MassPrep robotic protein handling system (Waters Micromass MS Technologies), using the manufacturer's recommended protocol. In brief, the gel plugs were destained, the disulfide bonds were reduced by the addition of dithiothreitol and the free cysteine residues were alkylated with iodoacetamide. The gel plugs were washed prior to a dehydration step, followed by the addition of trypsin, and incubated for 4.5 h. The resultant tryptic peptides were extracted up to three times in total and transferred to a cooled 96-well microtitre plate; if necessary, they were stored at –20 °C.
Analysis by liquid chromatography electrospray ionization tandem MS (LC-ESI-MS/MS).
The 96-well microtitre plate containing the extracted tryptic peptides was transferred to a modular CapLC and autosampler system (Waters Micromass MS Technologies). A 5 µl aliquot of the extract was mixed with 15 µl 0.1 % formic acid and loaded onto an LC Packings µ-Precolumn (C18 PepMap 100, 5 µm, 100 Å, 300 µm i.d.x5 mm) cartridge. Flushing the column with solution A desalted the bound peptides before a linear gradient of solution B at a flow rate of 0.2 nl min–1 eluted the peptides for further resolution on a 15 cm LC Packings C18 PepMap analytical column (5 µm, 5 Å, 75 µm i.d.).
ProteinLynx Global Server 2.0 (Waters Micromass MS Technologies) was used to interrogate the data obtained from the LC-ESI-MS/MS experiments. MS/MS protein identifications were accepted with a minimum of one manually validated peptide per protein.
Construction of the protein database.
Identification of a protein by MS-based proteomics approaches requires the proteins of the organism under investigation (or a close homologue) to be present in a database. At the time the project was conducted, S-PM2 and the host Synechococcus WH7803 had only recently been sequenced and the genomes were not present in any existing databases. The genomic sequence of S-PM2 was used to predict potential protein sequences which were then added to the latest release of the SWISS-PROT database. This allowed identification of contaminants both in the form of host proteins and those introduced during sample handling (mainly keratins). The use of a large database provided higher confidence in the database interrogation results; if the database consisted only of S-PM2 proteins, false-positives would be more likely. The raw genomic sequence of S-PM2 and Synechococcus WH7803 were translated in all six reading frames using the Translate tool from the ExPASy web server (http://ca.expasy.org/). ORFs giving rise to proteins with a molecular mass greater than 3000 Da were included in the database. Genes were also predicted using GeneMarkS (http://opal.biology.gatech.edu/GeneMark/) and Glimmer (www.tigr.org/software/glimmer/). The output was converted to a FASTA-formatted database by the use of Perl scripts, and all three databases were added to SWISS-PROT. The different methods of creating a database allowed for all possible proteins to be identified.
Cryoelectron microscopy.
A sample (5 µl) of an S-PM2 phage suspension was deposited on a pre-cleaned carbon-coated lacey grid (Agar Ltd). After 1 min incubation, the excess liquid was blotted and the grids were flash-frozen in liquid ethane cooled by liquid nitrogen. In this way the phages were trapped in a thin layer of amorphous ice in as close to native conditions as possible (Frank, 2006
).
Digital images were collected at x70 000 magnification on a 4096x4096 pixel (15 µm per pixel resolution) CCD camera (Ultrascan 4000; Gatan Inc.) with a JEM2010-FEG transmission electron microscope operated at 200 kV and at low-dose conditions (<16 electrons Å–2). All images were visualized and evaluated with Digital Micrograph Software (Gatan Inc.).
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
Database interrogation
A list of the number of matched peptides per protein for both T4 and S-PM2 is shown in Table 1
. For S-PM2, all ORF numbers are consistent with the latest version of the genome published on EMBL website (www.ebi.ac.uk/embl/). The table also displays the molecular mass of identified proteins and the copy number for T4 proteins (if known). All proteins identified were predicted by both the GeneMark and Glimmer gene prediction programs.
|
|
When the same approach was applied to S-PM2, 23 structural proteins were identified (Table 1
). There was a close match between the position of the protein on the gel and the predicted molecular mass of the protein (Fig. 3a
). ORFs encoding 11 of the 15 predicted virion structural proteins (Mann et al., 2005
), were confirmed as such. These were the major capsid protein gp23, the prohead assembly protein gp22, 2 of the 3 neck proteins (gp13 and gp20), and 7 of the tail and baseplate proteins (gp3, gp6, gp8, gp15, gp18, gp19 and gp48). A further 12 ORFs were established to encode S-PM2 virion structural proteins, which have no detectable homology to any entries in the public protein databases (Table 1
).
|
In T4, gp20 is a dodecamer protein that forms the proximal vertex of the head and through which DNA is injected (Leiman et al., 2004
). This protein was found in both S-PM2 and T4. gp13 forms a ring with gp14 that sits closest to the head (Akhter et al., 2007
; Leiman et al., 2004
). gp13 was detected in both S-PM2 and T4 preparations, but gp14 was not found in either phage preparation and thus it may not be amenable to the approach taken in this study (see discussion below). There is no homologue of the gene that encodes the whiskers (gpwac) in S-PM2, and electron microscopy studies have not revealed such a structure (Fig. 2
).
Tail sheath and tail tube
The contractile tail sheath protein gp18 and tail tube protein gp19 are very abundant within the phage particle (Leiman et al., 2003
). This abundance is consistent with their detection in both T4 and S-PM2 preparations. gp3 and gp15 are at the top of the tail tube in T4 (Leiman et al., 2004
). Although they are present at relatively low abundance, both proteins were found in S-PM2, and gp15 was detected in T4. The end of the tail tube in T4 is thought to be either gp48 or gp54; S-PM2 only has homology with gp48, and this protein was detected in S-PM2.
Baseplate
At the bottom of the tail in T4 is a baseplate wedge which consists of gp6, gp7, gp8, gp10, gp11, gp25 and gp53. Six wedges are joined around the central hub with the help of gp9 (the long-tail-fibre attachment protein) and gp12 (the short tail fibre) (Kostyuchenko et al., 2003
; Van Raaij et al., 2001
). The hub consists of gp5, gp27 and gp29. These proteins in T4 were not amenable to our approach for identification and thus we detected only one peptide for gp5 and gp27 and none for gp29.
S-PM2 has homology to four of the baseplate wedge proteins (gp6, gp8, gp25 and gp53) and the hub protein and gp5. If the tail is relatively constrained in evolutionary terms, it seems likely that the proteins attached to it are also constrained. Of the five predicted baseplate proteins, only gp6 and gp8 were detected.
The baseplate-related ORFs present in the T4 genome are at either end of the structural gene cluster with the genes encoding gp5, gp6, gp7, gp8, gp9, gp10, gp11, gp12 and gp53 at one end, and the rest (gp25, gp26, gp27, gp28, gp29, gp48, gp51 and gp54) at the other (Miller et al., 2003a
). This is not the case in other phages such as KVP40, where only one such cluster of baseplate proteins has been identified and this contains most of the genes found in the two separate clusters in T4 (Miller et al., 2003b
). The arrangement in the S-PM2 genome seems to be similar to KVP40 in that the gene encoding gp6 is next to the gene encoding gp25. In this region of the S-PM2 genome there are several newly identified ORFs and it seems likely that some of these encode proteins involved in the baseplate. A second cluster of genes encoding baseplate subunits (gp53, gp48, gp26 and gp5) are found at around 150–160 kb of the genome, thus the two clusters of baseplate-related ORFs are not grouped in the same way in S-PM2 as they are in T4.
Tail fibres
The tail fibres are very important determinants of specificity in host–phage interactions. Unsurprisingly, due to the differences in the cyanobacterial and E. coli surfaces, the tail fibres of S-PM2 are highly divergent from known T4-type phages, with none of the predicted ORFs in S-PM2 resembling known tail fibres. Indeed, no tail fibres from cyanophages have been identified by anything other than a bioinformatic approach.
In T4, the short tail fibres (gp12) recognize the bacterial cell surface and undergo an irreversible binding reaction (Miller et al., 2003a
). Putative short tail fibres in S-PM2 may be encoded by ORF 225 which has two amino acid sequence domains which are similar to the tail-fibre receptor of T4 (gp12, with a BLAST e-value of 6e–06).
In T4, the long tail fibres are gp34, gp35, gp36 and gp37. Three of these – gp34, gp36 and gp37 – were detected in this study. Based on the fact that we found the T4 tail fibres, it seems likely that some of our new proteins identified in this study are S-PM2 tail-fibre proteins. In T4 the only structural genes which are separated from all the other structural ORFs are the long-tail-fibre genes. Possible candidates for these tail-fibre genes in the S-PM2 genome are newly identified ORFs 172 and 174 that belong to a similarly discrete cluster.
In summary, a combination of proteomic and bioinformatic approaches has proved highly successful in the identification of proteins from the newly characterized virus S-PM2. At least 50 % of the structural proteins from T4 and S-PM2 have been identified, including a significant number of head, tail, neck, baseplate and tail-fibre proteins. Other recent approaches based on N-terminal sequencing and on MS/MS have identified a total of 10 and 8 proteins, respectively (which assuming there are a similar number of proteins as there are in T4, is around 26 and 20 % of total proteins in the virions, respectively), so our approach has been far more extensive in terms of coverage (Pope et al., 2007
; Weigele et al., 2007
). We are also the first to use this approach on T4, as a control, to see how many capsid proteins could be identified in a well-characterized bacteriophage.
The proteins that remained elusive T4 included those from the baseplate and head. This may be due to incompatibility of these proteins with our sample preparation procedures (e.g. a lack of solubility), or they may not be present in suitable abundance within the assembled phage capsid to be detected post-digestion in the mass spectrometer. Proteins present at very low concentrations may be masked by higher abundance proteins of similar molecular mass occurring in the same protein band. For example, in this study, gp23, gp18 and gp19 were highly abundant and therefore could mask proteins of a similar molecular mass. Furthermore, the presence of a protein on a gel does not ensure recovery of all the peptides following in-gel digestion (Speicher et al., 2000
).
Certainly it seems that one or more of these underlying causes resulted in some proteins not being detected in either T4 or S-PM2. For example, the only three predicted proteins that we did not detect in S-PM2 (the neck protein gp14 and two of the baseplate proteins, gp25 and gp53) were also not detected in the T4 preparations. We propose to utilize alternate methodologies to identify the proteins that remain elusive to our gel-resolved sample preparation methods.
Our study clearly indicates widespread applicability and a high success rate of this approach in studying novel proteomes where little or no detectable homology with other organisms exists. The prerequisites of a study of this type are a genome sequence from which a proteome can be predicted, and in-house bioinformatics expertise to manipulate the various datasets produced in conjunction with MS-based protein identification techniques.
Edited by: L. S. Frost
| REFERENCES |
|---|
|
|
|---|
Black, L. W., Showe, M. K. & Steven, A. C. (1994). Morphogenesis of the T4 head. In Molecular Biology of Bacteriophage T4, pp. 218–258. Edited by J. D. Karam, J. W. Drake, K. N. Kreuzer and others. Washington, DC: American Society for Microbiology.
Field, C. B., Behrenfeld, M. J., Randerson, J. T. & Falkowski, P. (1998). Primary production of the biosphere: integrating terrestrial and oceanic components. Science 281, 237–240.
Frank, J. (2006). Three Dimensional Electron Microscopy of Macromolecular Assemblies. Oxford: Oxford University Press.
Fuhrman, J. A. (1999). Marine viruses and their biogeochemical and ecological effects. Nature 399, 541–548.[CrossRef][Medline]
Hambly, E., Tétart, F., Desplats, C., Wilson, W. H., Krisch, H. M. & Mann, N. H. (2001). A conserved genetic module that encodes the major virion components in both the coliphage T4 and the marine cyanophage S-PM2. Proc Natl Acad Sci U S A 98, 11411–11416.
Konopa, G. & Taylor, K. (1979). Coliphage lambda ghosts obtained by osmotic shock or LiCl treatment are devoid of J- and H-gene products. J Gen Virol 43, 729–733.
Kostyuchenko, V. A., Leiman, P. G., Chipman, P. R., Kanamaru, S., van Raaij, M. J., Arisaka, F., Mesyanzhinov, V. V. & Rossmann, M. (2003). Three-dimensional structure of bacteriophage T4 baseplate. Nat Struct Biol 10, 688–693.[CrossRef][Medline]
Kostyuchenko, V. A., Chipman, P. R., Leiman, P. G., Arisaka, F., Mesyanzhinov, V. V. & Rossmann, M. G. (2005). The tail structure of bacteriophage T4 and its mechanism of contraction. Nat Struct Mol Biol 12, 810–813.[CrossRef][Medline]
Leiman, P. G., Kanamaru, S., Mesyanzhinov, V. V., Arisaka, F. & Rossmann, M. G. (2003). Structure and morphogenesis of bacteriophage T4. Cell Mol Life Sci 60, 2356–2370.[CrossRef][Medline]
Leiman, P. G., Chipman, P. R., Kostyuchenko, V. A., Mesyanzhinov, V. V. & Rossmann, M. G. (2004). Three-dimensional rearrangement of proteins in the tail of bacteriophage T4 on infection of its host. Cell 118, 419–429.[CrossRef][Medline]
Mann, N. H., Clokie, M. R. J., Millard, A., Cook, A., Wilson, W. H., Wheatley, P. J., Letarov, A. & Krisch, H. M. (2005). The genome of S-PM2, a photosynthetic T4-type bacteriophage that infects marine Synechococcus strains. J Bacteriol 187, 3188–3200.
Miller, E. S., Kutter, E., Mosig, G., Arisaka, F., Kunisawa, T. & Rüger, W. (2003a). Bacteriophage T4 genome. Microbiol Mol Biol Rev 67, 86–156.
Miller, E. S., Heidelberg, J. F., Eisen, J. A., Nelson, W. C., Durkin, A. S., Ciecko, A., Feldblyum, T. V., White, O., Paulsen, I. T. & other authors (2003b). Complete genome sequence of the broad-host-range vibriophage KVP40: comparative genomics of a T4-related bacteriophage. J Bacteriol 185, 5220–5233.
Pope, W. H., Weigele, P. R., Chang, J., Pedulla, M. L., Ford, M. E., Houtz, J. M., Jiang, W., Chiu, W., Hatfull, G. F. & other authors (2007). Genome sequence, structural proteins, and capsid organization of the cyanophage Syn5: a horned bacteriophage of marine Synechococcus. J Mol Biol 368, 966–981.[CrossRef][Medline]
Prangishvili, D. & Garrett, R. J. (2004). Exceptionally diverse morphotypes and genomes of crenarchaeal hyperthermophilic viruses. Biochem Soc Trans 32, 204–208.[CrossRef][Medline]
Sambrook, J., Fritsch, E. F. & Maniatis, T. (1989). Molecular Cloning: a Laboratory Manual. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory.
Speicher, K. D., Kolbas, O., Harper, S. & Speicher, D. W. (2000). Systematic analysis of peptide recoveries from in-gel digestions for protein identifications in proteome studies. J Biomol Tech 11, 74–86.
Suttle, C. A. (2007). Marine viruses – major players in the global ecosystem. Nat Rev Microbiol 5, 801–812.[CrossRef][Medline]
Van Raaij, M. J., Schoehn, G., Burda, M. R. & Miller, S. (2001). Crystal structure of a heat and protease-stable part of the bacteriophage T4 short tail fibre. J Mol Biol 314, 1137–1146.[CrossRef][Medline]
Weigele, P. R., Pope, W. H., Pedulla, M. L., Houtz, J. M., Smith, A. L., Conway, J. F., King, J., Hatfull, G. F., Lawrence, J. G. & Hendrix, R. W. (2007). Genomic and structural analysis of Syn9, a cyanophage infecting marine Prochlorococcus and Synechococcus. Environ Microbiol 9, 1675–1695.[CrossRef][Medline]
Wilson, W. H., Joint, I. R., Carr, N. G. & Mann, N. H. (1993). Isolation and molecular characterization of 5 marine cyanophages propagated on Synechococcus sp. strain WH7803. Appl Environ Microbiol 59, 3736–3743.
Received 26 December 2007;
revised 5 March 2008;
accepted 12 March 2008.
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| INT J SYST EVOL MICROBIOL | MICROBIOLOGY | J GEN VIROL |
| J MED MICROBIOL | ALL SGM JOURNALS | |