|
|
||||||||
Center for Environmental Genomics, Department of Biology, McMaster University, 1280 Main St West, Hamilton, Ontario L8S 4K1, Canada
Correspondence
Turlough M. Finan
finan{at}mcmaster.ca
| ABSTRACT |
|---|
|
|
|---|
54 promoters in the alphaproteobacteria is not well developed. In this study, 25 experimentally verified Sinorhizobium meliloti promoter sequences were compiled and used to predict the location of other related promoters in the S. meliloti genome. Fourteen candidate predictions were targeted for verification and of these at least 12 proved to be genuine promoters. As a result, the experimental identification of 12 novel promoters linked to genes rpoD, topA, rpmJ, trpS, ropB1, metC, rpsT, secE, trkH and three tRNA genes is reported. In all, 99 predicted and verified promoters are reported, including those linked with 13 tRNA genes, eight ribosomal protein genes and a number of other physiologically important or essential genes. On the basis of sequence conservation and a mutational analysis of promoter activity, the 35 and 10 consensus for these promoters is 5-CTTGAC-N17-CTATAT. This promoter structure, which seems to be widely conserved amongst several other genera in the alphaproteobacteria, shares significant similarity with, but is skewed by a 1 nt step from, the canonical Escherichia coli
70 promoter. Perhaps this difference is responsible for the observation that S. meliloti promoters are often poorly expressed in E. coli. In this regard, expression data from plasmid-borne gfp-reporter fusions to eight of the S. meliloti promoters verified in this work revealed that while these promoters were very active in S. meliloti and Agrobacterium tumefaciens only very low, near-background activity was detected in E. coli. | INTRODUCTION |
|---|
|
|
|---|
70 promoters is based on sequence conservation amongst several hundred promoter sequences (Harley & Reynolds, 1987
54 promoters (Dombrecht et al., 2002| METHODS |
|---|
|
|
|---|
|
|
Enzyme assays and other techniques.
Estimates of promoter activity were obtained by cloning DNA fragments into the transcriptional reporter vector pOT1 (Allaway et al., 2001
) and measuring green fluorescent protein (GFP) fluoresence in a Tecan Safire fluorimeter. Cells were grown overnight then used to inoculate fresh volumes of LB broth to an OD600 of
0.05. Cultures were grown to an OD600 of 0.70.8, washed once in 0.85 % saline and resuspended in 1 vol. saline. Two hundred microlitre volumes were deposited in black microtitre plates and fluorescence was measured with an excitation wavelength of 405 nm and an emission wavelength of 505 nm. The optical densities of equivalent volumes were measured at 600 nm in clear microtitre plates. Specific fluorescence was obtained by dividing relative fluorescence by the optical density.
DNA sequencing and oligonucleotide synthesis was conducted by MOBIX lab (McMaster University, Hamilton).
| RESULTS |
|---|
|
|
|---|
First-generation prediction of novel promoter sequences
Recently the characterization and mutational analysis of a promoter found upon the pSymB megaplasmid in S. meliloti led to the discovery of a small RNA gene that plays a role in regulating the replication of the plasmid (MacLellan et al., 2005
). We noticed a striking conservation of sequence (CTTGAC) at the 35 region of the small RNA (incA1) promoter with other S. meliloti promoter sequences (Bae et al., 1989
; Fisher et al., 1987
; Leong et al., 1985
; Osteras et al., 1995
; Ronson et al., 1987
) previously reported in the literature. We therefore compiled 11 experimentally verified promoter sequences that were related to one another (most obviously at the 35 hexanucleotide sub-sequences) with the idea of using them to predict the location of other related promoter sequences in the S. meliloti genome. These 11 promoter sequences are listed (and aligned) in Fig. 1
(sequences 111). Two strategies were employed to identify sequences in the genome that had similarity to these sequences. First, we used the program PatScan (Dsouza et al., 1997
) to perform a string pattern search of the S. meliloti genome (see http://sequence.toulouse.inra.fr/meliloti.html) using IUPAC symbols that constituted a degenerate consensus of the verified promoter sequences and a consensus based solely upon the rRNA operon promoter sequence. Amongst over 160 hits that were within
250 bp on the proper strand upstream of a predicted ORF we identified two hits occurring upstream of genes involved in transcription (suhR, rpoD), twelve involved in translation [four tRNA genes (SMc01378, SMc01936, SMc00758, SMc00303), two aminoacyl synthetases (cysS, trpS) and six ribosomal proteins (rluD, rpmE, rpmH, rpmJ, rplM, rpsT)] and hits linked with a number of other physiologically important or essential genes including secE (protein translocation subunit), topA (topoisomerase I), two proteases (ptrB, SMc03769), fixN3 (cytochrome c oxidase subunit), expA1 (one of a 21-gene cluster required for exopolysaccharide synthesis), and a predicted chaperone gene, grpE. Eleven hits were linked with solute transporter loci and 14 were linked with transcriptional regulatory protein genes. These hits are tabulated and aligned in Fig. 2
.
|
The hits tabulated in Fig. 2
were selected from the first-generation prediction stage as their function is generally well established in bacteria and several are either very important or essential to cellular viability. These fall into well-defined physiological groups such as genes involved in translation or in ribosomal structure and function. The degree of sequence conservation within the 10 and 35 sub-sequences aligned in Fig. 2
is in most cases striking and obvious. The 5' nucleotide of the 35 region is almost always a C or a G (as is the case with all of the input sequences used for the analyses) but the weight matrix analysis with low frequency specified some hits with an A (but never a T) in this position. One of the first indications of the broad predictive power of our analysis came with the subsequent but independent identification of two promoters in the genome (repA2 and pcaI, sequences 12 and 13 in Fig. 1
) that indeed possess an A at the 5' nucleotide of the 35 hexanucleotide. Thus, our sequence analysis suggested that such promoters may exist and we subsequently identified two of these promoters in the course of unrelated experimental investigations in our laboratory. Although the absolute frequency of the nucleotides found at this position may change as more promoters are identified, it seems that the hierarchy of nucleotide identity at the 5' position of the 35 sub-sequence is: C>G>>A>>T. All of the 31 predicted promoters listed in Fig. 2
are of course putative and we targeted 14 of these for subsequent experimental verification.
Experimental verification of predicted putative promoter sequences
One way of providing support for a predicted promoter element is to find that the sequence is positionally conserved in the genomes of related organisms. In the case of S. meliloti, we chose A. tumefaciens as a related alphaproteobacterium since the chromosomes of these organisms show a high degree of collinearity (Wood et al., 2001
). Sequences upstream of homologous genes in A. tumefaciens were compared with the S. meliloti intergenic sequences encompassing four predicted promoters (rpmE, rpoD, topA, secE) from the list in Fig. 2
. For each locus, the intergenic sequences were manually aligned (Fig. 3
) and in each case the critical 10 and 35 sub-sequences were completely conserved (except for a single nucleotide substitution in the 35 region of the secE predicted promoters). In secE and rpmE, the conserved hexanucleotide sub-sequences are flanked by rather extensive degrees of sequence conservation, but in the case of rpoD and topA, the conserved 10 and 35 sub-sequences are distinct in otherwise poorly conserved regions. This suggests that these discrete regions have experienced the selective pressure that would be expected for functionally important sequences such as promoter elements. It also suggests that, with whatever facility we can predict a subset of S. meliloti promoters, this same ability can be extended to A. tumefaciens and probably other related genera (see Discussion).
|
|
92 %) of the promoter elements we predicted and targeted for verification (not including ptrB, for which no information was obtained) are indeed genuine promoters and it is reasonable to suspect that the lone candidate that was not confirmed (rpmE) may be the result of premature termination of the primer extension reaction. In a circumstance that parallels the situation in E. coli, an A residue is the most frequently (50 %) utilized transcription start nucleotide (Fig. 4b
Second-generation weight-based matrix
We compiled the newly verified 12 promoter sequences (this work) with the original 11 sequences and with the two additional promoter sequences independently verified in our laboratory (sequences 12 and 13, Fig. 1
) to generate a list of 25 verified promoter sequences (Fig. 1
, sequences 125). Using these 25 verified promoter sequences we constructed a second-generation (and presumably more robust) weight matrix that we again used to scan intergenic regions in the S. meliloti genome using the program PatScan. The weight matrix in this case only identifies putative promoters that have a 17 nt linker region between the 35 and 10 sub-sequences.
The highest possible score with this matrix is 1484 and no sequence in the genome obtained this score. Five randomly generated 30-mer sequences (with the same G+C content as S. meliloti intergenic DNA: 56 %) had a mean score of only 710±59 (SE). Of the sequences used to construct the matrix (Fig. 1
) the promoter for the rRNA gene has the highest score (1376) while the promoter for the trpE gene has the lowest score (1116). The highest scoring sequence (with a 17 nt linker) in the genome had a score of 1364 and was a hit linked to gene SMa0229, a hypothetical ORF whose predicted product has similarity to the translation elongation factor GreA. Hits linked to genes cysS and topA (both with a score of 1360) were the next highest scoring sequences in the genome. Using an arbitrary threshold score of >1190, 411 hits were obtained. Ninety-five (23 %) of these hits were not obviously linked to annotated genes. Some of these hits occurred in large intergenic regions while others occurred close to annotated genes but on the opposite strand. Seven of these orphan promoter-like sequences were closely linked (within
200 nt) to orphan rho-independent transcriptional termination signals (that are annotated on the S. meliloti genome website), raising the possibility that together these elements may indicate the presence of small RNA genes. The number of orphan promoters closely linked with orphan terminators is likely an underestimation since a great many sequences that are good candidates for termination sequences are currently not annotated.
Eighteen and 12 hits, respectively, were closely linked with transcriptional regulatory protein genes and oxidoreductase/dehydrogenase genes. Twenty-nine hits were linked with transporter genes. As befitting the frequency distribution of transporter types in the genome (Galibert et al., 2001
), most of the hits link to ABC transporter genes. These, along with other selected predictions, are listed in Table 2
. Most of the hits listed in Table 2
come from the second-generation weight matrix analysis but some of the hits (including several tRNA gene and ribosomal protein gene promoter predictions) are duplications from Fig. 2
(from the first round of promoter predictions) and are included solely to facilitate sequence comparison amongst genes within a functional family. Table 2
also includes 32 hits linked to genes involved in translation and RNA metabolism and other hits to physiologically important or essential genes. Except for those few that we verified using primer extension, all of the sequences in Table 2
are of course putative but we expect roughly the same prediction success rate as demonstrated from the first generation of promoter predictions (that is 12 of 13 or minimally
90 %).
|
|
|
56 nt untranslated RNAs) mediate incompatibility against the resident megaplasmids of S. meliloti (MacLellan et al., 2005
As shown in Table 4
, the incA1 (pTH1560) and incA2 (pTH1982) promoters are highly active in both A. tumefaciens and E. coli. Mutations in the nucleotide 3' of the consensus (pTH1985 and pTH1983, respectively) have little or no impact on promoter activity in both species. We expected this result for A. tumefaciens cells since that nucleotide falls outside the consensus hexanucleotide and is less biased than the other adjacent nucleotides. In the case of E. coli, however, we found this surprising since we expected that a T in that nucleotide position (which is found much less frequently than an A in that position) might lower activity if E. coli RNA polymerase continued to recognize the E. coli consensus hexanucleotide that is incidentally formed in plasmids pTH1560 and pTH1983 but not plasmids pTH1985 and pTH1982. In all cases, when the inferred 5' nucleotide of the S. meliloti consensus (CTTGAC) was mutated to form TTTGAC (plasmids pTH1991 and pTH1984), promoter activity was significantly affected. For reasons we have not determined, the influence of the mutation is much more dramatic in the incA2 promoter than in the incA1 promoter. Although the incA1 and incA2 promoter regions are highly similar, sequence differences elsewhere in the promoters are presumably responsible for this effect. In any case, the weight matrix analysis suggests that the 35 hexanucleotide rarely if ever begins with a T residue and this is consistent with the dramatic loss of promoter activity from promoters that carry this nucleotide in that position.
|
| DISCUSSION |
|---|
|
|
|---|
54 promoters (Dombrecht et al., 2002
The structure demonstrated for S. meliloti promoters appears to be conserved amongst other bacteria in the alphaproteobacteria for the following reasons: (1) we showed that the promoter sequences for the rpmE, rpoD, topA and secE genes were conserved between S. meliloti and A. tumefaciens (Fig. 3
); (2) homologues of the incA1 and incA2 genes whose promoters have been verified have homologues in A. tumefaciens, Rhizobium etli, R. leguminosarum and Brucella spp. (Chai & Winans, 2005
; Dombrecht et al., 2002
; Izquierdo et al., 2005
; MacLellan et al., 2005
; Venkova-Canova et al., 2004
) and their promoters are nearly identical; and (3) a small compilation (eight genes) of Bradyrhizobium japonicum promoters (Beck et al., 1997
) demonstrated the same general pattern of nucleotide conservation as described in this work.
We have not experimentally deduced which sigma factor recognizes the confirmed and putative promoters that have been predicted in this work. However, the sigma factor is likely to be rpoD, the vegetative sigma factor, since many of the confirmed and predicted loci associated with these promoters encode products required for translation and transcription, and other essential or important functions.
In this work we have identified consensus 10 and 35 hexameric promoter sequences defining what we believe to be a subset of S. meliloti promoters. Further experimental work will be required to better define these sequences from a functional point of view and to determine whether the E. coli paradigm of conserved hexamers is appropriate for S. meliloti promoters, particularly with regard to the poorly conserved 10 region. From the perspective of predicting related sequences (putative promoters) in the alphaproteobacteria, however, the identification of patterns of nucleotide conservation described in this work is a productive first step towards more effective promoter prediction in these bacteria.
| ACKNOWLEDGEMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
Allaway, D., Schofield, N. A., Leonard, M. E., Gilardoni, L., Finan, T. M. & Poole, P. S. (2001). Use of differential fluorescence induction and optical trapping to isolate environmentally induced genes. Environ Microbiol 3, 397406.[CrossRef][Medline]
Argaman, L., Hershberg, R., Vogel, J., Bejerano, G., Wagner, E. G., Margalit, H. & Altuvia, S. (2001). Novel small RNA-encoding genes in the intergenic regions of Escherichia coli. Curr Biol 11, 941950.[CrossRef][Medline]
Bae, Y. M., Holmgren, E. & Crawford, I. P. (1989). Rhizobium meliloti anthranilate synthase gene: cloning, sequence, and expression in Escherichia coli. J Bacteriol 171, 34713478.
Beck, C., Marty, R., Klausli, S., Hennecke, H. & Gottfert, M. (1997). Dissection of the transcription machinery for housekeeping genes of Bradyrhizobium japonicum. J Bacteriol 179, 364369.
Chai, Y. & Winans, S. C. (2005). A small antisense RNA downregulates expression of an essential replicase protein of an Agrobacterium tumefaciens Ti plasmid. Mol Microbiol 56, 15741585.[Medline]
Chen, S., Lesnik, E. A., Hall, T. A., Sampath, R., Griffey, R. H., Ecker, D. J. & Blyn, L. B. (2002). A bioinformatics based approach to discover small RNA genes in the Escherichia coli genome. Biosystems 65, 157177.[CrossRef][Medline]
DelVecchio, V. G., Kapatral, V., Elzer, P., Patra, G. & Mujer, C. V. (2002). The genome of Brucella melitensis. Vet Microbiol 90, 587592.[CrossRef][Medline]
Dombrecht, B., Marchal, K., Vanderleyden, J. & Michiels, J. (2002). Prediction and overview of the RpoN-regulon in closely related species of the Rhizobiales. Genome Biol 3, RESEARCH0076.[Medline]
Dsouza, M., Larsen, N. & Overbeek, R. (1997). Searching for patterns in genomic data. Trends Genet 13, 497498.[Medline]
Fisher, R. F., Brierley, H. L., Mulligan, J. T. & Long, S. R. (1987). Transcription of Rhizobium meliloti nodulation genes. Identification of a nodD transcription initiation site in vitro and in vivo. J Biol Chem 262, 68496855.
Galibert, F., Finan, T. M., Long, S. R. & 53 other authors (2001). The composite genome of the legume symbiont Sinorhizobium meliloti. Science 293, 668672.
Gustafson, A. M., O'Connell, K. P. & Thomashow, M. F. (2002). Regulation of Sinorhizobium meliloti 1021 rrnA-reporter gene fusions in response to cold shock. Can J Microbiol 48, 821830.[CrossRef][Medline]
Halling, S. M., Peterson-Burch, B. D., Bricker, B. J., Zuerner, R. L., Qing, Z., Li, L. L., Kapur, V., Alt, D. P. & Olsen, S. C. (2005). Completion of the genome sequence of Brucella abortus and comparison to the highly similar genomes of Brucella melitensis and Brucella suis. J Bacteriol 187, 27152726.
Harley, C. B. & Reynolds, R. P. (1987). Analysis of E. coli promoter sequences. Nucleic Acids Res 15, 23432361.
Izquierdo, J., Venkova-Canova, T., Ramirez-Romero, M. A., Tellez-Sosa, J., Hernandez-Lucas, I., Sanjuan, J. & Cevallos, M. A. (2005). An antisense RNA plays a central role in the replication control of a repC plasmid. Plasmid 54, 259277.[Medline]
Kaneko, T., Nakamura, Y., Sato, S. & 14 other authors (2002). Complete genomic sequence of nitrogen-fixing symbiotic bacterium Bradyrhizobium japonicum USDA110 (supplement). DNA Res 9, 225256.[CrossRef][Medline]
Leong, S. A., Williams, P. H. & Ditta, G. S. (1985). Analysis of the 5' regulatory region of the gene for delta-aminolevulinic acid synthetase of Rhizobium meliloti. Nucleic Acids Res 13, 59655976.
Lisser, S. & Margalit, H. (1993). Compilation of E. coli mRNA promoter sequences. Nucleic Acids Res 21, 15071516.
MacLellan, S. R., Smallbone, L. A., Sibley, C. D. & Finan, T. M. (2005). The expression of a novel antisense gene mediates incompatibility within the large repABC family of alpha-proteobacterial plasmids. Mol Microbiol 55, 611623.[CrossRef][Medline]
Osteras, M., Driscoll, B. T. & Finan, T. M. (1995). Molecular and expression analysis of the Rhizobium meliloti phosphoenolpyruvate carboxykinase (pckA) gene. J Bacteriol 177, 14521460.
Papp, P. P. (2004). tRNA gene containing an attachment site of a temperate phage is functional: proof by converting attB to amber suppressor. GenBank accession no. AJ698943.
Paulsen, I. T., Seshadri, R., Nelson, K. E. & 28 other authors (2002). The Brucella suis genome reveals fundamental similarities between animal and plant pathogens and symbionts. Proc Natl Acad Sci U S A 99, 1314813153.
Ronson, C. W., Nixon, B. T., Albright, L. M. & Ausubel, F. M. (1987). Rhizobium meliloti ntrA (rpoN) gene is required for diverse metabolic functions. J Bacteriol 169, 24242431.
Ross, W., Aiyar, S. E., Salomon, J. & Gourse, R. L. (1998). Escherichia coli promoters with UP elements of different strengths: modular structure of bacterial promoters. J Bacteriol 180, 53755383.
Thony, B. & Hennecke, H. (1989). The 24/12 promoter comes of age. FEMS Microbiol Rev 5, 341357.[CrossRef][Medline]
Venkova-Canova, T., Soberon, N. E., Ramirez-Romero, M. A. & Cevallos, M. A. (2004). Two discrete elements are required for the replication of a repABC plasmid: an antisense RNA and a stem-loop structure. Mol Microbiol 54, 14311444.[CrossRef][Medline]
Wood, D. W., Setubal, J. C., Kaul, R. & 49 other authors (2001). The genome of the natural genetic engineer Agrobacterium tumefaciens C58. Science 294, 23172323.
Zacharias, M., Goringer, H. U. & Wagner, R. (1990). The signal for growth rate control and stringent sensitivity in E. coli is not restricted to a particular sequence motif within the promoter region. Nucleic Acids Res 18, 62716275.
Zacharias, M., Theissen, G., Bradaczek, C. & Wagner, R. (1991). Analysis of sequence elements important for the synthesis and control of ribosomal RNA in E coli. Biochimie 73, 699712.[Medline]
Received 4 December 2005;
revised 26 February 2006;
accepted 28 February 2006.
This article has been cited by other articles:
![]() |
C. Bahlawane, B. Baumgarth, J. Serrania, S. Ruberg, and A. Becker Fine-Tuning of Galactoglucan Biosynthesis in Sinorhizobium meliloti by Differential WggR (ExpG)-, PhoB-, and MucR-Dependent Regulation of Two Promoters J. Bacteriol., May 15, 2008; 190(10): 3456 - 3466. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Chen, M. Bagdasarian, M. G. Kaufman, A. K. Bates, and E. D. Walker Mutational Analysis of the ompA Promoter from Flavobacterium johnsoniae J. Bacteriol., July 15, 2007; 189(14): 5108 - 5118. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. I. Muglia, D. H. Grasso, and O. M. Aguilar Rhizobium tropici response to acidity involves activation of glutathione synthesis Microbiology, April 1, 2007; 153(4): 1286 - 1296. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Cheng, C. D. Sibley, R. Zaheer, and T. M. Finan A Sinorhizobium meliloti minE mutant has an altered morphology and exhibits defects in legume symbiosis Microbiology, February 1, 2007; 153(2): 375 - 387. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. H. Mauchline, J. E. Fowler, A. K. East, A. L. Sartor, R. Zaheer, A. H. F. Hosie, P. S. Poole, and T. M. Finan Mapping the Sinorhizobium meliloti 1021 solute-binding protein-dependent transportome PNAS, November 21, 2006; 103(47): 17933 - 17938. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Rotter, S. Muhlbacher, D. Salamon, R. Schmitt, and B. Scharf Rem, a New Transcriptional Activator of Motility and Chemotaxis in Sinorhizobium meliloti. J. Bacteriol., October 1, 2006; 188(19): 6932 - 6942. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| INT J SYST EVOL MICROBIOL | MICROBIOLOGY | J GEN VIROL |
| J MED MICROBIOL | ALL SGM JOURNALS | |