|
|
||||||||


Department of Microbiology and Alimentary Pharmabiotic Centre, University College Cork, Ireland
Correspondence
Paul W. O'Toole
pwotoole{at}ucc.ie
| ABSTRACT |
|---|
|
|
|---|
These authors contributed equally to this work.
| INTRODUCTION |
|---|
|
|
|---|
Beginning with Lactobacillus plantarum WCFS1 in 2003 (Kleerebezem et al., 2003
), five Lactobacillus genome sequences are now available. L. plantarum WCFS1 was isolated from human saliva, and is a metabolically diverse organism with a relatively large genome of 3.3 Mb (Kleerebezem et al., 2003
). This species is used for plant fermentation to produce silage (Cai et al., 1999
), but also has probiotic properties (Molin, 2001
), and its metabolic diversity was explained at a molecular level by the complexity of its genome (Kleerebezem et al., 2003
). Lactobacillus johnsonii NCC533 was isolated from human faeces, and has been thoroughly characterized for probiotic properties (reviewed by Pridmore et al., 2004
). The 1.99 Mb genome sequence of L. johnsonii NCC533 (Pridmore et al., 2004
) highlighted the host dependence of this multiply auxotrophic strain and its potential for host interaction via surface proteins. The Lactobacillus acidophilus NCFM genome sequence of 1.99 Mb (Altermann et al., 2005
) revealed the genetic basis for the biochemical complexity and host-interaction capability of this probiotic strain, which has been employed extensively by the dairy industry. The recent genome sequence determination for Lactobacillus sakei strain 23K (Chaillou et al., 2005
) provided a contrasting insight into the adaptive mechanisms of this meat-associated organism with extensive auxotrophy, including purine nucleoside scavenging, resilience against changing redox and oxygen levels, and potential mechanisms for biofilm formation.
We recently sequenced the genome of Lactobacillus salivarius UCC118 (Claesson et al., 2006
). This strain was isolated from the terminal ileum of a healthy human subject undergoing reconstructive urinary tract surgery (Dunne et al., 1999
). It has been extensively studied for its probiotic properties in human trials and animal models (Dunne et al., 1999
, 2001
; McCarthy et al., 2003
; Sheil et al., 2004
). Members of this species are commonly isolated from the oral and GI tracts of humans and other animals (Ahrne et al., 1998
; Heilig et al., 2002
; Molin et al., 1993
; Rogosa et al., 1953
). The 2.13 Mb genome of L. salivarius UCC118 comprised a 1.83 Mb chromosome, a 242 kb megaplasmid and two smaller plasmids of 20 kb and 44 kb. Annotation of the genome sequence indicated an unexpected level of prototrophy compared to other enteric lactobacilli, partly due to the contribution of megaplasmid-encoded genes, including those for completion of the pentose-phosphate pathway (Claesson et al., 2006
). The availability of the genome sequence is enabling a systems approach to investigate the interaction of L. salivarius UCC118 with the human GI tract, exemplified by genome-wide analysis of sortase-anchored adhesins (van Pijkeren et al., 2006
).
Preliminary comparative analysis of the total genome of this strain to other Lactobacilllus genomes suggested a lack of extensive genome conservation (Claesson et al., 2006
). Determination of numbers of shared orthologues revealed highest numbers with L. plantarum (61.1 % of L. salivarius proteins), followed by Enterococcus faecalis (50.4 %), L. sakei (49.8 %), L. acidophilus (46.3 %) and L. johnsonii (45.1 %); other relevant genome features are shown in Table 1
. The genomes of L. plantarum and L. johnsonii have been subjected to a detailed pairwise comparative analysis (Boekhorst et al., 2004
). A total of 28 regions of conserved gene order were identified, totalling 0.75 Mb, but these regions were not collinear. Major differences were identified in reconstructed metabolic maps, biosynthetic capabilities and extracellular protein repertoires, providing a rational basis for directing experimental analyses. A focused comparative analysis of genome synteny and exopolysaccharide biosynthesis loci in L. acidophilus, L. plantarum, L. johnsonii and Lactobacillus gasseri has also been published (Klaenhammer et al., 2005
), but exhaustive phylogenetic issues were not within the scope of that review.
|
| METHODS |
|---|
|
|
|---|
Predicted proteome comparisons.
Each of the six predicted proteomes was searched for orthologues against the other proteomes, where orthology between two proteins was defined as best bidirectional FASTA (Pearson, 2000
) hits. Such hits were also required to have at least 30 % identity over at least 80 % of both protein sequences. While this is an arbitrary definition of orthology, it has been applied in many comparative analyses from the Sanger Institute (e.g. Parkhill et al., 2003
). This orthology definition excluded hits between single-component proteins and fusion proteins. Proteins sharing the same putative orthologues with each and every one of the other genomes were identified, to generate a set of shared orthologues. This stringent approach was also applied to the five Lactobacillus genomes (i.e. E. faecalis excluded) to investigate genus-specific proteins. These were then further searched against the non-redundant database to check for other non-Lactobacillus hits. Those proteins showing no orthology with any of the other five genomes were regarded as unique.
Extraction of shared core genes and functional annotation.
To reconstruct the phylogeny of lactobacilli, our analysis started with the identification of those core genes that had been most conserved over a long period through evolution, so that the core was defined as the set of genes shared in each studied genome. To determine the core genes and their corresponding protein sequences, we extracted the orthologues present in each of the respective predicted proteomes in a two-step procedure. First we compared each protein against all the other proteins (all-against-all) using BLAST (Altschul et al., 1997
) (cutoff: E-value 1x1010) and clustered them in families' according to their homology using the BLAST results. This clustering of families was performed using MCL, a graph-theory-based Markov clustering algorithm (Van Dongen, 2000
; http://micans.org/mcl/). Following this, we selected the families that contained a single protein member in each genome, thereby discarding paralogues that could bias our analyses. Protein sequences corresponding to genes coming from mobile elements such as phages and IS elements were discarded manually. An in-house Perl script was used to compare the resulting list of proteins with that obtained using protein-based FASTA pairwise genome comparisons; only those proteins found by both methodologies were selected for analysis. A final total of 354 core proteins was obtained following this methodology.
Functional COG category assignments were collated for the core proteins thus identified, in order to gain further biological understanding from differences and similarities observed between the genomes. In rare cases where individual protein COG category assignments differed for members of core protein sets (presumably due to borderline-confidence COG assignments), the category containing the majority of proteins was adapted.
Construction of a 16S rRNA-based phylogenetic tree.
A complete phylogenetic analysis was performed using the 111 16S rRNA gene sequences corresponding to the Lactobacillus species available in public databases (this exceeds the number of validated species discussed above). A non-lactobacillus DNA sequence was also included in the 16S tree, derived from the Pediococcus pentosaceus type strain. Most of the sequences were extracted from the Ribosomal Database Project-II (Cole et al., 2005
); type strain sequences were chosen from each species where available. Those sequences that were not already curated in this database were extracted from the EMBL database using the public Sequence Retrieval System (SRS) server at EBI (Zdobnov et al., 2002
). The 16S rRNA sequence of the Bacillus subtilis type strain was added as outgroup, since previous analyses have suggested that enterococci and streptococci are too close to the lactobacilli to use as outgroups for 16S phylogeny (Heilig et al., 2002
).
Sequences were aligned with CLUSTALW (Thompson et al., 1994
) using default parameters, prior to manual removal of gap regions, which was then followed by building the phylogenetic tree using the method of maximum-likelihood implemented in PhyML (Guindon & Gascuel, 2003
) with the general time-reversible (GTR) model.
Whole-genome alignments between species at DNA and protein level.
Lactobacillus full genome sequence alignments were performed in an all-against-all comparison using MUMmer (Kurtz et al., 2004
). Comparisons were performed at both DNA and protein level. For the L. salivarius genome, the megaplasmid sequence was concatenated in succession to its chromosome sequence. To complement our graphical analysis, a quantitative approach was designed based on the numerical data obtained using MUMmer. Distances were calculated based on a modification of the method applied by Chan et al. (2006)
where the values of total length of regions aligned at protein level with an identity higher than 60 % using the PROmer (part of the MUMmer package) algorithm results replaced the total mums lengths. The calculation of the distance between two genomes is the logarithm of the PROmer alignments normalized by the size of the shortest genome being compared as follows:
|
|
Orthologue tree.
Another quantitative approach, based upon distances derived from orthologue repertoires, was undertaken to compare with the phylogeny described above. After removing paralogues and mobile elements from the orthologue sets, two phylogenetic trees were constructed; the first was based on orthologue counts, where distances DAB were calculated as the logarithm of shared orthologues between two genomes, normalized by the weighted mean of the total number of genes (Korbel et al., 2002
):
|
|
Supertree construction.
Using CLUSTALW, 354 protein families were aligned and phylogenetic trees were built using MULTIPHYL (Keane et al., 2006
, http://www.cs.nuim.ie/distributed/multiphyl.php). The best supertree was found following a heuristic search of tree space using the Most Similar Supertree (dfit) (Creevey et al., 2004
), Maximum Quartet fit (qfit) and Maximum Split fit (sfit) analysis methods as implemented in Clann 2.0.2 (Creevey & McInerney, 2005
). The final representation of the supertree was built by SplitsTree (Huson, 1998
).
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
Phylogeny based on the 16S rRNA gene and diversity of lactobacilli
Since the original description of the genus Lactobacillus by Beijerinck (Beijerink, 1901
; Skerman et al., 1980
), more than 100 different species have been described. It is widely recognized that the taxonomy of this genus is unsatisfactory due to the highly heterogeneous nature of its members (Schleifer & Ludwig, 1995
). Despite the significant increase in the number of recognized Lactobacillus species, a complete phylogeny has not been published since Dellaglio's revision (Dellaglio et al., 2005
). For this comparative genomic analysis, we elected to use 16S rRNA gene phylogeny as our baseline, as it is the most established marker. A maximum-likelihood phylogenetic tree was therefore built using 111 16S rRNA gene sequences from currently available Lactobacillus species entries. The resulting tree is depicted in Fig. 1
. This analysis using the most classic molecular marker clearly depicts the biological diversity in lactobacilli; however, some order can also be observed. Based on the phylogeny reconstruction, it is possible to group lactobacilli in five main divisions by imposing the criteria of number of species (>10 species) or high bootstrap values. Some of the latter values are low at major divisions, as is typical when large numbers of taxa are analysed (Takahashi et al., 1999
). Group A is the largest division, containing 28 species, and includes the established members of the L. acidophilus complex (Kullen et al., 2000
). Species not present in any group thus correspond to branches with small numbers of species and with low bootstrap values.
|
Based on their taxonomy, Dellaglio has recently suggested reclassification of the lactobacilli (Dellaglio et al., 2005
), proposing a division into eight groups. As noted, the presence of clear divisions can also be observed in our tree topology of the 16S-based phylogeny. However, a single phylogenetic marker may be insufficient for tree resolution due to bias or lack of fine resolution, favouring usage of several markers, as proposed by the International Committee on Systematics of Prokaryotes (ICSP) (Vandamme et al., 1996
). We therefore applied a range of whole-genome-based approaches. Four of the species for which genome sequences were available fell into three of the five major 16S rRNA phylogeny groups: L. acidophilus and L johnsonii in group A, L. plantarum in group D and L. salivarius in group E. Complete genome sequences for representatives of groups B and C were not available.
Whole-genome alignments
Our first approach was the alignment of whole genome sequences. Though some degree of relationships could be visualized at DNA level (data not shown), phylogenetic signals quickly disappear as DNA sequences diverge, as a reflection of the increasing distance between the lactobacilli. The fact that long-range genome alignments of Lactobacillus spp. could not be produced at a DNA level is a significant indication of profound diversity, contrasting for example with other DNADNA interspecific alignments such as those of Mycobacterium spp. (Brussow et al., 2004
). The closest DNADNA alignments that could be achieved for the lactobacilli were between members of the proposed Lactobacillus groups, such as in group A between L. acidophilus and L. johnsonii (data not shown). Protein rather than DNA alignments were therefore employed for these interspecific genome comparisons, providing a better picture across the genus.
The degree of alignment between species (Fig. 2
) varied depending on the phylogenetic distance of the genomes being compared. The sequences with the best alignment are L. johnsonii and L. acidophilus, the almost continuous line observed in the xy graph being an expression of the high level of synteny between these two genomes. Their high degree of similarity reflected by the PROmer alignment can also be seen in their proximity in the 16S-based tree, both belonging to group A in Fig. 1
. In contrast, the least related genomes, based on their low degree of alignment, were the L. salivarius genome (group E), versus those of L. acidophilus and L. johnsonii (group A), which correlates well with the distance between these genomes observed in the 16S-based tree.
|
Quantitative data can also be extracted from genome alignments as demonstrated by Chan et al. (2006)
, who used the total lengths of the mums regions. Though the use of genome alignments at DNA level using MUMmer has been validated for reconstruction of broad phylogenetic relationships of prokaryotic genomes (Chan et al., 2006
; Henz et al., 2005
), it might lack sensitivity for reconstructing phylogenies if we limit our analysis at intrageneric level. Due to the relatedness of our genomes, we slightly modified the distance measure formula: the regions aligned using PROmer were used instead of using the mums lengths. Finally, a neighbour-joining tree was built using the calculated distances (Fig. 3
). The position of E. faecalis relative to Streptococcus thermophilus and B. subtilis used to outgroup this tree highlights the close relationship of E. faecalis with the Lactobacillus group compared to other close genera, which in turn indicates the complexity of the number of genera that might be included within this group. The close relationship between L. johnsonii and L. acidophilus is still conserved in the PROmer-based tree. However, some discrepancies in the positions of other species with respect to the 16S tree are observed. L. salivarius is no longer one of the more external species in the tree topology, this position being now occupied by L. sakei. In this new tree, L. salivarius and L. plantarum share a common ancestor based on the regions aligned at protein level. A deeper analysis based on gene content and the corresponding predicted proteome in lactobacilli was therefore performed to complement and clarify these relationships.
|
|
A logical extension of the analysis of common characteristics was to investigate species-specific proteins. Between 30 % and 50 % of each of the six genomes (i.e. including E. faecalis) analysed showed no significant similarity to the other genomes, and the corresponding proteins were thus considered unique. Fig. 4(b)
shows the COG categories of unique proteins as a percentage of the total number of proteins for each species. The largest category represents proteins of unknown function. It is also evident that the two largest genomes (L. plantarum and E. faecalis) have a much higher percentage of unique proteins involved in transcription and signal transduction. The correlation of increased genome size and increased number of proteins in these categories has been previously noted for Pseudomonas (Stover et al., 2000
). This probably reflects a requirement for more complex regulation of genome expression, as metabolic and cellular complexity increases proportional to genome size. Certain other functional categories in bacterial genomes have previously been described to correlate either positively or negatively with genome size (Konstantinidis & Tiedje, 2004
; Ranea et al., 2004
). Our analysis of the six genomes identified such categories that display the same pattern for the two largest genomes, namely transport and metabolism of carbohydrates, amino acids and secondary metabolites.
Gene duplication is postulated to have played a major role in the evolution of biological novelty and biodiversity (Hancock, 2005
). Such events within the respective genomes were elucidated by investigating and comparing paralogue numbers. Some examples where one genome has a significantly higher number of genes with the same or similar function are DNA-binding response regulators (11 in E. faecalis, 56 in others);
-glucoside phosphotransferase system (PTS) enzyme IIABC (13 in L. johnsonii, 28 in others); PTS system, IID component (13 in E. faecalis, 15 in others); multidrug transport proteins (10 in L. johnsonii, 25 in others); ABC transporter, ATP-binding proteins (10 in L. johnsonii, 25 in others); and maltose O-acetyltransferase (7 in L. johnsonii, 03 in others). Thus, gene duplications appear to be most frequent in E. faecalis and L. johnsonii, which is surprising for the latter genome, since it is relatively small. Targeted analysis of these paralogues may explain the selective pressure for their amplification in the respective species and test if they are niche-specific adaptations.
Since E. faecalis is an opportunistic pathogen that may be vancomycin resistant, we also investigated if related antibiotic-resistance genes were present in the genomes of the commensals. The vancomycin-resistance region (EF1955EF1963) of E. faecalis contains genes for an enolase, a triosephosphate isomerase and a phosphoglycerate kinase (EF1961EF1963), which are present as a conserved arrangement in all the Lactobacillus genomes except L. acidophilus. Two additional sets of adjacent genes in the pathogenicity island of E. faecalis, ornithine cyclodeaminase with two hypothetical proteins (EF0616EF618), and DNA-damage-inducible protein J with one hypothetical protein (EF0512EF0513), are present in the genome of L. salivarius (LSL_1273LSL_1275 and LSL_0030LSL_0030b, respectively). Based upon the current annotations, therefore, only metabolism genes or stress-resistance genes from this E. faecalis vancomycin-resistance locus are present in the lactobacilli.
The genome sizes of the sequenced lactobacilli are fairly homogeneous, with the exception of L. plantarum, whose genome is 6075 % larger than the others. A comparison of regions with anomalous G+C content and interruptions of synteny revealed only a few likely sites where insertions in the L. plantarum genome relative to the other four lactobacilli may have occurred (data not shown). These regions make up a total of approximately 380 kb, which fails to explain the considerable difference in size. Furthermore, no extensive gene duplications were found in the L. plantarum genome. Obvious deletions in the other Lactobacillus genomes are harder to distinguish due to the low degree of gene conservation among them. A study of the top BLAST hits of L. plantarum proteins against the non-redundant database showed that hits against the draft P. pentosaceus ATCC 25745 genome (http://www.jgi.doe.gov/) were most abundant, with 434 top matches, followed by L. salivarius (329), L. sakei (280) and L. casei ATCC 334 (229). It is theoretically possible that P. pentosaceus is a major contributor of laterally acquired genes in L. plantarum, but lineage-specific gene loss in the other lactobacilli is a more plausible explanation for genome size differences. The imminent publication of P. pentosaceus and other lactobacilli genomes will shed light on genome plasticity and diversity among these related species.
Interestingly and on a different note, there were four genes present in the core that are harboured by the megaplasmid pMP118 of L. salivarius: alcohol dehydrogenase (LSL_1901), L-serine dehydratase alpha subunit (LSL_1931), ribulose-5-phosphate-3-epimerase (LSL_1948) and ribose 5-phosphate isomerase (L. salivarius LSL_1806). The presence of these core genes in pMP118 may be evidence of a process of mobilization between the megaplasmid and the chromosome of vertically evolved genes, since no other core gene were found in the megaplasmid.
Lactobacillus phylogeny and phylogenomics
To complement the sequence-based and synteny-based phylogeny studies, we also constructed a distance-based phylogeny on the basis of orthologous gene content. Although not calculated from sequence-derived distances, trees of orthologue-number distances are based on the principle that the number of genes two genomes have in common depends on their evolutionary distance. These distances can be seen as reflecting evolutionary events such as gain and loss of genes, whereas the underlying properties such as the gene content can be interpreted in terms of function (Snel et al., 1999
). Equations 1 and 2 (see Methods) were applied to the numbers of orthologue pairs and fractions of identical protein sequences for these pairs. Table 2
shows the number of orthologues for the 15 possible comparisons between the six species, as well as the mean percentage identity for these pairs, as another measure of similarity. Neighbour-joining trees were constructed based on these values. When the neighbour-joining tree was rooted with E. faecalis, S. thermophilus and B. subtilis as outgroups (Fig. 3b
), it showed the same topology and similar evolutionary distances for both equations. Reassuringly, it was also in concordance with the topology displayed by a transformed UPGMA tree constructed from the same distance matrix (data not shown). The phylogeny based on orthology-derived distance values is also strikingly similar to that based upon PROmer-derived distances (Fig. 3a
), confirming L. sakei as being the most peripheral of the sequenced species. In this respect it contradicts the 16S rRNA gene phylogeny. However, some discrepancies are found for the positions of S. thermophilus and B. subtilis in the PROmer-based tree, indicating perhaps the limitation of the PROmer approach to reconstruct the phylogeny if it goes beyond the genus level.
|
|
|
| NOTE ADDED IN PROOF |
|---|
|
|
|---|
| ACKNOWLEDGEMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
Altermann, E., Russell, W. M., Azcarate-Peril, M. A. & 11 other authors (2005). Complete genome sequence of the probiotic lactic acid bacterium Lactobacillus acidophilus NCFM. Proc Natl Acad Sci U S A 102, 39063912.
Altschul, S. F., Madden, T. L., Schaffer, A. A., Zhang, J., Zhang, Z., Miller, W. & Lipman, D. J. (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25, 33893402.
Beijerink, M. W. (1901). Sur les ferments lactiques de l'industrie. Arch Néerl Sci Exactes Nat Série II, 212243.
Boekhorst, J., Siezen, R. J., Zwahlen, M. C. & 7 other authors (2004). The complete genomes of Lactobacillus plantarum and Lactobacillus johnsonii reveal extensive differences in chromosome organization and gene content. Microbiology 150, 36013611.
Brussow, H., Canchaya, C. & Hardt, W. D. (2004). Phages and the evolution of bacterial pathogens: from genomic rearrangements to lysogenic conversion. Microbiol Mol Biol Rev 68, 560602.
Cai, Y., Benno, Y., Ogawa, M. & Kumai, S. (1999). Effect of applying lactic acid bacteria isolated from forage crops on fermentation characteristics and aerobic deterioration of silage. J Dairy Sci 82, 520526.[Abstract]
Candela, M., Vitali, B., Matteuzzi, D. & Brigidi, P. (2004). Evaluation of the rrn operon copy number in Bifidobacterium using real-time PCR. Lett Appl Microbiol 38, 229232.[CrossRef][Medline]
Chaillou, S., Champomier-Verges, M. C., Cornet, M. & 8 other authors (2005). The complete genome sequence of the meat-borne lactic acid bacterium Lactobacillus sakei 23K. Nat Biotechnol 23, 15271533.[CrossRef][Medline]
Chan, P. Y., Lam, T. W. & Yiu, S. M. (2006). A more accurate and efficient whole genome phylogeny. In Proceedings of 4th Asia-Pacific Bioinformatics Conference, Taipei, Taiwan, pp. 337352. London: Imperial College Press.
Claesson, M. J., Li, Y., Leahy, S. & 12 other authors (2006). Multireplicon genome architecture of Lactobacillus salivarius. Proc Natl Acad Sci U S A 103, 67186723.
Cole, J. R., Chai, B., Farris, R. J., Wang, Q., Kulam, S. A., McGarrell, D. M., Garrity, G. M. & Tiedje, J. M. (2005). The Ribosomal Database Project (RDP-II): sequences and tools for high-throughput rRNA analysis. Nucleic Acids Res 33, D294D296.
Creevey, C. J. & McInerney, J. O. (2005). Clann: investigating phylogenetic information through supertree analyses. Bioinformatics 21, 390392.
Creevey, C. J., Fitzpatrick, D. A., Philip, G. K., Kinsella, R. J., O'Connell, M. J., Pentony, M. M., Travers, S. A., Wilkinson, M. & McInerney, J. O. (2004). Does a tree-like phylogeny only exist at the tips in the prokaryotes? Proc Biol Sci 271, 25512558.
Daubin, V. & Perriere, G. (2003). G+C3 structuring along the genome: a common feature in prokaryotes. Mol Biol Evol 20, 471483.
Dellaglio, F. & Felis, G. E. (2005). Taxonomy of lactobacilli and bifidobacteria. Probiotics and Prebiotics: Scientific Aspects, pp. 2549. Edited by G. W. Tannock. Norfolk, UK: Caister Academic Press.
Dellaglio, F., Felis, G. E. & Torriani, S. (2005). Is the genus Lactobacillus a single genus? In LAB8 Symposium on Lactic Acid Bacteria. Egmond aan Zee, Netherlands: FEMS.
Dunne, C., Murphy, L., Flynn, S. & 12 other authors (1999). Probiotics: from myth to reality. Demonstration of functionality in animal models of disease and in human clinical trials. Antonie Van Leeuwenhoek 76, 279292.[CrossRef][Medline]
Dunne, C., O'Mahony, L., Murphy, L. & 11 other authors (2001). In vitro selection criteria for probiotic bacteria of human origin correlation with in vivo findings. Am J Clin Nutr 73, 386S392S.
Eisen, J. A., Heidelberg, J. F., White, O. & Salzberg, S. L. (2000). Evidence for symmetric chromosomal inversions around the replication origin in bacteria. Genome Biol 1, RESEARCH0011.[Medline]
Fitch, W. M. (1970). Distinguishing homologous from analogous proteins. Syst Zool 19, 99113.[Medline]
Flynn, S. (2001). Molecular characterisation of bacteriocin producing genes and plasmid encoded functions of the probiotic strain Lactobacillus salivarius subsp. salivarius UCC118. PhD thesis, Department of Microbiology, University College Cork, Ireland.
Guarner, F. & Schaafsma, G. J. (1998). Probiotics. Int J Food Microbiol 39, 237238.[CrossRef][Medline]
Guindon, S. & Gascuel, O. (2003). A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol 52, 696704.[CrossRef][Medline]
Hammes, W. P. & Hertel, C. (2003). The Genera Lactobacillus and Carnobacterium. In The Prokaryotes, release 3.15. Edited by M. Dworkin.
Hancock, J. M. (2005). Gene factories, microfunctionalization and the evolution of gene families. Trends Genet 11, 591595.[CrossRef]
Heilig, H. G., Zoetendal, E. G., Vaughan, E. E., Marteau, P., Akkermans, A. D. & de Vos, W. M. (2002). Molecular diversity of Lactobacillus spp. and other lactic acid bacteria in the human intestine as determined by specific amplification of 16S ribosomal DNA. Appl Environ Microbiol 68, 114123.
Henz, S. R., Huson, D. H., Auch, A. F., Nieselt-Struwe, K. & Schuster, S. C. (2005). Whole-genome prokaryotic phylogeny. Bioinformatics 21, 23292335.
Huson, D. H. (1998). SplitsTree: analyzing and visualizing evolutionary data. Bioinformatics 14, 6873.
Keane, T. M., Page, A. J., Naughton, T. J., Travers, S. A. A. & McInerney, J. O. (2006). Building large phylogenetic trees on coarse-grained parallel machines. Algorithmica 45, 285300.[CrossRef]
Klaenhammer, T. R. & Russell, W. M. (2000). In Encyclopedia of Food Microbiology, pp. 11511157. Amsterdam: Elsevier.
Klaenhammer, T. R., Barrangou, R., Buck, B. L., Azcarate-Peril, M. A. & Altermann, E. (2005). Genomic features of lactic acid bacteria effecting bioprocessing and health. FEMS Microbiol Rev 29, 393409.[CrossRef][Medline]
Kleerebezem, M., Boekhorst, J., van Kranenburg, R. & 17 other authors (2003). Complete genome sequence of Lactobacillus plantarum WCFS1. Proc Natl Acad Sci U S A 100, 19901995.
Konings, W. N., Kok, J., Kuipers, O. P. & Poolman, B. (2000). Lactic acid bacteria: the bugs of the new millennium. Curr Opin Microbiol 3, 276282.[CrossRef][Medline]
Konstantinidis, K. T. & Tiedje, J. M. (2004). Trends between gene content and genome size in prokaryotic species with larger genomes. Proc Natl Acad Sci U S A 101, 31603165.
Korbel, J. O., Snel, B., Huynen, M. A. & Bork, P. (2002). SHOT: a web server for the construction of genome phylogenies. Trends Genet 18, 158162.[CrossRef][Medline]
Kullen, M. J., Sanozky-Dawes, R. B., Crowell, D. C. & Klaenhammer, T. R. (2000). Use of the DNA sequence of variable regions of the 16S rRNA gene for rapid and accurate identification of bacteria in the Lactobacillus acidophilus complex. J Appl Microbiol 89, 511516.[CrossRef][Medline]
Kurtz, S., Phillippy, A., Delcher, A. L., Smoot, M., Shumway, M., Antonescu, C. & Salzberg, S. L. (2004). Versatile and open software for comparing large genomes. Genome Biol 5, R12.[CrossRef][Medline]
McCarthy, J., O'Mahony, L., O'Callaghan, L. & 8 other authors (2003). Double blind, placebo controlled trial of two probiotic strains in interleukin 10 knockout mice and mechanistic link with cytokine balance. Gut 52, 975980.
Molin, G. (2001). Probiotics in foods not containing milk or milk constituents, with special reference to Lactobacillus plantarum 299v. Am J Clin Nutr 73, 380S385S.
Molin, G., Jeppsson, B., Johansson, M. L., Ahrne, S., Nobaek, S., Stahl, M. & Bengmark, S. (1993). Numerical taxonomy of Lactobacillus spp. associated with healthy and diseased mucosa of the human intestines. J Appl Bacteriol 74, 314323.[Medline]
Nakamura, C. E. & Whited, G. M. (2003). Metabolic engineering for the microbial production of 1,3-propanediol. Curr Opin Biotechnol 14, 454459.[CrossRef][Medline]
Parkhill, J., Sebaihia, M., Preston, A. & 50 other authors (2003). Comparative analysis of the genome sequences of Bordetella pertussis, Bordetella parapertussis and Bordetella bronchiseptica. Nat Genet 35, 3240.[CrossRef][Medline]
Pearson, W. R. (2000). Flexible sequence similarity searching with the FASTA3 program package. Methods Mol Biol 132, 185219.[Medline]
Pridmore, R. D., Berger, B., Desiere, F. & 12 other authors (2004). The genome sequence of the probiotic intestinal bacterium Lactobacillus johnsonii NCC 533. Proc Natl Acad Sci U S A 101, 25122517.
Ranea, J. A., Buchan, D. W., Thornton, J. M. & Orengo, C. A. (2004). Evolution of protein superfamilies and bacterial genome size. J Mol Biol 336, 871887.[CrossRef][Medline]
Rogosa, M., Wiseman, R. F., Mitchell, J. A., Disraely, M. N. & Beaman, A. J. (1953). Species differentiation of oral lactobacilli from man including description of Lactobacillus salivarius nov spec and Lactobacillus cellobiosus nov spec. J Bacteriol 65, 681699.
Satokari, R. M., Vaughan, E. E., Smidt, H., Saarela, M., Matto, J. & de Vos, W. M. (2003). Molecular approaches for the detection and identification of bifidobacteria and lactobacilli in the human gastrointestinal tract. Syst Appl Microbiol 26, 572584.[CrossRef][Medline]
Schleifer, K. H. & Ludwig, V. (1995). Phylogenetic relationships of lactic acid bacteria. In The Genera of Lactic Acid Bacteria, pp. 717. Edited by B. J. B. Wood & W. H. Holzapfel. Glasgow, UK: Chapman & Hall.
Sheil, B., McCarthy, J., O'Mahony, L., Bennett, M. W., Ryan, P., Fitzgibbon, J. J., Kiely, B., Collins, J. K. & Shanahan, F. (2004). Is the mucosal route of administration essential for probiotic function? Subcutaneous administration is associated with attenuation of murine colitis and arthritis. Gut 53, 694700.
Skerman, V. B. D., McGowan, V. & Sneath, P. H. A. (1980). Approved Lists of Bacterial Names. Edited by Skerman, V. B. D., McGowan, V. and Sneath, P. H. A. on behalf of the Ad Hoc Committee of the Judicial Commission of the International Committee on Systematic Bacteriology of the International Association of Microbiological Societies. Washington, DC: American Society for Microbiology.
Snel, B., Bork, P. & Huynen, M. A. (1999). Genome phylogeny based on gene content. Nat Genet 21, 108110.[CrossRef][Medline]
Stiles, M. E. (1996). Biopreservation by lactic acid bacteria. Antonie Van Leeuwenhoek 70, 331345.[CrossRef][Medline]
Stover, C. K., Pham, X. Q., Erwin, A. L. & 28 other authors (2000). Complete genome sequence of Pseudomonas aeruginosa PA01, an opportunistic pathogen. Nature 406, 959964.[CrossRef][Medline]
Takahashi, T., Satoh, I. & Kikuchi, N. (1999). Phylogenetic relationships of 38 taxa of the genus Staphylococcus based on 16S rRNA gene sequence analysis. Int J Syst Bacteriol 49, 725728.[CrossRef][Medline]
Tannock, G. W. (2004). A special fondness for lactobacilli. Appl Environ Microbiol 70, 31893194.
Thompson, J. D., Higgins, D. G. & Gibson, T. J. (1994). CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight-matrix choice. Nucleic Acids Res 22, 46734680.
Tillier, E. R. & Collins, R. A. (2000). Genome rearrangement by replication-directed translocation. Nat Genet 26, 195197.[CrossRef][Medline]
Vandamme, P., Pot, B., Gillis, M., de Vos, P., Kersters, K. & Swings, J. (1996). Polyphasic taxonomy, a consensus approach to bacterial systematics. Microbiol Rev 60, 407438.
Van Dongen, S. (2000). Graph clustering by flow simulation. PhD thesis, University of Utrecht.
van Kranenburg, R., Golic, N., Bongers, R., Leer, R. J., de Vos, W. M., Siezen, R. J. & Kleerebezem, M. (2005). Functional analysis of three plasmids from Lactobacillus plantarum. Appl Environ Microbiol 71, 12231230.
van Pijkeren, J.-P., Canchaya, C., Ryan, K. A. & 8 other authors (2006). Comparative and functional genomics of sortase-dependent proteins in the predicted secretome of Lactobacillus salivarius UCC118. Appl Environ Microbiol 72, 41434153.