|
|
||||||||

TNO Quality of Life, PO Box 360, 3700 AJ Zeist, The Netherlands
Correspondence
Mariët J. van der Werf
vanderWerf{at}voeding.tno.nl
| ABSTRACT |
|---|
|
|
|---|
Supplementary tables are available with the online version of this paper.
Present address: BioDetection Systems BV, Kruislaan 406, 1098 SM Amsterdam, The Netherlands.
| INTRODUCTION |
|---|
|
|
|---|
Currently, two approaches are commonly employed in order to identify the relevant biomolecules from functional genomics data. Most frequently, analysis of the fold difference in expression (univariate data analysis) between the condition of interest and a reference condition is used to identify the biomolecules (i.e. genes, proteins or metabolites) that are potentially of interest. Subsequently, biomolecules whose response is above a certain threshold (e.g. more than twofold difference) are selected and studied in more detail. Increasingly, hierarchical cluster analysis (HCA) is employed for analysing functional genomics data. In this case, biomolecules or experiments with similar expression profiles are grouped. HCA is only useful when more than two functional genomics datasets of more than two situations are available. Visual inspection of the hierarchical clusters results in the identification of coexpressed biomolecules that are specifically regulated under the condition of interest (e.g. Heyer et al., 1999
; Tefferi et al., 2002
).
The drawback of both these approaches is that they generally result in large numbers of leads: in the order of tens or hundreds. As it is too time-consuming to analyse all of these in more detail with molecular biological, biochemical or bioinformatic approaches in order to (experimentally) validate the targets identified, there is a need for data analysis tools that allow one to rank the potential targets. Potentially, the fold difference in expression level can be used to rank the targets. However, the fundamental basis behind such a ranking, i.e. that biomolecules that show the largest response are also the most important for the question under study, is doubtful (van der Werf, 2005
).
Multivariate data analysis (MVDA) tools seem much better suited to prioritize leads from functional genomics datasets. These tools take into account the inherent interdependency of biomolecules. Principal component analysis (PCA) is the most frequently applied multivariate statistics tool. It has been applied for many decades in epidemiology, econometry, ecology, etc., but only recently has the potential of these tools in cellular biology been recognized (Orr & Scherf, 2002
; Michaud et al., 2003
). Although the mathematics behind PCA might seem complex to the untrained cellular biologist, the basic principle behind it is straightforward, i.e. PCA combines two, or more, correlated factors (i.e. transcripts) into one new variable, a principal component (PC) (Orr & Scherf, 2002
; van der Werf et al., 2005
). Thus in PCA the dimensionality of the dataset is reduced by replacing the original variables by a smaller number of newly formed variables that are linear combinations of the original variables and that explain the majority of the information (variability) from the experiment. For each PC, loadings (or weights) reflect the influence of the original variables, whereas scores (coefficients of the PC) reflect the contribution of each PC in every sample. PCA and the related tool principal component discriminant analysis (PCDA; Hoogerbrugge et al., 1983
) are currently mostly used for the classification of samples with a similar expression pattern, e.g. related to a specific treatment or phenotype. However, these tools are not only descriptive but also allow the identification of the specific biomolecules that are most important for the differences between the groups. The most important variables are identified by analysing the strength of the correlation of every biomolecule with the biological process of interest.
The goal of the research described here was to empirically investigate the effectiveness of the multivariate data analysis tools PCA and PCDA for the ranking of important transcripts from microarray data and to compare the results with those obtained by the fold-difference and HCA approaches. To this end, genes from Pseudomonas putida S12 were identified whose expression is specific for growth on one or more of four different carbon sources (i.e. glucose, gluconate, fructose and succinate), as the genes encoding enzymes of carbohydrate catabolism and their regulation are still largely unknown (Petruschka et al., 2002
). In order to avoid biological prejudgments during the evaluation of the data analysis tools, anonymous clone-based arrays were used. The identity of the genes only became known after completing the data analysis phase by sequencing the inserts in the clones corresponding to the relevant spots.
| METHODS |
|---|
|
|
|---|
RNA isolation.
RNA was isolated from the cells using the hot borate method, basically as described by Wan & Wilkins (1994)
. RNA purity and concentrations were determined both spectrophotometrically and on agarose gel. The RNA isolates were checked for residual RNase activity by comparing samples that were incubated for 1 h at 37 °C with the initial material on an agarose gel.
Array design.
The microarray used is a clone-based array. To this end, a chromosomal library of P. putida S12 was constructed by Baseclear (Leiden, The Netherlands). DNA fragments were obtained by shearing, and fragments of 23 kb were blunt-end cloned in pSMARTLC (Lucigen). The chromosomal library was ordered in 96-well microtitre plates and stored as glycerol stocks at 80 °C. Subsequently, in total 5000 genomic fragments were amplified by PCR, purified and arrayed as described previously (Pieterse et al., 2005
).
Fluorescent labelling and hybridization.
Differential transcript levels were determined by two-colour fluorescent hybridizations of the corresponding cDNAs on the clone-array. The RNA samples were labelled by random hexamers primed in vitro reverse transcription with either Cy5- or Cy3-labelled dUTP. Labelling, hybridization and washing were performed as previously described (Pieterse et al., 2005
). In all instances, Cy3-labelled cDNA of a batch of the fructose-F3 sample was used as the reference condition.
Image analysis.
The fluorescent signals from the two different labels on the hybridized arrays were quantified with a ScanArray Express scanner (Packard Bioscience) and Imagene 4.2 software (BioDiscovery). Spots with a Flag 0 (spots the quality of which was approved by the Imagene software package) or a Flag 3 (spots which obtained a warning by the Imagene softwate package for a manual check on their quality) were selected. Subsequently, spots from which the difference between the mean signal of the spot and the mean signal of the background was larger than zero times the background standard deviation were excluded from further analysis. After removal of the empty spots and the spots from which the signal exceeded the detection limit of the scanner, 3676 spots (68 %) remained for further analysis.
Data preprocessing and normalization.
The data from the microarray analysis were delivered as Excel files and the data were imported into Matlab (version 6.5.1; The Mathworks). Within-slide, intensity-dependent normalizations were performed with the scatter plot smoother LOWESS (polynomial order=1) using a Matlab routine (copyright 1998 by Datatool). The user-defined fraction of data used for smoothing at each point was set at 25 % for all slides. Subsequently, these preprocessed data were used as the input for significance and MVDA analysis.
Significance analysis.
Prior to significance analysis, a data transformation was applied to the normalized ratios in order to obtain a normal distribution of the data (Pieterse et al., 2005
). Significance analysis was performed by means of a 1-way ANOVA. Subsequently a Tukey HSD test was performed to determine whether a significant differential expression level (99 % confidence interval) was observed under a specific condition.
Multivariate data analysis.
Datasets were scaled [x/(xmaxxmin)] per variable prior to MVDA analysis. Two different MVDA tools were applied in Matlab (The Mathworks): (i) principal component analysis (PCA) (PLS Toolbox for Matlab, version 3.0.2, Eigenvector Research), and (ii) principal component discriminant analysis (PCDA) [algorithm reproduced from Hoogerbrugge et al. (1983)
and programmed into Matlab]. In PCDA, the centre of a group was established by taking the mean score on D1 and D2. Subsequently, the loadings were determined under the angle present between the fructose group and that of one of the other three carbon sources.
For the hierarchical clustering and visualization of the results, the programs CLUSTER and TREEVIEW were applied (Eisen et al., 1998
). Only those genes or operons were included that fell within the 99 % confidence interval of the Tukey HSD test. Average linkage clustering was performed on the natural logarithm of the ratios of the spots taken into account.
Nucleotide sequencing and sequence analysis.
Clones containing the inserts selected by MVDA as being relevant were traced back in the 96-well plates. Approximately 500 bp of both the 3'- and the 5'-end of the inserts in these clones was subsequently sequenced using universal primers based on pSMART by Baseclear (Leiden, the Netherlands), and the complete gene content of these inserts was inferred from the P. putida KT2440 genome sequence (Nelson et al., 2002
). DNA sequences were identified by similarity searches against the TIGR (www.tigr.org/) and NCBI (www.ncbi.nlm.nih.gov/) database libraries using BLAST. Gene numbers used in this study (PP numbers) are based on the gene numbering of the P. putida KT2440 genome (Nelson et al., 2002
).
| RESULTS |
|---|
|
|
|---|
In order to address the biological and technical (array) variability, P. putida S12 was grown on the four carbon sources, i.e. D-fructose [F] (µ=0·18±0·02 h1), D-glucose [G] (µ=0·28±0·02 h1), D-gluconate [N] (µ=0·21 h1) and succinate [S] (µ=0·21±0·01 h1), in triplicate in independent batch cultures. The cells were harvested and immediately quenched in order to prevent alterations in the mRNA composition of the samples (Pieterse et al., 2005
). In one instance, two samples were harvested from the same fermenter (fermentation 3 of glucose-grown cells). Subsequently, mRNA was isolated from these samples. In all instances, RNA isolated from the third fructose fermentation was used as the reference. Two independent micorarray hybridizations of every sample were performed.
PCA analysis of transcriptomes
In order to identify the transcripts that are the most important for the differences between the cells grown on the different carbon sources, the multivariate data analysis tool PCA was applied. In Fig. 1
(a), the results of the PCA analysis of the transcription datasets are visualized in a two-dimensional plot. A cloud of points is observed, with each point representing the transcriptome of the different samples (van der Werf et al., 2005
). Transcriptomes that end up close together are overall more similar, while more dissimilar transcriptomes are further apart. It can clearly be seen that, with the exception of transcriptomes of gluconate- and succinate-grown cells, the transcriptomes of cells grown on the same carbon source are more similar than the transcriptomes of cells grown on different carbon sources. Also in plots of PC1 versus PC3 and of PC2 versus PC3, the transcriptome datasets of gluconate- and succinate-grown cells slightly overlapped (results not shown). The separation of the different groups of transcriptomes originating from the same carbon sources by PCA indicates that the overall variation in the datasets due to biological and technical variation is less than the differences introduced by changing the growth substrate of P. putida S12. PC1 explains 49 % of the total variance in these datasets, while PC2 explains only 6 % of the variance.
|
As with PCA, in PCDA the strongly correlating variables are combined into one new variable that is now called a discriminant (D). The discriminants are linear combinations of the original variables, i.e. the transcripts. When the (absolute values of the) loadings for each of the transcripts in the different D's are studied, transcripts can be identified that are the most important for the variance explained by that D. D1 is mainly responsible for explaining the difference between fructose and succinate. This can most easily be seen by projecting all transcriptomes in Fig. 1(b)
on D1 (the x-axis). In a similar way, transcripts with a high absolute loading in D2 are important for explaining the difference between glucose and succinate. The 3'- and the 5'-ends of the inserts in the clones belonging to the spots with the highest absolute value for the loadings on D1 and D2 were subsequently sequenced. The identities of the genes present on these inserts (Table 1
) were identified by performing a homology search using BLAST (Altschul et al., 1990
) based mainly on the annotated P. putida KT2440 genome sequence (Nelson et al., 2002
).
|
|
|
Comparing PCDA analysis with the fold-difference and hierarchical clustering approaches
Currently, two approaches other than the above-described PCA and PCDA analysis are commonly used for identifying the important transcripts from microarray studies: the fold-difference approach and HCA. We also applied the fold-difference approach to rank the transcripts based on the ratio or, when the ratio was <1, i.e. in the case of down-regulated genes, on 1/ratio. Again, inserts of the 30 clones belonging to the spots with the highest value for (1/)ratio were sequenced and the genes present on these inserts identified (see Table S1, available as supplementary data with the online version of this paper). By and large, completely different transcripts were identified compared to the PCDA approach: only 30 % of the spots were the same.
The datasets were also analysed by HCA (Fig. 3
) and subsequently clusters of transcripts whose expression was specifically affected by (one of) the carbon sources were identified (yellow boxes in Fig. 3
). All inserts of the spots in these clusters were sequenced and the genes present on these inserts identified (see supplementary Table S2). Most of the genes identified by HCA were also identified by the PCDA and/or the fold-difference approach (see also Table 3
). However, several gene clusters, encoding proteins such as surface adhesion protein, formate dehydrogenase and flagellar proteins, were not identified in one of the other two approaches. Remarkable is the fact that also the gene clusters encoding proteins such as 6-phosphogluconate dehydratase/glucokinase (PP1010PP1011 downregulated in S), glucose-6-phosphate dehydrogenase/6-phosphogluconolactonase/2-dehydro-3-deoxyphosphogluconate aldolase (PP1022PP1024 slightly upregulated in G and N, downregulated in S) and cytochrome o ubiquinol oxidase (PP0812PP0814 upregulated in G, N and S), which are of key importance in the degradation of (one of the) carbon sources studied (Fig. 2
), are clearly visible as clusters in the HCA plot (Fig. 3
), but do not end up in the top 30 by either PCDA or ratio analysis.
|
|
| DISCUSSION |
|---|
|
|
|---|
In order to avoid bias during the data analysis, caused by a perceived immediate understanding of the importance of a specific transcript (or transcripts) being detected as relevant, and in that way directing the data analysis process, an anonymous clone-based array was used. Genome fragments of P. putida were cloned, and the inserts were amplified and spotted on glass slides. Only when spots were identified to be relevant by one of the data analysis methods were the inserts sequenced, and the identity of the transcripts unravelled. In many instances genes or operons of relevance proved to be present on multiple clones, indicating that clone-based arrays are a reliable means of identifying genes that are relevant for a specific biological process. This generic approach seems therefore very suitable for studying the comprehensive transcript response of micro-organisms whose full genome sequence is not available.
This study clearly shows that the data analysis method chosen has a profound effect on the transcripts identified as being the most relevant; large differences were observed between the transcripts that ranked the highest based on the PCA approach (results not shown), the PCDA approach or the fold-difference approach. The most frequently used method for selecting data from transcriptomics experiments, the fold-difference or ratio approach, has the disadvantage that genes that inherently show a low response, like constitutively expressed genes, are overlooked (van der Werf, 2005
; Wu, 2001
; Slonim, 2002
). In this study, this is clearly illustrated by the 6-phosphogluconate dehydratase/glucokinase (PP1010PP1011) and glucose-6-phosphate dehydrogenase/6-phosphogluconolactonase/2-dehydro-3-deoxyphosphogluconate aldolase (PP1022PP1024) operons that are only slightly up- or downregulated (at most a factor of 2; Table 3
) compared to the responses of the 2-ketogluconate 6-phosphate reductase/2-ketogluconate transporter/2-ketogluconate kinase/epimerase KguE/transcriptional regulator PtxS (PP3376PP3380) and the fructose degradation (PP0792PP0795) gene clusters that are up- and downregulated by a factor of 7 to 24 (Table 3
). The other frequently used tool for analysing transcriptome data, HCA, has the disadvantage that only gene clusters that show a specific expression profile are identified as relevant. For instance, in this study, the fructose degradation (PP0792PP0795) gene cluster, which is strongly downregulated in cells grown on glucose, gluconate and succinate, was not identified by HCA as it ended up in the large bulk of downregulated genes (lower third of the hierarchical cluster; Fig. 3
), and was therefore not specifically identified.
So far, there have only been a few isolated studies in which PC(D)A biplots, i.e. making use of the loadings under an angle, have been applied for analysing transcriptome data (Chapman et al., 2001
). PCDA is particularly well suited for the analysis of functional genomics datasets derived from samples originating from more than two different biological groups. This study clearly demonstrates that the loadings under an angle resulting from PCDA analysis are an appropriate quantitative statistical parameter with which relevant transcripts for a specific phenotype can be ranked, as illustrated by the fact that many genes encoding enzymes known to be involved in the degradation of the carbon sources studied [Fig. 2
i.e. the fructose degradation operon, the gluconokinase/gluconate transporter (PP3415PP3417) gene cluster, the 2-ketogluconate 6-phosphate reductase/2-ketogluconate transporter/2-ketogluconate kinase/epimerase KguE/transcriptional regulator PtxS (PP3376PP3380) gene cluster, and the C4-dicarboxylate transporter (PP1188)], were identified in this way. Moreover, in many instances, a transcript identified to be relevant for a specific carbon source by PCDA analysis was present on genome fragments of several other spots that were the most important, again indicating that this is relevant information, and that it is not chance correlations that identified these genes as being relevant. In this respect, the many regulatory genes on the inserts of spots identified by PCDA to strongly correlate with one of the different carbon sources are of special interest.
However, PCDA did not identify all the genes that are involved in the degradation of the different carbon sources; the complete set of genes involved in the degradation of the carbon sources studied (Fig. 2
) was only obtained by combining the results of PCDA analysis, the fold-difference and the HCA approach. In this respect, the complementary nature of the three approaches (HCA, fold-difference and PCDA; Table 3
) is very notable.
Besides the complementary nature of the different data analysis tools studied, this paper also demonstrates the strength of the clone-based array approach for the identification of relevant transcripts. It resulted in the identification of all except two of the genes involved in the degradation of the different carbohydrates studied (Fig. 2
; Lessie & Phibbs, 1984
; Temple et al., 1998
): the fructose utilization gene cluster (PP0792PP0795), the glucose utilization gene cluster (PP1010PP1012; Sage et al., 1996
), the zwfpgleda gene cluster (PP1021PP1024; Petruschka et al., 2002
; Hager et al., 2000
), the 2-ketogluconate utilization gene cluster (PP3376PP3380; Swanson et al., 2000
) and the gluconate utilization gene cluster (PP3415PP3417). Only the genes encoding glucose dehydrogenase and gluconate dehydrogenase were not positively identified. Unfortunately, the genes encoding these enzymes have so far not been isolated from a Pseudomonas species. Although these genes were annotated in the P. putida KT2240 genome (PP1444 and PP3383, respectively) a BLAST study showed no significant homology between these genes and any of the functionally characterized glucose and gluconate dehydrogenase genes. Therefore, it is possible that one of the many genes encoding hypothetical proteins identified in this study encodes one of these two enzymes. Also a sugar ABC transporter gene cluster (PP1015PP1019; Wylie & Worobec, 1994
, 1995
) was identified that was specifically induced upon growth on glucose (Table 3
). This gene cluster encodes a glucose porin and a sugar transporter, of which the sugar-binding protein (PP1015) is very likely the previously purified glucose-specific glucose binding protein (Stinson et al., 1977
), as this gene encodes a protein of a similar size as the purified protein (44·5 kDa) and has a similar amino acid composition.
The clone-based-array approach, in combination with the different data analysis tools, not only resulted in the identification of genes encoding the enzymes known to be involved in the degradation of the carbon sources studied, but also gave new insights into the physiology of the degradation of the carbon sources studied. Most remarkable was an upregulation of a large number of genes that respond to iron limitation in glucose-, gluconate- or succinate-grown cells in comparison with fructose-grown cells (Table 3
). This includes six different iron chelate receptors the siderophore receptors (PP0267, PP0535, PP0861, PP3155 and PP4217) and the ferric citrate receptor FecA (PP0867; Enz et al., 2003
) TonB, which is involved in the translocation of the iron chelate bound to the siderophore and ferric citrate receptors across the outer membrane (PP3612; Moeck & Coulton, 1998
), and the iron-responsive transcript FagA (PP0943; Hassett et al., 1997
). This coincides with an upregulation of two RNA polymerase
70 factors of the ECF subfamily (PP0162 and PP0704) that are involved in the regulation of siderophore biosynthesis (Redly & Poole, 2003
). All these transcripts are under control of the Fur repressor protein. The Fur repressor (PP4730) was not amongst the 200 clones that were sequenced in this study.
Moreover, also several cytochrome- or quinone-associated genes were specifically upregulated (i.e. PP0812, PP0813, PP0814, PP0815, PP3606) or downregulated (i.e. PP0071, PP1841, PP2010 and PP3823). This indicates that the composition of the electron-transport chain in P. putida S12 is different in fructose-grown cells compared to glucose-, gluconate- or succinate-grown cells. This is in agreement with the distinct degradation pathway for fructose compared to that of glucose and gluconate in pseudomonads (Fig. 2
; Lessie & Phibbs, 1984
; Temple et al., 1998
). Potentially, the different composition of the electron-transport chain reflects its greater importance in cells grown on glucose or gluconate, which are initially degraded extracellularly in a sequence from glucose to gluconate and subsequently 2-ketogluconate by a PQQ-dependent glucose dehydrogenase (Matsushita & Ameyama, 1982
) and a cytochrome-containing gluconate dehydrogenase (Matsushita et al., 1979
), respectively. Both glucose and gluconate dehydrogenase are directly linked to the electron-transport chain. The observed iron limitation when cells were grown on glucose or gluconate suggests that there is a larger demand for iron as the prosthetic groups in proteins, such as cytochrome-containing enzymes, when P. putida S12 is cultivated on either of these carbon sources. Gluconate dehydrogenase is known to contain dihaem cytochrome c as a prosthetic group (Matsushita et al., 1979
). Also succinate dehydrogenase is directly linked to the electron-transport chain and is a cytochrome-containing enzyme (http://www.tigr.org/tigr-scripts/CMR2/GenomePage3.spl?database=gpp).
The multivariate data analysis tools that are currently used in functional genomics research originate from other research fields. Although specific adaptations have been made to these tools in order to optimize them for biological purposes (Eisen et al., 1998
; Heyer et al., 1999
; Tamayo et al., 1999
; Tavazoie et al., 1999
), this paper demonstrates an important role for multiple complementary approaches. In the near future further improvement of multivariate data analysis methods for analysing functional genomics datasets is to be expected using mathematical considerations that are based on a molecular biological rationale. Further improvements are also expected to overcome the problem of having far more variables than samples available for statistical analysis. This can lead to both false positives and false negatives with the existing multivariate data analysis tools when applied to functional genomics datasets.
In conclusion, this paper clearly demonstrates that the data analysis method used has a large effect on the ranking of the transcripts that are relevant for a specific phenotype. The methods used in this study were complementary: only when the results of the transcripts that were ranked the highest were combined did a complete picture of the processes important for the catabolism of the different carbon sources studied become apparent. For the more subtle, regulatory processes in a cell, especially the multivariate data analysis tool PCDA seemed to be very effective, as relatively more regulator genes were identified by this method. Moreover, this study showed that anonymous cloned-based arrays provide a reliable means of identifying relevant genes from micro-organisms whose full genome sequence is not available.
| ACKNOWLEDGEMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
Altschul, S. F., Gish, W., Miller, W., Meyers, E. W. & Lipman, D. J. (1990). Basic local alignment search tool. J Mol Biol 215, 403410.[CrossRef][Medline]
Carpentier, A.-S., Riva, A., Tisseur, P., Didier, G. & Henaut, A. (2004). The operons, a criterion to compare the reliability of transcriptome analysis tools: ICA is more reliable than ANOVA, PLS and PCA. Comput Biol Chem 28, 310.[CrossRef][Medline]
Chapman, S., Schenk, P., Kazan, K. & Manners, J. (2001). Using biplots to interpret gene expression patterns in plants. Bioinformatics 18, 202204.
Eisen, M. B., Spellman, P. T., Brown, P. O. & Botstein, D. (1998). Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci U S A 95, 1486314868.
Enz, S., Brand, H., Orellana, C., Mahren, S. & Braun, V. (2003). Sites of interaction between the FecA and FecR signal transduction proteins of ferric citrate transport in Escherichia coli K-12. J Bacteriol 185, 37453752.
Hager, P. W., Calfee, M. W. & Phibbs, P. V. (2000). The Pseudomonas aeruginosa devB/SOL homolog, pgl, is a member of the hex regulon and encodes 6-phosphogluconolactonase. J Bacteriol 182, 39343941.
Hartmans, S., van der Werf, M. J. & de Bont, J. A. M. (1990). Bacterial degradation of styrene involving a novel flavin adenine dinucleotide-dependent styrene monooxygenase. Appl Environ Microbiol 56, 13471351.
Hassett, D. J., Howell, M. L., Ochsner, U. A., Vasil, M. L., Johnson, Z. & Dean, G. E. (1997). An operon containing fumC and sodA encoding fumarase C and manganese superoxide dismutase is controlled by the ferric uptake regulator in Pseudomonas aeruginosa: fur mutants produce elevated alginate levels. J Bacteriol 179, 14521459.
Heyer, L. J., Kruglyak, S. & Yooseph, S. (1999). Exploring expression data: indentification and analysis of coexpressed genes. Genome Res 9, 11061115.
Hoogerbrugge, R., Willig, S. J. & Kistemaker, P. G. (1983). Discriminant analysis by double stage principal component analysis. Anal Chem 55, 17101712.[CrossRef]
Lessie, T. G. & Phibbs, P. V. (1984). Alternative pathways of carbohydrate utilization in Pseudomonas. Annu Rev Microbiol 38, 359387.[CrossRef][Medline]
Matsushita, K. & Ameyama, M. (1982). D-Glucose dehydrogenase from Pseudomonas fluorescens, membrane-bound. Methods Enzymol 89, 149154.[CrossRef][Medline]
Matsushita, K., Shinagawa, E., Adachi, O. & Ameyama, M. (1979). Membrane-bound D-gluconate dehydrogenase from Pseudomonas aeruginosa. J Biochem 85, 11731181.
Michaud, D. J., Marsh, A. G. & Dhurjati, P. S. (2003). eXPatGen: generating dynamic expression patterns for the systematic evaluation of analytical methods. Bioinformatics 19, 11401146.
Moeck, G. S. & Coulton, J. W. (1998). TonB-dependent iron acquisition: mechanisms of siderophore-mediated active transport. Mol Microbiol 28, 675681.[CrossRef][Medline]
Nelson, K. E., Weinel, C., Paulsen, I. T. & 40 other authors (2002). Complete genome sequence and comparative analysis of the metabolically versatile Pseudomonas putida KT2440. Environ Microbiol 4, 799808.[CrossRef][Medline]
Orr, M. S. & Scherf, U. (2002). Large-scale gene expression analysis in molecular target discovery. Leukemia 16, 473477.[CrossRef][Medline]
Petruschka, L., Adlf, K., Burchardt, G., Dernedde, J., Jurgensen, J. & Hermann, H. (2002). Analysis of the zwf-pgl-eda operon in Pseudomonas putida strains H and KT2440. FEMS Microbiol Lett 215, 8995.[CrossRef][Medline]
Pieterse, B., Jellema, R. H. & van der Werf, M. J. (2005). Quenching of microbial samples for increased reliability of microarray data. J Microbiol Methods doi:10.1016/j.mimet.2005.04.035
Quackenbush, J. (2001). Computational analysis of microarray data. Nat Rev Genet 2, 418427.[CrossRef][Medline]
Redly, G. A. & Poole, K. (2003). Pyoverdine-mediated regulation of FpvA synthesis in Pseudomonas aeruginosa: involvement of a probable extracytoplasmic-function sigma factor, FpvI. J Bacteriol 185, 12611265.
Sage, A. E., Proctor, W. D. & Phibbs, P. V. (1996). A two-component response regulator, gltR, is required for glucose transport activity in Pseudomonas aeruginosa PAO1. J Bacteriol 178, 60646066.
Slonim, D. K. (2002). From patterns to pathways: gene expression data analysis comes of age. Nat Genet Suppl 32, S502S508.[CrossRef]
Stinson, M. W., Cohen, M. A. & Merrick, J. M. (1977). Purification and properties of the periplasmic glucose-binding protein of Pseudomonas aeruginosa. J Bacteriol 131, 672681.
Swanson, B. L., Hager, P., Phibbs, P., Ochsner, U., Vasil, M. & Hamood, A. N. (2000). Characterization of the 2-ketogluconate utilization operon in Pseudomonas aeruginosa PAO1. Mol Microbiol 37, 561573.[CrossRef][Medline]
Tamayo, P., Slonim, D., Mesirov, J., Zhu, Q., Kitareewan, S., Dmitrovsky, E., Lander, E. S. & Golub, T. R. (1999). Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. Proc Natl Acad Sci U S A 96, 29072912.
Tavazoie, S., Hughes, J. D., Campbell, M. J., Cho, R. J. & Church, G. M. (1999). Systematic determination of genetic network architecture. Nat Genet 22, 281285.[CrossRef][Medline]
Tefferi, A., Bolander, M. E., Ansell, S. M., Wieben, E. D. & Spelsberg, T. C. (2002). Primer on medical genomics. Part III: microarray experiments and data analysis. Mayo Clin Proc 77, 927940.
Temple, L. M., Sage, A. E., Schweizer, H. P. & Phibbs, P. V. (1998). Carbohydrate catabolism in Pseudomonas aeruginosa. In Pseudomonas, pp. 3572. Edited by T. C. Montie. New York: Plenum.
van der Werf, M. J. (2005). Towards replacing closed with open target selection strategies. Trends Biotechnol 23, 1116.[CrossRef][Medline]
van der Werf, M. J., Jellema, R. H. & Hankemeier, T. (2005). Microbial metabolomics: replacing trial-and-error by the unbiased selection and ranking of targets. J Ind Microbiol Biotechnol 32, 234252.[CrossRef][Medline]
Wan, C.-Y. & Wilkins, T. A. (1994). A modified hot borate method significantly enhances the yield of high-quality RNA from cotton (Gossypium hirsutum L.). Anal Biochem 223, 712.[CrossRef][Medline]
Wu, T. D. (2001). Analysing gene expression data from DNA microarrays to identify candidate genes. J Pathol 195, 5365.[CrossRef][Medline]
Wylie, J. L. & Worobec, E. A. (1994). Cloning and nucleotide sequence of the Pseudomonas aeruginosa glucose-selective OrpB porin gene and distribution of oprB within the family Pseudomonaceae. Eur J Biochem 220, 505512.[Medline]
Wylie, J. L. & Worobec, E. A. (1995). The OprB porin plays a central role in carbohydrate uptake in Pseudomonas aeruginosa. J Bacteriol 177, 30213026.
Received 17 June 2005;
revised 21 October 2005;
accepted 26 October 2005.
This article has been cited by other articles:
![]() |
T. del Castillo, J. L. Ramos, J. J. Rodriguez-Herva, T. Fuhrer, U. Sauer, and E. Duque Convergent Peripheral Pathways Catalyze Initial Glucose Catabolism in Pseudomonas putida: Genomic and Flux Analysis J. Bacteriol., July 15, 2007; 189(14): 5142 - 5152. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Koide, R. Z. N. Vencio, and S. L. Gomes Global Gene Expression Analysis of the Heat Shock Response in the Phytopathogen Xylella fastidiosa. J. Bacteriol., August 1, 2006; 188(16): 5821 - 5830. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| INT J SYST EVOL MICROBIOL | MICROBIOLOGY | J GEN VIROL |
| J MED MICROBIOL | ALL SGM JOURNALS | |