Microbiology
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Microbiology 148 (2002), 2967-2973
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Camus, J.-C.
Right arrow Articles by Cole, S. T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Camus, J.-C.
Right arrow Articles by Cole, S. T.
Agricola
Right arrow Articles by Camus, J.-C.
Right arrow Articles by Cole, S. T.
Microbiology (2002), 148, 2967-2973.
© 2002 Society for General Microbiology


Molecular Genetics and Immunobiology of Mycobacteria

Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv

Jean-Christophe Camus1,2, Melinda J. Pryor1,2, Claudine Médigue3 and Stewart T. Cole1

Unité de Génétique Moléculaire Bactérienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex, France1
Annotation-Bases de Données (PT4), Génopole, Institut Pasteur, Paris, France2
Génoscope/UMR 8030, Atelier de Génomique Comparative, 2 rue Gaston Crémieux, 91006 Evry Cedex, France3

Author for correspondence: Stewart T. Cole. Tel: +33 1 45688446. Fax: +33 1 40613583. e-mail: stcole{at}pasteur.fr


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS AND DISCUSSION
 REFERENCES
 
Original genome annotations need to be regularly updated if the information they contain is to remain accurate and relevant. Here the complete re-annotation of the genome sequence of Mycobacterium tuberculosis strain H37Rv is presented almost 4 years after the first submission. Eighty-two new protein-coding sequences (CDS) have been included and 22 of these have a predicted function. The majority were identified by manual or automated re-analysis of the genome and most of them were shorter than the 100 codon cut-off used in the initial genome analysis. The functional classification of 643 CDS has been changed based principally on recent sequence comparisons and new experimental data from the literature. More than 300 gene names and over 1000 targeted citations have been added and the lengths of 60 genes have been modified. Presently, it is possible to assign a function to 2058 proteins (52% of the 3995 proteins predicted) and only 376 putative proteins share no homology with known proteins and thus could be unique to M. tuberculosis.

Keywords: mycobacteria, tuberculosis, genomics

Abbreviations: CDS, protein-coding sequences


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS AND DISCUSSION
 REFERENCES
 
Since the completion of the first prokaryotic genome sequence (Fleischmann et al., 1995Down ) the number of sequencing projects and related biological databases have been increasing exponentially. This has in turn led to the development of downstream sciences that take advantage of this sequence information such as comparative genomics, transcriptomics, proteomics and metabolomics. Significant advances have been made in these fields, thereby increasing our knowledge of the functions of many gene products. It is critical, therefore, that genome annotations are frequently updated if the information they contain is to remain accurate, relevant and useful. Several other genomes have thus been re-annotated recently (Dandekar et al., 2000Down ; Serres et al., 2001Down ; Gaasterland & Oprea, 2001Down ; Bocs et al., 2002Down ).

Mycobacterium tuberculosis H37Rv was first isolated in 1905, has remained pathogenic and is the most widely used strain in tuberculosis research. The complete genome sequence and annotation of this strain was published in 1998 (Cole et al., 1998Down ). The information from this project was incorporated into the public database TubercuList (http://genolist.pasteur.fr/TubercuList/) which was created using the GenoList model (Moszer et al., 2002Down ).

In this paper we describe the re-annotation of the M. tuberculosis H37Rv genome. We have manually re-evaluated each of the coding sequences (CDS) previously annotated and present the combined results of recent database searches and literature surveys. This annotation also contains new comparisons with the recently completed genome sequence of Mycobacterium leprae (Cole et al., 2001Down ).


    METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS AND DISCUSSION
 REFERENCES
 
Sequence analysis and annotation.
Each of the M. tuberculosis H37Rv CDS previously predicted and annotated (Cole et al., 1998Down ) has been manually re-analysed based on the results of BLASTP (Altschul et al., 1990Down ) and FASTA (Pearson & Lipman, 1988Down ) sequence comparisons using non-redundant data from the EMBL, TrEMBL and SWISS-PROT databases. Additional functional insight was obtained by using the PROSITE database (Falquet et al., 2002Down ) and the programs TMHMM 1.0 (Sonnhammer et al., 1998Down ) and SIGNALP (Nielsen et al., 1997Down ) to predict subcellular localization. Information about transport proteins has been incorporated from recent expert reviews (Braibant et al., 2000Down ; http://www.biology.ucsd.edu/~ipaulsen/transport/; Paulsen et al., 2000Down ). The coding sequence length has been re-evaluated in particular by examining M. tuberculosis protein families generated with the MEME/MAST program as described previously (Tekaia et al., 1999Down ). Re-annotation was supported by Artemis software release 4 (http://www.sanger.ac.uk/; Rutherford et al., 2000Down ). This update has recently been deposited in EMBL (accession no. AL123456) and in our TubercuList website (http://genolist.pasteur.fr/TubercuList). In this release (R4), a number of new features has been included where possible: SWISS-PROT accession number, synonyms, EC number and catalytic reaction for enzymes, protein family name, most important references from the literature with a qualifier for proteins studied experimentally, information from numerous published proteomic or transcriptomic studies, M. leprae orthologues, name of the putative product and a short description of the predicted function. A table with all of the predicted CDS and the corresponding functional classification, adapted from the work of Riley (1993)Down , is available from TubercuList.

Identification of new CDS.
Three independent approaches were used for detecting new potential coding sequences. In the first, some new CDS with appropriate GC content, correlation scores and codon usage were found manually during the re-annotation of the genome. In the second, a new program, AMIGA (Automatic MIcrobial Genome Annotation), was used to identify possible frameshifts and potential coding sequences that had been overlooked (for details see Bocs et al., 2002Down ). Briefly, AMIGA found the most likely CDS longer than 60 bp and merged the results with those generated by a modified GeneMark analysis. The combined results were then compared with the original annotation and the additional CDS detected by AMIGA investigated further using the criteria in the first approach and database searches. In the third approach, other CDS were found following TBLASTN searches of TubercuList using protein sequence data from the literature (Jungblut et al., 2001Down ; Rosenkrands et al., 2000aDown ; Corixa Corporation patent no. WO 97/09428, 1997) or personal communications (N. Stoker, P. Jungblut).


    RESULTS AND DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS AND DISCUSSION
 REFERENCES
 
Revising the number of genes in the genome
The original sequence and annotation of Mycobacterium tuberculosis strain H37Rv identified 3974 genes (Cole et al., 1998Down ). This included 3924 genes thought to encode proteins and 50 encoding stable RNA. Following the re-annotation, we have included 82 additional genes. All of the new genes are believed to encode polypeptides and no change has been detected in the number of RNA molecules. The numbering of the new CDS has not interfered with the labelling of the existing genes (Table 1Down). The new CDS use the same Rv number as the preceding CDS followed by a letter (e.g. Rv2307A, Rv2307B or Rv2307D have been introduced between CDS Rv2307c and Rv2308). The 82 new CDS are inventoried in Table 1Down and comprise 75 found by re-examining the genome using manual or automatic analysis methods (see Methods) and seven that were included based on experimental results from other laboratories.


View this table:
[in this window]
[in a new window]
 
Table 1. Rv numbers of new and removed CDS, and CDS with lengths changed

 
The cut-off for gene length used during the initial analysis of the genome was 100 codons (Cole et al., 1998Down ) and unknown genes smaller than this are often difficult to identify reliably by predictive methods using hidden Markov models or by other approaches that do not use sequence similarity. Lowering the gene length threshold, coupled with checking for appropriate codon usage, resulted in the identification of 27 putative new CDS with a median length of 207 bases, e.g. Rv0979A (product: 57 aa) and Rv0634B (product: 55 aa). These two protein-coding genes share homology with the products of rpmF and rpmG, respectively, and in several other micro-organisms they encode part of the ribosomal machinery. Another gene, Rv1028A, was identified because we have changed the parameters of the BLAST program and thus detected similarities with other orthologues. This short CDS, termed kdpF, is localized in the kdp cluster involved in phosphate transport and is thought to play a role in the stabilization of the kdp system in Escherichia coli (Gassel et al., 1999Down ).

Using AMIGA, an in-house program employing M. tuberculosis codon usage (see Methods), we have found 30 other CDS whose putative products comprise between 45 and 211 aa (Table 1Up; see AMI). With these different methods, nine new CDS replaced genes identified in the original annotation, but localized on the opposite strand. In these cases, the same Rv number has been kept but the letter extension c, denoting localization on the complementary strand, has been added or removed as appropriate. AMIGA also predicts real or potential frameshifts, and these results led to the correction of four sequencing errors. The current nucleotide sequence now contains 4411532 nt.

Seven other CDS were identified using data from the literature. Thus, the proteomic study of Jungblut et al. (2001)Down identified five new proteins by two-dimensional electrophoresis and mass spectrometry. Two other CDS were uncovered using the results of antigen discovery programmes of the Corixa Corporation (Patent no. WO 97/09428, 1997) and the Statens Serum Institute (Rosenkrands et al., 2000aDown ), respectively.

All new CDS have been classified into one of the 11 functional classes used by Cole et al. (1998)Down and described in Table 2Down. For 60 of the putative proteins we were unable to predict a precise function and this group includes proteins classified as conserved hypothetical proteins (class 10), unknown proteins without homology (class 8), proteins putatively involved in cell-wall processes or localized in the membrane (class 3) and two PE family proteins (class 6). Functions were predicted for 22 of the new proteins, e.g. RpmF or KdpF as discussed previously and classified in classes 2 (information pathways) or 3 (cell-wall and cell processes), respectively.


View this table:
[in this window]
[in a new window]
 
Table 2. Functional classification of Mycobacterium tuberculosis genes

 
Updating the annotation of CDS
To enhance the value of the annotation, systematic re-analysis has been undertaken to include principally the findings from reiterative BLASTP/FASTA searches and new scientific data from the literature. We have updated all the protein-coding genes previously identified in 1998 and tried to assign new or more precise functions when possible. This re-annotation led to changes in the functional classification of more than 600 CDS, the annotation of ~350 other CDS has been changed without affecting their classification and almost 300 gene names have been added. In addition, we have altered the related product description of 33 CDS, in many cases making this more specific, and changed the gene name of >50 existing CDS. Where gene names have been changed, the old name is retained within the note. Over 1000 targeted literature citations have been added to support the functional information.

In addition, during the new analysis, the lengths of 60 coding sequences have been altered based on the results of BLASTP/FASTA and MEME/MAST or due to correction of the nucleotide sequence (Table 1Up). Thus 35 CDS have been shortened and 15 extended at their 5' ends, whereas 10 have been altered at their 3' ends due to sequence corrections or errors in the original annotation. One gene, Rv2233, has been removed, but not replaced as this merged with the preceding CDS, Rv2232. When the coding sequence of a previously determined gene has been shortened, we have checked for the possible presence of a new CDS in another reading frame. So, for example, after shortening the smc gene, encoding a putative chromosome segregation protein, the acyP gene (Rv2922A) was identified.

Changes to functional classes
The functional classification of 643 of the predicted proteins of M. tuberculosis H37Rv identified in 1998 has been changed during the update (Fig. 1aDown). Not unexpectedly, the two functional classes exhibiting the greatest number of transfers to other classes were the unknown category with 354 changes (class 8) and the conserved hypotheticals category with 183 changes (class 10). Figs 1(bDown, cDown) show the functional categories to which the protein-coding genes annotated as unknown and conserved hypothetical proteins in 1998 have moved.



View larger version (24K):
[in this window]
[in a new window]
 
Fig. 1. Changes to the functional classification of protein-coding genes following the re-annotation of the M. tuberculosis H37Rv genome. (a) The percentage of protein-coding genes changed in each of the functional classes. (b) The functional classes where the protein-coding genes originally annotated as unknown function (class 8) have moved. (c) The functional classes where the protein-coding genes originally annotated as conserved hypothetical proteins (class 10) have moved.

 
During the re-annotation, functions have been postulated for 94 unknown proteins based on new sequence similarities with other proteins or experimental data from the literature (studies on M. tuberculosis or other organisms). For example, the putative protein encoded by Rv2476c (class 8) now shows consistent homology with several gdh products, NAD+-dependent glutamate dehydrogenases (EC 1.4.1.2), which have been well characterized recently in other bacteria (Kersten et al., 1999Down ; Minambres et al., 2000Down ; Lu & Abdelal, 2001Down ). Consequently, Rv2476c has been transferred to class 7. However, the majority of the class 8 proteins (245) have been reclassified now as conserved hypothetical proteins without any indication of a function. These changes are principally due to an increase in genomic data generated from sequencing projects in the last 4 years. Notably, many of the M. tuberculosis CDS have orthologues in M. leprae (Cole et al., 2001Down ; http://genolist.pasteur.fr/Leproma/) and Streptomyces coelicolor (Bentley et al., 2002Down ; http://www.sanger.ac.uk/Projects/S_coelicolor/).

Many of the class 10 and class 8 proteins have been reclassified in the cell wall and cell process category (class 3) (Fig. 1bUp, cUp). There are two reasons for these changes. First, the criteria for classification in this group (class 3) have been amended since 1998. Class 3 now comprises all predicted membrane proteins or proteins believed to be involved in a cell process (including secreted and transmembrane proteins), regardless of whether they have similarities with other proteins or a predicted function. For example in 1998, Rv0970, encoding an integral membrane protein with no similarity to other proteins, was classified as unknown (class 8). Second, some of the predicted proteins have moved to the cell-wall and cell processes group because of new information from the databases or new data from research. For example, Rv2450c is believed to encode a bacterial growth factor or cytokine involved in promoting the resuscitation and growth of dormant cells (Mukamalova et al., 1998Down ) and has been renamed rpfE (M. Young, personal communication).

There have also been transfers from the other functional groups as follows (Fig. 1aUp): 63 transfers from class 7 (intermediary metabolism and respiration), 24 transfers from class 3, 11 transfers from class 9 (regulatory proteins), 6 transfers from class 1 (lipid metabolism), 1 transfer from class 0 (virulence, detoxification or adaptation) and 1 transfer from class 2 (information pathways). These transfers often involve a change from a predicted function in one class to a more precise function in another (e.g. 7 to 1), but can also involve a regression. For example, Rv3522 was predicted to be a transcriptional regulatory protein in the first annotation (class 9), but re-analysis of its sequence leads us to consider it to be involved in lipid metabolism (class 1). The total number of regressions was 82, the largest group involving transfer from class 7 to 10. The original annotation of these putative proteins involved either incorrect analysis of the results or the existence of errors in the database used. For example, the CDS Rv0382c was originally annotated as a probable uridine 5'-monophosphate synthase based on the best similarities after BLASTP or FASTA analysis. However, this assignment is misleading as uridine 5'-monophosphate synthase is a bifunctional enzyme containing orotate phosphoribosyltransferase activity (EC 2.4.2 . 10) at its N terminus and orotidine 5'-phosphate decarboxylase activity (EC 4.1.1.23) at its C terminus. Several proteins annotated as uridine 5'-monophosphate synthases in the databases share similarity only at their N terminus and these are therefore incorrectly annotated because they lack the second domain found in authentic uridine 5'-monophosphate synthases. We thus consider the Rv0382c product to be an orotate phosphoribosyltransferase and not a uridine 5'-monophosphate synthase as predicted in the first annotation. Note that in certain regressions, gene names attributed in 1998 have been removed.

Changes within functional classes
Updating the genome annotation of M. tuberculosis H37Rv has also resulted in many changes within the functional classes usually due to new information from the literature. These have included updating the product names, changing 58 specific gene names and introducing appropriate new citations. Gene names were included when there was significant similarity or a pertinent publication. For example, the previously annotated umaA2 (Rv0470c), unknown mycolic acid synthase, has recently been changed to pcaA, encoding an S-adenosyl methionine (SAM)-dependent methyl transferase, required for {alpha}-mycolic acid cyclopropanation and lethal chronic persistence in M. tuberculosis infection (Glickman et al., 2000Down ). On occasion, a gene name has been given to a CDS which previously had just an Rv number, e.g. Rv0981 is identified now as mpr (Zahrt & Deretic, 2001Down ).

Changes have also been generated from new sequences in the databases and from detailed studies of synteny, particularly with the related pathogen M. leprae. For example, Rv1860 was previously known as apa, encoding a 45–47 kDa secreted antigen (Laqueyrerie et al., 1995Down ), located at the end of the modABC operon (molybdate transport system). Its name was recently changed to modD, on the basis of its proximity to modABC, and it is now part of the ModD family described by SWISS-PROT (e.g. P46842, Q50906). However, the protein which has been shown to be glycosylated and to have fibronectin-binding activity (Schorey et al., 1995Down ) shares no significant sequence similarity with other proteins involved in molybdate uptake. Furthermore, all of the functions involved in molybdate transport, and the enzymes that synthesize or require molybdopterin for activity, have been inactivated or lost from M. leprae (Eiglmeier et al., 2001Down ) with the exception of modD. This strongly suggests that ModD does not participate in molybdate uptake and we propose, therefore, that the name apa should be maintained.

Functional distribution of predicted CDS in 1998 and 2002
The new genomic annotation of M. tuberculosis H37Rv has incorporated many changes to the functional classifications of the predicted proteins. A comparison of the number of predicted proteins in each of the functional categories between 1998 and 2002 is shown in Table 2Up. An important change has been the decrease in the number of unknown proteins from 606 to 272. Presently we are able to predict a function for 2058 proteins (52% of the proteome) and more than 150 of these have been experimentally proven in mycobacterial research. The number of conserved hypothetical proteins has changed from 910 in 1998 to 1051 today. 376 putative proteins show no similarity to known proteins from other organisms and some of them may be specific to M. tuberculosis. To date, more than 400 M. tuberculosis proteins have been detected experimentally, most of them by proteomic studies (Weldingh et al., 1998Down ; Jungblut et al., 1999Down ; Mollenkopf et al., 1999Down ; Rosenkrands et al., 2000bDown ; Betts et al., 2000Down ). In the coming years, the number of unknown proteins should continue to decrease as more similarities are found by database searches or as functions are identified for some of these potentially M. tuberculosis-specific proteins. The structural genomics programmes currently under way on mycobacteria should have a significant impact in this respect (http://www.doe-mbi.ucla.edu/TB/; http://www.pasteur.fr/recherche/X-TB/).


    ACKNOWLEDGEMENTS
 
We thank Q. T. Huynh, B. Caudron, L. Jones and N. Joly for their generous assistance with informatics, and J. Parkhill and K. D. James for help with sequence analysis. Special thanks to N. Stoker, P. Jungblut, I. Rosenkrands, A. Wietzorrek, A. Marcel, L. Frangeul, T. Garnier and T. Stinear, and the members of the mycobacterial research community who provided helpful comments. Financial support for this work was from the Wellcome Trust and the Génopole programme.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS AND DISCUSSION
 REFERENCES
 
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. (1990). Basic local alignment search tool. J Mol Biol 215, 403-410.[Medline]

Bentley, S. D., Chater, K. F., Cerdeno-Tarraga, A. M. & 40 other authors (2002). Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2). Nature 417, 141–147.[Medline]

Betts, J. C., Dodson, P., Quan, S., Lewis, A. P., Thomas, P. J., Duncan, K. & McAdam, R. A. (2000). Comparison of the proteome of the Mycobacterium tuberculosis strain H37Rv with clinical isolate CDC1551. Microbiology 146, 3205-3216.[Abstract/Free Full Text]

Bocs, S., Danchin, A. & Medigue, C. (2002). Re-annotation of genome microbial CoDing-Sequences: finding new genes and inaccurately annotated genes. BMC Bioinformatics 3, 1-5.[Medline]

Braibant, M., Gilot, P. & Content, J. (2000). The ATP binding cassette (ABC) transport systems of Mycobacterium tuberculosis. FEMS Microbiol Rev 24, 449-467.[Medline]

Cole, S. T., Brosch, R., Parkhill, J. & 39 other authors (1998). Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence. Nature 393, 537–544.[Medline]

Cole, S. T., Eiglmeier, K., Parkhill, J. & 40 other authors (2001). Massive gene decay in the leprosy bacillus. Nature 409, 1007–1011.[Medline]

Dandekar, T., Huynen, M., Regula, J. T. & 10 other authors (2000). Re-annotating the Mycoplasma pneumoniae genome sequence: adding value, function and reading frames. Nucleic Acids Res 28, 3278–3288.[Abstract/Free Full Text]

Eiglmeier, K., Parkhill, J., Honore, N., & 12 other authors (2001). The decaying genome of Mycobacterium leprae. Lepr Rev 72, 387–398.[Medline]

Falquet, L., Pagni, M., Bucher, P., Hulo, N., Sigrist, C. J., Hofmann, K. & Bairoch, A. (2002). The PROSITE database, its status in 2002. Nucleic Acids Res 30, 235-238.[Abstract/Free Full Text]

Fleischmann, R. D., Adams, M. D., White, O. & 37 other authors (1995). Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science 269, 496–512.[Abstract/Free Full Text]

Gaasterland, T. & Oprea, M. (2001). Whole-genome analysis: annotations and updates. Curr Opin Struct Biol 11, 377-381.[Medline]

Gassel, M., Mollenkamp, T., Puppe, W. & Altendorf, K. (1999). The KdpF subunit is part of the K(+)-translocating Kdp complex of Escherichia coli and is responsible for stabilization of the complex in vitro. J Biol Chem 274, 37901-37907.[Abstract/Free Full Text]

Glickman, M. S., Cox, J. S. & Jacobs, W. R.Jr (2000). A novel mycolic acid cyclopropane synthetase is required for cording, persistence, and virulence of Mycobacterium tuberculosis. Mol Cell 5, 717-727.[Medline]

Jungblut, P. R., Schaible, U. E., Mollenkopf, H. J. & 7 other authors (1999). Comparative proteome analysis of Mycobacterium tuberculosis and Mycobacterium bovis BCG strains: towards functional genomics of microbial pathogens. Mol Microbiol 33, 1103–1117.[Medline]

Jungblut, P. R., Muller, E. C., Mattow, J. & Kaufmann, S. H. (2001). Proteomics reveals open reading frames in Mycobacterium tuberculosis H37Rv not predicted by genomics. Infect Immun 69, 5905-5907.[Abstract/Free Full Text]

Kersten, M. A., Muller, Y., Baars, J. J., Op den Camp, H. J., van der Drift, C., Van Griensven, L. J., Visser, J. & Schaap, P. J. (1999). NAD+-dependent glutamate dehydrogenase of the edible mushroom Agaricus bisporus: biochemical and molecular characterization. Mol Gen Genet 261, 452-462.[Medline]

Laqueyrerie, A., Militzer, P., Romain, F., Eiglmeier, K., Cole, S. T. & Marchal, G. (1995). Cloning, sequencing, and expression of the apa gene coding for the Mycobacterium tuberculosis 45/47-kilodalton secreted antigen complex. Infect Immun 63, 4003-4010.[Abstract]

Lu, C. D. & Abdelal, A. T. (2001). The gdhB gene of Pseudomonas aeruginosa encodes an arginine-inducible NAD(+)-dependent glutamate dehydrogenase which is subject to allosteric regulation. J Bacteriol 183, 490-499.[Abstract/Free Full Text]

Minambres, B., Olivera, E. R., Jensen, R. A. & Luengo, J. M. (2000). A new class of glutamate dehydrogenases (GDH). Biochemical and genetic characterization of the first member, the AMP-requiring NAD-specific GDH of Streptomyces clavuligerus. J Biol Chem 275, 39529-39542.[Abstract/Free Full Text]

Mollenkopf, H. J., Jungblut, P. R., Raupach, B., Mattow, J., Lamer, S., Zimny-Arndt, U., Schaible, U. E. & Kaufmann, S. H. (1999). A dynamic two-dimensional polyacrylamide gel electrophoresis database: the mycobacterial proteome via Internet. Electrophoresis 20, 2172-2180.[Medline]

Moszer, I., Jones, L. M., Moreira, S., Fabry, C. & Danchin, A. (2002). SubtiList: the reference database for the Bacillus subtilis genome. Nucleic Acids Res 30, 62-65.[Abstract/Free Full Text]

Mukamolova, G. V., Kaprelyants, A. S., Young, D. I., Young, M. & Kell, D. B. (1998). A bacterial cytokine. Proc Natl Acad Sci USA 95, 8916-8921.[Abstract/Free Full Text]

Nielsen, H., Engelbrecht, J., Brunak, S. & von Heijne, G. (1997). Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites. Protein Eng 10, 1-6.[Abstract/Free Full Text]

Paulsen, I. T., Nguyen, L., Sliwinski, M. K., Rabus, R. & Saier, M. H.Jr (2000). Microbial genome analyses: comparative transport capabilities in eighteen prokaryotes. J Mol Biol 301, 75-100.[Medline]

Pearson, W. R. & Lipman, D. J. (1988). Improved tools for biological sequence comparison. Proc Natl Acad Sci USA 85, 2444-2448.[Abstract/Free Full Text]

Riley, M. (1993). Functions of the gene products of Escherichia coli. Microbiol Rev 57, 862-952.[Abstract/Free Full Text]

Rosenkrands, I., Weldingh, K., Jacobsen, S., Hansen, C. V., Florio, W., Gianetri, I. & Andersen, P. (2000a). Mapping and identification of Mycobacterium tuberculosis proteins by two-dimensional gel electrophoresis, microsequencing and immunodetection. Electrophoresis 21, 935-948.[Medline]

Rosenkrands, I., King, A., Weldingh, K., Moniatte, M., Moertz, E. & Andersen, P. (2000b). Towards the proteome of Mycobacterium tuberculosis. Electrophoresis 21, 3740-3756.[Medline]

Rutherford, K., Parkhill, J., Crook, J., Horsnell, T., Rice, P., Rajandream, M. A. & Barrell, B. (2000). Artemis: sequence visualization and annotation. Bioinformatics 16, 944-945.[Abstract/Free Full Text]

Schorey, J. S., Li, Q., McCourt, D. W., Bong-Mastek, M., Clark-Curtiss, J. E., Ratliff, T. L. & Brown, E. J. (1995). A Mycobacterium leprae gene encoding a fibronectin binding protein is used for efficient invasion of epithelial cells and Schwann cells. Infect Immun 63, 2652-2657.[Abstract]

Serres, M. H., Gopal, S., Nahum, L. A., Liang, P., Gaasterland, T. & Riley, M. (2001). A functional update of the Escherichia coli K-12 genome. Genome Biol 2, 0035.1–0035.7.

Sonnhammer, E. L., von Heijne, G. & Krogh, A. (1998). A hidden Markov model for predicting transmembrane helices in protein sequences. Proc Int Conf Intell Syst Mol Biol 6, 175-182.[Medline]

Tekaia, F., Gordon, S. V., Garnier, T., Brosch, R., Barrell, B. G. & Cole, S. T. (1999). Analysis of the proteome of Mycobacterium tuberculosis in silico. Tuber Lung Dis 79, 329-342.[Medline]

Weldingh, K., Rosenkrands, I., Jacobsen, S., Rasmussen, P. B., Elhay, M. J. & Andersen, P. (1998). Two-dimensional electrophoresis for analysis of Mycobacterium tuberculosis culture filtrate and purification and characterization of six novel proteins. Infect Immun 66, 3492-3500.[Abstract/Free Full Text]

Zahrt, T. C. & Deretic, V. (2001). Mycobacterium tuberculosis signal transduction system required for persistent infections. Proc Natl Acad Sci USA 98, 12706-12711.[Abstract/Free Full Text]

Received 2 April 2002; revised 27 June 2002; accepted 18 July 2002.


This article has been cited by other articles:


Home page
J. Bacteriol.Home page
S. K. Ward, E. A. Hoye, and A. M. Talaat
The Global Responses of Mycobacterium tuberculosis to Physiological Levels of Copper
J. Bacteriol., April 15, 2008; 190(8): 2939 - 2946.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Lechat, L. Hummel, S. Rousseau, and I. Moszer
GenoList: an integrated environment for comparative analysis of microbial genomes
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D469 - D474.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
C. Z. Schneider, T. Parish, L. A. Basso, and D. S. Santos
The Two Chorismate Mutases from both Mycobacterium tuberculosis and Mycobacterium smegmatis: Biochemical Analysis and Limited Regulation of Promoter Activity by Aromatic Amino Acids
J. Bacteriol., January 1, 2008; 190(1): 122 - 134.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
I. Rajan, S. Aravamuthan, and S. S. Mande
Identification of compositionally distinct regions in genomes using the centroid method
Bioinformatics, October 15, 2007; 23(20): 2672 - 2677.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Roback, J. Beard, D. Baumann, C. Gille, K. Henry, S. Krohn, H. Wiste, M.I. Voskuil, C. Rainville, and R. Rutherford
A predicted operon map for Mycobacterium tuberculosis
Nucleic Acids Res., August 1, 2007; 35(15): 5085 - 5095.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
J. Becq, M. C. Gutierrez, V. Rosas-Magallanes, J. Rauzier, B. Gicquel, O. Neyrolles, and P. Deschavanne
Contribution of Horizontally Acquired Genomic Islands to the Evolution of the Tubercle Bacilli
Mol. Biol. Evol., August 1, 2007; 24(8): 1861 - 1871.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
A. M. Talaat, S. K. Ward, C.-W. Wu, E. Rondon, C. Tavano, J. P. Bannantine, R. Lyons, and S. A. Johnston
Mycobacterial Bacilli Are Metabolically Active during Chronic Tuberculosis in Murine Lungs: Insights from Genome-Wide Transcriptional Profiling
J. Bacteriol., June 1, 2007; 189(11): 4265 - 4274.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
T. Parish, G. Roberts, F. Laval, M. Schaeffer, M. Daffe, and K. Duncan
Functional Complementation of the Essential Gene fabG1 of Mycobacterium tuberculosis by Mycobacterium smegmatis fabG but Not Escherichia coli fabG
J. Bacteriol., May 15, 2007; 189(10): 3721 - 3728.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
C. J. Alteri, J. Xicohtencatl-Cortes, S. Hess, G. Caballero-Olin, J. A. Giron, and R. L. Friedman
Mycobacterium tuberculosis produces pili during human infection
PNAS, March 20, 2007; 104(12): 5145 - 5150.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
K. Ranguelova, S. Girotto, G. J. Gerfen, S. Yu, J. Suarez, L. Metlitsky, and R. S. Magliozzo
Radical Sites in Mycobacterium tuberculosis KatG Identified Using Electron Paramagnetic Resonance Spectroscopy, the Three-dimensional Crystal Structure, and Electron Transfer Couplings
J. Biol. Chem., March 2, 2007; 282(9): 6255 - 6264.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
K. G. Thakur, A. M. Joshi, and B. Gopal
Structural and Biophysical Studies on Two Promoter Recognition Domains of the Extra-cytoplasmic Function {sigma} Factor {sigma}C from Mycobacterium tuberculosis
J. Biol. Chem., February 16, 2007; 282(7): 4711 - 4718.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
S. C. G. Rison, J. Mattow, P. R. Jungblut, and N. G. Stoker
Experimental determination of translational starts using peptide mass mapping and tandem mass spectrometry within the proteome of Mycobacterium tuberculosis
Microbiology, February 1, 2007; 153(2): 521 - 528.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
S.-K. Kim, S. K. Reddy, B. C. Nelson, G. B. Vasquez, A. Davis, A. J. Howard, S. Patterson, G. L. Gilliland, J. E. Ladner, and P. T. Reddy
Biochemical and Structural Characterization of the Secreted Chorismate Mutase (Rv1885c) from Mycobacterium tuberculosis H37Rv: an *AroQ Enzyme Not Regulated by the Aromatic Amino Acids
J. Bacteriol., December 15, 2006; 188(24): 8638 - 8648.
[Abstract] [Full Text] [PDF]


Home page
Antimicrob. Agents Chemother.Home page
U. H. Manjunatha, R. Lahiri, B. Randhawa, C. S. Dowd, J. L. Krahenbuhl, and C. E. Barry III
Mycobacterium leprae Is Naturally Resistant to PA-824.
Antimicrob. Agents Chemother., October 1, 2006; 50(10): 3350 - 3354.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
P. Johansson, A. Castell, T. A. Jones, and K. Backbro
Structure and function of Rv0130, a conserved hypothetical protein from Mycobacterium tuberculosis
Protein Sci., October 1, 2006; 15(10): 2300 - 2309.
[Abstract] [Full Text] [PDF]


Home page
J. Clin. Microbiol.Home page
C. Benabdesselem, D. M. Fathallah, R. C. Huard, H. Zhu, M. A. Jarboui, K. Dellagi, J. L. Ho, and R. M. Barbouche
Enhanced Patient Serum Immunoreactivity to Recombinant Mycobacterium tuberculosis CFP32 Produced in the Yeast Pichia pastoris Compared to Escherichia coli and Its Potential for Serodiagnosis of Tuberculosis.
J. Clin. Microbiol., September 1, 2006; 44(9): 3086 - 3093.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. M. Joshi, A. K. Pandey, N. Capite, S. M. Fortune, E. J. Rubin, and C. M. Sassetti
Characterization of mycobacterial virulence genes through genetic interaction mapping
PNAS, August 1, 2006; 103(31): 11760 - 11765.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
C. T. Madsen, L. Jakobsen, K. Buriankova, F. Doucet-Populaire, J.-L. Pernodet, and S. Douthwaite
Methyltransferase Erm(37) Slips on rRNA to Confer Atypical Resistance in Mycobacterium tuberculosis
J. Biol. Chem., November 25, 2005; 280(47): 38942 - 38947.
[Abstract] [Full Text] [PDF]


Home page
Int. J. Syst. Evol. Microbiol.Home page
B. Gao and R. S. Gupta
Conserved indels in protein sequences that are characteristic of the phylum Actinobacteria
Int J Syst Evol Microbiol, November 1, 2005; 55(6): 2401 - 2412.
[Abstract] [Full Text] [PDF]


Home page
Antimicrob. Agents Chemother.Home page
C. E. Maus, B. B. Plikaytis, and T. M. Shinnick
Molecular Analysis of Cross-Resistance to Capreomycin, Kanamycin, Amikacin, and Viomycin in Mycobacterium tuberculosis
Antimicrob. Agents Chemother., August 1, 2005; 49(8): 3192 - 3197.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
A. Castell, P. Johansson, T. Unge, T. A. Jones, and K. Backbro
Rv0216, a conserved hypothetical protein from Mycobacterium tuberculosis that is essential for bacterial survival during infection, has a double hotdog fold
Protein Sci., July 1, 2005; 14(7): 1850 - 1862.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
P.-E. Jacques, A. L. Gervais, M. Cantin, J.-F. Lucier, G. Dallaire, G. Drouin, L. Gaudreau, J. Goulet, and R. Brzezinski
MtbRegList, a database dedicated to the analysis of transcriptional regulation in Mycobacterium tuberculosis
Bioinformatics, May 15, 2005; 21(10): 2563 - 2565.
[Abstract] [Full Text] [PDF]


Home page
Eur Respir JHome page
S. B. Sable, I. Verma, D. Behera, and G. K. Khuller
Human immune recognition-based multicomponent subunit vaccines against tuberculosis
Eur. Respir. J., May 1, 2005; 25(5): 902 - 910.
[Abstract] [Full Text] [PDF]


Home page
Antimicrob. Agents Chemother.Home page
C. E. Maus, B. B. Plikaytis, and T. M. Shinnick
Mutation of tlyA Confers Capreomycin Resistance in Mycobacterium tuberculosis
Antimicrob. Agents Chemother., February 1, 2005; 49(2): 571 - 577.
[Abstract] [Full Text] [PDF]


Home page
J. Clin. Microbiol.Home page
K. Weldingh, I. Rosenkrands, L. M. Okkels, T. M. Doherty, and P. Andersen
Assessing the Serodiagnostic Potential of 35 Mycobacterium tuberculosis Proteins and Identification of Four Novel Serological Antigens
J. Clin. Microbiol., January 1, 2005; 43(1): 57 - 65.
[Abstract] [Full Text] [PDF]


Home page
Mol. Biol. CellHome page
K. G. Mawuenyega, C. V. Forst, K. M. Dobos, J. T. Belisle, J. Chen, E. M. Bradbury, A. R.M. Bradbury, and X. Chen
Mycobacterium tuberculosis Functional Network Analysis by Global Subcellular Protein Profiling
Mol. Biol. Cell, January 1, 2005; 16(1): 396 - 404.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
R. Vincentelli, S. Canaan, V. Campanacci, C. Valencia, D. Maurin, F. Frassinetti, L. Scappucini-Calvo, Y. Bourne, C. Cambillau, and C. Bignon
High-throughput automated refolding screening of inclusion bodies
Protein Sci., October 22, 2004; 13(10): 2782 - 2792.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
A. G. Tsolaki, A. E. Hirsh, K. DeRiemer, J. A. Enciso, M. Z. Wong, M. Hannan, Y.-O. L. G. de la Salmoniere, K. Aman, M. Kato-Maeda, and P. M. Small
From the Cover: Functional and evolutionary genomics of Mycobacterium tuberculosis: Insights from genomic deletions in 100 strains
PNAS, April 6, 2004; 101(14): 4865 - 4870.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
K. C. Onwueme, J. A. Ferreras, J. Buglino, C. D. Lima, and L. E. N. Quadri
Mycobacterial polyketide-associated proteins are acyltransferases: Proof of principle with Mycobacterium tuberculosis PapA5
PNAS, March 30, 2004; 101(13): 4608 - 4613.
[Abstract] [Full Text] [PDF]