Microbiology 153 (2007), 633-651; DOI 10.1099/mic.0.2006/004762-0
© 2007 Society for General Microbiology
Constructing the wonders of the bacterial world: biosynthesis of complex enzymes
Frank Sargent
Centre for Metalloprotein Spectroscopy and Biology, School of Biological Sciences, University of East Anglia, Norwich NR4 7TJ, UK
Correspondence
Frank Sargent
f.sargent{at}uea.ac.uk
 |
ABSTRACT
|
|---|
The prokaryotic cytoplasmic membrane not only maintains cell integrity and forms a barrier between the cell and its outside environment, but is also the location for essential biochemical processes. Microbial model systems provide excellent bases for the study of fundamental problems in membrane biology including signal transduction, chemotaxis, solute transport and, as will be the topic of this review, energy metabolism. Bacterial respiration requires a diverse array of complex, multi-subunit, cofactor-containing redox enzymes, many of which are embedded within, or located on the extracellular side of, the membrane. The biosynthesis of these enzymes therefore requires carefully controlled expression, assembly, targeting and transport processes. Here, focusing on the molybdenum-containing respiratory enzymes central to anaerobic respiration in Escherichia coli, recent descriptions of a chaperone-mediated proofreading system involved in coordinating assembly and export of complex extracellular enzymes will be discussed. The paradigm proofreading chaperones are members of a large group of proteins known as the TorD family, and recent research in this area highlights common principles that underpin biosynthesis of both exported and non-exported respiratory enzymes.
 |
Introduction
|
|---|
Prokaryotes are renowned for their metabolic adaptability and diversity. Many prokaryotes can switch with relative ease between aerobic and anaerobic conditions, for example, or can improvise the use of inorganic compounds, such as hydrogen or sulfur, to form respiratory chains under even the most extreme environmental conditions. This remarkable respiratory adaptability is thanks to an elaborate bank of complex, multi-subunit, cofactor-containing redox enzymes, ready and waiting to be called upon (Richardson, 2000
).
The well-studied Gram-negative
-proteobacterium Escherichia coli is a facultative anaerobe and so is typically very flexible in its use of respiratory electron donors and acceptors (Fig. 1
). In E. coli, respiratory chains are relatively short, with reducing power being shuttled between only two enzymes by lipid-soluble quinones (Fig. 1
), namely ubiquinone, menaquinone and demethylmenaquinone (Soballe & Poole, 1999
). The generation of energy by respiratory electron-transfer chains involves the ionically sealed cytoplasmic membrane and, in the context of this particular review, it is most notable that respiratory enzymes, or at least their catalytic domains, can be located on either side of the lipid bilayer (Fig. 1
). Indeed, in terms of physiological function, the subcellular localization of all of these enzymes is absolutely critical. Some substrates, for example trimethylamine N-oxide (TMAO) and nicotinamide adenine dinucleotide (NADH), are membrane impermeable and therefore must be metabolized on one particular side of the membrane. More important, perhaps, is the application of the redox loop mechanism by respiratory chains to develop a transmembrane proton-motive force (Mitchell, 1961
; Mitchell & Moyle, 1967
; recently reviewed by Jormakka et al., 2003
). Redox loops are generated by membrane-associated respiratory enzymes in which the substrate and quinone/quinol-binding sites are juxtaposed on opposite sides of the lipid bilayer. Separation of the active sites in this way allows protons to be released, and taken up, on opposite sides of the membrane and thus contribute to a proton electrochemical gradient. Not all quinone/quinol-dependent respiratory systems are so electrogenic, but in E. coli the activities of two such enzymes are coupled to great effect under anaerobic conditions in the presence of nitrate (Enoch & Lester, 1975
; Jormakka et al., 2003
). Under these conditions formate oxidation in the periplasm is coupled, through a quinone mediator, to nitrate reduction in the cytoplasm. The formate dehydrogenase enzyme (often termed formate dehydrogenase-N as it is nitrate-inducible) is composed of three polypeptides, FdnG, FdnH and FdnI, with the formate-oxidation site being located on the FdnG subunit at the periplasmic side of the membrane (Jormakka et al., 2002
, 2003
). The quinone-reducing site is located on FdnI at the cytoplasmic side of the membrane (Jormakka et al., 2002
, 2003
). The topology and location of the substrate-binding sites are unequivocal since the 3D crystal structure of the intact complex has been solved to 1.6 Å resolution (Jormakka et al., 2002
; Fig. 2A
). The 3D crystal structure of the intact nitrate reductase located at the other end of this short respiratory chain has also been solved to near atomic resolution (Bertero et al., 2003
; Fig. 2B
). The crystal structure of this enzyme, termed nitrate reductase-A (i.e. the predominant nitrate reductase synthesized by E. coli) and comprising the NarG, NarH and NarI subunits, confirmed that the quinol-oxidizing site on NarI was located at the periplasmic side, and that the nitrate-reducing site on the NarG subunit was located at the cytoplasmic side, of the membrane (Fig. 2B
). The determination of the structural bases for formate-to-nitrate respiration in E. coli has underscored the fundamental importance of enzyme topology in this type of bacterial energy metabolism.

View larger version (33K):
[in this window]
[in a new window]
|
Fig. 1. Respiratory chains of E. coli. A summary of possible electron donors and acceptors that can be used to generate respiratory electron-transport chains in E. coli. Enzymes known, or predicted, to be involved in the metabolism of these compounds are listed by their standard gene-product names. Compounds that are metabolized on the periplasmic side of the inner membrane are coloured red; compounds metabolized on the cytoplasmic side are coloured blue. *Molybdenum-containing enzymes synthesized with twin-arginine signal peptides.
|
|

View larger version (54K):
[in this window]
[in a new window]
|
Fig. 2. Structures of FdnGHI, NarGHI and TorA. The crystal structures and subcellular localization of (A) the E. coli membrane-bound formate dehydrogenase-N (PDB accession 1KQF) with FdnG in sky blue, FdnH in chocolate and FdnI in green, (B) the E. coli membrane-bound nitrate reductase-A (1Q16) with NarG in red, NarH in yellow and NarI in slate, and (C) the periplasmic TMAO reductase from Shewanella massilia (1TMO). The figure was compiled using PyMol.
|
|
In this review, advances in the understanding of the biosynthetic processes governing assembly and orientation of complex respiratory enzymes will be discussed. In particular, recent genetic, biochemical, biophysical and structural studies of a new family of molecular chaperones (the TorD family) will be used to highlight common principles that underpin biosynthesis of both exported and non-exported respiratory enzymes.
 |
Molybdenum-dependent respiratory enzymes
|
|---|
The E. coli respiratory formate dehydrogenases and nitrate reductases contain molybdenum as a key catalytic cofactor (Enoch & Lester, 1975
; Jormakka et al., 2002
, 2004
; Bertero et al., 2003
). The chemical properties of molybdenum, which can cycle between the MoIV and MoVI valence states, have been seized upon by nature and exploited in numerous 2-electron transfer reactions. E. coli formate dehydrogenase-N catalyses the dehydrogenation of formate (Khangulov et al., 1998
):
The active site that catalyses this reaction is located on the FdnG subunit and comprises one molybdenum ion ligated by two molybdopterin guanine dinucleotide (bis-MGD) cofactors and a selenocysteine side-chain (Jormakka et al., 2002
). Following dehydrogenation of formate, the electrons are presumably passed via one of the MGD cofactors to an FeS cluster located in FdnG and then on to the FdnH subunit, which contains four FeS clusters, and finally on to FdnI, which has a proximal haem b close to FdnH and a distal haem b close to the quinone-binding site at the cytoplasmic side of the membrane (Jormakka et al., 2002
, 2003
).
The E. coli respiratory nitrate reductase-A catalyses the 2-electron reduction of nitrate to nitrite (an oxotransferase reaction), with the electrons usually being supplied by quinol (Berks et al., 1995
; Bertero et al., 2003
; Jormakka et al., 2004
):
The pathway the electrons take through nitrate reductase-A is precisely the opposite to electron transfer in formate dehydrogenase-N. First, quinol binds close to a distal haem b within NarI at the periplasmic side of the membrane, and is reduced to quinone, with the electrons passing from the distal haem to a proximal haem b at the cytoplasmic side of the membrane. The four FeS clusters in NarH then channel the electrons to an FeS cluster located in the NarG subunit, and finally on to the catalytic molybdenum ion, which is ligated by two MGD cofactors and in this case an aspartate side-chain donated by the NarG polypeptide (Bertero et al., 2003
; Jormakka et al., 2004
).
When taken together with phylogenetic analyses, the alternative active-site structures of molybdoenzymes such as those of FdnGHI and NarGHI have proven useful in classifying this vast superfamily of enzymes. As a result, the bis-MGD-containing catalytic proteins themselves can be grouped together into three different types (McDevitt et al., 2002
; Jormakka et al., 2004
). Type I bis-MGD molybdoenzymes almost always have a cysteine or selenocysteine side-chain as a direct molybdenum ligand (McDevitt et al., 2002
; Jormakka et al., 2004
), and this group includes E. coli FdnG, E. coli NapA (the periplasmic nitrate reductase), Salmonella TtrA (the catalytic subunit of tetrathionate reductase) (Hensel et al., 1999
), and the assimilatory nitrate reductases, for example Synechococcus NarB (Rubio et al., 2002
). Type II bis-MGD molydoenzymes display an aspartate molybdenum ligand (Bertero et al., 2003
) and include E. coli NarG and its huge array of homologues such as E. coli NarZ, Paracoccus NarG, proteins from bacilli, actinomycetes and the archaea (Richardson et al., 2001
; van Keulen et al., 2005
), and Rhodovulum sulfidophilum DdhA, the catalytic subunit of dimethyl sulfide dehydrogenase (McDevitt et al., 2002
). Finally, the type III bis-MGD molybdoenzymes possess a serine ligand to the molybdenum and include the E. coli TMAO reductases (TorA and TorZ), E. coli DmsA (the catalytic subunit of dimethyl sulfoxide (DMSO) reductase), the TMAO/DMSO reductases of Rhodobacter sp., and the E. coli biotin sulfoxide reductase, BisC (Pierson & Campbell, 1990
).
The type I, II and III classification of bis-MGD molybdoenzymes, however, does not discriminate between exported and non-exported proteins. For example, under type I there are the periplasmic FdnG and NapA proteins together with the cytoplasmic FdhF and NarB enzymes. Under type II there is the cytoplasmically oriented E. coli NarG together with the periplasmic Rv. sulfidophilum DdhA. Under type III there is the cytoplasmic E. coli BisC enzyme together with the E. coli periplasmic TorA enzyme as well as, most strikingly, the Shewanella DMSO reductases which reside on the outer leaflet of the outer membrane (Gralnick et al., 2006
). Since correct targeting of respiratory enzymes to their sites of physiological function is clearly an essential feature for prokaryotic respiration, the cell is faced with a problem of logistics: how to assemble homologous enzymes in different cellular compartments. The solution, it seems, is to assemble the enzymes in the cell cytoplasm then, if necessary, perform the translocation step last.
 |
The twin-arginine transport system
|
|---|
Many eubacteria and archaea (and their chloroplast descendants) have the ability to transport pre-folded, very often oligomeric, and enzymically active respiratory enzymes across ionically sealed membranes (Palmer & Berks, 2003
; Berks et al., 2003
). The proteins so transported are usually synthesized with, or associated with folded partner proteins that possess, distinctive N-terminal signal peptides that bear a common twin-arginine SRRxFLK amino acid sequence motif (Berks, 1996
). Twin-arginine signal peptides have a common tripartite structure that includes a polar N-terminal (n-) region of variable length, a moderately hydrophobic (h-) region of 1220 amino acids, and a C-terminal (c-) region that often contains basic residues (Table 1
). The consensus motif is always located at the junction between the n- and h-regions (Table 1
). These conserved common features have led to the development of algorithms designed to identify twin-arginine signal peptides from genomic information (Rose et al., 2002
; Bendtsen et al., 2005
). E. coli is predicted to produce
26 such proteins and the vast majority of these are cofactor-containing respiratory enzymes (Berks et al., 2005
). Indeed, all of the extracellular molybdoenzyme complexes synthesized by E. coli are either known or predicted to possess a subunit with an N-terminal twin-arginine signal peptide, including the type I molybdoproteins FdnG and NapA, and the type III molybdoenzymes TorA and DmsA (Fig. 1
). Some other prokaryotes make more extensive use of the twin-arginine signal peptide; for example the genome of the Gram-positive actinomycete Streptomyces coelicolor is predicted to encode upwards of 250 diverse proteins bearing twin-arginine signal peptides (Dilks et al., 2003
; Widdick et al., 2006
), and almost all exported proteins expressed by some halophilic archaea are predicted to contain twin-arginine signal peptides (Bolhuis, 2002
; Rose et al., 2002
).
All proteins bearing bona fide twin-arginine signal peptides are transported by the twin-arginine translocation (Tat) system (Sargent et al., 1998a
; Palmer & Berks, 2003
). The Tat translocase is essentially a membrane-bound nanomachine dedicated to the transmembrane translocation of fully folded proteins. Depending on the biological model system under investigation, the complete Tat translocation machinery comprises two or three different proteins. In E. coli, three types of integral membrane proteins TatA (and its homologue TatE), TatB, TatC have been identified that are involved in the transport process (Bogsch et al., 1998
; Sargent et al., 1998a
, 1999
; Weiner et al., 1998
). TatA/E and TatB are predicted to be structurally related, each comprising a single N-terminal transmembrane
-helix with the extreme N-terminus on the periplasmic side of the membrane (Settles et al., 1997
; Chanal et al., 1998
; Lee et al., 2006
). The TatC protein has six transmembrane helices, with both termini at the cytoplasmic side of the membrane (Drew et al., 2002
; Behrendt et al., 2004
; Ki et al., 2004
). Some other bacteria, including the Gram-positive Bacillus subtilis, have minimalist Tat translocation systems that comprise only TatA and TatC (Jongbloed et al., 2006
). Indeed, the Gram-positive Staphylococcus aureus has a genome that lacks a tatB homologue altogether (Pop et al., 2002
). In E. coli TatB is essential for physiological Tat transport activity and seems to form a link between TatA and TatC in the inner membrane (Sargent et al., 1999
; Bolhuis et al., 2001
).
Two distinct Tat complexes can be isolated from resting E. coli inner membranes. TatA can be purified as a large, heterogeneous complex of up to 600 kDa (McDevitt et al., 2005
) that also contains trace amounts of TatB (Sargent et al., 2001
; de Leeuw et al., 2002
; Porcelli et al., 2002
; Oates et al., 2005
). The TatC protein is found in another large complex with an equimolar amount of TatB (Bolhuis et al., 2001
; de Leeuw et al., 2002
; Oates et al., 2003
; McDevitt et al., 2005
; Richter & Brüser, 2005
). TatBC is the signal recognition module of the Tat system and contains the recognition site for the twin-arginine motif on the signal peptide (Mori & Cline, 2002
; Alami et al., 2003
). Low-resolution structural analysis points to the TatA protein forming the transport module of the Tat system i.e. numerous bundles of TatA may form the protein-conducting channel (Gohlke et al., 2005
; Dabney-Smith et al., 2006
). During protein transport a transient TatABC complex is thought to form in the membrane, but then rapidly dissociate again once the transport step has been completed (Mori & Cline, 2002
). The whole system is probably powered by the transmembrane proton-motive force (Alder & Theg, 2003
), although this is controversial (Finazzi et al., 2003
; Theg et al., 2005
).
The establishment of the Tat pathway as an exciting, vibrant and fast-moving research area was preceded by a long gestation period. During that time microbiologists studying complex respiratory enzymes recognized the importance of the signal peptide and its twin-arginine motif (e.g. van Dongen et al., 1988
; Nivière et al., 1992
; Bernhard et al., 1996
; Dreusch et al., 1997
) and, with a hint of irony, some had already isolated tat mutant strains and even sequenced the genes (e.g. Friedrich et al., 1981
; Yates et al., 1997
). In parallel, plant scientists were characterizing biochemically a thylakoid protein transport system for folded proteins in which twin-arginine signal peptides were critical to the targeting process (e.g. Creighton et al., 1995
; Chaddock et al., 1995
). Mutant plants defective in this transport system had also already been isolated (Voelker & Barkan, 1995
). Any possible commonality between the work of the microbial and plant research groups was originally dismissed because the substrate proteins transported by the plant thylakoid system were not present in cyanobacteria or any other prokaryotes. The seminal hypothesis of Berks (1996)
, however, raised the possibility that a prokaryotic protein transport system for folded proteins was indeed waiting to be unearthed. The race to identify components of the translocase was won by the plant scientists, who cloned and sequenced a gene subsequently shown to encode a homologue of E. coli TatB (Settles et al., 1997
; Weiner et al., 1998
; Sargent et al., 1998a
; Chanal et al., 1998
).
The central dogma of the Tat research field is that this system has evolved for the transmembrane translocation of fully folded proteins. That is not to say the Tat translocase cannot transport unfolded or misfolded proteins, as this has been clearly shown for the thylakoid Tat system (e.g. Hynds et al., 1998
). Nevertheless, in the case of eubacteria such as E. coli where the majority of Tat substrates are cofactor-containing, multi-subunit, respiratory enzymes, and in halophilic archaea, where protein folding in the extracellular environment would be biophysically impossible, the ultimate challenge in the Tat research field is to understand the molecular mechanism of the transport of fully folded proteins.
 |
Biosynthesis of TorA, a paradigm Tat-targeted respiratory enzyme
|
|---|
The ultimate aim for a bacterial physiologist would be to understand how a multi-subunit, multi-cofactor enzyme like formate dehydrogenase-N is assembled and transported across a membrane. In the first step, the FeS cluster and molybdenum cofactor must be loaded into FdnG (which also contains the twin-arginine signal peptide). At the same time the four FeS clusters must be loaded into FdnH. The FdnGH complex must then be formed in the cytoplasm before transport of the dimer via the Tat system to the periplasm. Next, part of FdnH must be integrated into the membrane (Hatzixanthis et al., 2003
) before FdnGH associates with FdnI (which will have already followed a different biosynthetic pathway) and finally forms a trimer-of-trimers' in the membrane. All this must occur before a single molecule of formate is oxidized. The complexity of this protein production line raises many questions about quality control and coordination of different biological events. Why, for example, is the FdnG protein not exported without its FdnH partner? Because of the myriad of different processes that need to be considered when studying the biosynthesis of a large enzyme like formate dehydrogenase-N, most of the early research in this area focused on a simpler, more tractable system.
The E. coli TMAO reductase (TorA) enzyme is a type III molybdoenzyme (McDevitt et al., 2002
) and is a water-soluble periplasmic protein that contains bis-MGD as its only cofactor. Crystal structures of TorA homologues from Shewanella massilia (Czjzek et al., 1998
), Rhodobacter capsulatus (Schneider et al., 1996
; McAlpine et al., 1998
) and Rhodobacter sphaeroides (Schindelin et al., 1996
) are available. The E. coli enzyme is abundant under anaerobic conditions in the presence of TMAO, an osmoprotectant found widely in marine animals (McCrindle et al., 2005
), and the evidence for cytoplasmic assembly and folding of TorA prior to export is strong (reviewed by Berks et al., 2003
).
The biosynthesis of the MGD cofactor itself has been extensively studied and crystal structures are now available for most of the biosynthetic enzymes. This entire process requires the elaborate use of nucleotide cofactors and so occurs exclusively in the cytoplasm (reviewed by Schwarz, 2005
). To summarize, the biosynthesis of the cofactor is mediated by 10 proteins each given the prefix Mo-. First, the MoaAC complex rearranges GTP into a pterin called cyclic pyranopterin monophosphate (cPMP) in a process requiring S-adenosylmethionine. Thiol groups are next added to cPMP in an elaborate process driven by protein adenylation and involving MoaD, MoaE and MoeB. The MogA and MoaB proteins then catalyse the ligation of molybdenum to the thiol groups to generate molybdopterin (MPT) and this process involves an adenylation/de-adenylation cycle of the cofactor itself. Finally, a GMP moiety is added to MPT by the MobA protein to generate MGD in a process that may be enhanced by MobB, a cytoplasmic GTPase. The take-home messages from this are that this elaborate cofactor must be constructed in the regulated environment of the cytoplasm, and that its extreme susceptibility to oxidation means that it is very unlikely to be secreted unprotected to the cell exterior.
The next pressing issue is the insertion of the completed MGD cofactor into the TorA apoenzyme. This event occurs before transport of TorA to the periplasm because if the above cofactor biosynthesis pathway is genetically blocked the E. coli TorA apoprotein remains in the cytoplasm (Santini et al., 1998
). The experiments of Santini et al. (1998)
are compelling and helped to pave the way for the characterization of the bacterial Tat system (Sargent et al., 1998a
; Weiner et al., 1998
); however, it should be noted that the Rhodobacter TorA homologue behaves quite differently (Buchanan et al., 2001
and references therein). In this case the enzyme is still transported to the periplasm in the absence of cofactor. The most likely explanation is that this particular enzyme folds around GMP instead, as also seen in the crystal structure of NarG purified from a mo mutant (Rothery et al., 2004
), and thus is recognized as a suitable substrate by the Tat translocase. Why does the E. coli TorA enzyme not behave in the same way? Is it possible that E. coli employs a more stringent quality control system which prevents premature export of the apoprotein? The key players in coordinating assembly and export of TorA-type enzymes are the members of a new class of molecular chaperones the TorD family. Cofactor-loading into E. coli TorA is enhanced both in vivo and in vitro by an accessory protein called TorD (Pommier et al., 1998
; Ilbert et al., 2003
). The TorD protein binds specifically to the TorA apoenzyme and maintains it in a conformation suitable for cofactor binding (Pommier et al., 1998
; Ilbert et al., 2003
, 2004
; Genest et al., 2005
, 2006a
, b
). However, it is abundantly clear that E. coli TorD is not absolutely essential for cofactor loading into the TorA enzyme since deletion of the torD gene reduces cellular TorA activity by only around 50 % (Pommier et al., 1998
; Jack et al., 2004
). By the same token, it should be noted that all the cellular TorA activity in a torD mutant strain is located in the periplasm and the mutant can therefore respire TMAO (Jack et al., 2004
; Ilbert et al., 2004
). As a result it is also clear that TorD is not absolutely essential for transport on the Tat pathway either. So how and why does the TorD protein recognize and interact with TorA?
 |
The TorD family: structural analyses
|
|---|
E. coli TorD is encoded by the final gene on the torCAD operon (Mejean et al., 1994
) and is a member of a vast family of related bacterial and archaeal proteins (Turner et al., 2004
; Ilbert et al., 2004
). The sequence databases currently contain hundreds of members of the TorD family which, after sequence analysis, broadly separate into three clades: the TorD clade, the DmsD clade and the NarJ clade (Turner et al., 2004
). The E. coli genome encodes a total of five members of the wider TorD family TorD itself, DmsD and YcdY from the DmsD clade, and NarJ and NarW from the NarJ clade (Turner et al., 2004
). When overproduced and purified, TorD family proteins can adopt several oligomeric and/or conformational states (Tranier et al., 2002
; Sarfo et al., 2004
). While the monomeric form always dominates, homodimers and homotrimers have been clearly identified and the 3D crystal structure of a dimer form of the TorD homologue from Shewanella massilia has been solved by X-ray methods (Tranier et al., 2003
). The structure of the Shewanella TorD dimer is interesting in several respects, not least because it contains no extensive regions of
-sheet and, apart from some unstructured loops, is therefore made up entirely of 10
-helices (Fig. 3A
).

View larger version (34K):
[in this window]
[in a new window]
|
Fig. 3. Structures of TorD, DmsD and NarJ. The crystal structures of (A) the TorD homodimer from Shewanella massilia (PDB accession 1N1C), (B) the DmsD monomer from Salmonella typhimurium (1S9U) and (C) the NarJ monomer from Archaeoglobus fulgidus (1ZE0) are shown. The figure was compiled using Yasara.
|
|
In addition, the Shewanella TorD dimer exhibits domain-swapping to the extreme (Fig. 3A
; Tranier et al., 2003
). Each Shewanella TorD protomer consists of two distinct domains linked by a hinge region. The N-terminal domain, comprising the first six
-helices, from one protomer interacts with the C-terminal domain, comprising the last four
-helices, of the second protomer, and Tranier et al. (2003)
postulated that, based on small-angle solution X-ray scattering, the monomer form of TorD would involve more-or-less identical intra-protomer interactions. The 3D crystal structure of the Salmonella typhimurium LT2 DmsD monomer (Fig. 3B
) corroborates this hypothesis completely [Y. Qiu, R. Zhang, V. Tereshko, Y. Kim, F. Collart, A. Joachimiak & A. Kossiakoff, unpublished; deposited in Protein DataBank in 2004 under 1S9U]. Although sharing only
15 % overall sequence identity with Shewanella TorD, Salmonella DmsD also displays an all-
-helical secondary structure (Fig. 3B
) and has an analogous fold to a single lobe of the domain-swapped Shewanella TorD dimer (Fig. 3A
).
A third crystal structure of a member of the TorD family is also available (Fig. 3C
). The crystal structure of the Af0173 monomer from the hyperthermophilic archaeon Archaeoglobus fulgidus has been solved in a structural genomics project [O. Kirillova, M. Chruszcz, T. Skarina, E. Gorodichtchenskaia, M. Cymborowski, I. Shumilin, A. Savchenko, A. Edwards & W. Minor, Midwest Center For Structural Genomics, unpublished; deposited in Protein DataBank in 2005 under 1ZE0]. The Af0173 protein is a peripheral member of the NarJ clade (Turner et al., 2004
). Despite some sequence similarity with DmsD proteins (Turner et al., 2004
), A. fulgidus Af0173 is probably best classified as a NarJ since it is encoded by a four-cistron cluster also containing the genes for a NarG-type nitrate reductase Af0176 (Richardson et al., 2001
; Dridge et al., 2006
), an FeS protein with limited homology with NarH (Af0175), and an unusual integral membrane protein, Af0174 (Dridge et al., 2006
). Thus it is clear from the 3D structures available that a common fold is conserved across this entire family of proteins (Fig. 3
).
 |
The TorD family: twin-arginine signal peptide-binding proteins
|
|---|
The identification of E. coli DmsD as a cytoplasmic twin-arginine signal peptide-binding protein was a watershed in the understanding of how TorD family proteins work. The E. coli DMSO reductase is a trimeric complex comprising the DmsA catalytic subunit, which contains bis-MGD and an FeS cluster as cofactors, the DmsB protein, which contains four FeS clusters, and an unusual integral membrane protein termed DmsC (Bilous et al., 1988
). The DmsA protein contains a twin-arginine signal peptide at its N-terminus (Berks, 1996
; Sambasivarao et al., 2000
) and this single signal peptide directs the export of the DmsAB dimer to the periplasmic side of the membrane, where it finally attaches to the DmsC membrane protein to form the holoenzyme (Stanley et al., 2002
). Oresnik et al. (2001)
developed an affinity chromatography method to detect proteins that bound tightly to the E. coli DmsA twin-arginine signal peptide, and the primary interactor was identified as YnfI (re-named DmsD). The DmsD protein is encoded by the ynfEFGHdmsD operon for a second (essentially cryptic) DMSO reductase in E. coli (Lubitz & Weiner, 2003
). A study of the Tat transport activity of the DmsA signal peptide in the absence of dmsD suggested that DmsD does not have an obligate role in transport per se (Ray et al., 2003
), and therefore does not operate as a kind of signal recognition particle. However, DmsD was certainly found to be essential for cellular DMSO respiration and DMSO reductase activity, which demonstrated that the signal-peptide-binding event was, in this case, critical for the biosynthesis of the native substrate (Ray et al., 2003
).
The sequence similarity between DmsD and TorD suggested strongly that TorD could also be a twin-arginine signal peptide-binding protein (Sargent et al., 2002
). This was initially confirmed genetically using a bacterial two-hybrid screen with torD as bait against a random genomic library as prey (Jack et al., 2004
). The strongest interacting clones identified by this completely blind screen encoded the Tat signal peptide of TorA (Jack et al., 2004
). The powerful biophysical technique of calorimetry was subsequently used to characterize in vitro the interaction between TorD and a synthetic TorA signal peptide. The isolated E. coli TorD monomer binds to the TorA signal peptide with an apparent dissociation constant (Kd) of
1 µM (Hatzixanthis et al., 2005
). By comparison, recent calorimetric experiments have shown E. coli DmsD to bind the DMSO reductase Tat signal peptide with an apparent Kd of 0.2 µM (Winstone et al., 2006
). Optimum ligand binding to TorD requires a 27-residue fragment of the TorA signal peptide stretching from the twin-arginine motif, through the h-region, to another arginine dipeptide at the C-terminus (Hatzixanthis et al., 2005
). The nature of the peptide-binding site has not yet been visualized for any TorD family member, and it is not immediately obvious from the available crystal structures where the peptide-binding site might be located. It is likely, however, that the TorA signal peptide is unstructured in aqueous solution, as this has been shown to be the case for the twin-arginine signal peptides of the Allochromatium vinosum high-potential iron protein (HiPIP) (Kipping et al., 2003
) and the E. coli SufI protein (San Miguel et al., 2003
). Note, however, that such an unstructured peptide would be excessively lengthy when compared to the folded, globular, TorD monomer. To be bound in its entirety by TorD, the TorA signal peptide will certainly need to adopt a high degree of secondary structure. Given the all-helical nature of TorD family proteins (Fig. 3
), it is possible the signal peptide could also display an
-helical conformation in the TorD-bound state, such as occurs when the signal peptide of the Sec substrate LamB is bound by the SecA protein (Chou & Gierasch, 2005
).
What is TorD actually doing when it binds to the TorA signal peptide? One possibility is that the binding event simply protects the exposed, unstructured, signal peptide from degradation by cytoplasmic proteases prior to export (Li et al., 2006
; Genest et al., 2006a
). For example, when the native TorA enzyme was overproduced in the absence of TorD the signal peptide was prematurely cleaved within the c-region betwixt the arginine dipeptide at positions 35 and 36 (Table 1
). However, co-expression of TorA with TorD prevented this degradation, presumably because chaperone-binding sterically hindered attack by a protease (Genest et al., 2006a
). It is certainly true that premature degradation of the TorA precursor would undoubtedly be deleterious to the assembly process. The physiological relevance of this particular observation, however, remains in question since the protease cleavage site on TorA appears to fit exactly with the OmpT recognition sequence (R/K)
(R/K)-A (Dekker et al., 2001
), and OmpT is an outer-membrane protease that would not normally be exposed to cytoplasmic substrates (Vandeputte-Rutten et al., 2001
). Moreover, experiments with the TorA signal peptide fused to a [NiFe] hydrogenase suggest a more sophisticated modus operandi for TorD (Jack et al., 2004
).
E. coli hydrogenase-2 is a heterodimer of a nickel-containing
-subunit (HybC) and an FeS cluster-binding
-subunit (HybO) (Sargent et al., 1998b
). Only the HybO protein bears a twin-arginine signal peptide and the 
dimer is transported as a pre-formed active complex by the Tat translocase (Rodrigue et al., 1999
; Sargent et al., 1998a
). This observation raised many questions, one of the most compelling being: what prevents premature export of HybO without its HybC partner? One possible answer came some years later when Jack et al. (2004)
swapped the native signal peptide of E. coli HybO for that of TorA. The assembly of the TorA : : hydrogenase-2 chimera was disrupted such that the HybO subunit carrying the TorA signal peptide was targeted prematurely to the membrane without its HybC partner and as a result cellular hydrogenase activity dropped dramatically in this strain (Jack et al., 2004
). Most remarkably, perhaps, this mutant phenotype could be rescued by overproduction of TorD cellular hydrogenase activity returned since the HybOC dimer was once again correctly assembled and exported (Jack et al., 2004
). In this case, protection of the TorA signal peptide from proteolysis was probably not an issue because the TorA : : HybO fusion was not being degraded and was still being targeted in a Tat-dependent manner (Jack et al., 2004
). Instead, TorD was somehow coordinating assembly and export of the hydrogenase. This result had wide-ranging connotations. First, it meant that what was previously assumed to be a private chaperone for a molybdenum enzyme (e.g. Pommier et al., 1998
), could actually be employed in the maturation of a completely alien enzyme with no molybdenum cofactors at all. Second, it meant that a significant part of TorD's biological activity was focused solely on the TorA signal peptide with scant regard to the passenger protein to which the signal peptide was attached. Indeed, this finding was corroborated when Li et al. (2006)
showed that extra copies of torD could enhance the levels of periplasmic jellyfish green fluorescent protein (GFP) when it was routed through the Tat pathway by the TorA signal peptide. Third, the action of TorD on the TorA signal peptide must mimic almost exactly a biochemical process normally central to the native hydrogenase assembly pathway (probably the precursor-binding activity of the HybE protein: Dubini & Sargent, 2003
; Jack et al., 2004
; Butland et al., 2006
). These papers, and others, led to the hypothesis that TorD was acting as a Tat proofreading chaperone, which is actually preventing protein export by shielding the signal peptide from the Tat translocase until all assembly processes are complete (Santini et al., 1998
; Wu et al., 2000
; Sargent et al., 2002
; Dubini & Sargent, 2003
; Jack et al., 2004
, 2005
).
The phrase proofreading in this context of enzyme assembly is akin to a quality control process (i.e. checking that folding is complete, as opposed to scanning for specific mistakes). However appropriate here, it should be noted well that the term quality control in the Tat transport field had already been coined by Delisa et al. (2003)
for a mechanism by which the TatABC translocase itself accepts or rejects proteins for export based on their folded state. As a result the term Tat proofreading is preferred to distinguish the pre-export chaperone-mediated system described here (Jack et al., 2005
).
Under laboratory conditions the stringency of different Tat proofreading systems is variable. The phenotype of an E. coli dmsD mutant is severe and results in complete disruption of DMSO reductase biosynthesis and blockage of all anaerobic respiration with DMSO as electron acceptor (Oresnik et al., 2001
; Ray et al., 2003
). This is different to that observed for an E. coli torD deletion mutant, in which respiration with TMAO is not significantly hampered and TorA levels are only reduced by around 50 % (Pommier et al., 1998
; Jack et al., 2004
). The differences are probably not due to the different structures of the TMAO and DMSO reductases in E. coli since disruption of the single dorD gene from Rhodobacter capsulatus, which encodes a TorD homologue, was enough to completely abolish activity of the cognate TorA-type reductase (Shaw et al., 1999
). These discrepancies are not easy to explain, although it should be considered that another member of the TorD family may be participating in TorA maturation in E. coli in the absence of TorD.
 |
The involvement of nucleotides in TorD function
|
|---|
Clearly, TorD binds tightly and specifically to the TorA signal peptide in an energetically favourable reaction, i.e. with a large negative Gibbs free energy change (Hatzixanthis et al., 2005
). In the cell a physiological TorD cycle could be envisaged in which the Tat proofreading chaperone would first bind, but then release, the TorA signal peptide once cofactor loading and protein folding were complete. What, if anything, triggers signal peptide release by TorD? It is certainly possible that, akin to the mitochondrial protein import system (Stan et al., 2000
), the TorA signal peptide would be naturally handed over from TorD to the TatBC signal-binding module if TatBC has a significantly lower Kd (and thus a more negative Gibbs free energy for binding) for the signal peptide than TorD. To date, rigorous experiments to determine the binding affinity of twin-arginine signal peptides to TatBC have not been reported. Moreover, this particular model lacks any obvious control mechanism. An alternative hypothesis was put forward by Hatzixanthis et al. (2005)
and was based on the known molecular mechanisms of other peptide-binding proteins: given the nucleotide-based mechanisms of the signal recognition particle (Doudna & Batey, 2004
; Focia et al., 2004
), the SecA protein (Vrontou & Economou, 2004
) and the DnaKDnaJGrpE system (Ellis & Hartl, 1999
), TorD was also tested for its ability to interact with nucleotide triphosphates (Hatzixanthis et al., 2005
).
By monitoring intrinsic tryptophan fluorescence, isolated TorD was shown to interact loosely (Kd
375 µM) with GTP in vitro (Hatzixanthis et al., 2005
). No GTP hydrolysis was observed, but the affinity for GTP was increased (to an apparent Kd
200 µM) by pre-incubation of TorD with the TorA signal peptide (Hatzixanthis et al., 2005
). This finding was controversial because TorD is an all-helical protein (Fig. 3
) and as such does not display a classic nucleotide-binding Rossmann fold (Rossmann et al., 1975
) or canonical Walker-A (or P-loop) or -B nucleotide-binding amino acid motifs (Walker et al., 1982
). Nevertheless, inspection of the primary sequences of TorD family proteins does suggest some similarity with metal-dependent phosphohydrolases. The interdomain hinge region of TorD family proteins contains a very highly conserved EPxDH amino acid motif where the aspartate is invariable and the histidine is only very occasionally substituted by tyrosine (Tranier et al., 2003
; Ilbert et al., 2004
; Turner et al., 2004
). Indeed, in E. coli TorD the D-124 and H-125 residues within this signature have been shown to be essential in the in vivo Tat proofreading process (Jack et al., 2004
). This motif is highly reminiscent of those identified in a range of phosphohydrolases including the GNH[DE] superfamily (Aravind & Koonin, 1998b
), the HKD phospholipases (Koonin, 1996
) and the DHH phosphoesterases (Aravind & Koonin, 1998a
). Most notable of all, perhaps, is the HD superfamily of phosphohydrolases, which includes dGTPases, ppGpp hydrolases and cyclic-di-GMP phosphodiesterases (Aravind & Koonin, 1998c
; Galperin et al., 2001
; Yakunin et al., 2004
; Ryan et al., 2006
). The aspartate and histidine side-chains in these systems are usually the ligands for magnesium or manganese cofactors (e.g. Yamagata et al., 2002
), ions that are also intimately linked with classical nucleotide-hydrolysing systems (e.g. Walker et al., 1982
). It is notable, of course, that the available crystal structures (Fig. 3
) do not show any opportunistic metal binding at the EPxDH motifs, and it has also been shown that GTP binding by E. coli TorD is magnesium-independent (Hatzixanthis et al., 2005
). Therefore, if the EPxDH motif is not a metal-binding site it is also worth considering that members of the J-domain co-chaperone family (including E. coli DnaJ) contain a functionally important conserved DPH tripeptide in their sequences (e.g. Mayer et al., 1999
). As with the TorD EPxDH motif, the DnaJ DPH signature is located in a loop between two
-helices, and, most importantly, has been shown to be involved in mediating proteinprotein interactions with DnaK (e.g. Mayer et al., 1999
).
It is worth considering at this point the alternative possibility that the observed GTP binding by E. coli TorD (Hatzixanthis et al., 2005
) is more closely related to the role of TorD in assisting molybdenum cofactor insertion into TorA (Pommier et al., 1998
) rather than Tat proofreading (Jack et al., 2004
). Since the molybdenum cofactor in question (MGD) is a pyranopterin dinucleotide where the pyranopterin moiety is itself derived from rearranged GTP and is linked via a phosphate bridge to another guanosine (Schwarz, 2005
), is it possible Hatzixanthis et al. (2005)
have identified an MGD-binding site on TorD rather than a GTP-binding pocket? The available evidence argues against this. First, the Kd for GTP binding would seem to be too high to account for efficient binding of MGD, of which there is apparently very little in the cell (Johnson et al., 1991
). Second, the Tat proofreading activity of TorD has been shown to be biochemically distinct from the cofactor insertion activity (Jack et al., 2004
). In contrast, Hatzixanthis et al. (2005)
have shown that GTP binding by TorD is intimately linked to signal peptide binding since its presence served to increase the affinity of TorD for GTP about twofold. Third, molybdoenzymes purified in the absence of MGD cofactor often bind GMP or GDP in their active sites by default (e.g. Rothery et al., 2004
; Temple & Rajagopalan, 2000
). Opportunistic insertion of the more abundant GTP into vacant MGD-binding sites has never been reported. Finally, further molecular modelling suggests that the pyranopterin moiety of MGD does not fit into the hypothetical GTP-binding pocket described by Hatzixanthis et al. (2005)
. Molecular modelling can, of course, superimpose the guanosine moiety of MGD onto GTP in the putative binding site; however, numerous apparent steric clashes prevent binding of the pyranopterin half of the molecule (A. Oubrie & F. Sargent, unpublished).
The latest working model for the TorD cycle (as proposed by Hatzixanthis et al., 2005
) takes into account the direct signal peptide-binding activity of this protein and its homologues, as well as the putative binding of GTP (Fig. 4
). Since affinity for GTP is increased by pre-incubation of TorD with the TorA signal peptide (and pre-loading of TorD with GTP did not enhance signal peptide binding), it is proposed that TorD binds the TorA signal first, then GTP binds, then either GTP hydrolysis or nucleotide exchange drives a conformational change in TorD that results in signal peptide release (Fig. 4
). The missing link in this model, however, is the trigger for signal release how does TorD know when to release the signal peptide; how does it know TorA maturation is complete? It seems unlikely that TorD can specifically sense the TorA protein itself, otherwise the experiments with hydrogenase (Jack et al., 2004
) and GFP (Li et al., 2006
) fusions to the TorA signal peptide would likely not work. It is possible, if TorD binds close enough to the junction between signal peptide and mature domain as some studies suggest (Genest et al., 2006a
), that folding of the passenger protein could sequester part of the binding site and thus force off the chaperone, although it should be noted that the free energy released in most protein-folding events is relatively small. There is some evidence to suggest that the TorD family protein DmsD interacts with the Tat machinery at the cytoplasmic membrane (Papish et al., 2003
). It is therefore conceivable that the action of a component of the Tat system could induce signal peptide release and/or GTP hydroloysis. One attractive alternative hypothesis, however, is that TorD operates as a molecular clock (Berks et al., 2005
), perhaps along similar lines as described for proteins involved in flagellar biosynthesis in Salmonella (Moriya et al., 2006
). In this model the signal peptide release rate (effectively the dissocation rate constant, Koff) would coincide with the rate of assembly of the passenger protein.

View larger version (47K):
[in this window]
[in a new window]
|
Fig. 4. The Tat proofreading mechanism the TorD cycle. A model for chaperone-mediated Tat proofreading. An apoprotein bearing a twin-arginine signal peptide is first released from the ribosome. Step 1: a proofreading chaperone (TorD) binds to the twin-arginine signal peptide, thus preventing premature export. Step 2: GTP binding to TorD may follow and cofactor-loading into the mature portion of the protein is completed. Step 3: TorD is released, possibly in a GTP-dependent manner. Step 4: targeting of the precursor to the Tat translocon can commence.
|
|
 |
Dual functionality of twin-arginine signal peptides
|
|---|
It is now clear from signal-swapping experiments (Sambasivarao et al., 2000
; Jack et al., 2004
), and from the recent studies of the chaperone-mediated Tat proofreading system decribed above, that some twin-arginine signal peptides have a separate biological role distinct from that of protein targeting: biosynthesis. As a result, twin-arginine signal peptides have been separated into two broad classes, class 1 containing those signal peptides with dual functions in biosynthesis (i.e. chaperone binding) and targeting, and class 2 containing those with a role in targeting only (Sargent et al., 2006
). The terminology chosen to classify these signal peptides, however, was not altogether satisfactory and could conceivably cause some confusion with the different classes of Sec signal peptide (e.g. Tokuda & Matsuyama, 2004
). It is therefore proposed here to dub instead the dual-function Tat signal peptides as A-class' and the targeting-only Tat signal peptides as B-class' (Table 1
).
 |
Common principles in the assembly of molybdoenzymes...
|
|---|