|
|
||||||||
1 Department of Molecular Biosciences, University of Oslo, 0316 Oslo, Norway
2 NIVA, Norwegian Institute for Water Research, 0411 Oslo, Norway
3 Department of Biology, Centre for Ecological and Evolutionary Synthesis (CEES), University of Oslo, 0316 Oslo, Norway
Correspondence
Kjetill S. Jakobsen
kjetill.jakobsen{at}bio.uio.no
| ABSTRACT |
|---|
|
|
|---|
The GenBank/EMBL/DDBJ accession number for the cyanopeptolin synthetase gene cluster described in this paper is DQ075244.
Four supplementary figures are available with the online version of this paper.
| INTRODUCTION |
|---|
|
|
|---|
Despite the large number of cyclic peptides found in cyanobacteria, relatively few classes of NRPS operons have been characterized to date. The most intensively studied so far are the microcystin class of synthetase gene clusters (mcy, nda), which have been sequenced from two strains of Microcystis (Nishizawa et al., 2000
; Tillett et al., 2000
), Planktothrix agardhii (Christiansen et al., 2003
) Anabaena strain 90 (Rouhiainen et al., 2004
) and Nodularia spumigena (Moffitt & Neilan, 2004
). The synthetases encoded by these operons contain both NRPS and polyketide synthetase (PKS) modules. Other examples of hybrid biosynthesis between NRPS and PKS are nostopeptolide and nostocyclopeptide synthetases from Nostoc spp. (Becker et al., 2004
; Hoffmann et al., 2003
). Many cyanobacterial groups, particularly Microcystis strains, produce other cyclic and linear peptides, including cyanopeptolins, microviridins and aeruginosins (Birk et al., 1989
; Ishitsuka et al., 1990
; Murakami et al., 1997
). So far only a single pure (without PKS modules) NRPS gene cluster, encoding a seven-residue depsipeptide (anabaenopeptilide) belonging to the cyanopeptolin family, has been characterized from Anabaena (Rouhiainen et al., 2000
).
The modular structure of NRPS gene clusters, their diversity and their patchy distribution among cyanobacterial species may hint at recombination and lateral gene transfer (LGT) as key processes in the evolution of these gene clusters. Phylogenetic studies of prokaryotic A-domains have shown that they essentially cluster according to function (i.e. according to the amino acid activated) (Challis et al., 2000
). For Bacillus subtilis it has been proposed that LGT of large NRPS gene clusters between strains is responsible for the very similar gene clusters present (Tsuge et al., 2001
). Investigating the sequence variation in mcy genes of various Microcystis strains, Mikalsen et al. (2003)
suggested that frequent recombinations between imperfect repeats (i.e. domains) as well as gene loss/deletions and LGT events have contributed to the variation seen for the mcyABC operon within different strains. Recombination events, possibly involving intragenomic recombinations and/or LGT events, were recently suggested for mcy genes in Planktothrix (Christiansen et al., 2003
) and for the mcy-like nda genes of N. spumigena (Moffitt & Neilan, 2004
). Tanabe et al. (2004)
inferred a reticulate phylogeny of mcyA and identified several potential recombination tracts of mcyA, but none in the PKS part (mcyD, G and J) of the gene cluster. Notably, transposases are found to be associated with all mcy gene clusters characterized as well as with the nda gene cluster (Christiansen et al., 2003
, 2006
; Moffitt & Neilan, 2004
; Rouhiainen et al., 2004
; Tillett et al., 2000
). The high number of isoforms and the recent discoveries of frequent recombinations (Kurmayer et al., 2004
; Mikalsen et al., 2003
; Tanabe et al., 2004
) may indicate a rapid evolution of mcy genes. In contrast, the findings of Rantala et al. (2004)
, suggesting that microcystin synthetase genes were present in the last common ancestor of a deep radiation of cyanobacteria, are consistent with an evolutionary process that acts in preserving the basic structure of the peptides, enzymes and operons.
The genus Microcystis represent an evolutionary lineage synthesizing both microcystin and other peptides, and co-production of microcystin and cyanopeptolin has been shown in several strains (see Birk et al., 1989
; Ishida et al., 1995
, 1998
; Martin et al., 1990
; Murakami et al., 1997
; Yasuno et al., 1998
). An interesting question then is what happens over evolutionary time when two or more NRPS gene clusters coexist in the same genome. Will they prevail as insulated islands, or will various recombination mechanisms gradually turn them into mosaics? One way of addressing this issue is to investigate a non-microcystin operon in a microcystin-producing lineage, preferably an operon from a class with some other genetically investigated members. Using MS screening, we have found that the strain Microcystis cf. wesenbergii NIVA-CYA 172/5 (Microcystis N-C 172/5) produces a cyanopeptolin. Cyanopeptolins are a class of protease-inhibiting peptides characterized by the presence of the amino acid 3-amino-6-hydroxy-2-piperidone (Ahp) and the cyclization of the peptide by an ester bond between the
-hydroxy group of threonine and the carboxy group of the terminal amino acid (Martin et al., 1993
). Next to microcystins, cyanopeptolins appear to be the most common peptide produced by Microcystis species and were detected in 60 % of single Microcystis colonies in a recent survey (Welker et al., 2006
). Gene clusters responsible for the production of these two peptide families therefore must coexist in many Microcystis species, as they do for example in Anabaena strain 90 (Rouhiainen et al., 2004
). Further information about cyanobacterial peptides and NRPSs, including cyanopeptolins, can be found in a recent review (Welker & von Döhren, 2006
). We have determined the structure of the N-C 172/5 peptide. In addition we have sequenced a complete cyanopeptolin operon and its flanking sequences and addressed the evolution of the cyanopeptolin gene clusters from Microcystis and Anabaena in a wider context to gain insight into the evolutionary mechanisms that shape the combinational nature of peptide synthetase operons.
| METHODS |
|---|
|
|
|---|
Chemical analyses.
The chemical analysis of Microcystis N-C 172/5 included an initial search for a compound with the anticipated molecular mass and a subsequent structural elucidation. In both cases a methanol extract of lyophilized N-C 172/5 cells was used as basis. The initial screening was done by LC/MS using a Waters LC system (600 S controller, Waters 626 pump, Waters Symmetry column 2.1x150 mm, particle size 5 µm) set to run a gradient starting with 100 % solvent A (84.9 % water, 15 % acetonitrile, 0.1 % acetic acid) and ending with 100 % solvent B (80 % acetonitrile, 19.9 % water, 0.1 % acetic acid) after 30 min. The flow rate was 0.5 ml min1. The LC system was connected through a 1 : 10 flow splitter to a Micromass Platform 2 MS detector equipped with an electrospray probe. The detector was run in the positive ion mode at a cone voltage of 50 V. A total ion scan from 500 to 2000 Da was performed during the entire length of the LC gradient. Once a candidate compound had been identified, its structure was analysed by MS fragmentation using a ThermoFinnigan LC/MS/MS system (Surveyor LC solvent delivery system at a flow rate of 0.2 ml min1, TS quantum MS/MS detector). The LC column and gradient were identical to those described above. The MS/MS detector was run in positive electrospray ionization mode. The candidate compound was set as parent molecule. All settings of the source, collision cell as well as quadrupoles 1 and 3 were optimized automatically at collision energies of 30, 40, 50 and 60 eV. Fragments with molecular masses of 30 to 1000 Da were recorded during the entire length of the LC gradient. The fragmentation spectra were screened for immonium ions of amino acids and other typical small indicative fragments to be expected when fragmenting peptides. Larger fragments were identified by comparison with a fragment library of purified cyanobacterial oligopeptide maintained at NIVA. The identification was further assisted by comparison with theoretical fragment spectra calculated based on the structural predictions of the genetic analyses. Finally, all information on fragments was used to gradually develop a model of the compound's structure. This method of structural elucidation has previously been used extensively and successfully for cyanobacterial oligopeptides (e.g. Fastner et al., 2001
; Welker et al., 2006
).
The amounts of Ile and Leu in the peptide were determined by means of an amino acid analyser (model 421, Applied Biosystems) after acid hydrolysis.
DNA isolation and manipulations.
A Wizard Genomic DNA Purification Kit (Promega) was used for isolation of genomic DNA. For preparation and transformation of competent Escherichia coli cells, digestion and ligation of DNA, amplification and purification of plasmids, standard procedures were used (Sambrook & Russell, 2001
).
Identification of the putative cyanopeptolin biosynthetic gene cluster and its flanking regions.
Genes encoding putative peptide synthetases were identified using consensus primers derived from highly conserved regions in NRPS modules, followed by primer walking. To screen the Microcystis strain for peptide synthetase A-domains, we performed PCR with the primer pair MTF2 and MTR (Neilan et al., 1999
) and BD Advantage 2 polymerase (BD Biosciences). The resulting amplicons were cloned in the vector pGEM-T-Easy (Promega), and 23 random clones were sequenced. Three different peptide synthetase gene fragments were obtained, and the rest of the gene cluster was amplified using specific PCR and hemidegenerate PCR. In the hemidegenerate PCR approach, specific PCR primers were used together with degenerate A-domain PCR primers (MTF2 and MTF; Neilan et al., 1999
). For amplification of flanking regions, we performed PCR using a single, specific primer and a low annealing temperature (Malo et al., 1994
). This approach successfully generated a series of overlapping fragments. To avoid possible artefacts introduced during PCR, several pGEM-T-Easy clones were sequenced for each PCR. DNA sequencing was performed on both strands using a DYEnamic ET Dye Terminator Cycle Sequencing Kit (GE Healthcare UK) and an automated DNA sequencer (MegaBACE 1000, GE Healthcare UK). The nucleotide sequence of the cyanopeptolin synthetase gene cluster described in this paper has been submitted to GenBank under accession number DQ075244.
RNA extraction and reverse transcription-PCR (RT-PCR).
Total RNA was extracted from 50 ml culture as described previously (Kaebernick et al., 2000
). After extraction with Trizol (Invitrogen), RNA was column purified using the SV total-RNA Isolation System (Promega). DNA was removed by treatment with DNase I (Fermentas). Reverse transcription for production of cDNA for RT-PCR was carried out using the SuperScript III First-Strand Synthesis System for RT-PCR (Invitrogen). The presence of cyanopeptolin synthetase transcripts was verified by RT-PCR. Four different primer pairs from different parts of the gene cluster were used: mcnA (mcnA-F, 5'-ACTCCAATCTTCAATCATTACCTGTAG-3'; mcnA-R, 5'-TACGGAATCCTCGTATTTTTACTTG-3'), mcnC (mcnC-F, 5'-GATTTTACCGAGAACTAAGGATACAGA-3'; mcnC-R, 5'-ACGGGTTAATAACTGACTAAAACTCG-3'), mcnD, encoding a halogenase (mcnD-F, 5'-TTAGTCCAGAAGCTGATACCACAG-3'; mcnD-R, 5'-ATAAACTTCTGCTTCACCTAAATCCTC-3'), and mcnF, encoding an ABC transporter (mcnF-F, 5'-TTTTATTTTGCTCGATATCTTGGAC-3'; mcnF-R, 5'-CGGGAAAGTCTCTTTTAACTATCATC-3'). PCR without reverse transcription was used to check for DNA contamination in the total RNA used in RT-PCR.
Phylogenetic analysis of A-, T- and C-domains.
Amino acid sequences encoded by A-, T- and C-domains from the mcn gene cluster were aligned with publicly available sequences using CLUSTALW (Thompson et al., 1997
) and manual editing. After discarding ambiguously aligned characters, the A-domain data consisted of 352 characters and 40 sequences, the C-domain data of 300 characters and 39 sequences, and the T-domain data of 95 characters and 41 sequences. The sequences from the A-, T - and C-domains were used to infer the phylogeny in a Bayesian framework applying the program MrBayes v2.01, v3.0 and v3.1 (Ronquist & Huelsenbeck, 2003
) on the freely available Bioportal computer resources website (http://www.bioportal.uio.no). Variable substitution rates across sites were accounted for by allowing each site to evolve with a rate defined as a random variable drawn from a gamma distributed rate probability (G), which was approximated using four discrete rate categories. The gamma distribution was used together with a category of invariable sites (I; together: G+I). In addition, the WAG stationary amino acid substitution model was used for A-, C- and T-domain datasets. Priors for all other model parameters were set to default values. Metropolis coupling was used with three heated (temperature parameter 0.2) and one cold chain. The Markov chain Monte Carlo (MCMC) chains started from random trees and lasted for 2 000 0004 000 000 generations. Sampling of trees was done every 100 generations. Burn-in of trees was set to 5000 trees based on assessment of the likelihood plots. The consensus of the remaining trees was used to calculate the posterior probabilities of the clades. Two separate runs were performed for each dataset, and the posterior probability and likelihood of the trees were compared to confirm the stationarity of the MCMC chains. DNA sequences from all datasets were used to construct neighbour joining (NJ) distance trees using Kimura two-parameter and uniform rates among the sites. The statistical support for the internal branch topologies was analysed with bootstrapping using 1000 pseudoreplicates.
| RESULTS |
|---|
|
|
|---|
Determination of the cyanopeptolin structure by chemical analyses
The signal at 984 Da identified by peptide screening was selected for fragmentation experiments, the results of which are shown in Table 1
. To obtain a model of the structure, the fragments were gradually identified and combined, beginning from the lower end of the mass scale. The occurrence of the typical Ahp fragment at 114 Da supported the identification of the compound as cyanopeptolin. Typical amino acid fragments and immonium ions suggested the occurrence of Gln, Tyr, Thr and Ile or Leu in the molecule. Fragments with a mass of 127 and 198 Da implied a chlorination of Tyr, which is not unusual in cyanopeptolins (Welker & von Döhren 2006
). However, the actual position of Cl within the Tyr ring could not be elucidated. Besides being chlorinated, Tyr was also methylated at its amino group (183.1 Da fragment). A number of larger fragments indicated that the cyanopeptolin ring had the amino acid sequence Thr-Gln-ClMeTyr-Phe-Ahp-Ile(or Leu). Other fragments (e.g. 254.1, 243.1, 195.1, 72.0 and 70.0 Da) implied that the side chain consisted of acetic acid bound to a Gln residue. Finally, the amino acid analysis confirmed that the cyanopeptolin contains Ile or allo-Ile) and not Leu (data not shown). Taken together, the chemical analyses strongly suggested that Microcystis N-C 172/5 produces a cyanopeptolin with a molecular mass of 984 Da [M+H]+and a structure as shown in Fig. 1
(a). We name the peptide cyanopeptolin-984 (cypep-984).
|
|
|
70-like 10 element (TATATG, consensus TATAAT) was identified 133 bp upstream of the ATG start codon. The corresponding 35 element is less conserved (TTGCGA, consensus TTGACA). The second ORF (mcnB) has two putative start codons: one ATG codon is located 3 bp and the other 15 bp downstream of the preceding TAA stop codon of the mcnA. The third ORF (mcnC) overlaps mcnB by 4 bp and starts with an ATG start codon, while the ATG start codon of the fourth ORF (mcnD) is located 4 bp upstream of the TGA stop codon of mcnC. The fifth ORF (mcnE) has two possible start codons: the first is GTG and is located 4 bp upstream of the TGA stop codon of mcnD; the second is an ATG codon 93 bp downstream of the stop codon of mcnD.
Analysis of the amino acid sequences of the mcn gene products revealed seven modules typical of peptide synthetases: one in McnA, one in McnB, four in McnC and one in McnE. All modules contain a C-, an A- and a T-domain (Fig. 2a
). The presence of a C-domain at the N-terminal end of the peptide synthetase is unusual, since the first module in the NRPS usually starts with an A-domain. However, a similar arrangement has been observed in nostopeptolide A synthetase (Nos) of Nostoc sp. GSV224 (Hoffmann et al., 2003
) and lichenycin synthetase of Bacillus licheniformis (Konz et al., 1999
). In these two systems, the N-terminal C-domain appears to be involved in N-acylation of the N-terminal amino acid.
The fourth module in McnC contains an insert of 427 amino acids between conserved regions A8 and A9 in the A-domain. This domain has extensive similarity to N-methyltransferase (NMT) domains from different cyanobacterial peptide synthetases, especially to the NMT-domain in module 4 of ApdB in anabaenopeptilide synthetase, with 70 % identity over 427 amino acids. The last module contains a thioesterase (TE) domain located at the C-terminal end of the McnE, and its function is most likely to offload the linear peptide intermediate from the NRPS and catalyse its cyclization. A BLAST search with this TE-domain found similarities to other NRPS TE-domains, especially to ApdD, with 76 % identity over 282 amino acids, and to NosD, with 49 % identity over 285 amino acids.
The fourth ORF, mcnD, encodes a 625 amino acid polypeptide with no similarity to peptide synthetases. We found two highly conserved motifs characteristic of FADH2-dependent halogenases (see Supplementary Fig. S2): a nucleotide binding motif GxGxxG (Scrutton et al., 1990
) and a motif WxWxIP, which might be involved in halide binding (van Pee & Unversucht, 2003
). The amino acid sequence of McnD shows striking similarity to the halogenase ApdC, with 86 % identity over 625 amino acids, while other halogenases associated with secondary metabolite biosynthetic gene clusters (Chiu et al., 2001
; Hammer et al., 1997
; Nowak-Thompson et al., 1999
; Pelzer et al., 1999
) show only 1215 % identity.
To confirm that the peptide synthetase cluster is active, the presence of cyanopeptolin synthetase transcripts was demonstrated by RT-PCR. Four different primer pairs from different parts of the gene cluster were used, and all gave positive results in the RT-PCR analyses; negative results when reverse transcription was omitted indicated no significant DNA contamination (data not shown).
Peptide structure information gained from operon structure and the derived amino acid sequence
Having obtained the entire operon at the DNA and protein sequence level and assuming that the encoded NRPS belongs to the linear class of NRPSs (Finking & Marahiel, 2004
), we could deduce information about the resulting cyclic peptide structure using general knowledge about the modular structure of NRPSs, similarities between the A-domains in the operon and A-domains from the sequence databases, and the binding pocket theory of Stachelhaus et al. (1999)
. The presence of seven modules in the Mcn sequence indicates that the peptide consists of seven amino acid residues. The seven A-domains of the mcn gene product were compared with database sequences using phylogenetic analyses (see details below), and also aligned with the Phe-activating A-domain of GrsA (Conti et al., 1997
) to identify the binding-pocket residues. Based on the comparisons and the binding pocket signatures, the amino acids likely to be activated by A-domains of different modules are shown in Table 2
. According to this prediction, the A-domain in module 4 activates the precursor to the unusual amino acid 3-amino-6-hydroxy-2-piperidone (Ahp). The prediction is based on the high degree of similarity with the corresponding domain in the Anabena operon sequence, including identical binding pocket residues. Based on this, the peptide produced from the operon must be a cyanopeptolin, since Ahp is characteristic for this class of cyclic peptides (Namikoshi & Rinehart, 1996
). The other invariant amino acid residue in all cyanopeptolins is Thr, which fits well with our prediction that the A-domain of module 2 activates this amino acid (Fig. 1b
, Table 2
). The lack of epimerization domains or a racemase gene in the gene cluster indicates that all amino acids incorporated into the peptide are in the L-configuration.
|
The presence of a halogenase in the NRPS indicates halogenation of an amino acid residue, as shown for anabaenopeptilide (Rouhiainen et al., 2000
) and for other cyanopeptolins when the amino acid in position R2 is Tyr (R positions according to Namikoshi & Rinehart, 1996
). In the mcn gene cluster the ORF (mcnD) encoding the halogenase is located between modules 6 and 7, suggesting that the amino acid activated by module 6 is halogenated.
The C-domain at the N-terminal end of peptide synthetase module 1 indicates that the first amino acid residue in the peptide is acylated by a fatty acid as described for other peptides (Hoffmann et al., 2003
; Konz et al., 1999
).
The only A-domain in the cluster for which there is a clear disagreement between phylogenetic analyses and binding pocket signatures is A-domain 7 in McnE, for which phylogenetic analysis predicts Ile and the binding pocket signature suggests Gln. This discrepancy will be addressed further in the Discussion, but we will point out here that Ile is the amino acid activated by the corresponding A-domain in the Anabaena gene cluster, while the main peptide detected in the present Microcystis strain contains Gln in this position.
Based on the operon organization and sequence, the predicted structure of the peptide synthesized by this operon from Microcystis N-C 172/5 is as shown in Fig. 1(b)
. Since our initial screening had shown that this strain most likely produces a single peptide, the similarities between the peptide structure determined by chemical analyses and the structural information deduced from the operon were not unexpected and indicate that the peptide is synthesized by the enzymes encoded by this operon.
Analysis of flanking regions
To what extent do the immediate flanking regions of mcn contain similar or homologous ORFs compared to flanking regions of other described peptide synthetase operons? We sequenced about 1000 bp downstream and 4000 bp upstream of the putative cyanopeptolin synthetase to look for other genes commonly found in the clusters of characterized peptide synthetases. The upstream region contains two ORFs, mcnF and mcnG, transcribed in the opposite direction from the cyanopeptolin synthetase gene cluster. The downstream region did not display any ORFs with notable lengths.
The putative start codon (ATG) of mcnF (2070 bp) is located 557 bp upstream of the start codon of mcnA. mcnF is preceded by a putative promoter region with an E. coli
70-like 10 element (TATAAT) 104 bp upstream of the ATG start codon. The corresponding 35 element is less conserved (TTAGCA). mcnF encodes a protein of 689 amino acids revealing extensive similarity to ABC transporters associated with cyanobacterial NRPS gene clusters: NosG (69 % identity) and NcpC (66 % identity) from Nostoc spp., McyH from the mcy gene clusters from Anabaena strain 90 (69 % identity), P. agardhii (63 % identity) and Microcystis aeruginosa (62 % identity); and NdaI from N. spumigena (66 % identity). A detailed sequence analysis of McnF revealed the presence of five putative transmembrane helices at the N-terminal end between residues 49 and 334. The putative C-terminal ATP-hydrolysing domain contains three characteristic conserved motifs (Supplementary Fig. S3): Walker motifs A (GxxGxGKT/S) and B (hhhhDEAT) and the linker peptide (LSGGQQQR) (Walker et al., 1982
). The EEA motif, characteristic of the ABC import systems (Dassa & Hofnung, 1985
), was not present, indicating that the ABC transporter encoded by mcnF is an ABC exporter.
The translation start of the putative 1167 bp ORF (mcnG) lies 62 bp downstream of the TAG stop codon of mncF. We observed no obvious promoter region in front of this gene, which encodes a 389 amino acid hypothetical protein with no obvious similarity to any proteins with known functions.
Phylogeny of the different cyanopeptolin domains
Phylogenetic analyses of the A-, T- and C- domains were performed separately at both the amino acid and nucleotide level (only protein trees are shown since the results were the same). We compared the Mcn domains from Microcystis N-C 172/5 to domains in microcystin synthetase (Mcy) from M. aeruginosa (Tillett et al., 2000
), anabaenopeptilide synthetase (Apd) from Anabaena strain 90 (Rouhiainen et al., 2000
), nostopeptolide A synthetase (Nos) from Nostoc sp. GSV224 (Hoffmann et al., 2003
), nostocyclopeptide synthetase (Ncp) from Nostoc sp. ATCC 53789 (Becker et al., 2004
) and nodularin synthetase (Nda) from N. spumigena (Moffitt & Neilan, 2004
).
The A-domains from the different NRPSs revealed clustering largely according to the type of amino acid activated. In the tree in Fig. 3
, A-domains from the same species/operon generally do not form clades. Intraspecies clades are formed only in the few instances where two A-domains within the same operon activate the same amino acid (such as NosA4-Leu and NosC1-Leu; ApdB3-Thr and ApdA2-Thr). Domains activating aromatic amino acids form a well-supported clade (PP 1.00; BS 96 %). Interestingly, within this clade A-domains activating the same amino acid do not group together. The Ahp precursor domains form a distinct clade. Some notable exceptions to this general picture are also seen. The A-domain from McnE forms a well-supported clade with the A-domains activating Ile (from ApdD, NcpB and NosA), despite having a binding site signature that suggests that this domain activates Gln (see above). Also, the A-domain in module 1 in McnC, with an Ile binding site signature, clusters basally to the Thr and Ile branches with low/moderate support.
|
|
| DISCUSSION |
|---|
|
|
|---|
Biosynthesis of a new cyanopeptolin
Comparison of cypep-984 with other cyanopeptolins isolated from Microcystis spp. reveals several similarities, such as acylation of the side chain amino acid (Gln) and presence of an N-methylated aromatic amino acid in position R2 (ClMeTyr). One difference from other Microcystis cyanopeptolins concerns the C-terminal amino acid (position R1), which normally is either Ile or Val (Bister et al., 2004
), whereas in cypep-984 Gln is found in this position. Interestingly, the phylogenetic analysis of A-domains (Fig. 3
) showed clustering of the corresponding A-domain in the gene cluster described here with those activating Ile in other peptide synthetases, while the binding site signature suggests that this domain activates Gln. If we accept that cypep-984 is a product of the gene cluster, this could indicate that this A-domain activated Ile in the common ancestor (as it still does in cyanopeptolin synthetases of other characterized cyanopeptolins of Microcystis) and has been altered to a Gln-activating A-domain in the peptide synthetase producing cypep-984. If so, the fact that the binding pocket signature is identical to that of at least one other Gln-activating domain, LchAA (Yakimov et al., 1998
), while the rest of the sequence is Ile-like, may constitute an interesting example of convergent evolution.
All amino acids in known cyanopeptolins isolated from Microcystis spp. are in the L-configuration (Bister et al., 2004
). The chemical methods we have used to predict the peptide structure do not discriminate between L- and D-isoforms, but in silico analysis of the mcn gene cluster suggested that this is also so in cypep-984, except that phylogenetic analysis showed that the C-domain in McnB clusters with those condensing D-amino acids (Fig. 4
) suggesting that the amino acid in the side chain (Gln) is in the D-configuration. Also the T-domain (McnA) upstream of this C-domain clusters with TE-domains, even though there is no epimerization domain in the mcn gene cluster (Supplementary Fig. S4). This T-domain also has a conserved motif typical for TE domains (GGDSI), rather than the GGHSL that is typical for TC-domains (Linne et al., 2001
). This unexpected juxtaposition of a TE-domain and a D-amino acid condensing C-domain without an intervening E-domain might indicate that a deletion of an E-domain at the end of mcnA has occurred.
The mcn operon of Microcystis has an overall structure similar to the Anabaena apd operon
The cyanopeptolin synthetase from Microcystis N-C 172/5 is the second cyanopeptolin synthetase so far described at the gene level. Previously, Rouhiainen et al. (2000)
characterized an anabaenopeptilide synthetase gene cluster (apd) from Anabaena strain 90. A comparison of the apd and mcn gene clusters reveals overall similarities, even though the two first modules in Mcn are encoded by separate ORFs. The general arrangement and transcriptional orientation are the same, and both clusters contain genes encoding halogenases. A flanking ABC transporter is found in both. Differences between mcn and apd include the lack in mcn of the two genes (apdE and F) encoding putative tailoring proteins, ApdE (a putative methyltransferase) and ApdF (a putative 3-oxoacyl reductase). In addition, mcnG, encoding a protein with unknown function, has no counterparts in the apd gene cluster.
The ABC transporter is located on opposite sides of the operon in Anabaena and Microcystis. Furthermore, in the mcn operon the ABC transporter gene is immediately flanking mcnA, while two ORFs (apd E and F) are located between the ABC transporter gene and apdD in Anabaena. Also, the Microcystis ABC transporter gene is transcribed in the opposite direction to the operon, while the Anabaena transporter gene is transcribed in the same direction as the operon. These findings indicate that the flanking regions, although containing some common elements, are structurally less conserved than the operons themselves.
Ahp: an unusual amino acid present in all cyanopeptolins
The pathway leading to formation of Ahp, the unusual amino acid at position 4 in the cyanopeptolin ring, is not yet resolved. Rouhiainen et al. (2000)
have proposed that Gln may be the precursor of Ahp since the binding-pocket sequence shows the highest similarity to those activating Gln. Our phylogenetic analyses of A-domains (see Fig. 3
) show that the A-domains activating the Ahp precursor do not cluster together with Gln- or Glu-activating domains, but form an independent clade. Extrapolating from what is known from prokaryotic A-domains (see Challis et al., 2000
) and what this study has indicated, namely that A-domains cluster according to function, it may be that the Ahp precursor is not Gln/Glu (or at least not activated by an A-domain with a Glu/Gln binding site signature). The lack of phylogenetic congruence between the Ahp A-domains and the Glu A-domains from Mcy and Nda may, however, be because the latter ones most likely activate the D-form, and not the L-form, of Glu (Nishizawa et al., 2001
; Sielaff et al., 2003
; Tillett et al., 2000
). More work is needed to resolve this issue.
Rouhiainen et al. (2000)
have suggested that both the methyltransferase domain of module 5 in the anabaenopeptilide synthetase and ApdF a protein with similarity to 3-oxoacyl [acyl-carrier protein] reductases may be involved in Ahp formation. In Mcn there is no methyltransferase present in module 5, and a gene with similarity to apdF was not found near the mcn gene cluster. Thus, none of the genes or domains within the mcn cluster seems to be involved in the formation of Ahp. It may be that genes involved in Ahp formation are located elsewhere in the genome.
Halogenases and halogenated cyanopeptolins
In many cases, the number, type and position of halogen groups in organic compounds are important for biological activity (see Nogami et al., 1990
; Otsuka et al., 2004
; Sancelme et al., 1994
). Concerning cyanopeptolins, both chlorinated and non-chlorinated peptides have been described, e.g. the congeners anabaenopeptilide 90A (nonchlorinated) and anabaenopeptilide 90B (chlorinated) (Fujii et al., 1996
). Rouhiainen et al. (2000)
showed the presence of a halogenase gene in the gene cluster responsible for the synthesis of these two variants. In addition, we have recently sequenced a cyanopeptolin gene cluster from P. agardhii with high overall similarity to mcn, but without a halogenase, indicating that chlorination activity is not an enzymic feature of all cyanopeptolin synthetases (T. B. Rounge, T. Rohrlack, A. Tooming-Klunderud, T. Kristensen and K. S Jakobsen, unpublished). These findings indicate that halogenation is not essential from a functional perspective. Cyanopeptolins inhibit serine proteases, and in the two known structures of peptidase active sites with bound cyanopeptolins, the aromatic amino acid in position R2 (in one case Tyr, in the other chlorinated Tyr) is located in a hydrophobic pocket near the active site (Lee et al., 1994
; Matern et al., 2003
). Obviously, the presence or absence of a chlorine atom on the Tyr ring may influence this interaction and modulate the inhibiting activity of the peptide, so that the production of a set of variant peptides might lead to inhibition of a wider selection of proteases. All speculations regarding the effects of chlorination of this class of peptides are, however, hampered by the fact that we do not actually know the real biological functions of these peptides. The existence of both halogenated and non-halogenated cyanopeptolins may itself indeed play a role in regulation of the biological activity, or point towards multiple functions for cyanopeptolins. An experimental setup including targeted knockout of the halogenase of Mcn is likely to be rewarding.
The evolution of NRPS clusters
From an overall perspective, the two cyanopeptolin operons (Anabaena and Microcystis) are quite similar when it comes to module order and architecture. Taking a closer look at the C- and T-domains, which tend to cluster according to gene affiliation or position in the operon, we note that the first modules (up to the C-domain in module 5) are less similar between the two operons than module 6, the halogenase and module 7 (see Figs 1 and 4![]()
and supplementary data).
The phylogenetic analyses display different topologies for the A-domains compared to the C- and T-domains. Whatever the mechanisms may be for the difference between the A and C/T-domains, our findings indicate that the evolution seen for these two domain groups can to some extent be regarded to be independent.
The phylogenetic analyses do not show any clear evidence for genetic recombination between the two cyanopeptolin gene clusters. This is in contrast to the mcy genes, where data suggest that the nda cluster in Nodularia was formed by transposition of mcy from Microcystis, followed by deletion of two NRPS modules (Moffitt & Neilan, 2004
). Likewise, there is no clear evidence for recombination between cyanopeptolin operons and other NRPS genes clusters such as mcy. Since cyanopeptolins and microcystins are likely to have co-existed for a long evolutionary period in the same lineage, this may indicate that the mcn and mcy classes of peptide synthetases have evolved independently. Consequently, this may be an indication of distinct and non-overlapping functions and strong functional constraints. Previously, recombination has been found to be prevalent in the microcystin class of NRPS (Kurmayer et al., 2004
; Mikalsen et al., 2003
; Moffitt & Neilan, 2004
; Tanabe et al., 2004
). The lack of evident recombination in the mcn operon may be attributed to lack of information about potential donor and acceptor sequences, since unlike Kurmayer et al. (2004)
and Mikalsen et al. (2003)
we have investigated only a single strain of the genus here. Furthermore, the Anabaena and Microcystis strains compared here for cyanopeptolin gene clusters may be too diverse from an evolutionary point of view, thus making Anabaena/Microcystis recombination less likely. This would be in agreement with the findings of Rantala et al. (2004)
indicating that the mcy operons in Anabaena and Microcystis represent ancient evolutionary clades. In line with the predictions by Rudi et al. (1998)
that recombination will increase between closely related strains (i.e. within the same genus), recently supported by whole-genome analyses of cyanobacteria (Zhaxybayeva et al., 2006
), we expect that analysis of a panel of mcn genes from related Microcystis strains most likely would detect recombination between the strains.
| ACKNOWLEDGEMENTS |
|---|
Edited by: K. Forchhammer
| REFERENCES |
|---|
|
|
|---|
Billich, A. & Zocher, R. (1987). Enzymatic synthesis of cyclosporin A. J Biol Chem 262, 1725817259.
Birk, I. M., Dierstein, R., Kaiser, I., Matern, U., König, W. A., Krebber, R. & Weckesser, J. (1989). Nontoxic and toxic oligopeptides with D-amino acids and unusual residues in Microcystis aeruginosa PCC 7806. Arch Microbiol 151, 411415.[CrossRef][Medline]
Bister, B., Keller, S., Baumann, H. I., Nicholson, G., Weist, S., Jung, G., Süssmuth, R. D. & Jüttner, F. (2004). Cyanopeptolin 963A, a chymotrypsin inhibitor of Microcystis PCC 7806. J Nat Prod 67, 17551757.[CrossRef][Medline]
Challis, G. L., Ravel, J. & Townsend, C. A. (2000). Predictive, structure-based model of amino acid recognition by nonribosomal peptide synthetase adenylation domains. Chem Biol 7, 211224.[CrossRef][Medline]
Chiu, H. T., Hubbard, B. K., Shah, A. N., Eide, J., Fredenburg, R. A., Walsh, C. T. & Khosla, C. (2001). Molecular cloning and sequence analysis of the complestatin biosynthetic gene cluster. Proc Natl Acad Sci U S A 98, 85488553.
Christiansen, G., Fastner, J., Erhard, M., Börner, T. & Dittmann, E. (2003). Microcystin biosynthesis in Planktothrix: genes, evolution, and manipulation. J Bacteriol 185, 564572.
Christiansen, G., Kurmayer, R., Liu, Q. & Börner, T. (2006). Transposons inactivate biosynthesis of the nonribosomal peptide microcystin in naturally occurring Planktothrix spp. Appl Environ Microbiol 72, 117123.
Conti, E., Stachelhaus, T., Marahiel, M. A. & Brick, P. (1997). Structural basis for the activation of phenylalanine in the non-ribosomal biosynthesis of gramicidin S. EMBO J 16, 41744183.[CrossRef][Medline]
Dassa, E. & Hofnung, M. (1985). Homologies between integral proteins of the inner membrane of binding protein transport systems in enterobacteria. Ann Inst Pasteur Microbiol 136A, 281288.[CrossRef]
Fastner, J., Erhard, M. & von Döhren, H. (2001). Determination of oligopeptide diversity within a natural population of Microcystis spp. (cyanobacteria) by typing single colonies by matrix-assisted laser desorption ionization-time of flight mass spectrometry. Appl Environ Microbiol 67, 50695076.
Finking, R. & Marahiel, M. A. (2004). Biosynthesis of nonribosomal peptides. Annu Rev Microbiol 58, 453488.[CrossRef][Medline]
Fujii, K., Harada, K.-I., Suzuki, M., Kondo, F., Ikai, Y., Oka, H. & Sivonen, K. (1996). Novel cyclic peptides together with microcystins produced by toxic cyanobacteria. In Harmful and Toxic Algal Blooms, pp. 559562. Edited by T. Yatsumoto, Y. Oshima & Y. Fukuyo. Paris: Intergovernmental Oceanographic Commission of UNESCO.
Hammer, P. E., Hill, D. S., Lam, S. T., Van Pee, K. H. & Ligon, J. M. (1997). Four genes from Pseudomonas fluorescens that encode the biosynthesis of pyrrolnitrin. Appl Environ Microbiol 63, 21472154.[Abstract]
Hoffmann, D., Hevel, J. M., Moore, R. E. & Moore, B. S. (2003). Sequence analysis and biochemical characterization of the nostopeptolide A biosynthetic gene cluster from Nostoc sp. GSV224. Gene 311, 171180.[CrossRef][Medline]
Ishida, K., Murakami, M., Matsuda, H. & Yamaguchi, K. (1995). Micropeptin-90, a plasmin and trypsin-inhibitor from the blue-green-alga Microcystis aeruginosa (NIES-90). Tetrahedron Lett 36, 35353538.[CrossRef]
Ishida, K., Matsuda, H. & Murakami, M. (1998). Micropeptins 88-A to 88-F, chymotrypsin inhibitors from the cyanobacterium Microcystis aeruginosa (NIES-88). Tetrahedron 54, 55455556.[CrossRef]
Ishitsuka, M. O., Kusumi, T., Kakisawa, H., Kaya, K. & Watanabe, M. (1990). Microviridin. A novel tricyclic depsipeptide from the toxic cyanobacterium Microcystis viridis. J Am Chem Soc 112, 81808182.[CrossRef]
Kaebernick, M., Neilan, B. A., Börner, T. & Dittmann, E. (2000). Light and the transcriptional response of the microcystin biosynthesis gene cluster. Appl Environ Microbiol 66, 33873392.
Konz, D., Doekel, S. & Marahiel, M. A. (1999). Molecular and biochemical characterization of the protein template controlling biosynthesis of the lipopeptide lichenysin. J Bacteriol 181, 133140.
Kurmayer, R., Christiansen, G., Fastner, J. & Börner, T. (2004). Abundance of active and inactive microcystin genotypes in populations of the toxic cyanobacterium Planktothrix spp. Environ Microbiol 6, 831841.[CrossRef][Medline]
Lee, A. Y., Smitka, T. A., Bonjouklian, R. & Clardy, J. (1994). Atomic structure of the trypsin-A90720A complex: a unified approach to structure and function. Chem Biol 1, 113117.[CrossRef][Medline]
Linne, U., Doekel, S. & Marahiel, M. A. (2001). Portability of epimerization domain and role of peptidyl carrier protein on epimerization activity in nonribosomal peptide synthetases. Biochemistry 40, 1582415834.[CrossRef][Medline]
Malo, M. S., Srivastava, K., Andresen, J. M., Chen, X. N., Korenberg, J. R. & Ingram, V. M. (1994). Targeted gene walking by low stringency polymerase chain reaction: assignment of a putative human brain sodium channel gene (SCN3A) to chromosome 2q24-31. Proc Natl Acad Sci U S A 91, 29752979.
Marahiel, M. A., Stachelhaus, T. & Mootz, H. D. (1997). Modular peptide synthetases involved in nonribosomal peptide synthesis. Chem Rev 97, 26512674.[CrossRef][Medline]
Martin, C., Sivonen, K., Matern, U., Dierstein, R. & Weckesser, J. (1990). Rapid purification of the peptide toxins microcystin-LR and nodularin. FEMS Microbiol Lett 56, 15.[Medline]
Martin, C., Oberer, K., Ino, T., König, W. A., Busch, M. & Weckesser, J. (1993). Cyanopeptolines, new depsipeptides from the cyanobacterium Microcystis sp. PCC 7806. J Antibiot 46, 15501556.[Medline]
Matern, U., Schleberger, C., Jelakovic, S., Weckesser, J. & Schulz, G. E. (2003). Binding structure of elastase inhibitor scyptolin A. Chem Biol 10, 9971001.[CrossRef][Medline]
Mikalsen, B., Boison, G., Skulberg, O. M., Fastner, J., Davies, W., Gabrielsen, T. M., Rudi, K. & Jakobsen, K. S. (2003). Natural variation in the microcystin synthetase operon mcyABC and impact on microcystin production in Microcystis strains. J Bacteriol 185, 27742785.
Moffitt, M. C. & Neilan, B. A. (2004). Characterization of the nodularin synthetase gene cluster and proposed theory of the evolution of cyanobacterial hepatotoxins. Appl Environ Microbiol 70, 63536362.
Murakami, M., Kodani, S., Ishida, K., Matsuda, H. & Yamaguchi, K. (1997). Micropeptin 103, a chymotrypsin inhibitor from the cyanobacterium Microcystis viridis (NIES-103). Tetrahedron Lett 38, 30353038.[CrossRef]
Namikoshi, M. & Rinehart, K. L. (1996). Bioactive compounds produced by cyanobacteria. J Ind Microbiol Biotechnol 17, 373384.[CrossRef]
Neilan, B. A., Dittmann, E., Rouhiainen, L., Bass, R. A., Schaub, V., Sivonen, K. & Börner, T. (1999). Nonribosomal peptide synthesis and toxigenicity of cyanobacteria. J Bacteriol 181, 40894097.
Nishizawa, T., Ueda, A., Asayama, M., Fujii, K., Harada, K., Ochi, K. & Shirai, M. (2000). Polyketide synthase gene coupled to the peptide synthetase module involved in the biosynthesis of the cyclic heptapeptide microcystin. J Biochem 127, 779789.
Nishizawa, T., Asayama, M. & Shirai, M. (2001). Cyclic heptapeptide microcystin biosynthesis requires the glutamate racemase gene. Microbiology 147, 12351241.
Nogami, T., Shigihara, Y., Matsuda, N., Takahashi, Y., Naganawa, H., Nakamura, H., Hamada, M., Muraoka, Y., Takita, T. & other authors (1990). Neopyrrolomycin, a new chlorinated phenylpyrrole antibiotic. J Antibiot 43, 11921194.[Medline]
Nowak-Thompson, B., Chaney, N., Wing, J. S., Gould, S. J. & Loper, J. E. (1999). Characterization of the pyoluteorin biosynthetic gene cluster of Pseudomonas fluorescens Pf-5. J Bacteriol 181, 21662174.
Otsuka, M., Ichinose, K., Fujii, I. & Ebizuka, Y. (2004). Cloning, sequencing, and functional analysis of an iterative type I polyketide synthase gene cluster for biosynthesis of the antitumor chlorinated polyenone neocarzilin in Streptomyces carzinostaticus. Antimicrob Agents Chemother 48, 34683476.
Pelzer, S., Süssmuth, R., Heckmann, D., Recktenwald, J., Huber, P., Jung, G. & Wohlleben, W. (1999). Identification and analysis of the balhimycin biosynthetic gene cluster and its use for manipulating glycopeptide biosynthesis in Amycolatopsis mediterranei DSM5908. Antimicrob Agents Chemother 43, 15651573.
Rantala, A., Fewer, D. P., Hisbergues, M., Rouhiainen, L., Vaitomaa, J., Börner, T. & Sivonen, K. (2004). Phylogenetic evidence for the early evolution of microcystin synthesis. Proc Natl Acad Sci U S A 101, 568573.
Ronquist, F. & Huelsenbeck, J. (2003). MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19, 15721574.
Rouhiainen, L., Paulin, L., Suomalainen, S., Hyytiäinen, H., Buikema, W., Haselkorn, R. & Sivonen, K. (2000). Genes encoding synthetases of cyclic depsipeptides, anabaenopeptilides, in Anabaena strain 90. Mol Microbiol 37, 156167.[CrossRef][Medline]
Rouhiainen, L., Vakkilainen, T., Siemer, B. L., Buikema, W., Haselkorn, R. & Sivonen, K. (2004). Genes coding for hepatotoxic heptapeptides (microcystins) in the cyanobacterium Anabaena strain 90. Appl Environ Microbiol 70, 686692.
Rudi, K., Skulberg, O. M. & Jakobsen, K. S. (1998). Evolution of cyanobacteria by exchange of genetic material among phyletically related strains. J Bacteriol 180, 34533461.
Sambrook, J. & Russell, D. W. (2001). Molecular Cloning: a Laboratory Manual, 3rd edn. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory.
Sancelme, M., Fabre, S. & Prudhomme, M. (1994). Antimicrobial activities of indolocarbazole and bis-indole protein kinase C inhibitors. J Antibiot 47, 792798.[Medline]
Scrutton, N. S., Berry, A. & Perham, R. N. (1990). Redesign of the coenzyme specificity of a dehydrogenase by protein engineering. Nature 343, 3843.[CrossRef][Medline]
Sieber, S. A. & Marahiel, M. A. (2005). Molecular mechanisms underlying nonribosomal peptide synthesis: approaches to new antibiotics. Chem Rev 105, 715738.[CrossRef][Medline]
Sielaff, H., Dittmann, E., Tandeau De Marsac, N., Bouchier, C., Von Döhren, H., Börner, T. & Schwecke, T. (2003). The mcyF gene of the microcystin biosynthetic gene cluster from Microcystis aeruginosa encodes an aspartate racemase. Biochem J 373, 909916.