Filip J.
Wyszynski
a,
Andrew R.
Hesketh
b,
Mervyn J.
Bibb
*b and
Benjamin G.
Davis
*a
aDepartment of Chemistry, University of Oxford, Chemistry Research Laboratory, 12 Mansfield Road, Oxford, OX1 3TA, UK. E-mail: ben.davis@chem.ox.ac.uk; Fax: +44 (0)1865 285002; Tel: +44 (0)1865 275652
bDepartment of Molecular Microbiology, John Innes Centre, Norwich Research Park, Norwich, NR4 7UH, UK. E-mail: mervyn.bibb@bbsrc.ac.uk; Fax: +44 (0)1603 450778; Tel: +44 (0)1603 450773
First published on 10th September 2010
Tunicamycin nucleoside antibiotics were the first known to target the formation of peptidoglycan precursor lipid I in bacterial cell wall biosynthesis. They have also been used extensively as inhibitors of protein N-glycosylation in eukaryotes, blocking the biogenesis of early intermediate dolichyl-pyrophosphoryl-N-acetylglucosamine. Despite their unusual structures and useful activities, little is known about their biosynthesis. Here we report identification of the tunicamycin biosynthetic genes in Streptomyces chartreusis following genome sequencing and a chemically-guided strategy for in silico genome mining that allowed rapid identification and unification of an operon fractured across contigs. Heterologous expression established a likely minimal gene set necessary for antibiotic production, from which a detailed metabolic pathway for tunicamycin biosynthesis is proposed. These studies unlock a comprehensive and unusual toolbox of biosynthetic machinery with which to create variants of this important natural product, allowing possible improved understanding of the mode of action and facilitating future redesign. We anticipate that these results will enable the generation of altered specific inhibitors of diverse carbohydrate-processing enzymes, including improved targeting of lipid I biosynthesis.
Fig. 1 (a) Structures of the tunicamycins and (b) sites of label incorporation from prior feeding experiments with labelled D-glucose.13 See ESI† for a detailed interpretation of possible label incorporation pathways. |
High molecular weight genomic DNA was isolated from the two bacterial strains known to produce tunicamycin - S. chartreusis NRL3882 and S. lysosuperificus ATCC31396.13 The latter was shown to contain a high copy number plasmid and since this would bias subsequent sequencing data towards plasmid sequences, we elected to work with S. chartreusis NRL3882. The genome was sequenced to 36× coverage, generating 3112 contigs19 with a maximum size of 53.9 kb and an N50 average size20 of 4.6 kb. The contigs covered 7.95 Mb of the S. chartreusis chromosome with a G+C content of 70 mol%, consistent with existing data for Streptomyces genomes.21
Using bioinformatics tools tBLASTn22 and Artemis,23 these contigs were scanned for the presence of candidate tun genes. The tunicamycins contain many unique structural motifs (Figure 1) created by proteins with functions for which no similar examples or parallels are known. We therefore selected from a wide range of existing gene products with possible, chemically-similar function to putative members of the tunicamycin biosynthetic cluster. These focused on unique features and included: 1→1-linking glycosyltransferases (such as those from 3,3′-neotrehalosadiamine biosynthesis24 and from the OtsA-OtsB, TreY-TreZ and TreS pathways of trehalose biosynthesis25), N-acetyl-hexosamine N-deacetylases (e.g., from mycothiol, glycophosphatidylinositol, neomycin and teicoplanin biosynthesis26), lipid-processing proteins and numerous examples from the abundant NDP-hexose epimerase and dehydratase families. Using the combined presence of more than one of these reactivities (putative activity indicated by high similarity scores in tBLASTn to known enzyme activities) coupled with a knowledge of the required enzymology as a sifting strategy by which to mine the genome sequence data, a single contig was identified as the only ‘hit’ containing homologues of inosityl-GlcNAc deacetylase, hexose epimerase/dehydratase and acyl carrier protein encoded by a cluster of eleven open reading frames (ORFs). The 3′-end of this operon coincided with the end of the contig and terminated with the partial sequence of a putative GT-1 family glycosyltransferase (termed tunD), another key element of our reactivity filter. The full length sequences of glycosyltransferases homologous to TunD were screened against the S. chartreusis genome and an additional contig detected, containing the remaining portion of tunD and three further ORFs. In this way, the use of a bioinformatics filter, based on chemical logic, uniquely suggested the unification of an operon fractured across two distinct contigs. This unification was confirmed experimentally by generating a PCR product to bridge the gap between the two contigs, using primers matching the 5′ and 3′ ends of tunD and a genomic DNA template; its sequence confirmed that the two contigs were indeed adjacent on the chromosome and revealed that there were no bases between them. This joined contig contained 14 ORFs that appear to lie in a single operon, with many translationally coupled to the preceding gene (Figure 2). Analysis of the 5′ upstream region of each ORF in the cluster revealed that most had at least four bases of the GGAGG ribosomal binding site motif; regions preceding tunJ, tunM and tunN possess a GGA motif. Translational coupling to tunK likely allows effective expression of tunL and tunM. Further bioinformatic analysis revealed that these genes are likely to constitute a tunicamycin biosynthetic gene cluster and the predicted function of genes tunA-N and flanking ORFs are presented in Table 1 and Figure 2.
Fig. 2 Genetic organisation of the tunicamycin biosynthetic gene cluster in S. chartreusis and its homologues in S. clavuligerus and A. mirum. |
ORF | aaa | Proposed function in tunicamycin biosynthesis | Closest protein homologued, origin, (%Id/Si)b, Acc.c | Homologue in S. clavuligerus ATCC27064, aa, (%Id/Si), Acc. | Homologue in A. mirum DSM43827, aa, (%Id/Si), Acc. |
---|---|---|---|---|---|
a Number of amino acids. b (% Identity/Similarity). c Accession number. d Homologues from S. clavuligerus ATCC27064 and A. mirum DSM43827 were omitted from this column. | |||||
ORF-2 | 406 | Integrase, Streptomyces sviceus ATCC 29083, (52/65), ZP_05020140 | |||
ORF-1 | 297 | Transposase, Rhodococcus jostii RHA1, (50/64), YP_700005 | |||
TunA | 321 | UDP-GlcNAc epimerase/dehydratase | NAD-dependent epimerase/dehydratase, Streptomyces sp. Mg1, (42/58), ZP_04996782 | SCLAV_4287, 276, (72/80), ZP_06773762 | Amir_2816, 322, (54/65), YP_003100592 |
TunB | 338 | Uridine oxidoreductase | Radical SAM domain protein, Haloterrigena turkmenica DSM5511, (32/48), YP_003405396 | SCLAV_4286, 338, (90/95), ZP_06773761 | Amir_2817, 340, (78/87), YP_003100593 |
TunC | 318 | N-Acyltransferase | GCN5-related N-Acetyltransferase, Fervidobacterium nodosum, (33/50), YP_001410548 | SCLAV_4285, 322, (60/72), ZP_06773760 | Amir_2818, 318, (43/57), YP_003100594 |
TunD | 474 | Glycosyltransferase | Group 1 family glycosyltransferase, Thermococcus barophilus, (28/45), ZP_04876510 | SCLAV_4284, 461, (63/77), ZP_06773759 | Amir_2819, 451, (47/58), YP_003100595 |
TunE | 234 | N-Deacetylase | GlcNAc-phosphatidylinositol de-N-acetylase, Cylindrospermopsis raciborskii, (43/57), ZP_06309433 | SCLAV_4283, 236, (77/85), ZP_06773758 | Amir_2820, 230, (63/76), YP_003100596 |
TunF | 327 | UDP-GlcNAc- 4-epimerase | UDP-glucose 4-epimerase, Paenibacillus sp. oral taxon 786, (46/63), ZP_04852226 | SCLAV_4282, 327, (76/83), ZP_06773757 | Amir_2821, 332, (58/70), YP_003100597 |
TunG | 203 | UMP phosphatase | Phosphoglycerate mutase, Frankia sp. CcI3, (29/47), YP_481446 | SCLAV_4281, 208, (65/73), ZP_06773756 | Amir_2822, 223, (50/60), YP_003100598 |
TunH | 515 | UDP-tunicaminyluracil pyrophosphatase | Type I nucleotide pyrophosphatase, Burkholderia sp. 383, (35/50), YP_370731 | SCLAV_4280, 518, (66/76), ZP_06773755 | Amir_2823, 510, (53/65), YP_003100599 |
TunI | 304 | ABC transporter ATP-binding subunit | Putative ABC transporter ATP-binding subunit, Streptomyces scabiei 87.22, (41/61), YP_003492364 | SCLAV_4279, 302, (77/88), ZP_06773754 | Amir_2824, 302, (60/73), YP_003100600 |
TunJ | 262 | ABC transporter permease subunit | ABC-2 type transporter, Thermobaculum terrenum, (32/51), YP_003322218 | SCLAV_4278, 261, (76/83), ZP_06773753 | Amir_2825, 253, (61/78), YP_003100601 |
TunK | 81 | Acyl carrier protein | Phosphopantetheine-binding protein, Catenulispora acidiphila DSM 44928, (32/61), YP_003117493 | SCLAV_4277, 81, (65/87), ZP_06773752 | Amir_2826, 79, (34/54), YP_003100602 |
TunL | 229 | Phospholipid phosphatase | Phosphoesterase PA-phosphatase, Micromonospora aurantiaca, (33/42), ZP_06217896 | SCLAV_4276, 223, (52/67), ZP_06773751 | - |
TunM | 216 | Radical SAM protein | Methyltransferase family protein, Saccharomonospora viridis, (48/63), YP_003133112 | SCLAV_4274, 212, (54/67), ZP_06773749 | Amir_2815, 232, (30/54), YP_003100591 |
TunN | 152 | UTP pyrophosphatase | NUDIX hydrolase, Nakamurella multipartita, (36/55), YP_003200035 | SCLAV_4275, 170, (68/77), ZP_06773750 | - |
ORF1 | 213 | Secreted protein, Streptomyces viridochromogenes, (81/90), ZP_05533938 | |||
ORF2 | 573 | Secreted protein, Streptomyces viridochromogenes, (90/94), ZP_05533937 | |||
ORF3 | 606 | Secreted protein, Streptomyces viridochromogenes, (82/89), ZP_05533936 |
Fig. 3 Evidence of heterologous production of tunicamycins in S. coelicolor. (A): Bioassay showing heterologous expression of (i) a genomic library-derived cosmid harboring the tun gene cluster introduced into a S. coelicolor M1152 host (giving recombinant strains S. coelicolor M1027 and M1028 derived from library cosmids 6N9 and 7C3, respectively) and control strain S. coelicolor M1030 (containing the same cosmid but without any insert sequence) and (ii) the minimal tun gene cluster cloned into pRT802 in S. coelicolor M1146 (giving recombinant strain S. coelicolor M1035) and control strain S. coelicolor M1031 (containing the empty pRT802 cosmid); (B): LC/MS analysis of (i) an authentic tunicamycin sample and mycelium extracts of these recombinant S. coelicolor strains (see ESI† for further chromatograms) (ii) M1031, (iii) M1035, (iv) M1027 and (v) M1030. See also Fig. S8, ESI† for 1H NMR analysis of the extracts. |
The ORFs flanking the proposed tun gene cluster are clearly not required for tunicamycin biosynthesis. The three flanking genes downstream of the tun cluster (ORF1-3) have close homologues in many Streptomyces genomes and encode conserved housekeeping genes. The two upstream flanking genes (ORF−1 and ORF−2) are homologous to transposase and integrase genes respectively, lending support to the hypothesis that S. chartreusis acquired the tun gene cluster by lateral gene transfer. The 1.9 kb region between ORF−1 and tunA contains a putative ORF with multiple frameshifts (“junk DNA”), again consistent with recent evolutionary acquisition.
Bioinformatic analysis revealed other potential tunicamycin producers. Homologous gene clusters were identified in A. mirum DSM4382730 and S. clavuligerus ATCC27064; the latter has been reported to produce the closely related antibiotic MM19290.5 Only minor differences were observed between the three gene clusters, suggesting a recently shared evolutionary heritage (Figure 2, Table 1).
Proteins encoded by the S. chartreusis gene cluster exhibit greatest similarity to those from S. clavuligerus, with amino acid sequence identities ranging from 52 to 90%. It is highly likely that genes annotated SCLAV_4274 to SCLAV_4287 are responsible for MM19290 biosynthesis in S. clavuligerus ATCC27064. Although the structure of MM19290 has not been reported, the high degree of homology with the tun genes from S. chartreusis strongly suggests that, like the streptovirudins and corynetoxins, this compound shares its core carbohydrate skeleton with tunicamycin.
Proteins encoded by the A. mirum gene cluster exhibit amino acid sequence identities with Tun proteins from S. chartreusis that range from 30 to 78%. While no full-length homologues of TunN or TunL were found, closer inspection of the A. mirum sequence revealed a truncated version of TunL that contained a number of frameshift mutations. Since this organism has not been reported to produce any antibiotics structurally related to the tunicamycins, it is probable that we have uncovered a silent gene cluster that has lost the ability to produce its tunicamycin-like metabolite.
Fig. 4 Proposed biosynthetic pathway for the tunicamycins. See also Fig. S2 and section S2, ESI† for a more detailed discussion of the roles of the tun gene products. |
The tun gene cluster described is relatively small in size, although a previous suggestion that as few as five genes would be necessary for the biosynthesis of tunicamycin has proved too conservative.35 Of the nine additional genes not originally predicted, two are involved in the generation of free uridine from UTP, contrary to suggestions that uridine would be obtained directly from primary metabolism.13 Two further genes are implicated in formation of UDP-tunicaminyl-uracil, one coding for a sugar epimerase supplementary to the dehydratase catalyzing UDP-4-keto-5,6-ene-GlcNAc formation, and one which potentially mediates the radical coupling event alongside the gene responsible for uridine oxidation. Hydrolysis of UDP from the undecose intermediate has also been suggested to require enzyme catalysis. Although the acyl side chains are likely to originate from cellular pools of fatty acids, consistent with the lack of a fatty acid synthase, the tun gene cluster still encodes two enzymes that may provide sufficient fatty acid flux and are involved in sequestering lipids and processing them prior to attachment. Finally, the last two additional tun genes are not directly involved in tunicamycin biosynthesis, but are likely to be crucial in conferring self-resistance to the producing organism. tunI and tunJ together encode for a putative ABC transporter, homologues of which are responsible for rapid ATP-driven efflux of antibiotics from cells in a large number of antibiotic-producing organisms.36
No regulatory genes were found in the tun gene cluster, suggesting that tunicamycin production may be subject to global control associated with growth rate reduction. The presence of rare TTA leucine codons (only 2% of S. coelicolor genes contain a TTA codon) in tunA and tunM may well reflect an element of translational regulation. In S. coelicolor, the accumulation of LeutRNAUUA is temporally regulated, and translation of mRNAs containing this codon may be largely confined to later stages of growth.37
The findings presented here will allow detailed studies of tunicamycin biosynthesis. Functional characterization of individual enzymes will provide insight into how some of the unique linkages in tunicamycin are constructed. In addition, armed with this comprehensive toolbox of biosynthetic machinery, tunicamycin analogues with altered selectivity for bacterial MraY versus human GPT can now be sought, potentially leading to future therapeutic antibiotics with improved antibacterial activity and reduced cytotoxicity. Importantly, the mode of action of tunicamycin is orthogonal to all existing antibiotic drugs. Tunicamycin also provides a unique natural product template for inhibition of carbohydrate-processing enzymes. It represents a possible substrate and/or transition state mimic and hence transition state mimics of other important nucleotide sugar-dependent carbohydrate-processing enzymes might also be targeted by precursor-driven biosynthesis or chemoenzymatic methods, exchanging terminal functionalities of the tunicamycin structure.
Footnotes |
† Electronic supplementary information (ESI) available: Detailed description of individual tun genes, a summary of prior studies into tunicamycin biosynthesis, experimental procedures, chromatograms and NMR spectra. See DOI: 10.1039/c0sc00325e/ |
‡ The sequences reported in this publication for the tun cluster are available in the Genbank database, accession code HQ172897. |
§ We thank a referee for this very useful suggestion. |
This journal is © The Royal Society of Chemistry 2010 |