Kate M. J.
de Mattos-Shipley‡
*ab,
Catherine E.
Spencer‡
a,
Claudio
Greco
a,
David M.
Heard
a,
Daniel E.
O'Flynn
a,
Trong T.
Dao
a,
Zhongshu
Song
a,
Nicholas P.
Mulholland
c,
Jason L.
Vincent
c,
Thomas J.
Simpson
a,
Russell J.
Cox
d,
Andrew M.
Bailey
*b and
Christine L.
Willis
*a
aSchool of Chemistry, University of Bristol, Cantock's Close, Bristol, BS8 1TS, UK. E-mail: Chris.Willis@bristol.ac.uk; andy.bailey@bristol.ac.uk; kd4495@bristol.ac.uk
bSchool of Biological Sciences, University of Bristol, 24 Tyndall Avenue, Bristol, BS8 1TQ, UK
cSyngenta, Jealott's Hill International Research Centre, Bracknell, RG42 6EY, UK
dInstitute for Organic Chemistry and BMWZ, Leibniz University of Hannover, Schneiderberg 38, 30167, Hannover, Germany
First published on 7th October 2020
Maleidrides are a class of bioactive secondary metabolites unique to filamentous fungi, which contain one or more maleic anhydrides fused to a 7-, 8- or 9- membered carbocycle (named heptadrides, octadrides and nonadrides respectively). Herein structural and biosynthetic studies on the antifungal octadride, zopfiellin, and nonadrides scytalidin, deoxyscytalidin and castaneiolide are described. A combination of genome sequencing, bioinformatic analyses, gene disruptions, biotransformations, isotopic feeding studies, NMR and X-ray crystallography revealed that they share a common biosynthetic pathway, diverging only after the nonadride deoxyscytalidin. 5-Hydroxylation of deoxyscytalidin occurs prior to ring contraction in the zopfiellin pathway of Diffractella curvata. In Scytalidium album, 6-hydroxylation – confirmed as being catalysed by the α-ketoglutarate dependent oxidoreductase ScyL2 – converts deoxyscytalidin to scytalidin, in the final step in the scytalidin pathway. Feeding scytalidin to a zopfiellin PKS knockout strain led to the production of the nonadride castaneiolide and two novel ring-open maleidrides.
Results of extensive studies on selected nonadrides using isotopic labelling, gene knock outs and heterologous expression experiments have led to the proposal that the biosynthetic pathway begins with the assembly of an unsaturated precursor 11via an iterative highly reducing polyketide synthase (hrPKS), where the chain length varies according to the structure of the natural product (Scheme 1). Coupling of 11 with oxaloacetate is catalysed by citrate synthase-like enzymes and is followed by dehydration catalysed by 2-methylcitrate dehydratases to generate the maleidride monomer 12.15,16 Decarboxylation of 12 then gives the tautomeric compounds 13 and 14 (Scheme 1A).14,17 The second stage of the biosynthetic pathway, coupling the monomers in various modes and cyclisation to the carbocyclic ring, involves ketosteroid isomerase-like (KSI) and phosphatidylethanolamine binding protein-like (PEBP) enzymes.16 The mode of cyclisation determines the carbon framework of the maleidride (Scheme 1B), and finally tailoring modifications decorate these structures leading to the observed diversity of natural products.
Scheme 1 The universal pathway to maleidrides16 involves the production of maleic anhydride monomers (A) followed by dimerisation (B) – the mode of which determines the carbon framework of the final compound. |
Octadrides, with 8-membered rings, are less common than nonadrides and include viburspiran 918 and zopfiellin 1519 (Fig. 1 and 2). The biosynthetic origin of the 8-membered ring is unknown (§see footnote). Zopfiellin was first isolated from Zopfiella curvata and the structure determined by a combination of spectroscopic methods.19–21 The relative and absolute configurations of zopfiellin have been indicated in several publications,22–24 but we were unable to find any justification of the configuration of the side chain secondary alcohol. Hence, with our continuing interest in the biosynthesis of fungal natural products, and in particular bioactive maleidrides, our goal was to confirm the structure of zopfiellin 15 and determine the biosynthetic origin of the 8-membered ring, with the longer-term goal of engineering the pathway to new antifungal targets.
Whilst this manuscript was in preparation Minami, Oikawa and co-workers reported investigations into the biosynthesis of zopfiellin using isotopic labelling, heterologous expression of candidate genes in Aspergillus oryzae, combined with structure elucidation of minor metabolites. These studies showed that a ring contraction occurs from a nonadride to an octadride.25 Herein we describe our recent studies which confirm the absolute and relative configurations of the nonadride scytalidin 16 and octadride zopfiellin 15 and reveal that they share a common biosynthetic precursor, deoxyscytalidin 17 (Fig. 2). Our biosynthetic investigations involved a different approach from those previously reported25 and used a combination of gene disruptions in Diffractella curvata and Scytalidium album, feeding studies to mutant strains and full structure elucidation of known and novel compounds. These studies establish the biosynthetic relationships between zopfiellin 15, scytalidin 16, and deoxyscytalidin 17.
Feeding [1,2-13C2]-acetate to cultures of D. curvata showed mainly intact incorporation of acetate into zopfiellin 15 (Fig. 3). Enhancement of C-5 in the 13C NMR spectrum was observed, but there was no coupling to an adjacent 13C, indicating that it originates from a cleaved acetate and therefore that a carbon atom is lost during zopfiellin biosynthesis.
Fig. 3 Incorporation patterns in octadrides and nonadrides from feeding studies with [1,2-13C2]-acetate. |
A possible biosynthetic precursor to zopfiellin is therefore a nonadride which on rearrangement with the loss of one carbon atom would give the corresponding octadride. Such ring contractions are well precedented in fungal biosynthesis, for example, in the contraction of ring B of the kaurenoids to the gibberellins,26 the conversion of the 14-membered polyketide macrodiolide colletotriene to the 13-membered ring of bartanol27 and the oxidative ring contractions seen in xenovulene biosynthesis.28 Conceivably the known natural products scytalidin 1629 (C22H28O7), deoxyscytalidin 1730 (C22H28O6) or castaneiolide 611 (C22H28O8) could be precursors. To obtain samples of 16 and 17, cultures of S. album strain UAMH 3620 were grown, and after 14 days compounds 16 and 17 were isolated in similar titres (ca. 10 mg L−1). X-ray analysis confirmed that the alkyl side chains are syn in both metabolites (Fig. 2). The results of feeding studies using [1,2-13C2] acetate to cultures of S. album were in accord with the predicted biosynthetic pathway to nonadrides and that one or both could possibly be biosynthetic precursors of zopfiellin 15 (Fig. 3).
De novo genome sequencing was performed for D. curvata CBS591.74 and S. album UAMH3620, and an initial screen of the genome data, searching for homologues to the byssochlamic acid biosynthetic gene cluster (BGC),16 revealed putative maleidride gene clusters within both genomes. The gene clusters were analysed in detail using FGeneSH31 to predict coding sequences, and manually annotated in Artemis32 to produce full draft BGCs (Fig. 4 and Table 1). Putative gene functions were assigned via the identification of homologues in the NCBI database and the detection of conserved domains using InterPro33 (Tables S1 and S2†). Transcriptomic analysis for D. curvata under a range of zopfiellin 15 production and non-production conditions aided identification of the cluster boundaries by clearly identifying a co-transcribed region that includes genes from zopL9 to zopR4 (Fig. 4, S1 and S2†). In addition, detailed analysis of the transcriptomic data confirmed the positions of all introns and exons in the predicted ORFs. However, despite repeated attempts, it was not possible to generate quality RNAseq data for the putative scytalidin gene cluster in S. album.
Fig. 4 (A) RNAseq data mapped to the zopfiellin BGC showing coregulation of the genes under four different conditions (Fig. S1 and S2†). Pink: GN. Purple: CDB. Green: PDB-day 5. Black: PDB-day 8. (B) The zopfiellin gene cluster, as annotated in Artemis, which contains genes zopL9 to zopR4. (C) The scytalidin gene cluster, as annotated in Artemis, which contains genes scyL2 to scyR12. ‘L’ in the gene names denotes genes to the left, or upstream, of the PKS. ‘R’ denotes genes to the right, or downstream, of the PKS. Putative gene functions are shown in Table 1. |
Gene | Putative function | Homologue | Gene | Putative function | Homologue |
---|---|---|---|---|---|
zopL9 | Hydroxylase/Desaturase | DES34 | scyL2 | Hydroxylase | AclN36 |
AsaB35 | MfR137 | ||||
zopL8 | Hypothetical protein | — | scyL1* | PEBP | BfL516 |
zopL7 | Transcription factor (TF) | AlnR38 | scyPKS | hrPKS | Tox1-PKS39 |
zopL6 | Major facilitator superfamily (MFS) transporter | Itp140 | scyR1 | DUF341 hydrolase/esterase | Fub441 |
zopL5 | 2-Methylcitrate dehydratase (2MCDH) | PrpD42 | scyR2 | Enoyl CoA hydratase | Ech143 |
zopL4* | Ketosteroid isomerase-like protein (KSI) | BfL616 | scyR3 | Citrate synthase | MfR344 |
zopL3 | Major facilitator superfamily (MFS) transporter | Itp140 | scyR4 | Isochorismatase-like hydrolase/amidohydrolase | CSHase45 |
zopL2 | Isochorismatase-like hydrolase/amidohydrolase | NicF46 | scyR5 | MFS transporter | MfM644 |
scyR6* | KSI | BfL616 | |||
zopL1 | Phosphatidylethanolamine-binding protein (PEBP) | Tfl147 | scyR7 | 2-Methylcitrate dehydratase | PrpD42 |
zopPKS | hrPKS | Tox1-PKS39 | scyR8 | Transcription factor | AlnR38 |
zopR1 | DUF341 hydrolase/esterase | Fub441 | scyR9 | Histone H2A | H2A.Z48 |
zopR2 | Enoyl-CoA hydratase | Ech143 | scyR10 | MFS transporter | MfR544 |
zopR3 | Citrate synthase | CshA49 | scyR11 | Hypothetical protein | — |
zopR4 | FAD-dependent oxidoreductase | YanF50 | scyR12* | PEBP | BfL916 |
Sol551,52 |
To confirm the identity of the two gene clusters, protoplast-mediated transformation protocols were developed for both fungal species and the PKS genes; zopPKS and scyPKS, were disrupted using the bipartite gene knock-out approach developed by Nielsen et al.53 In both cases this led to a total loss of maleidride biosynthesis, with no scytalidin 16 or deoxyscytalidin 17 being present in the crude extracts of the ΔscyPKS strains (Fig. 5e), and no zopfiellin 15 being present in the crude extracts of the ΔzopPKS strains (Fig. 5h).
An ACT (Artemis Comparison Tool)54 comparison of the zopfiellin and scytalidin BGCs (Fig. 6) highlighted significant homologies. Both clusters contain genes encoding the essential hrPKS, DUF341 hydrolase, citrate synthase (CS) and 2-methylcitrate dehydratase (2MCDH),12,13,15,16 as well as the ‘dimerisation’ genes encoding a ketosteroid-isomerase-like protein (KSI) and a PEBP-like protein.16 In the case of the scytalidin BGC a second PEBP gene is present, which has been observed previously in the byssochlamic acid and rubratoxin BGCs.12,16 A domain analysis of the two PKS genes identified all of the domains typically present in a highly-reducing PKS: KS, AT, DH, CMeT, ER, KR and ACP. Additional genes identified in both clusters encode an enoyl-CoA hydratase, an isochorismatase, a small hypothetical protein (zopL8 and scyR11), a GAL4 type transcription factor and two MFS (major facilitator superfamily) transporters. The enoyl-CoA hydratase encoded by the zopfiellin cluster (zopR2) was not identified in the work by Oikawa and colleagues,25 but RNAseq data confirmed its annotation.
The proteins encoded by the clusters also have high sequence identity (Table S3†). The KSI and PEBP-like proteins, for example, are far more similar to one another (80.7% and 66.2% identity respectively, Table S3†) than to those identified in other maleidride BGCs (Fig. S4–S7†). In the case of the transporters, ZopL6 and ScyR5 are highly homologous (80.4% identity) and thus may be expected to transport similar compounds. The other two transporters, ZopL3 and ScyR10, are more diverse, sharing only 38.5% identity. All other homologues encoded by the gene clusters share over 50% identity, with many sharing over 80% (Table S3†).
The only proteins which are uniquely encoded by the zopfiellin gene cluster are a putative hydroxylase/desaturase (ZopL9), which shares homology with DES from the gibberellin pathway, and an FAD-dependent oxidoreductase (ZopR4). The scytalidin gene cluster encodes only one unique protein, a putative hydroxylase (ScyL2). ZopL9 and ScyL2 are both predicted to be α-ketoglutarate (αKG) dependent enzymes based on the presence of conserved iron and αKG binding residues (Fig. S10†), but they were not considered to be homologues due to the low level of sequence identity (approximately 25% identity), suggesting that the exact catalytic function is unlikely to be the same. Also located within the scytalidin BGC is a gene (scyR9) encoding a highly conserved H2A.F/Z family histone,55 but this is unlikely to play a direct role in scytalidin biosynthesis (Table S3†).
The striking similarity between the two gene clusters (Fig. 6) suggests that zopfiellin 15 and scytalidin 16 have very similar biosynthetic origins, and that either deoxyscytalidin 17 or scytalidin 16 may be precursors of zopfiellin. To test this theory, 16 and 17 were purified from S. album cultures and separately fed to D. curvata ΔzopPKS cultures on days 3 and 5 post-inoculation (5 mg per day per 100 ml of culture). In cultures fed with 17, zopfiellin production was restored (Scheme 2 and Fig. 7e), demonstrating that deoxyscytalidin 17 is a precursor of zopfiellin. The extent of conversion of deoxyscytalidin to zopfiellin varied between individual feeds, presumably due to varying expression levels for the zopfiellin BGC. In cultures where the conversion was partial, the HPLC trace showed a peak for deoxyscytalidin, as well as a novel compound with a mass of 404 eluting at 16.1 minutes (Fig. 7e). This metabolite was isolated, and the structure confirmed by NMR spectroscopy to be 5-hydroxy-deoxyscytalidin 19 (Table S9 and Fig. S44–S49†). The signal assigned to 5-H (at δ 4.90) appeared as a broad singlet and NOE studies were in accord with hydroxylation occurring on the same face as the butyl side chain at C-6. To determine whether 19 is a shunt product or an intermediate on the biosynthetic pathway, it was fed to D. curvata ΔzopPKS cultures and again this restored zopfiellin biosynthesis (Scheme 2 and Fig. 7f).
Scheme 2 Feeding studies to D. curvata strain ΔzopPKS identified 17 and 19 as intermediates in zopfiellin biosynthesis and scytalidin was metabolised to novel products. |
When scytalidin 16 was fed to cultures of D. curvata ΔzopPKS, no zopfiellin was detected, but several additional compounds were present when compared to controls (Fig. 7g). The mixture was purified by HPLC giving the known11 nonadride castaneiolide 6 as one of the metabolites, presumably formed by 5-hydroxylation of scytalidin 16 (Scheme 2). In addition, two novel compounds 20 and 21 were isolated (3.2 mg and 0.7 mg respectively) and their structures elucidated by MS and extensive NMR studies (Table S10 and Fig. S50–S63†). The 13C NMR spectrum of compound 20 (C22H28O8 from HRMS) showed downfield signals assigned to the two maleic anhydrides, as well as an additional signal at δC 205 ppm assigned to a ketone. The 1H-NMR showed characteristic AB signals assigned to the hydroxymethylene. Based on this evidence, and combined with further analysis of 2D NMR data, the new metabolite was assigned as the ring cleaved alcohol 20. The spectral data of 21 was also in accord with a ring cleaved metabolite with a ketone in the side chain, but in this case the primary alcohol was oxidised to a carboxylic acid and one of the carbonyls of the anhydride reduced to give a mixture of epimers (δC 96.9/97.4 ppm, δH 6.19/6.23 ppm). Such reductions of anhydrides are commonly observed in maleidride biosynthesis.14,56,57 We propose that ketone 20 may be formed via cleavage of the 5,6-diol 6 to give an intermediate 22, which would tautomerise to generate ketone 20. Oxidation of the primary alcohol of 20 to a carboxylic acid and selective reduction of the anhydride would give 21. Such linear bis(maleic anhydride) structures are not without precedent, for example the fungal metabolite cordyanhydride A 23.58 However, 23 has previously been proposed to arise via the linear coupling of two distinct maleic anhydride monomers rather than via cleavage of a cyclic intermediate.15
Based on the bioinformatic analysis of the BGCs and the identification of deoxyscytalidin 17 as an intermediate to both scytalidin 16 and zopfiellin 15, it is a reasonable inference that the two biosynthetic pathways proceed in an identical manner to 17, after which they diverge. Hydroxylation at C-6 would produce scytalidin whereas hydroxylation at C-5, ring contraction and hydroxylation of the side chain would produce zopfiellin.
ScyL2, being the only unique catalytic protein encoded by the scytalidin BGC, was identified as a likely candidate for catalysing the final step in the scytalidin biosynthetic pathway. This is supported by a bioinformatic analysis identifying ScyL2 as a likely α-ketoglutarate dependent hydroxylase (Table S2†), which shares homology with enzymes known to catalyse hydroxylations, such as MfR1 and MfR2 from the squalestatin pathway,37 and RbtB and RbtG from the rubratoxin pathway12 (Fig. S8–S10†). To confirm this role, scyL2 knock-out strains of S. album were generated. These strains did not produce scytalidin 16, but accumulated deoxyscytalidin 17 (Fig. 5f), confirming the role of ScyL2.
Recent work by Shiina et al.25 on the zopfiellin pathway of Z. curvata, using heterologous expression as well as in vitro assays, has begun to elucidate the later stages of zopfiellin biosynthesis. ZopK, which is equivalent to ZopL9 of the D. curvata BGC reported here, was shown to catalyse the oxidative ring contraction in the zopfiellin pathway to produce deoxyzopfiellin 24.25 This proceeds via the intermediate 19 (which we now know to be 5-hydroxy-deoxyscytalidin) and is consistent with our identification, via feeding studies, of 19 as an intermediate in the zopfiellin pathway. In vitro assays using purified recombinant ZopL9 gave results which are entirely consistent with the findings of Shiina et al.25 In the presence of α-ketoglutarate, ZopL9 was capable of catalysing highly efficient hydroxylation of deoxyscytalidin 17 to give 5-hydroxy-deoxyscytalidin 19 (Fig. S68†). A minor compound was also detected which was purified and identified as deoxyzopfiellin 24. Repeating the assays using 5-hydroxy-deoxyscytalidin 19 as the substrate also gave rise to deoxyzopfiellin 24 (Fig. S68†), in accord with the proposal that ZopL9 is implicated in the ring contraction in the zopfiellin pathway of D. curvata, by first hydroxylating deoxyscytalidin to give 19, then catalysing the ring contraction to give the octadride core.
Having identified 17, 19 and 24 as intermediates in the zopfiellin pathway – through both feeding studies and in vitro assays – an extract of wild-type D. curvata was re-analysed, which revealed the presence of all three compounds as minor metabolites (Fig. S25†). Taken together with the bioinformatic analysis and the discovery that ScyL2 catalyses the final step of the scytalidin pathway, a branching biosynthetic route for the production of deoxyscytalidin, scytalidin and zopfiellin can now be proposed as shown in Scheme 3.
Scheme 3 Proposed biosynthetic pathways of the maleidrides deoxyscytalidin 17, scytalidin 16 and zopfiellin 15. |
Intriguingly there is a homologue to ZopL9 (PhiK) encoded by the phomoidride BGC (44.2% homology – Fig. S11†). The structure of the phomoidrides, e.g.5, suggests that multiple oxidations must occur during their biosynthesis, but no ring contraction of the nine-membered core occurs in this pathway, and the role of PhiK has not yet been investigated.
Footnotes |
† Electronic supplementary information (ESI) available. CCDC 2011262–2011264. Crystallographic data for compounds 16, 17 and 18. GenBank accession numbers; MT724049 (zopfiellin BGC) and MT724050 (scytalidin BGC). For ESI and crystallographic data in CIF or other electronic format see DOI: 10.1039/d0sc04309e |
‡ These authors contributed equally. |
§ The numbering systems used for the maleidrides varies greatly in the literature and shows no consistency. We propose a common more systematic system based on the size of the ring (1–9, 1–8, 1–7 as appropriate) beginning at the carbon alpha to the maleic anhydride ring, which gives the lowest numbers to the side chains. The maleic anhydride carbons would be numbered with a prime, appropriate to the ring numbering, hence 3′4′ and 8′9′ for byssochlamic acid, and 1′′, 2′′, etc. for the first side chain, numbering from the ring junction and 1′′′, 2′′′, etc. for the second side chain. |
This journal is © The Royal Society of Chemistry 2020 |