Shan
Wang
*ab,
Fleurdeliz
Maglangit
c,
Qing
Fang
b,
Kwaku
Kyeremeh
d and
Hai
Deng
*b
aState Key Laboratory of Microbial Technology, Shandong University, Qingdao 266237, China. E-mail: shan.wang@sdu.edu.cn
bDepartment of Chemistry, School of Natural and Computing Sciences, University of Aberdeen, Aberdeen AB24 3UE, UK. E-mail: h.deng@abdn.ac.uk
cDepartment of Biology and Environmental Science, College of Science, University of the Philippines Cebu, Lahug, Cebu City 6000, Philippines
dMarine and Plant Research Laboratory of Ghana, Department of Chemistry, University of Ghana, P.O. Box LG56, Legon-Accra, Ghana
First published on 30th September 2024
The Baeyer–Villiger monooxygenase (BVMO), LgnC, plays a crucial role in the biosynthesis of bacterial pyrrolizidine alkaloids, legonmycins. It processes bicyclic indolizidine substrates generated from the coordinative action of two non-ribosomal peptide synthetases (LgnB and LgnD) and the standalone type II thioesterase-like enzyme (LgnA). It has been demonstrated that the enzyme selectively inserts molecular oxygen into the carbon–carbon bond adjacent to the carbonyl group in legonindolizidines to form bicyclic 1,3-oxazepine carbamate intermediates. After ring opening and contraction, the most advanced products, prelegonmycins, are formed. However, factors controlling the final hydroxylation step and how the enzyme handles the substrates have remained elusive. In this study, we show that the final hydroxylation at the activated carbon of the electron-rich pyrrole system is attributed to either spontaneous oxidation or the action of an endogenous redox reagent. Substrate docking on the structural model of LgnC combined with site-directed mutagenesis allows the identification of several key amino acids that are essential for substrate/intermediate binding and a mechanism of LgnC-catalysed transformation is proposed.
Pyrrolizidine alkaloids (PAs) are a group of heterocyclic specialised metabolites, featured by two-fused 5-membered rings with a nitrogen atom at the bridgehead. While PAs have been mainly found as plant metabolites, less than 40 PAs have been discovered from bacterial origins, many of which exhibit potent anticancer and antimicrobial activities.14 The representatives include clazamycins 1,15,16 jenamidine B 2,17 legonmycins 3,18 pyrrolizixenamide 419 and azetidomonoamide B20,21 (or azabicyclene) 5 (Fig. 1(A)). Some of these PAs (i.e.1–3) exist as isomers as they possess an achiral hydroxyl group at the C7 position on the carbon bridgehead of their bicyclic rings (Fig. 1(A)). It has been proposed that, although the hydroxylation at C7 may occur through enzymatic reactions in a stereospecific manner, the products undergo spontaneous racemization at neutral pH.15,18
Recent biosynthetic studies demonstrated that the biosynthesis of bacterial PAs starts with the production of bicyclic [5+6] indolizidine intermediates 6, resulting from the action of bimodular non-ribosomal peptide synthetases (NRPSs) in most bacterial PA pathways18–22 (Fig. 1(B)). These indolizidine intermediates are further modified by a group of pathway-specific BVMO enzymes, which catalyse a Baeyer–Villiger ring expansion from 6 to [5+7] 1,3-oxazepine carbamates 718 (Fig. 1(B)). Subsequent non-enzymatic hydrolysis leads to ring opening and decarboxylative contraction, ultimately yielding [5+5] PA ring systems, as demonstrated in a recent report (Fig. 1(B)).23 In the case of legonmycins, their biosynthesis, however, requires four enzymes, where LgnB and LgnD are NRPSs, possessing domain arrangements of A1–T0 and C1–T1–C2–A2–T2–TE, respectively (Fig. 1(C)). It has been demonstrated that LgnA, an unusual type II thioesterase (TEII), catalyses the essential aminoacyl translocation from LgnB-T0 to LgnD-T1, facilitating downstream reactions24 (Fig. 1(C)). Biochemical analysis also showed that the LgnD-C2 domain, a member of unusual C domains,25 catalyses the essential dehydration reaction on LgnD-T1-tethered acyl-Thr to provide the corresponding acyl-dehydrobutyrine intermediates24 (Fig. 1(C)).
Interestingly, the pathways of bacterial PAs share the same bimodular NPRS architectures with the lipocyclocarbamates, brabantamides 9, to provide intermediates with identical [5+6] indolizidine ring systems 7b (Fig. 1(B)).26,27 However, a homologue of PA BVMOs, BraC, encoded in the corresponding biosynthetic gene cluster, catalyses a diverse ring re-arrangement of the [5+7] 1,3-oxazepine carbamates to provide 9 (Fig. 1(B)).23 In some cases, the [5+7] 1,3-oxazepine carbamates produced by these BVMOs could be trapped or further modified by other enzymes to form stable products, which are then released into the fermentation broth. This can be exemplified in clipibicyclenes 10,28 cyclocarbamides,29 legoncarbamate,30 and SB-31502131 (Fig. S1, ESI†), which were found to be potent antimicrobial agents. Recently, it has been shown that 10 is a potent covalent inhibitor of bacterial caseinolytic proteases (ClpPs),28 which are part of bacterial highly conserved proteolytic complexes that are, in many cases, essential for growth.
In this study, we report that the recombinant LgnC can efficiently mediate the biotransformation of legonindolizidine A 8 to prelegonmycin A 11 (Fig. 1(C)). However, the hydroxylation at the C7 position of 11 to form 3 in our in vitro assays likely results from spontaneous oxidations. Assays of legonindolizidine A or purified prelegonmycin A with E. coli whole cells where the recombinant LgnC was overexpressed along with the necessary cofactors, demonstrate that legonmycin A is efficiently produced, strongly indicating that the hydroxylation at the C7 position of 11 is mediated by an endogenous hydroxylase. Structural modelling of LgnC with its cofactor FAD and subsequent substrate docking analysis allow the identification of key amino acid residues in its putative active site. Site-directed mutagenesis (SDM) experiments coupled with bioinformatic analysis indicate that H200, Y219, and W221, which are highly conserved in PA BVMOs, play important roles in the BV biotransformation from 8 to 11. Finally, a mechanism of LgnC-mediated transformation is proposed.
Efforts were then made to optimize the reaction conditions to improve the efficient biotransformation from 8 to 3. Incubation with the recombinant LgnC-TF in the presence of 8 (acquired from our previous studies18) together with FAD (0.1 mM), NADP+ (1 mM), and the NADPH recycling system18 has only resulted in a trace amount of 3 (by HPLC) (Fig. 2(A)) as also observed in MS and MS fragmentation analysis (Fig. S2, ESI†). However, a major compound with the same UV absorption has emerged in our HPLC analysis, as indicated in our time course assays (Fig. 2(A)). LC-MS and MS fragmentation analysis indicated that this accumulated compound is prelegonmycin A 11 that lacks the hydroxyl group at the C7 position (Fig. S3, ESI†). It is worth noting that there was a large portion of substrate 8 that remained even after extensive reaction time (6 h) (Fig. 2(A)). Although, to the best of our knowledge, there is no literature supporting that recombinant proteins with TF tags affect the overall enzymatic activities,32 attempts were made to remove the TF tag from the recombinant LgnC. However, such cleavage resulted in the complete loss of soluble proteins and enzymatic activities (data not shown).
To investigate this further, we cloned the lgnC into pGM1190, a self-replicative shuttle vector with a thiostrepton-inducible promoter PtipA used for gene expression in Streptomyces.33 The subsequently isolated pGM1190-lgnC was directly introduced into Streptomyces coelicolor M1152, a heterologous host for recombinant protein production,34via conjugation for heterologous expression without further modification. The protein extract containing overexpressed recombinant LgnC was used for enzyme assays. Addition of 8 into this protein extract, together with exogenous FAD (0.1 mM) and NADPH (1 mM), resulted in the accumulation of 11 with only a trace amount of 3 in our time course experiments (Fig. 2(B)). It appears that all of 8 was consumed after a 4-h assay as observed in HPLC traces. Interestingly, the NADPH recycling system was no longer required to complete the transformation from 8 to 11. Similar to what was observed in the assays of LgnC-TF, only a trace amount of 3 was produced during these time-course experiments. However, purification of recombinant LgnC from the Streptomyces host proved to be difficult due to the low expression level, often resulting in protein contamination.
Taken together, it is likely that, while the recombinant LgnC is able to efficiently catalyse the conversion of 8 to 11, the final hydroxylation in our in vitro assays may result from spontaneous oxidation on the activated C7 position of the electron-rich pyrrole scaffolds, similar to the observation found during the synthesis of legonmycins.35
To further investigate the hydroxylation, we set out to isolate compound 11via large-scale biotransformation and multi-rounds of semi-preparative HPLC separations. MS and MS2 fragmentation analysis confirmed the molecular identity of 11, displaying a similar UV profile to the ones of 3 (Fig. S4, ESI†). In a controlled experiment where the boiled recombinant LgnC (inactivated enzyme) was used, 11 remained unchanged with a small amount of 3 presented, which likely originated from the spontaneous oxidation of 11 during this overnight incubation (Fig. 3(A) trace iv). An assay with recombinant LgnC-TF and 8 showed a similar level of 11 produced compared to the control experiment with a small amount of 3 (Fig. 3(A) trace v). This strongly indicated that LgnC is only responsible for the conversion of 8 to 11 (Fig. 3(A), trace vi), but it is unable to install the hydroxyl group on C7 of 11 to provide 3. However, the slow spontaneous oxidation rate on 11 observed in our in vitro assays was only able to result in a trace amount of 3 and may not be able to account for our previous observations where only the hydroxylated products were isolated in the original producing isolate, Streptomyces sp. MA37 (MA37) and the heterologous expression strain, S. albus:7G1.18 This suggests that the final hydroxylation at the C7 position in some bacterial PAs results from an endogenous redox reagent present (i.e. a promiscuous hydroxylase or even hydrogen peroxide) in the host cells.
To confirm this, we set out to investigate whether we are able to observe efficient hydroxylation on 3 in the E. coli whole cell biotransformation. The benefit of cell-based biotransformation lies in the ability to utilize multistep cascade reactions linked to oxidative pathways, as many biochemicals require consecutive enzymatic modifications for effective synthesis and functionality.36 Another benefit during the synthesis is that coenzymes can be provided through intracellular metabolism, or reducing potential can be supplied through cofactor generation in cells with high volumetric activity.36 To this end, we first utilized E. coli BL21(DE3) harbouring the overexpression construct pColdTF-lgnC as the catalyst.18 Overnight incubation of compound 8 in the cell-based system led to its complete conversion to compounds 11 and 3, as confirmed by HPLC analysis (Fig. 3(B), trace iv). In a control experiment, where BL21(DE3) cells harboring the empty plasmid (BL21/pCold-TF) were used, no conversion of 8 was observed, indicating that the transformation to 11 and 3 requires recombinant LgnC (Fig. 3(B), trace v). Interestingly, when 11 was added to both the BL21/pCold-TF-LgnC (Fig. 3(B), trace ii) and BL21/pCold-TF (Fig. 3(B), trace iii) systems, it was efficiently converted to compound 3. This demonstrates that the final hydroxylation of 11 to 3 is independent of recombinant LgnC but can be mediated by endogenous redox reagents. The 11-to-3 ratio in the cell-based system was approximately 3:1, suggesting that the conversion to 3 is more efficient in the whole-cell system compared to in vitro assays.
In contrast, incubation of 8 in cell-free lysates containing overexpressed LgnC-TF resulted primarily in the accumulation of 11, with only trace amounts of 3 (Fig. S5, ESI†). Additionally, no conversion of 8 to 11 or 3 was observed in control lysates from BL21(DE3) containing the empty pColdTF plasmid, in line with previous in vitro observations (Fig. 3(A)).
These results suggest that the complete conversion of 8 to 11 is catalyzed by recombinant LgnC, whereas the final hydroxylation step to form 3 is likely driven by an endogenous hydroxylase or hydrogen peroxide produced by E. coli BL21(DE3) under aerobic conditions.37 The diminished production of hydroxylated products in cell-free lysates may be due to reduced activity of this endogenous redox system.
Attempts to crystallize recombinant LgnC-TF were made owing to its unusual multistep catalysis. Despite repeated trials, we were unable to obtain a crystal structure of recombinant LgnC-TF. Nonetheless, we sought to understand the ability of LgnC to have a bound FAD cofactor by using the recently developed AlphaFold 3 algorithm.38 The predicted structure of LgnC with a bound FAD cofactor was then submitted to the Dali server39 for 3D structural comparison. The predicted LgnC structure displays 26% structural similarity (Z-scores of 35.2, rmsd value of 2.8) to AbsH3, an FAD-dependent reductase from the abyssomicin biosynthetic pathway,40 the highest %id among other structures of FAD-dependent oxidoreductases. Similar to AbsH3, LgnC appears to have two “domains” (Fig. S6, ESI†), site A containing residues 5–73, 90–176, 270–345, and 370–381, and site B containing residues 70–90, 177–269 and 346–369 (Fig. S6, ESI†). The arrangement of bound FAD in site A is similar to AbsH3 (Fig. S7, ESI†). One cavity was predicted by CAVER software41 in close proximity to the quinone of the FAD cofactor (Fig. S8, ESI†). This void appears to link channels to facilitate the transport of substrates and products in site B (Fig. S8, ESI†). We then conducted docking studies on legonindolizidine A 8 and the transient 5+7 7a to gain insights into how LgnC handles the substrate and intermediate.42 In substrate 8, the hydrophobic 5+6 ring system appears to be inserted into a pocket between the indole ring of W221 and the quinone moiety of the FAD cofactor through hydrophobic interaction with W221 (Fig. 4(A)). The oxygen amide at the isovaleryl motif of 8 also engages in hydrogen bonding interactions with H200 and NH of W221 (Fig. 4(A)). Such an insertion of 8 between FAD and W221 is likely to be further supported by the hydrogen bonding interactions between the oxygen at C1 of 8 and the hydroxyl group of Y219 (Fig. 4(A)). While the overall position of 1,3 oxazepine carbamate intermediate 7a remains similar to 8, and Y219 also forms H-bonding with the oxygen at C1 of 7a, but there was variability observed in the docking results in the initial trajectory and binding position of 7a (Fig. 4(B)). This variability could account for the ring expansion from the original 8 to the transient 1,3-oxazepine carbamate intermediate 7a. We observed that, while the amide oxygen of the isovaleryl motif is likely to form the H-bonding interaction with S48, NH of W221 and H200 instead forms the H-bonding interactions of the carbonyl oxygen of the carbamate motif (Fig. 4(D)). This suggests that NHs of H200 and W221 may form an oxyanion hole to stabilise the transient intermediate after the nucleophilic attack initiated by the FAD-C4a-hydroperoxide ion for the subsequent structural rearrangement (Fig. S9, ESI†).
Interestingly, we also observed that residues R207, T295, and Q350 are in close proximity to the exit tunnel and within 6 Å distance of the H200 and W221 (Fig. S8D, ESI†). Such hydrophilic surroundings may position one water molecule close to the carbonyl carbon of 1,3 oxazepine carbamate intermediate 7a to facilitate the ring opening, followed by the final decarboxylative ring contraction.
To investigate the catalytic roles of the above identified residues, we performed site-directed mutagenesis (SDM). Changing His200 and Trp221 to Ala, resulting in protein variants H200A and W221A (Fig. S10, ESI†), respectively, completely abolished the enzymatic activities of LgnC from 8 to 11 (Fig. 4(C)). The protein variant, Y219F, lost 95% activity compared to the wild-type (WT) LgnC. While these three residues are absent in other BVMOs, they are highly conserved in other LgnC BVMO homologues (Fig. S11, ESI†), strongly suggesting that they indeed are key to the biotransformation of oxygen insertion from [5+6] indolizidines to [5+7] 1,3 oxazepine carbamate intermediates. While protein variants S48A and T295A (Fig. S10, ESI†) retained almost the same activity as the WT LgnC, the enzyme activities were reduced to approximately 50% and 30% compared to the WT LgnC in the assays of protein variants R207Q and Q350A (Fig. S10, ESI†), respectively. Partial loss of enzymatic activities in protein variants, R207Q and Q350A, suggested that these residues may have some impact on the enzyme activity (possibly water activation).
In conclusion, we characterized the enzyme activity of the BVMO, LgnC, that catalyses the conversion of legonindolizidine A 8 to prelegonmycin A 11, the last intermediate in the biosynthesis of legonmycin A. The final hydroxylation at the C7 position of 11 is likely to originate from either spontaneous oxidation by hydrogen peroxide or an endogenous hydroxylase in bacterial strains. Structural modelling of LgnC with FAD cofactor and subsequent substrate docking enabled the identification of key amino acid residues in the putative active sites. Our SDM experiments revealed that Y219F, H200A, and W221A play important roles in biotransformation, which are likely to position legonindolizidine A 8 to facilitate the proposed Baeyer–Villiger ring expansion. Further insights into the molecular mechanism will require protein structures of LgnC or its homologues and the protein complexes with the corresponding substrates.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4cb00186a |
This journal is © The Royal Society of Chemistry 2024 |