Aurelio Mollo‡
a,
A. Nikolai von Krusenstiern‡a,
Joshua A. Bulosa,
Veronika Ulrichb,
Karin S. Åkerfeldta,
Max J. Cryle*bcd and
Louise K. Charkoudian*a
aDepartment of Chemistry, Haverford College, Haverford, PA 19041, USA. E-mail: lcharkou@haverford.edu
bDepartment of Biomolecular Mechanisms, Max Planck Institute for Medical Research, Jahnstrasse 29, 69121 Heidelberg, Germany. E-mail: max.cryle@monash.edu
cEMBL Australia, Monash University, Clayton, Victoria 3800, Australia
dThe Monash Biomedicine Discovery Institute, Department of Biochemistry and Molecular Biology, ARC Centre of Excellence in Advanced Molecular Imaging, Monash University, Clayton, Victoria 3800, Australia
First published on 12th July 2017
Complestatin is a non-ribosomal heptapeptide belonging to the glycopeptide antibiotic family that displays anti-complement, anti-HIV integrase, neuroprotective, anti-apoptotic, and antibacterial activities. Development of complestatin as a pharmaceutical agent and biological probe has been stymied by the difficulty in isolating the molecule from its natural source, Streptomyces lavendulae, as well as by the uneconomical and environmentally unfriendly nature of current total synthetic routes. Of particular challenge to the organic chemist is the installation of the biaryl and aryl–ether–aryl linkages that provide complestatin with the structural rigidity responsible for its potent pharmacological properties. Herein, we demonstrate that ComJ, a P450 monooxygenase from the complestatin biosynthetic gene cluster, can catalyse phenolic cross-linking of amino acid side chains in vitro. ComJ acts with high efficiency and low substrate stereoselectivity, a finding which paves the way towards the use of ComJ as a biocatalyst for the chemo-enzymatic synthesis of complestatin and other related molecules. The ability of ComJ to accept peptides of alternative stereochemistries raises intriguing questions about the evolutionary origins of glycopeptide antibiotic biosynthesis and the capacity of S. lavendulae to produce different conformations of complestatin.
Fig. 1 The complestatin NRPS is composed of 29 distinct domains distributed across four proteins (ComA–D). The domains are arranged into seven functional modules (M1–M7), each responsible for the incorporation of one amino acid into the growing chain; putative intermediates found at each module are shown tethered to their cognate carrier protein domains. When the growing chain reaches the terminal module as an α-ketoacyl hexapeptide, it is chlorinated by in trans-acting halogenases (not shown),35 then cyclised by the in trans-acting P450s ComI and ComJ to yield fully active complestatin (boxed). Bioinformatic analysis suggests that M6 contains an additional E domain (dashed) that was overlooked in the original annotation (see text):32 the structure of complestatin is shown as published, but the stereochemistry of the tyrosine residue remains unclear. A = adenylation domain; PCP = peptidyl carrier protein domain; C = condensation domain; E = epimerisation domain; Mt = N-methyltransferase domain; L = extended linker domain; TE = thioesterase domain; M = module. |
Among NRPs, GPAs are unique in that they are characterised by a high percentage of non-proteinogenic amino acids as well as synthetically challenging biaryl and biaryl ether linkages.1,2 These unusual cross-links, absent from traditional ribosome-assembled peptides, play important roles in rigidifying the peptide scaffold into potent bioactive conformations. In vivo, the biaryl and biaryl ether linkages are installed stereo- and regioselectively by dedicated P450 monooxygenases (known as “Oxy enzymes”),3 which are recruited to the terminal module of the NRPS by the adjacent X-domain, a conserved domain present in the terminal module of all GPA NRPSs.4 Recent results indicate that the Oxy enzymes undergo a continuous association/dissociation process with the X-domain, only to “lock in” upon recognition of their cognate substrate and thus catalysing side chain cross-linking in a defined order.5 Due to their potential use as biocatalysts, Oxy enzymes from many different GPA systems have been structurally and biochemically characterised to date.6–11 In the future, these enzymes could prove to be useful tools for enabling access to novel structural and chemical diversity, including molecules that contain biaryl motifs that would otherwise be challenging to synthesise via classical approaches.12
Complestatin (Fig. 1) is an atypical GPA isolated in the early 1980s from the soil bacterium Streptomyces lavendulae.13 It possesses a remarkable range of pharmacological properties, including anti-complement,13 anti-HIV integrase,14,15 neuroprotective,16,17 anti-apoptotic,18 and antibacterial19 effects. Because of the presence of an aryl–ether–aryl–aryl moiety cross-linking three of its amino acid side chains, total synthetic approaches towards complestatin remain arduous. Current routes in fact require in excess of 20 steps and involve the use of large amounts of metal catalysts, together rendering these routes economically and environmentally unsustainable for industrial scale productions.2,20–22 A greener, more efficient route towards the synthesis of this molecule and its analogues, which may be used as biological probes or as lead compounds in drug development, is greatly needed.
Complestatin's regulatory and biosynthetic gene cluster (BGC) was first sequenced and annotated in 2001. At the time it was inferred based on homology that two enzymes, called ComI and ComJ, are involved in cyclising the linear heptapeptide to yield fully active complestatin (Fig. 1).23 More recently, it was shown through gene disruption studies and subsequent analysis of cell extracts that ComI and ComJ are indeed responsible for installing the biaryl and phenolic bridges, respectively, with ComJ acting prior to ComI.24 In spite of these advances, however, ComI and ComJ have not been shown to be active in vitro, a result critical to further developing these enzymes into useful biocatalysts. Previous work in the field has suggested that an environmentally friendly route might involve the synthesis of the linear form of complestatin, followed by P450-mediated cross-linking,4,25 thus exploiting the remarkable biosynthetic capability of P450 enzymes to install the synthetically-challenging crosslinks. Towards this goal, we herein demonstrate that ComJ is an active P450 enzyme that catalyses aryl–ether–aryl bond formation in vitro with high efficiency and low substrate stereoselectivity, defining it as a potential biocatalyst for the chemo-enzymatic reconstitution of complestatin and related molecules.
In preparation for spectral assays, ComI and ComJ were further purified in a two-step process consisting of anion exchange chromatography (AEC) and size-exclusion chromatography (SEC). Initially, the proteins were buffer exchanged with AEC buffer A (50 mM Tris–HCl pH 7.4, 20 mM NaCl) using Illustra NAP-25 columns (GE Healthcare, Chalfont St Giles, UK) and then concentrated using Vivaspin® centrifugal concentrators (30000 MWCO; Sartorius, Göttingen, Germany). AEC was performed using a 6 mL Resource™ Q column (GE 17–1179) equilibrated with AEC buffer A and connected to an Äkta purifier 10 system (GE) at RT. The protein was applied onto the column, which was then washed with 5 column volumes (CV) of AEC buffer A, before the protein was eluted over a gradient from 0 to 100% AEC buffer B (50 mM Tris–HCl pH 7.4, 1 M NaCl) over 20 CV. The elution fractions were analysed by SDS-PAGE, with relevant fractions pooled and concentrated prior to SEC. For SEC, protein was loaded onto a Superose 12300 mL column (GE 17-5173) equilibrated using SEC buffer (50 mM Tris–HCl pH 7.4, 150 mM NaCl) and connected to an Äkta purifier 10 system at RT. The protein was eluted using 50 mM Tris–HCl pH 7.4, 150 mM NaCl, and analysed by SDS-PAGE. Relevant fractions were pooled, and the protein portioned into aliquots prior to being flash-frozen in liquid N2 and stored at −80 °C.
We next evaluated whether ComI and ComJ interact with PCP domains, X-domains, and/or PCP-X didomains from the 7th module of the complestatin NRPS, thus opening up the possibility of a biomimetic route to synthesis of the molecule.
Monomeric Xcom (X-domain from M7 of complestatin NRPS; see Fig. 1), PCP7com (PCP domain from M7 of complestatin NRPS; see Fig. 1), and heterodimeric PCP7-Xcom constructs were designed and cloned into modified pET vectors, containing maltose-binding protein (MBP) to enhance solubility and expression titers, as detailed in Section 2.2.
Protein–protein interactions were next evaluated by analytical size-exclusion chromatography (SEC) with simultaneous monitoring of the absorption at λ = 280 nm for protein absorption and at λ = 415 nm for the heme-specific (P450) absorption. The results (Fig. 3) indicate that ComI and ComJ both interact tightly with MBP-PCP7-Xcom. In addition to the above-mentioned interaction, both P450s exhibited tight interaction with monomeric Xcom constructs (Fig. S2†) but not with monomeric PCP7com constructs (Fig. S3†). Collectively, these data suggest that interaction of ComI and ComJ with the NRPS assembly line in vivo occurs primarily through the X-domain.5
Having established the ability of the catalytically competent P450s to dock onto the PCP7-Xcom didomain, we proceeded to testing the ability of ComJ to catalyse a cyclisation reaction (Fig. 4). Previous research by the Robinson Lab found that OxyBvan, the ComJ homologue from the vancomycin biosynthetic pathway, was able to successfully cyclise a tripeptide substrate in vitro representing the core of the cognate heptapeptide.31 Based on this precedent, we investigated the ability of ComJ to cyclise a tripeptide substrate representing the core of the complestatin heptapeptide (Fig. 4).
Fig. 4 Summary of the reaction flow from starting linear peptide to cyclised product. (A) Complestatin is a bismacrocyclic heptapeptide. Each ring is installed by a separate P450 enzyme (see Fig. 1). Inspired by the left-hand macrocycle, containing a biaryl ether bond (boxed), we constructed epimeric linear tripeptides (1, 2) by solid-phase peptide synthesis (SPPS). Following cleavage from the resin, the peptides (R = OH) were converted to CoA conjugates (R = SCoA) via a thiophenol intermediate (R = SPh), before being loaded onto an MBP fusion construct of the PCP7-X didomain from the complestatin NRPS (R = S-Ppant-PCP7-Xcom; Ppant structure shown in part B). (B) Peptidyl-PCP, shown top, was treated with ComJ and other proteins required to provide a sustained electron flow to the P450 heme. Following biaryl ether bond installation over a 1 h period, the cyclised product was cleaved using methylamine, purified by solid-phase extraction (SPE), and detected by LC-MS. See Section 2.7 for complete procedure. |
In addition to testing the cyclisation efficiency of D-Hpg-D-Hpg-L-Tyr (1)—the tripeptide substrate based on the published structure of complestatin—we also evaluated the epimeric tripeptide D-Hpg-D-Hpg-D-Tyr (2). The rationale behind this was two-fold. First, a recent bioinformatics analysis of the complestatin BGC revealed the presence of an additional, predicted epimerisation domain in module 6 of the complestatin NRPS that was not reported in the original annotation32 (Fig. 1), which suggests possible enzymatic conversion of the L-Tyr residue to D-Tyr during biosynthesis. Second, alternative conformations of complestatin and the existence of structurally related natural products from Streptomyces isolates have been reported in the literature.2 Together, these findings inspired us to explore the stereoselectivity of ComJ.
The target peptides were constructed using Fmoc/tBu-based SPPS on the Wang resin. Due to the high susceptibility of Hpg residues to racemisation during peptide synthesis, the Fmoc-deprotection and coupling protocols were optimised to minimise the length and intensity of exposure of the peptides to basic conditions (see Section 2.6).28 The peptides were cleaved from the resin, then converted to their CoA conjugates via a thiophenol intermediate before being loaded onto the PCP7-Xcom didomain in an Sfp-dependent reaction.33 After incubating peptidyl-PCP with the relevant cytochrome P450 enzyme for 1 h at 30 °C, turnover was evaluated via methylamine-assisted cleavage, followed by solid-phase extraction (SPE) and LC-QTOF-MS detection (Fig. 4).
The robustness of the coupled turnover and LC-MS detection protocol was tested by performing a positive control for turnover based on the results of Robinson and co-workers. Thus, the L-Tyr-containing tripeptide (1) was loaded onto MBP-PCP7-Xvan25 and OxyBvan added as the P450. This resulted in 79% turnover (Fig. S4†) in agreement with published results.31
Next, the biocatalytic activity of ComJ was tested by performing turnover of 1 loaded onto MBP-PCP7-Xcom. As shown in Fig. 5, 1 was turned over by ComJ in 48% efficiency, with spinach ferredoxin (SpFd) providing the reducing equivalents to ComJ. This constitutes the first direct observation of ComJ activity in vitro. In addition to turning over 1, ComJ was also found to turn over the D-Tyr-containing tripeptide 2 with a slightly higher efficiency of 60% (P < 0.001, Fig. 5E), suggesting that ComJ may possess a slight preference for D- over L-Tyr. The finding that ComJ can accept an epimerised peptide substrate suggests that an alternative conformation of complestatin is in fact a feasible product of the complestatin gene cluster.
Fig. 5 Results of turnover studies on 1 (D-Hpg-D-Hpg-L-Tyr) and 2 (D-Hpg-D-Hpg-D-Tyr). (A) Schematic showing turnover of PCP-X—bound tripeptide by ComJ. The linear form of the peptide (post-methylamine cleavage) has an m/z of 533.2 (negative mode), whereas the cyclised form has an m/z of 531.2, 2 Da lower owing to the loss of two hydrogen atoms. (B) Left: UV trace showing the decrease in linear peptide concentration in the presence of ComJ and either ComK (red dotted line) or SpFd (green dashed line) for 1 loaded on MBP-PCP7-Xcom. Right: extracted ion count (EIC) showing the concomitant increase in cyclised product. (C) Left: UV trace showing the decrease in linear peptide concentration in the presence of ComJ and either ComK (blue dotted line) or SpFd (orange dashed line) for 2 loaded on MBP-PCP7-Xcom. Right: EIC showing the concomitant increase in cyclised product. In both B and C, solid black lines represent negative (no ComJ) controls. (D) Mass spectra for linear peptide (left, m/z = 533.203, expected: 533.204) and cyclised product (right, m/z = 531.187, expected: 531.188). (E) Bar graph summarising the turnover percentages of 1 and 2 by ComJ in the presence of SpFd or ComK; the presenting PCP construct for all ComJ reactions was MBP-PCP7-Xcom. The positive control with OxyBvan as the P450 and MBP-PCP7-Xvan as the peptide presentation platform is shown for comparison (see Fig. S4† for relevant spectra). All experiments were performed in triplicate and turnover percentages computed by quantifying the decrease in linear starting material concentration relative to the negative (no Oxy) controls. Significant differences between means were computed using the unpaired t-test. *P < 0.05, ***P < 0.001, ****P < 0.0001. |
We next tested the possibility of a tight ComJ-Xcom interaction hindering catalysis, as we observed slightly lower extent of conversion of the linear starting materials (1, 2) into cyclised products (3, 4) by ComJ compared to the OxyBvan control (P < 0.0001 for both 1 and 2, Fig. 5E). It was recently discovered in fact that the cytochrome P450 enzyme StaH involved in the biosynthesis of the GPA A47934 possesses low catalytic efficiency, and that when Xsta (X-domain from A47934 NRPS) is exchanged with Xtei (X-domain from teicoplanin NRPS), the P450 activity increases dramatically, which suggests that an overly tight interaction between StaH and Xsta hinders catalytic efficiency in vitro.27 However, in our case, exchanging MBP-PCP7-Xcom with MBP-PCP7-Xvan did not result in any significant increase in turnover (Fig. S5†), allowing us to exclude this hypothesis in this particular case.|| Increasing the reaction time for turnover also did not result in more cyclised product (Fig. S5†).
The fully annotated complestatin BGC (GenBank: AF386507.1) encompasses comK, whose protein product was predicted based on homology to be a ferredoxin.23 Thus, we decided to test the relative efficiency of ComK versus SpFd in transferring reducing equivalents to ComJ. Expressing ComK in TB medium supplemented with 50 μM FeSO4 (see Section 2.2) resulted in brown protein fractions, consistent with the presence of a chromophoric [2Fe–2S] cluster absorbing at 422 nm (Fig. S6†).34 As shown in Fig. 5, replacing SpFd with ComK resulted in ∼20% turnover for both L- and D-Tyr-containing tripeptides, with the turnover rate for the latter being slightly higher (P < 0.05, Fig. 5E). This represents the first observation of ComK acting as a ferredoxin, although spinach ferredoxin remains superior with respect to turnover efficiency. One hypothesis suggested by Khosla and co-workers23 is that in vivo, ComK is likely to interact with a specific electron source encoded outside the complestatin BGC. We speculate that the characterisation and subsequent inclusion of this oxidoreductase in the turnover mixture might enable higher rates of turnover, comparable if not superior to those achieved with spinach ferredoxin.
Turnover was also attempted using longer, non-chlorinated heptapeptides, but no turnover could be detected (data not shown). Recent results suggest that chlorine atoms may play a role in helping orient peptide substrates in the polar P450 active site,35 and therefore it is possible that the non-chlorinated heptapeptides were simply too non-polar to interact favourably with the P450 enzyme. With that in mind, future studies should investigate the activity on substrates that more closely resemble the natural substrates for cyclisation.
The experiments presented here focused on ComJ as a biaryl ether-installing P450, but the UV/vis and SEC data reveal that ComI also possesses all the characteristics of a competent P450 that is able to interact productively with the NRPS machinery. Future studies will need to investigate ComI activity—especially in conjunction with prior cyclisation by ComJ—thus completing the picture of complestatin biosynthesis and paving the way towards a chemo-enzymatic synthesis of the molecule and potential derivatives.
A | Adenylation domain |
ACN | Acetonitrile |
AEC | Anion exchange chromatography |
BGC | Biosynthetic gene cluster |
C | Condensation domain |
CoA | Coenzyme A |
DBU | 1,8-Diazabicyclo[5.4.0]undec-7-ene |
DCM | Dichloromethane |
DIC | N,N′-Diisopropylcarbodiimide |
DIPEA | N,N-Diisopropylethylamine |
DMAP | 4-Dimethylaminopyridine |
DMF | N,N-Dimethylformamide |
E | Epimerisation domain |
EcoFlvR | E. coli flavodoxin reductase |
EIC | Extracted ion count |
FA | Formic acid |
GPA | Glycopeptide antibiotic |
G6P | Glucose-6-phosphate |
G6P-DH | Glucose-6-phosphate dehydrogenase |
HOBt | Hydroxybenzotriazole |
L | Extended linker domain |
M | Module |
MALDI | Matrix-assisted laser desorption ionisation (mass spectrometry) |
MBP | Maltose-binding protein |
Mt | N-Methyltransferase domain |
NADPH | Nicotinamide adenine dinucleotide phosphate |
NRP | Non-ribosomal peptide |
NRPS | Non-ribosomal peptide synthetase |
PCP | Peptidyl carrier protein domain |
Ppant | 4′-Phosphopantetheine |
PyBOP | (Benzotriazol-1-yl-oxy)tripyrrolidinophosphonium hexafluorophosphate |
Q-TOF LC/MS | Quadrupole time-of-flight liquid chromatography/mass spectrometry |
SEC | Size exclusion chromatography |
SPE | Solid-phase extraction |
SpFd | Spinach ferredoxin |
SPPS | Solid-phase peptide synthesis |
tBu | tert-Butyl (protecting group) |
TE | Thioesterase domain |
TEA | Triethylamine |
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c7ra06518c |
‡ These authors contributed equally to this work. |
§ It was observed that expressing ferredoxins in TB medium resulted in higher yields compared to expressing the same proteins in LB medium. |
¶ The decision to start from free Wang resin instead of D-Tyr-loaded Wang resin in the synthesis of 2 was based on the much lower purchase price of free Wang resin compared to the corresponding pre-loaded resin. |
|| Although PCP7van was exchanged for PCP7com, effects from direct PCP-Oxy binding can be excluded given that ComJ does not appear to bind to the PCP7 domain but only to the X-domain (Fig. S2 and S3†). |
This journal is © The Royal Society of Chemistry 2017 |