Tan
Liu‡
a,
Xiaojie
Ma‡
a,
Jiahui
Yu
a,
Wensheng
Yang
b,
Guiyang
Wang
a,
Zhengdong
Wang
a,
Yuanjie
Ge
a,
Juan
Song
a,
Hua
Han
b,
Wen
Zhang
b,
Donghui
Yang
a,
Xuehui
Liu
*c and
Ming
Ma
*a
aState Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University, 38 Xueyuan Road, Haidian District, Beijing 100191, China. E-mail: mma@bjmu.edu.cn
bSchool of Medicine, Tongji University, 1239 Siping Road, Shanghai 200092, China
cCAS Research Platform for Protein Sciences, Institute of Biophysics, Chinese Academy of Sciences, 15 Datun Road, Chao-yang District, Beijing 100101, China. E-mail: xhliu@ibp.ac.cn
First published on 10th August 2021
Lasso peptides are a unique family of natural products whose structures feature a specific threaded fold, which confers these peptides the resistance to thermal and proteolytic degradation. This stability gives lasso peptides excellent pharmacokinetic properties, which together with their diverse reported bioactivities have garnered extensive attention because of their drug development potential. Notably, the threaded fold has proven quite inaccessible by chemical synthesis, which has hindered efficient generation of structurally diverse lasso peptides. We herein report the discovery of a new lasso peptide stlassin (1) by gene activation based on a Streptomyces heterologous expression system. Site-directed mutagenesis on the precursor peptide-encoding gene is carried out systematically, generating 17 stlassin derivatives (2–17 and 21) with residue-replacements at specific positions of 1. The solution NMR structures of 1, 3, 4, 14 and 16 are determined, supporting structural comparisons that ultimately enabled the rational production of disulfide bond-containing derivatives 18 and 19, whose structures do not belong to any of the four classes currently used to classify lasso peptides. Several site-selective chemical modifications are first applied on 16 and 21, efficiently generating new derivatives (20, 22–27) whose structures bear various decorations beyond the peptidyl monotonicity. The high production yields of these stlassin derivatives facilitate biological assays, which show that 1, 4, 16, 20, 21 and 24 possess antagonistic activities against the binding of lipopolysaccharides to toll-like receptor 4 (TLR4). These results demonstrate proof-of-concept for the combined mutational/chemical generation of lasso peptide libraries to support drug lead development.
However, different from many other peptides that can be prepared by solid-phase peptide synthesis (SPPS), lasso peptides have proven quite inaccessible by chemical synthesis. Nevertheless, the identification and engineering of lasso peptide biosynthetic pathways from native producer organisms—as well as their subsequent chemical modifications—should help to generate potentially many structurally diverse lasso peptides for basic studies and for drug development. Notably, genome mining studies have suggested that the number of candidate biosynthetic gene clusters (BGCs) far exceeds the number of lasso peptides structurally characterized to date (currently around 80 structures).12–14 As with the BGCs of many natural products, it can be challenging to culture the native producer cells; moreover, the lack of relevant biological understanding often prevents the transcriptional activation of a BGC in its native producer.
In lasso peptide biosynthesis, the precursor peptide A (consisting of a leader peptide and a core peptide) is first processed by a protease B, which cleaves off the leader peptide; in the biosynthesis of some lasso peptides from Actinobacteria and Firmicutes phyla, protease B is an enzyme complex comprising protein B1 (an example of the RiPP precursor peptide recognition element, RRE)14,15 and protein B2. After cleavage of the leader peptide, the so-called “core peptide” is the substrate for N-terminal macrolactam ring formation as catalyzed by a cyclase C enzyme, thereby generating the mature lasso peptide product (Fig. 1B). Although the precise catalytic mechanisms employed by the B/C proteins remain uncharacterized, there are reports indicating that they display some degree of substrate promiscuity.16–22 This presents an opportunity for using genetic engineering approaches to efficiently generate lasso peptide derivatives; that is, it should be possible to simply substitute amino acid residues of the core peptide, provided such substitutions can be tolerated by downstream processing B/C proteins.
Fig. 1 The biosynthetic gene cluster stla and the proposed biosynthesis of stlassin (1). (A) The biosynthetic gene cluster stla from the marine Streptomyces sp. PKU-MA01240. The amino acid sequence of the precursor peptide StlaA is displayed, with the leader peptide in black, the N-terminal macrolactam residues of the core peptide in blue (except the macrolactam-forming residue Asp8 in red), and the C-terminal residues of the core peptide in green. The functional annotations of numbered genes are shown in ESI Table S1.† The region between two BglII restriction sites indicates the DNA fragment used in the heterologous expression (some parts of genes −13 and 4 were not included). (B) The proposed biosynthetic pathway of 1, with StlaB1/B2/C together responsible for the pre-folding of the precursor peptide StlaA (leader peptide in black, and core peptide in blue, red and green), StlaB2 responsible for proteolysis of StlaA to cleave the leader peptide from the core peptide, and StlaC responsible for the formation of the N-terminal macrolactam ring from the core peptide to generate mature stlassin. |
Recalling the aforementioned problems when working with BGCs from native producers, it is unsurprising that such peptide-encoding gene mutations have been carried out using E. coli as a heterologous host. However, the production yields for lasso peptides (and engineered derivatives) are often very low in E. coli, limiting the amount of material available for biological activity assays and synthetic modification studies. Thus, and given the reports about the successful site-selective chemical modification of peptides and proteins under mild conditions (e.g., O-glycosylation of phenolic functionality and S-arylation of cysteine residues in aqueous solution at room temperature),23–25 we were interested in the idea of generating substantial structural diversity in lasso peptide derivatives by combining an efficient heterologous expression system with site-selective chemical modifications of biosynthetic products.
Pursuing this, we report here the discovery of a new class II lasso peptide (stlassin, 1) based on heterologous expression and activation of a BGC (stla) from the marine Streptomyces sp. PKU-MA01240. Using a Streptomyces heterologous expression system, we mutationally substituted each position of the core peptide StlaA and thereby simultaneously generated new lasso peptide derivatives (2–17 and 21), and identified key residues involved in the post-translational processing of stlassin peptides by the B/C enzymes of the stla BGC. Moreover, we determined solution NMR structures for 1, 3, 4, 14, and 16, which supported structural comparisons that ultimately enabled the rational design and successful production of new disulfide bond-containing derivatives 18 and 19, whose structures do not belong to any of the four classes currently used to classify lasso peptides.5,14 Our structural insights also supported the identification of modification-accessible residues (tyrosine or cysteine) at specific core peptide positions, and we successfully achieved site-selective chemical modifications at these sites, generating new derivatives (20 and 22–27) with O-glycosylation, S-glycosylation, S-arylation, disulfide bond formation, in addition to both the cleavage and formation of C–S bonds. Finally, we harnessed the high production yields from our heterologous production and chemical modification efforts and conducted biological activity assays: these revealed antagonistic activities of 1, 4, 16, 20, 21, and 24 against the binding of lipopolysaccharides (LPS) to toll-like receptor 4 (TLR4). Our study thus identifies an exciting class of lasso peptides and illustrates the powerful combination of biosynthetic gene-manipulations and chemical modifications for the generation of structurally diverse and bioactive lasso peptide molecules.
Given that no lasso peptide product was detected in cultures of S. sp. PKU-MA01240 assessed with HPLC or LC-MS, and considering that this strain does not readily accept exogenous plasmids, we explored the heterologous expression of the stla cluster in experimentally tenable Streptomyces hosts. Using RecET direct cloning and the Redαβ recombineering system,30,31 we constructed a plasmid (pMM2002) bearing the 20.3 Kb fragment covering the stla gene cluster (Fig. 1A and S2†). pMM2002 was integrated into S. coelicolor A3(2) and S. lividans K4-114,28 but no lasso peptide was detected by HPLC or LC-MS analyses of cultures grown using three different media (Fig. S3†), suggesting that the stla cluster may not be expressed in these heterologous hosts.
Seeking to activate the stla cluster, a constitutive promoter KasOp*, which has been used for the activation of other natural product biosynthetic gene clusters,32–34 was cloned into the upstream region of the stlaA locus, generating the plasmid pMM2004 (Fig. S4†). pMM2004 was integrated into S. coelicolor A3(2) and S. lividans K4-114 and HPLC analysis showed a product from both strains (∼80% amount in the supernatant and ∼20% amount in cells, Fig. S5†) cultured in all three tested media, with the highest yield detected for S. coelicolor A3(2) cultured in M2 medium (ESI and Fig. S3†). MS analysis of the product showed an [M–H]− ion at m/z 1694.8662 (Table S4 and Fig. S16†). This ion would be consistent with the aforementioned proposed product from the stla cluster. Subsequently, a 5 L fermentation and isolation using a combination of chromatographic methods yielded 51 mg of this compound (named stlassin, 1, Fig. 2). Currently, heterologous expression of actinobacterial lasso peptide clusters in E. coli is difficult. Given the increasing discovery of lasso peptides from Streptomyces,35–41 the Streptomyces expression system here could benefit the investigation of interesting clusters from this bacterial genus.
The sufficient amount of 1 generated from the heterologous expression enabled us to determine its solution NMR structure. The structure of 1 shows a typical right-handed lasso fold. The N-terminal macrolactam ring is formed between Leu1 and Asp8, and the C-terminal residues thread through the macrolactam ring (Fig. 3A). Note that in so-called class II lasso peptides, a defining feature is the steric lock that keeps the C-terminal tail from escaping the confines of the macrolactam ring;5 this differs from the disulfide bond present in class I, III, and IV lasso peptides (Fig. 4B). The structure of 1 obeys the conserved feature in class II, with bulky residues including Trp14 and Phe15, which protrude in opposite directions under the macrolactam ring (Fig. 3A).
Cells expressing the L1A and A7G variants produced lasso peptide derivatives L1A (10) and A7G (11), which the HPLC data suggested to be produced at much reduced yields compared to 1. Even lower product yields were detected from cells expressing the D8E, W9A, and W9R variants (these putative products could only be detected by LC-MS, Fig. 2, S8 and S27†). Later experiments showed that cells expressing W9F and W9Y variants produced higher yields of products (named stlassins W9F (12) and W9Y (13)) compared to products from cells expressing the W9A and W9R variants. These results suggest that an aromatic side chain at this position may somehow support the post-translational processing of these peptides, or help maintain the threaded lasso products to avoid degradation before export out of cells.
No product was detected for cells expressing G13A or G13S variants (Fig. 2 and S8†). The solution NMR structure of 1 shows that the Gly13 residue is located exactly in the middle of the macrolactam ring (Fig. 3A). Thus, any residue with a side chain at this position may generate a steric effect, which may prevent proper pre-folding. Mutations of the W14 and F15 residues suggested that aromatic side chains may be required for the pre-folding step of stlassin production: briefly, only low-abundance products were detected from cells expressing the W14A and F15A variants, whereas cells with W14F, W14Y, F15Y and F15W produced lasso peptide derivatives named stlassins W14F (14), W14Y (15), F15Y (16) and F15W (17). Collectively, these mutational studies revealed a rich diversity of stlassin derivatives and highlighted individual residues of the core peptide that exert pronounced effects on the post-translational processing of the precursor peptide StlaA or the stability of the mature threaded lasso products.
Our NMR data indicated similar right-handed lasso folds for 3, 4, 14, and 16 (Fig. 3B–E) to that in 1, and structural superimposition indicated that the five main chains overlap well (Fig. S12A†). In contrast, the side chains of Gln6, Trp9, Asn10, Trp14 (or Phe14), and Phe15 (or Tyr15) have distinct positions in these various structures (Fig. S12B†). Interestingly, among these stlassin derivative structures (except 3), the side chains of Trp9 and Trp14 (or Phe14) are invariably positioned facing each other (Fig. 3), which likely help keep the C-terminal tail from escaping the confines of the macrolactam ring.
The 20 solution structures with the lowest total energy in each of 1, 3, 4, 14 and 16 are generally the same, with Trp9, Trp14, and Phe15 (or Tyr15 in 16) displaying the highest extent of flexibility (Fig. S11†). We tested the thermal and proteolytic stabilities of 1, 3, 4, 14, and 16 by heating experiments and carboxypeptidase Y-treatment experiments,5,21,42–44 and found that all of these compounds are stable, with the exception of stlassin V3A (3) which could be converted to its unthreaded counterpart upon heating to 95 °C for 3 h (ESI†). In the solution NMR structures of 1, 4, 14 and 16, the side chains of Val3 and Pro12 are positioned facing each other (with their shortest distances ranging from 3.7 Å to 4.4 Å) (Fig. 3A and 3C–E). However, for 3 there was a larger distance (5.5 Å) between Ala3 and Pro12 (Fig. 3B). More NOESY correlations are observed between Val3 and Pro12 in each of 1, 4, 14 and 16, whereas only one NOESY correlation is observed between Ala3 and Pro12 in 3, due to the mutation from Val3 to a smaller side chain-containing Ala3 (Fig. S13†). The V3A mutation has additional structural consequences: Gly13 in 3 is not located in the middle of the macrolactam ring, and the indole ring of 3's Trp14 is not facing the indole ring of Trp9 any more. Ultimately, these differences suggest weaker steric locking in 3 than in 1, 4, 14 and 16. This is another example of a residue in the macrolactam ring contributing to thermal stability, which is additional to the case that a specific P8A mutation in lasso peptide caulosegnin II causes higher macrolactam ring flexibility and subsequent loss of thermal stability.45
We generated double mutant StlaA variants by introducing two cysteine residues, seeking to generate derivatives of 1 having one disulfide bond positioned between the macrolactam ring and the C-terminal loop. If successful, this should generate a type of lasso fold that is outside the traditional four classes of lasso peptides. We excluded several residues including Asp8, Trp9, and Gly13 as potential cysteine residue replacement sites based on our results from the single mutant-variants. Given that the lengths of C–S and S–S bonds are about 1.82 Å and 2.07 Å, respectively, the distance between the two β-carbons of two cysteine residues in a disulfide bond should be smaller than 5.71 Å (1.82 Å × 2 + 2.07 Å).
We checked the solution NMR structures of 1, 3, 4, 14, and 16 and found four pairs of residues Leu1/Ala11, Val2/Ala11, Val3/Pro12, and Ala7/Pro12 whose β-carbons distances are smaller than 5.71 Å (Fig. S9†). We then generated double mutations of L1C/A11C, V2C/A11C, V3C/P12C and A7C/P12C, and fermented the strains bearing these mutations (MM20030-MM20033). HPLC and LC-MS analyses showed that cells expressing the V2C/A11C and V3C/P12C double cysteine mutant variants produced derivatives 18 and 19, with [M–2H]2− ions at m/z 863.8757 and 850.8723, respectively (Fig. S8, S34 and S35†). No products were detected in cells expressing the L1C/A11C and A7C/P12C variants (Fig. S8†), perhaps owing to the negative effects from the mutations of Leu1 and Ala7.
Large-scale fermentation and isolation of 18 supported solution NMR analysis: 18 has a right-handed lasso fold with a similar C-terminal steric lock to 1 (Fig. 3F). The disulfide bond between Cys2 and Cys11 causes distinct positions of most residues in 18 from those of 1, 3, 4, 14, and 15. The thermal and proteolytic stabilities of 18 were tested, showing similarly high stability to that of 1 (Fig. S6G†). The disulfide bond may decrease the flexibilities of both the macrolactam ring and C-terminal residues, thereby contributing to the thermal stability of 18. This is supported by the invariably positioned residues around the disulfide bonds in the 20 solution structures of 18 with the lowest total energy, particularly the C-terminal residues from Pro12 to the steric lock Trp14 (Fig. S11†).
Fig. 5 HPLC analysis of the reactions in the generation of 20 and 22–27 by the indicated site-selective chemical modifications (all achieved under mild conditions). |
Seeking to introduce a cysteine residue in the lasso peptide, we checked the solution NMR structures of 1, 3, 4, 14, 16, and 18, and selected Ile4 for mutation because (i) the I4A mutation gave a high yield of the product (4) and (ii) its position far from the C-terminal loop region may avoid potential steric effects. Cells expressing a I4C variant produced stlassin I4C (21) ([M–H]− ion at m/z 1684.7978) (Fig. S8 and S35†), which was isolated and incubated with α-D-fluoroglucose and Ca(OH)2 at room temperature in water for 1 h.24 We detected the glucosylation product stlassin I4C-I (22) ([M–H]− ion at m/z 1846.8480), but in a low yield (3%) (Fig. 4A and 5, lane V; Fig. S38†). Interestingly, a dimerization product stlassin I4C-II (23, [M–2H]2− ion at m/z 1683.7888) with an intermolecular disulfide bond was produced in this reaction, which suggests that the reaction conditions may provide an oxidative environment facilitating the disulfide bond formation (Fig. 4A and 5, lane V; Fig. S39†). To improve the glucosylation efficiency of 21, TCEP was added to prevent the dimerization, which led to an increased yield (24%) of 22 (Fig. 5, lane VI).
Beyond these glycosylation modifications, we also explored arylation of 21 by using the recently developed nickel(II)-promoted cysteine S-arylation reaction with 2-nitro-substituted arylboronic acid.25 Compound 21 was incubated with 2-nitrophenylboronic acid and Ni(OAc)2 in N-methylmorpholine (NMM) buffer at 37 °C for 30 min, generating the arylation product stlassin I4C-III (24, [M–H]− ion at m/z 1805.8148) in a high yield (93%) (Fig. 4A and 5, lane VIII; Fig. S40†). Note that a small amount of the aforementioned dimerization product 23 was also produced in this reaction (6%).
Previous studies have shown that some RiPPs such as lanthipeptides, thiopeptides, and goadsporins feature dehydroalanine (Dha) residue(s).48 Generation of Dha residues can result from the dehydration of serine by a dehydratase, but no dehydratase-encoding gene is present in lasso peptide biosynthetic gene clusters reported to date. However, it is possible to generate Dha residues through sulfur elimination from cysteine residues under mild conditions.49 Thus, we carried out the bis-alkylation–elimination reaction of 21 for the conversion of Cys4 to Dha4 using the reagent α,α′-dibromo-adipyl(bis)amide. When compound 21 was treated with α,α′-dibromo-adipyl(bis)amide and K2CO3 for 30 min at room temperature, followed by an additional 4 h at 37 °C, the Dha4-containing product stlassin I4C-IV (25, [M + H]+ ion at m/z 1652.8258) was generated in a high yield (91%) (Fig. 4A and 5, lane XI; Fig. S41†).
Dha residues are highly amenable to a diverse range of modifications, including phosphorylation, glycosylation, methylation, acetylation, and lipidation.49,50 We further modified 25 by Michael addition-like reactions with thiol molecules.50 The conjugate addition of 2-mercaptoethanol generated stlassins I4C-V (26a, [M + H]+ ion at m/z 1728.81) and I4C-VI (26b, [M + H]+ ion at m/z 1728.81) (Fig. 5, lane XIII and Fig. S42 and S43†), while conjugate addition of N-acetylcysteamine generated stlassins I4C-VII (27a, [M + H]+ ion at m/z 1771.89) and I4C-VIII (27b, [M + H]+ ion at m/z 1771.83) (Fig. 5, lane XV and Fig. S44 and S45†). Products 26a/26b and 27a/27b are hypothesized as two pairs of epimer products from the thiol additions, and similar epimer products from the addition of thiols to Dha have been mentioned in previous literature;49 these were generated in high yields at room temperature after only 2 h. The reactions of 21 with 2-mercaptoethanol or N-acetylcysteamine generated no products (Fig. 5, lanes XII and lane XIV), confirming the site-selective modifications on Dha4 in 26a, 26b, 27a, and 27b. Thus, along with the generation of 20 and 22–27, chemical modifications under mild conditions can be employed to expand the structural diversity beyond the limitation of lacking biosynthetic modifying genes.
To evaluate the antagonistic activities of other stlassin derivatives, we carried out large-scale fermentations of the corresponding mutant strains, followed by chemical modifications of the isolated products, to prepare for assays with 2–12, 14–16, 18–21, and 24. Our LPS antagonistic activity assays showed that stlassins I4A (4), I4C (21), I4C-III (24), F15Y (16), and F15Y-I (20) exerted comparable inhibitory activities to stlassin (1). The other stlassin derivatives showed no or less inhibitory activity (Fig. 6). Dose–response experiments were carried out for 1, 4, 21, 24, 16, and 20, showing increased antagonistic activities with increased concentrations of each compound (Fig. S15†). IC50 values were determined based on the dose–response curves (Fig. S15†), in which 16 (IC50 value of 0.273 ± 0.022 μM) and 20 (IC50 value of 0.501 ± 0.057 μM) gave smaller IC50 values than 1 (IC50 value of 0.753 ± 0.015 μM). The different inhibitions of these stlassin derivatives provide important information useful for the generation of more potent inhibitors, e.g. more mutations and chemical modifications on Ile4 and Phe15 positions in the future.
Mutations of Val2–Gln6 and Asn10–Pro12 in the core peptide were well tolerated and enabled production of structurally diverse products, as did substitution of Trp9, Trp14, and Phe15 with other aromatic residues. These results pave the way for carrying out more mutations at these positions, which could be an alternative way to random isolation from natural resources for obtaining new lasso peptides. Seventeen stlassin derivatives (2–17 and 21, Fig. S10†) were generated, and six solution NMR structures of stlassin derivatives (1, 3, 4, 14, 16 and 18) were determined. The solution NMR structural comparisons of 1, 3, 4, 14 and 16 revealed the structural differences related to the loss of thermal stability of 3. We harnessed this structural insight to support rational double mutations that enabled the production of 18 and 19, each of which contains both an intramolecular disulfide bond and a C-terminal steric lock; these structural features extend beyond the current classification of lasso peptides (Fig. 4B). Notably, our productions of 18 and 19 support that disulfide bonds in class I, III and IV lasso peptides may form spontaneously.
Moreover, we performed site-selective chemical modifications at specific positions in the stlassin peptides, generating 20 and 22–27. It should be possible to further expand the structural diversity of stlassin-type lasso peptides by for example introducing tyrosine, cysteine, or other residues into the core peptide to support chemical modifications. Finally, we found that stlassin derivatives with mutations and chemical modifications at the Ile4 and Phe15 positions (4, 16, 20, 21 and 24) showed antagonistic activity against the binding of LPS to TLR4. To the best of our understanding, no previous studies have reported this activity for lasso peptides. In sum, our application of biosynthetic gene mutations and site-selective chemical modifications sets the stage for constructing a unique lasso peptide library, which can support further efforts for bioactivity-guided structural optimization and anti-inflammatory agent development.
Footnotes |
† Electronic supplementary information (ESI) available: All experimental and spectroscopic details. See DOI: 10.1039/d1sc02695j |
‡ These authors contributed equally. |
This journal is © The Royal Society of Chemistry 2021 |