Jacob A.
Iannuzzelli
and
Rudi
Fasan
*
Department of Chemistry, University of Rochester, Rochester, New York 14627, USA. E-mail: rfasan@ur.rochester.edu
First published on 27th May 2020
The macrocyclization of recombinant polypeptides by means of genetically encodable non-canonical amino acids has recently provided an attractive strategy for the screening and discovery of macrocyclic peptide inhibitors of protein–protein interactions. Here, we report the development of an expanded suite of electrophilic unnatural amino acids (eUAAs) useful for directing the biosynthesis of genetically encoded thioether-bridged macrocyclic peptides in bacterial cells (E. coli). These reagents are shown to provide efficient access to a broad range of macrocyclic peptide scaffolds spanning from 2 to 20 amino acid residues, with the different eUAAs offering complementary reactivity profiles toward mediating short- vs. long-range macrocyclizations. Swapping of the eUAA cyclization module in a cyclopeptide inhibitor of streptavidin and Keap1 led to compounds with markedly distinct binding affinity toward the respective target proteins, highlighting the effectiveness of this strategy toward tuning the structural and functional properties of bioactive macrocyclic peptides. The peptide cyclization strategies reported here expand opportunities for the combinatorial biosynthesis of natural product-like peptide macrocycles in bacterial cells or in combination with display platforms toward the discovery of selective agents capable of targeting proteins and protein-mediated interactions.
This progress notwithstanding, and with the notable exception of SICLOPPS libraries,26,37,38 none of the aforementioned methodologies allows for the production of macrocyclic peptides of arbitrary sequences in living cells, thus limiting the screening of these macrocyclic peptide libraries to in vitro affinity-based selection procedures.27–34 To expand capabilities in this area, our group has introduced methodologies useful for mediating the synthesis of macrocyclic peptides (a.k.a. macrocyclic organo-peptide hybrids or MOrPHs)39 in living cells via the spontaneous, post-translational cyclization of recombinant polypeptides by means of genetically encoded non-canonical amino acids (ncAA).40–43 These methodologies have enabled the creation and screening of macrocyclic peptide libraries which can be produced in cellulo42,44 or displayed on phage particles45 for the discovery of chemical agents capable of disrupting protein–protein interactions. So far, however, only a few ncAAs40,42 were made available for these applications, thus limiting opportunities for structural diversification of the resulting macrocyclic peptides through variation of the ncAA module. Here, we report the development and characterization of an expanded suite of non-canonical amino acids suitable for directing the biosynthesis of macrocyclic peptides constrained by a non-reducible, inter-side-chain thioether linkage in bacterial cells. These cyclization strategies provide efficient access to a broader range of macrocyclic peptide scaffolds than previously possible. In addition, variation of the ncAA module is shown to furnish an effective means to modulate the functional properties of bioactive macrocyclic peptides.
Conveniently, two engineered variants of Methanocaldococcus jannaschii tyrosyl-tRNA synthetase (MjTyrRS) were recently made available for the genetic incorporation of both pVsaF and pAaF for their use in the context of intermolecular protein–protein crosslinking via aza-Michael reactions.54 Accordingly, these AARS variants were prepared by site-directed mutagenesis of MjTyrRS and cloned into a pEVOL-based vector56 for expression of pVsaF-RS (Tyr32Gly, Leu65Tyr, Phe108His, Gln109Gly, Asp158Gly, Ile159Leu, Leu162Gln, Asp286Arg) or pAaF-RS (a.k.a. TyrRS42: Tyr32Val, Leu65Tyr, Phe108His, Gln109Gly, Asp158Gly, Leu162Glu, Asp286Arg) in conjunction with the cognate suppressor MjtRNATyrCUA. Both AARS/tRNA pairs were found to enable the selective incorporation of pVsaF (or pAaF) into a protein, as determined using a reporter yellow fluorescent protein containing an amber stop codon within its N-terminal region (YFP(TAG))42 (Fig. 2a). To identify a suitable AARS for genetic incorporation of pCaaF, we surmised that this ncAA may be also recognized by pVsaF-RS and/or pAaF-RS, in view of the structural similarity between pCaaF and pVsaF/pAaF and considering the polyspecificity often exhibited by engineered AARS.42,57,58 Gratifyingly, both AARSs and in particular pAaF-RS, were found to effectively charge the suppressor tRNA with pCaaF, enabling its efficient incorporation into the reporter YFP protein (Fig. 2a). In fact, both AARS showed a marked polyspecificity within this set of ncAAs, which may be facilitated by specific recognition of the amido/sulfonamido group of these substrates within the enzyme active site. To identify a AARS for the genetic encoding of O4bbY, we screened a broader in-house collection of MjTyrRS variants previously engineered for recognition of various non-canonical tyrosine and phenylalanine derivatives, including O-propargyl-tyrosine (OpgY),59 3-amino-tyrosine (3AmY),60 and 2-naphthylalanine (2NapA), among others. Using the aforementioned YFP(TAG)-based fluorescence assay, the MjTyrRS variant O2beY-RS, which we previously engineered for recognition of O2beY,40 was found to provide a viable AARS for incorporation of O4bbY in response to an amber stop codon (Fig. 2b). Interestingly, O2beY-RS differs from the OpgY-specific aminoacyl-tRNA synthetase OpgY-RS by a single mutation, Ala32Gly, which expands the active site cavity in proximity of the para position of the tyrosine substrate.40 Unlike for O2beY, no detectable incorporation of O4bbY was observed using OpgY-RS (Fig. 2b), indicating that this cavity-enlarging mutation becomes critical for accommodating the more extended para substituent in this ncAA compared to O2beY and OpgY.
With a functional set of orthogonal AARS/tRNA pairs for the genetic encoding of the target ncAAs in hand, we set out to test the ability of these eUAAs to promote peptide cyclization according to the strategy outlined in Fig. 1a. To this end, we prepared a series of model polypeptides encompassing a macrocycle precursor sequence in which the eUAA and Cys residues are spaced from each other by an increasing number of intervening residues and leading to a i/i + 1 to i/i + 20 macrocycle, where i and n correspond to the eUAA and Cys residue, respectively (Table 1). In these constructs, the macrocycle precursor sequence was N-terminally fused to a FLAG tag for ease of detection by mass spectrometry and in the binding assays (vide infra) and C-terminally fused to a chitin-binding domain (CBD) containing a poly-histidine tag. The latter was introduced to facilitate purification of the recombinantly produced constructs and isolation of any product resulting from potential reactions of the genetically incorporated eUAA with other nucleophiles in cells. In addition, a Factor Xa cleavage site was introduced between the macrocycle precursor sequence and the CBD tag to allow for proteolytic cleavage of the tag for LC-MS analysis of cyclization efficiency.
Entry | Name | Macrocycle precursor sequencea | Cysb |
---|---|---|---|
a The full protein construct contains an N-terminal FLAG tag (MDYKDDDDK-GSGSG-) and a C-terminal Factor Xa site (-GIEGR-) followed by chitin binding domain from B. circulans chitinase A1 containing a poly-His tag. b Relative position of cysteine with respect to the electrophilic unnatural amino acid (eUAA) (=position ‘i’). | |||
1 | Z1C | i + 1 | |
2 | Z2C | i + 2 | |
3 | Z3C | i + 3 | |
4 | Z4C | i + 4 | |
5 | Z5C | i + 5 | |
6 | Z6C | i + 6 | |
7 | Z8C | i + 8 | |
8 | Z10C | i + 10 | |
9 | Z12C | i + 12 | |
10 | Z15C | i + 15 | |
11 | Z20C | i + 20 | |
12 | C6Z | i − 6 | |
13 | C8Z | i − 8 | |
14 | Strep-m3(Z) | i − 8 | |
15 | KDD-m1(Z) | i + 7 |
The target constructs were produced in E. coli BL21(DE3) cells containing a dual plasmid system for expression of the precursor polypeptide and the orthogonal AARS/tRNA pair in the presence of the desired eUAA. The recombinant constructs were purified via Ni-affinity chromatography and analyzed by MALDI-TOF mass spectrometry. For each of the ncAAs, the corresponding proteins were obtained in high yields (e.g., pVsaF constructs (avg): ∼25 mg protein per L culture; pAaF constructs (avg): ∼65 mg L−1 culture; pCaaF constructs (avg): ∼26 mg L−1 culture; O4bbY constructs (avg): ∼72 mg L−1 culture). In addition, MALDI-TOF MS analysis of both the purified proteins and the N-terminal peptides obtained after proteolytic (Factor Xa) cleavage of the CBD tag (Fig. S2–S5†) show a m/z signal corresponding to the desired ncAA-containing construct with no detectable amounts of by-products resulting from amino acid misincorporation. These results thus confirmed the functionality and specificity, respectively, of the AARS enzymes applied for incorporation of these eUAAs. To measure the extent of eUAA-induced cyclization, the N-terminal peptides were analysed by LC-MS to quantify the amount of the desired thioether-linked macrocyclic product vs. the linear peptide (see Fig. S8–S11†). pCaaF- and O4bbY-mediated cyclization results in the loss of a HX group (X = Cl or Br), enabling direct discrimination of cyclic vs. linear peptide based on the mass difference. Since the cyclized and acyclic forms of the pVsaF- and pAaF-containing peptides are isobaric, these compounds were treated with iodoacetamide prior to LC-MS analysis. Under these conditions, the free Cys in the linear peptide is quantitatively alkylated (+57 mass units; Fig. S1†) enabling its discrimination from the cyclic form.
As shown in Fig. 3 and S2–S5,† these experiments showed that all the eUAAs are able to induce efficient macrocyclization across variable target sequence lengths, thus demonstrating their functionality and efficiency toward directing the biosynthesis of macrocyclic peptides in E. coli. Previously, O2beY-mediated cyclization was found to occur efficiently (80–100%) only across inter-residue distances ranging from i/i + 2 to i/i + 8, where i is the eUAA and the Cys residue is located at the i + n positon (Fig. 3a).40 In stark contrast, all of the new eUAAs investigated here undergo efficient cyclization from i/i + 2 up to i/i + 20 inter-residue distances, enabling the formation of peptide macrocycles spanning 3 to 21 amino acid residues in high to quantitative yields (>80–100%; Fig. 3a–d). As the only exception, modest cyclization (30%) was observed for the 16mer construct (Z15C) with pVsaF (Fig. 3c). This result, however, likely arises from a sequence-dependent effect since significantly higher macrocycle yield (85%) was observed for the longer (21mer) target sequence using this eUAA. Thus, complementing the reactivity profile of O2beY,40 the pCaaF-, O4bbY-, pVsaF-, and pAaF-based cyclization modules greatly expand the range of cyclopeptide ring sizes accessible using this methodology (Fig. 1) and provide an effective means to achieve long-range peptide macrocyclizations (i.e., i/i + 10 to i/i + 20) in addition to medium- and short-range ones (i.e., i/i + 2 to i/i + 8). To assess the impact of the orientation of the eUAA/Cys pair on cyclization efficiency, two additional constructs were examined in which the Cys/eUAA residues are installed in a i/i − 6 and i/i − 8 arrangement (Table 1, entries 12 and 13). In both cases and across all of the eUAAs, the desired macrocyclic peptide was obtained in comparably high yields as for the i/i + 6-linked and i/i + 8-linked counterparts (94–100%; Fig. 3a–d and S2–S5†), indicating that both orientations of the eUAA/Cys pair are compatible with efficient macrocyclization.
Fig. 3 Cyclization efficiency for the different macrocycle precursor sequences with varying distances and orientations of the eUAA/Cys pair. For each graph, the eUAA cyclization module is indicated. Proteins were isolated after expression for 16–18 hours at 27 °C and the extent of cyclization was determined by LC-MS analysis after proteolytic cleavage of the CBD tag (see ESI for details†). |
Whereas the eUAAs performed similarly across i/i + 2 to i/i + 20 crosslinking distances, insights into their differential reactivity could be gained from analysis of the i/i + 1 constructs (Z1C(eUAA); Table 1; entry 1). Indeed, in contrast to pCaaF, pAaF, and O2beY, which undergo no to inefficient macrocyclization (0–26%) with an adjacent cysteine (Fig. 3a and b), both pVsaF and O4bbY are able to form the corresponding i/i + 1-linked macrocycles with substantially higher efficiency (75–88%; Fig. 3c and d). Notably, these reactions generate 16-membered macrocycles containing five sp2 carbon centers, whose formation is expected to be quite unfavorable due to ring strain.61 Compared to the structurally similar pAaF, the higher efficiency of pVsaF toward i/i + 1 cyclization (75% vs. 26%) can be rationalized based on the higher reactivity of vinylsulfonamides vs. acrylamides toward Michael addition reactions with thiols.62 Another noteworthy aspect is the accumulation of a certain amount of glutathione adduct or hydrolysis product for the constructs unable to undergo efficient cyclization, namely Z1C(pCaaF), Z1C(pAaF), and Z15C(pCaaF) (Fig. S2 and S3†). While this indicate that these reactions can potentially compete with intramolecular crosslinking within the cell, the high extent of cyclization observed for the vast majority of the constructs and reactions tested (i.e., 48/52 = 92%; Fig. S2–S5†) highlights the fact that eUAA-mediated cyclization effectively outcompete these side-reactions.
The different types of side-chain electrophilic groups carried by the eUAA modules result in the formation of peptide macrocycles constrained by structurally different thioether linkages (Fig. 4). Since subtle backbone modifications are known to have a large impact on the conformational (and thus functional) properties of cyclic peptides,4,63 we envisioned the different eUAAs could also provide a means to tune the functional properties of the corresponding peptide macrocycles. To investigate this aspect, we selected as testbed two bioactive cyclopeptides we have recently isolated from the screening of phage displayed libraries of O2beY-based macrocycles.45 These correspond to Strep-m3, a (i/i − 8)-linked macrocycle with nanomolar affinity for streptavidin (KD: 20 nM), and KDD-m1, a (i/i + 7)-linked macrocyclic peptide capable of targeting Kelch-like ECH-associated protein 1 (Keap1) (KD: 107 nM), which is implicated in regulating cytoprotective responses to oxidative stress in human cells.64 To examine the impact of the thioether linkage on the function of these peptides, a series of macrocyclic peptides were generated by introducing each of the new eUAAs in place of O2beY within these peptide sequences (Table 1, entries 14 and 15). Consistent with the structure–activity analyses of Fig. 3, all of the desired macrocycles containing the different eUAAs were obtained in quantitative yields as confirmed by MS analysis (Fig. S6 and S7†). After purification, the binding affinity of these macrocyclic peptides toward the respective target proteins (streptavidin or Keap1) was determined using an in vitro assay. As shown in Fig. 5, these experiments revealed a pronounced effect of the nature of the thioether linkage on the binding properties of these molecules. In the case of the Strep-m3 derived series, pAaF- and pCaaF-mediated cyclization resulted in cyclic peptides displaying a streptavidin binding affinity comparable to that of the O2beY-linked peptide (KD: 31–35 nM vs. 20 nM) (Fig. 5c), indicating that the corresponding thioether linkages (–NHC(CO)(CH2)n–, where n = 1 and 2, respectively) are isofunctional to that generated via O2beY-mediated cyclization (–O(CH2)2–) in the context of this macrocyclic scaffold (Fig. 5a). In contrast, a 17-fold lower binding affinity (KD: 343 vs. 20 nM) was measured for the pVsaF-cyclized peptide, despite the fact its thioether linkage is identical in length to that given by pAaF and differs from it by only a subtle structural feature, namely the presence of a sulfonamide (–NHSO2–) vs. amide (–NHCO–) bond (Fig. 5a). Only a modest reduction in affinity (2.5-fold) was observed instead for the O4bbY-cyclized peptide, which features an alkyl thioether linkage extended by two methylene units compared to that of O2beY. Interestingly, a completely different scenario emerged from the analysis of the Keap1-targeting peptides derived from KKD-m1 (Fig. 5b and c). Here, the best Keap1 binder among this series is the pVsaF-cyclized macrocycle, which binds Keap1 with only a slightly reduced affinity (3-fold) compared to the reference O2beY-linked macrocycle (KD: 327 nM vs. 107 nM; Fig. 5c). Whereas the O4bbY-cyclized peptide also maintained substantial affinity for Keap1 (KD: 1200 nM), both the pAaF- and pCaaF-cyclized peptide showed a dramatic drop (>100-fold) in the strength of their binding interaction with Keap1 compared to the O2beY-based peptide. Taken together, these studies illustrate the ability of the different eUAA cyclization modules and corresponding thioether linkages to tune the functional properties of a bioactive macrocyclic peptide, also displaying a differential effect in the context of different cyclopeptide scaffolds. In this regard, particularly instructive is the markedly distinct behaviour of the pVsaF- vs. pAaF-based macrocycles, which is dependent upon a small variation (–SO2– vs. –C(O)– unit) within their thioether linkages. Given that the rest of these molecules remain invariant, this effect likely arises from a subtle yet functionally important change in the conformation of the macrocycles as influenced by the nature of the intramolecular crosslink.
Footnote |
† Electronic supplementary information (ESI) available: Detailed experimental procedures, synthetic procedures, additional MS spectra and LC-MS chromatograms, NMR spectra. See DOI: 10.1039/d0sc01699c |
This journal is © The Royal Society of Chemistry 2020 |