Shuhe
Wang
a,
Johannes
Sigl
a,
Lars
Allmendinger
a,
Victor
Maurizot
b and
Ivan
Huc
*a
aDepartment Pharmazie, Ludwig-Maximilians-Universität München, Butenandtstraße 5–13, Munich D-81377, Germany. E-mail: ivan.huc@cup.lmu.de
bUniversité de Bordeaux, CNRS, Bordeaux Institut National Polytechnique, CBMN (UMR 5248), Institut Européen de Chimie Biologie, 2 Rue Escarpit, Pessac 33600, France
First published on 4th December 2024
Starting from the solid state structure of C3-symmetrical homochiral parallel trimolecular bundle of three aromatic helices held together by intermolecular hydrogen bonds, we have used simple rational principles and molecular modelling to design a similar heterochiral structure where one helix had an opposite orientation and handedness. A rigid and a flexible linker to connect these helices and transform the bundle into a unimolecular object were designed and synthesized. Model sequences with two helices and one linker were then prepared. Their conformations were investigated in solution by nuclear magnetic resonance and circular dichroism, in the solid state by X-ray crystallography, and by molecular dynamics simulations, overall supporting the initial design. A final 6.9 kDa unimolecular three-helix bundle was then prepared using a fragment condensation approach. Solution studies support the formation of the targetted tertiary fold in the case of the rigid linker, thereby validating the overall approach.
In peptides, sophisticated functions are rarely associated with an isolated α-helix or β-strand. Instead, most protein functions emerge at the level of tertiary structures that are complex objects consisting of several helices or sheets. While the design of artificial proteins has been thriving,32–35 tertiary structures based on abiotic backbones are still in their infancy. Our own efforts have consisted in assembling secondary motifs based on oligoamides of 8-amino-2-quinoline carboxylic acid bearing different solubilising side chains in position 4 (QD, QB, QM in Fig. 1a). Oligomers of these δ-amino acids fold into extremely stable 2.5 aromatic helices in all types of solvents (Fig. S1†).36,37 Such helices are easily produced by automated solid phase synthesis,38 and constitute convenient building blocks to be assembled into larger structures.39
In α-peptides, the bundling of several α-helices by multimolecular self-assembly or by folding of a single sequence containing multiple α-helical segments separated by loops is one of the best understood motifs. Such structures can be designed reliably,40–42 and have inspired the development of β-peptidic43–45 and urea-based46–48 foldamer helix bundles as well as bundles of 310 helices.49 We have followed the same path and used helix–helix interactions to produce the first abiotic tertiary structures.50 Monomer X, an analogue of Q, was developed for this purpose (Fig. 1a). The 4-hydroxy group of X protrudes from the aromatic helix and may hydrogen bond to the amide carbonyl group of another helix in chlorinated solvents. Similarly, P and Y constitute analogues of Q and X, respectively, in which the quinoline benzene ring has been trimmed to avoid possible steric clashes within helix bundles.
By carefully arranging the hydroxy groups of X and Y units at the helix surface and choosing a proper turn unit to covalently link two helices, a first helix-turn-helix motif was designed in which two helices are held with their axes parallel to each other by inter-helix hydrogen bonds (Fig. 2a and S2†).50 In this first generation, the helices of the bundle were by design set to both have the same right-handed (P) or left-handed (M) handedness while in a second generation of helix-turn-helix motif, a P helix was linked to an M helix (Fig. S2†).51 These tertiary structures are robust enough to undergo further assembly into quaternary motifs upon introducing additional hydroxy groups to form intermolecular hydrogen bonds.52 Nevertheless, we discovered that the parallel arrangement of two helices is not the most stable. When the connecting turn element (T1 or T2 in Fig. S2†) is absent as, for example, in sequence 1 (Fig. 1b), single helices associate in parallel trimers (Fig. 2c) or in titled dimers (Fig. 2b), not in parallel dimers.50 These different aggregates coexist in solution making the selective formation of one or the other challenging. The reason for their prevalence is that they both allow the release of some unfavorable helix torsion present in the helix-turn-helix structures without hampering hydrogen bonding.53,54
Fig. 2 (a) Schematic representation of a homochiral helix-T1-helix structure. After removing the T1 turn unit, single helices may assemble in tilted dimers (b) or in a parallel trimer (c). As shown in (c), the three helices are parallel in the trimer (black arrows) and the N termini (blue balls) are on the same side of the structure. (d) An oligoethylene glycol-based T3 turn unit was introduced to stabilize an intramolecular tilted arrangement.65 (e) Model of a trimer after inverting the orientation and the handedness of one helix. P helices and M helices are colored in blue and red, respectively. The C terminus of helix ② is then close to the N termini of both helices ① and ③. (f) Illustration of the similar arrangement of hydrogen bond donors (yellow balls) and acceptors (red balls) after inverting the handedness and orientation of a helix. (g) Connection of the C terminus of helix ② to the N terminus of helix ③ and of the N terminus of helix ② to the C terminus of helix ①. (h) Connection of the C terminus of helix ② to the N terminus of helix ① and of the N terminus of helix ② to the C terminus of helix ③. (i) Hydrogen bonding motif between X units and between Y units in the parallel trimer shown in (c). (j) Hydrogen bonding motif between X units and between Y units in the trimer shown in (e). The color code is the same as in (e). (k) “Open-book” view of the hydrogen bond patterns of the homochiral trimer. (l) “Open-book” view of the hydrogen bond patterns of the representation shown in (g). In (k) and (l), the helix face is represented as a plane, with the N terminus shown as a white ball, the C terminus shown as a black ball and the linker T6 shown as a green stick. Planes with a blue and red frame correspond to P and M helices, respectively. Dashed-arrows indicate hydrogen bonds. |
The parallel trimer was a serendipitous discovery and the first motif in which three abiotic helices were assembled in such a way that each helix interacts with the two others (Fig. 2i). Here we present how we successfully transformed this trimolecular object into a unimolecular helix-turn-helix-turn-helix tertiary structure analogous to a pattern common in proteins, for example in the B domain of protein A (Fig. S3†).55,56 We first explored the behavior of shorter models 2–6 before implementing the same design principle in 7 and 8 which contain three helical segments. This achievement not only generated the most complex abiotic tertiary structure known to date. It also allowed us to streamline the design principles, computational approach, as well as synthetic and purification strategies. Routinely accessing such 6.9 kDa abiotic folds may become a realistic prospect in the near future.
The linker should preferably be relatively rigid to reduce the possibility of arrangements other than those intended. The linker should also not create strain that would risk destabilizing the structure. Keeping this in mind, models with different connections between the helix termini (green arrow in Fig. 3b) were built and energy-minimized. When strain was not apparent in the minimized structure under the form of e.g. helix distortion, a further test was to cut a bond in the linker and energy-minimize again. Any important conformational change at this stage would be interpreted as strain and the corresponding linker would be discarded or subjected to improvement. Finally, the synthetic accessibility was also considered to select the linker. A number of trials led to the design of two analogous turn units, T6f and T6r (Fig. 1a, 3c and d). T6f is a diamide of 4-aminomethyl pyridine 2-carboxylic acid and 5-amino quinoline 2-carboxylic acid, while T6r is a diamide of 5-amino isoquinoline 3-carboxylic acid and 5-amino quinoline 2-carboxylic acid. T6f is more flexible due to the presence of an additional rotatable bond at the CH2 group. The linker units were produced with a free acid function and an Fmoc-protected amine, that is, ready for solid-phase synthesis (Fig. 4, see ESI† for details).
The synthesis of 5-nitroquinoline 10 is similar to that of the 8-nitro precursor of QB,66 starting from 3-nitroaniline instead of 2-nitroaniline. Of note, the cyclization that produces 9 yields an equal amount of the 7-nitro isomer which can be used in other foldamer architectures.67 The two isomers can be separated by selective crystallization. Similarly, 14 represents a regioisomer of Fmoc-P-OH, albeit obtained by a different route. Hydrogenation of the commercially available ethyl 4-cyano-2-pyridinecarboxylate in MeOH and in presence of Boc2O produced 12 without observable transesterification. The synthesis of precursor 20 starts from commercially available tetrahydroisoquinoline 3-carboxylic acid. The nitration step is selective and the 5-nitro regioisomer was isolated in 80% yield by precipitation. The acylation of 11 by 14 using PyBOP activation or by 20 produced Fmoc-T6f-OH and Fmoc-T6r-OH, respectively. Overall, these syntheses worked quite well and needed not be repeated for this study. About half of the steps required chromatographic purification. We surmise that a full optimization would allow to improve some yields further, to increase the scales, and to avoid some chromatographic steps.
We thus prepared three sequences equivalent in length to Q3-turn-Q3, 2b, 3b and 4b. Sequences 2b and 3b are achiral, with T6f and T6r as turn units, respectively. Sequence 4b is a chiral analogue of 3b, bearing a (1S)-camphanyl group so that the N-terminal helix has P handedness.68 All three sequences have a single X monomer in position two of the first helical domain. The hydroxy group of this X unit can potentially form an intramolecular hydrogen bond with a carbonyl group of the second Q3 helical domain. In these short oligomers, the second residue after the turn is a QM monomer in replacement of X in 7b and 8b, where this residue should interact with the third helix.
Sequences 2a, 3a and 4a (Fig. 1b), the protected precursors of 2b, 3b and 4b, were synthesized on solid phase using previously reported methods.38 The synthesis was performed on a super acid sensitive resin (SASRIN™) so that mildly acidic resin cleavage preserved tBu-ether protection of the monomer. The C-terminal carboxy group was then methylated and the sequences were purified in their protected form. After TFA-mediated cleavage of the tBu group, they were purified in their deprotected form. The 1H NMR spectra of 2a, 3a and 4a in CDCl3 showed one set of sharp signals (Fig. S5†), and so did the spectra of 2b, 3b and 4b (Fig. 5a). However, the signal patterns of the CH2 protons belonging to iBu side chains indicate that P/M handedness interconversion of the helical segments of 2a and 3a is fast on the NMR time scale at 298 K whereas it is slow for 2b and 3b, indicating higher conformational stability of the latter (Fig. S6†). For 2b, 3b and 4b, 1H,15N Heteronuclear Single Quantum Coherence (HSQC) spectra allowed for the indirect assignment of the OH resonance as an exchangeable proton that does not correlate with 15N (Fig. 5a and S7–9†). In all cases, the OH proton signal is found above 10 ppm indicating that this proton is involved in a hydrogen bond.
The solid-state structures of 2b and 3b were elucidated using single crystal X-ray diffraction analysis. Their conformations are similar and both are very close to the initial molecular models. The helix-turn-helix motif is heterochiral, that is, PM or MP, meaning that T6r and T6f promote a reversal of helix handedness as desired. The hydroxy group of X is involved in the expected intramolecular hydrogen bond (Fig. 5b and c). A slight deviation from the initial design concerns the relative orientation of the hydrogen bond donor and acceptor (Fig. 5d and e). The observed orientation, with the hydroxy group pointing to the position 3 of the quinoline ring to which it belongs (Fig. 5d), is similar to that of a previously described arrangement that we called “shifted” interface (Fig. S10†).69 In contrast, in the predicted arrangement (Fig. 5e) and in the structure of (1)3 (Fig. 2i),50 the hydroxy group points toward the position 5 of the quinoline ring. Molecular Dynamics (MD) simulations starting from the solid-state structure of 3b showed that the two hydrogen bond patterns alternate within a simulation time as short as 1 ns, suggesting fast dynamics in these smaller model compounds (Fig. S11†).
The CD spectra of chiral sequences 4a and 4b in solution were consistent with the solid-state structure of 3b (Fig. 5f). The relatively weak CD band of 4b is in agreement with its two helical segments having opposite handedness and largely cancelling each other's CD contribution. In contrast, the bulky tBu ether of 4a may disfavor a P-turn-M arrangement with both helices on the same side (Fig. 5g). Having both Q3 helices on opposite sides of the turn unit would then favor a P-turn-P arrangement,70,71 as reflected in its intense positive CD band. Adding DMSO to a CHCl3 solution of 4b disrupts the intramolecular hydrogen bond, resulting in an increase of the proportion of the P-turn-P conformation and thus in an enhancement of CD intensity (Fig. 5f and g). Altogether, these results hint at T6r and T6f performing well at promoting the desired conformations. We thus proceeded with the examination of longer model systems.
Sequences 5b (with T6f) and 6b (with T6r) are equivalent in length to Q8-turn-Q8 and represent achiral analogues of the two N-terminal helix-turn-helix segments of 7b and 8b, respectively. In 5b and 6b, the OH groups intended to promote interactions between helices ③ and ② of 7b and 8b have been preserved, and the OH groups intended to promote interactions with helix ① have been removed. Thus, 5b and 6b contain three X units in total, compared to three X units and two Y units for the first two helices of 7b and 8b. The protected precursors 5a and 6a (Fig. 1b) were synthesized on solid phase using the same protocol as for 2a, 3a and 4a, but on a different resin. With a Gly-HMBA AM resin (4-(hydroxymethyl)benzoyl-aminomethyl polystyrene with a preloaded glycine), resin cleavage in presence of methanol under basic conditions directly yields the C-terminal methyl ester while preserving tBu-ether protection on the monomers. Sequences 5 and 6 thus have a glycine methyl ester at their C terminus.
Handedness interconversion in each of the longer helices of 5a and 6a is slow on the NMR timescale, unlike in the shorter helices of, e.g., 2a. This allows for the observation of diastereomeric conformers as distinct sets of signals at 298 K (Fig. S5†). Thus, the 1H NMR spectrum of 5a showed two species in a 1:0.8 ratio, which correspond to PP/MM and PM/MP diastereomers. In contrast, the spectrum of 6a shows one set of signals assigned to the PP/MM conformers by extension of the preference of 4a for the PP conformation. The different spectra of 5a and 6a reflect different behaviors of T6f and T6r. Due to its rigidity, the latter conveys handedness from one helix to the next, whereas the former disrupts helix handedness communication.70,71 Nevertheless, after side chain deprotection, the 1H NMR spectra of 5b and 6b both show a single set of signals in CDCl3 (Fig. 6b), indicating that a single diastereomeric conformer prevails and thus that quantitative helix handedness communication through the turn units takes place, as observed above for 2b and 3b. OH resonances were assigned as for 2a and 3a (Fig. 6b, S12 and 13†) and appeared above 10 ppm, indicating their involvement in hydrogen bonds. Diffusion-Ordered Spectroscopy (DOSY) shows that 6b has a larger diffusion coefficient than 6a (Fig. S14†). Since 6a is a monomer, this suggests that 6b is monomeric as well and thus that all hydrogen bonds are intramolecular. The prevalent conformers of 5b and 6b in CDCl3 were also prevalent in CD2Cl2 (Fig. S15 and 16†). This was examined because the relative stability of (1)3 had been shown to differ in these two solvents.50
We then evaluated the stability of these conformers by 1H NMR upon adding DMSO-d6 to CDCl3 solutions (Fig. 6c, S17 and 18†). DMSO competes for hydrogen bonding and is known to disrupt tertiary folds mediated by hydrogen bonds, resulting in variations of chemical shift values.50,51,53,72 The conformational change was observed in both species, yet their transition points differed slightly: around 8% (vol/vol) DMSO for 5b, and 12% for 6b. The sequence having the more rigid linker thus appears to be more stable.
The results above are consistent with 5b and 6b folding as a U-shaped helix-turn-helix structure stabilized by the turn unit geometry and three intramolecular inter-helix hydrogen bonds. Crystals of 5b and 6b were obtained but did not diffract at atomic resolution and their solid-state structure could not be solved. Their conformations were thus further investigated by MD simulations. Building on what was observed for 3b, two different models of 6b were built. In one model, the OH groups were made to point towards the position 5 of the quinoline ring they belong to, as in the models of 7b and 8b and in the structure of (1)3 (Fig. 6a). This conformation was termed 6b-T, T standing for “Three helix pattern”. In the other model, the OH groups were made to point towards the position 3 of the quinoline ring they belong to, as in the crystal structures of 2b and 3b. This conformation was termed 6b-S, S standing for “Shifted pattern”. Both models could be energy-minimized while preserving the orientation of the OH groups and the three inter-helix hydrogen bonds. In 6b-T, a view of the helix down its axis showed a 15-crown-5-like shape of the inner rim, suggesting that the helix held its preferred curvature of 2.5 units per turn (Fig. S19†). This was less obvious in 6b-S, hinting at a possible distortion of the main chain. Furthermore, the two helices had their axes parallel in 6b-T and at an angle in 6b-S. During MD simulation, the 6b-S was found to convert to 6b-T within 2 ns while the reverse process was not observed (Fig. S20†), suggesting a higher stability of 6b-T.
In summary, the evidence gathered so far suggests that T6f and T6r indeed mediate the helix-turn-helix arrangement desired to form three-helix bundles. This entails preventing the formation of aggregates and of other hydrogen-bonded arrangements that may occur in the absence of linker.50 The shorter helix-T6-helix models may also adopt a conformation with a different orientation of the hydrogen bonds without changing the donors and acceptors involved, but we expect these alternate conformations to be disfavored with the longer helices. Some differences were observed between T6f and T6r, with the more rigid T6r emerging as a better option to control conformation.
Sequences 7a and 8a both have three identical octaamide segments linked by T6f or T6r turns. However, fragment condensation with aromatic helices works much better on aliphatic amines and this led to adopting different strategies for the two targets (Fig. 7). T6f is terminated by an aliphatic amine where segments can be coupled. Accordingly, we prepared the Fmoc-QQQQQ-OH repeat motif (fragment A in Fig. 7a) on SASRIN™ resin and purified it in its fully protected form. Sequence 7a was then assembled on a Gly-HMBA AM resin using alternatively fragment A activated with BOP and T6f activated as an acid chloride. The terminal Fmoc group was removed with 2% DBU/NMP and (1S)-camphanic chloride was used to finally cap the N terminus and induce a P handedness in the N terminal helix. After cleavage from the resin as a methyl ester using the same conditions as for 5a and 6a, 7a was purified by GPC and isolated in high purity in an overall yield of 10%.
Fig. 7 Scheme representation of fragment condensation strategies for the synthesis of 7a, 7b (a) and 8a, 8b (b). (c) Extracts of 1H NMR spectra (500 MHz, CD2Cl2) of 7b and 8b. (d) Extract of 1H,15N HSQC (500 MHz, CD2Cl2) of 8b. The region above 8.8 ppm is shown, see Fig. S25† for a full spectrum. Only NH resonances correlate, red diamonds indicate the signals of hydrogen-bonded OH protons. |
Because T6r is terminated by an aromatic amine, a different strategy was required to prepare 8a. The sequence was divided into four fragments of three different kinds (B, C and D in Fig. 7b) with the plan to condense the fragments at the benzylic amine of units. The synthesis proceeded as for 7a with the prior synthesis of the fragments on SASRIN™ resin and their purification as a free carboxylic acid, followed by their assembly on Gly-HMBA AM resin. The final isolated yield after GPC purification was 9%.
Consistent with the behavior of 5a, the 1H NMR spectrum of 7a showed four sets of signals with similar intensities (ratio of 1/0.9/0.85/0.8) that can be explained by the coexistence of PPP, PMP, PPM and PMM diastereomeric conformers (Fig. S5†). In contrast, and in agreement with the behavior of 6a, the 1H NMR spectrum of 8a showed one set of signals assigned to the PPP conformer (Fig. S5†). After side chain deprotection, the 1H NMR spectra of 7b in both CDCl3 and CD2Cl2 showed numerous overlapping signals indicating the presence of several species (Fig. 7c and S21†). No significant changes were observed after allowing the solution to stand for 2 weeks (Fig. S22†). It is not clear whether the species correspond to aggregates, severely kinetically trapped states, alternate folds or a combination of these. Adding DMSO-d6 to a solution of 7b to disrupt the hydrogen bonds led to a simplification of the spectrum (Fig. S23†).
Sequence 8b behaved differently from 7b. Its spectrum in CD2Cl2 showed one set of signals indicating a well-defined unimolecular species or a symmetrical aggregate (Fig. 7c and S24†), while the spectrum in CDCl3 was complex, reflecting the coexistence of several conformers or aggregates in solution. The 1H,15N HSQC spectrum in CD2Cl2 allowed us to assign all twenty-nine 15N resonances and, indirectly, the nine hydrogen bonded OH resonances (Fig. 7d and S25†). The DOSY spectrum of an 8a + 8b mixture in CD2Cl2 confirms the smaller size of 8b and thus its probable monomeric nature (Fig. S26†). Moreover, the 1H NMR spectrum did not significantly change upon adding 4% DMSO-d6 even at concentrations as low as 50 μmol (Fig. 8b and S27†). This proportion of DMSO is usually insufficient to disrupt tertiary folds but potentially destabilizing for some aggregates.50,53,54 The effect of further increasing the proportion of DMSO was monitored by 1H NMR and CD. The 1H NMR spectra showed that a transition occurred between 12 and 16% of DMSO (Fig. S28†), that is, more than required for the disruption of the folded conformer of 6b, hinting at a higher stability of 8b. The CD spectra corroborated our interpretation of NMR data. The spectrum of protected precursor 8a showed a strong positive band consistent with a PPP conformation (Fig. 8c). In comparison, the CD spectrum of 8b showed a relatively weak band in CH2Cl2 which we assigned to the desired PMP conformer in which the contributions of the helical segments partly cancel each other. As described above for 4b, the intensity of this band increased upon adding DMSO as the PMP conformer is at least in part disrupted, leading to an increase of the proportion of PPP conformer (Fig. 8c–e).
Taken altogether, these data point to 8b being a unimolecular tertiary fold with two helix handedness reversals held together by nine intramolecular hydrogen bonds consistent with the initially proposed molecular model (Fig. 8a). Well-defined folding of 8b, however, appears to be restricted to CH2Cl2. In CDCl3, its 1H NMR spectrum is broad and shows multiple species. These species already emerge upon adding CDCl3 to a solution of 8b in CD2Cl2 (Fig. S24†). Whether these species correspond to alternate folds or aggregates is unknown at this stage. One may point to other examples that we previously reported of changes in aggregation behavior of aromatic helices between CDCl3 and CD2Cl2.69 It is probably because of other possibilities these molecules have to fold and aggregate that the rigid T6r turn brought a critical advantage to control the conformation of 8b in CD2Cl2. Indeed, the PMP three-helix bundle of 8b does not form at all in the absence of linker – sequence 1 trimerizes in a PPP/MMM three-helix bundle. Apparently, the more flexible T6f turn of 7b failed to bring a sufficient advantage. One important lesson to draw from this study is thus that the design of a well-defined tertiary fold amounts as much to stabilizing the desired fold as to preventing the formation of alternate structures.
Footnote |
† Electronic supplementary information (ESI) available: Supplementary figures, detailed experimental protocols and characterization of new compounds. CCDC 2391393 and 2391429. For ESI and crystallographic data in CIF or other electronic format see DOI: https://doi.org/10.1039/d4sc07336c |
This journal is © The Royal Society of Chemistry 2025 |