Lucia
Tapia
a,
Yolanda
Pérez
b,
Jordi
Solà
a,
Santiago V.
Luis
c,
Ignacio
Alfonso
*a and
Cristian
Vicent
*d
aDepartment of Biological Chemistry, Institute for Advanced Chemistry of Catalonia, IQAC-CSIC, Jordi Girona 18-26, 08034, Barcelona, Spain. E-mail: ignacio.alfonso@iqac.csic.es
bNMR Facility, Institute for Advanced Chemistry of Catalonia, IQAC-CSIC, Jordi Girona 18-26, 08034, Barcelona, Spain
cDepartamento de Química Inorgánica y Orgánica, Universitat Jaume I, Av. de Vicent Sos Baynat s/n, 12071 Castellón, Spain
dServei Central d'Instrumentació Científica (SCIC), Universitat Jaume I, Avda. Sos Baynat s/n, 12006 Castellón, Spain. E-mail: barrera@uji.es
First published on 28th October 2022
Complexation of the glutamic–tyrosine–glutamic tripeptide (EYE) with a series of pseudopeptidic cages has been thoroughly investigated using different analytical techniques. The stoichiometry and affinities of the supramolecular host:guest complexes both in aqueous solution and in the gas-phase were obtained from a suitable combination of fluorescence spectroscopy, NMR, and mass spectrometry (MS) methods. The cages bearing basic groups (lysine, ornitine and histidine) display the tightest EYE binding in aqueous media following the order CyHis > CyLys>CyOrn, thus suggesting that Tyr side chain encapsulation is additionally modulated by the identity of the cage side chains and their ability to be engaged in polar interactions with the EYE peptide. Similarly, binding affinities estimated by MS methods clearly point towards a reduced affinity for the Cy cages with acidic pendant groups and a higher affinity of the CyHis cage over CyLys and CyOrn. Ion mobility spectrometry (IMS)-MS, assisted by molecular modelling, has been used to uncover the structural and conformational characteristics of the pseudopeptidic hosts and their supramolecular adducts with the EYE peptide. The cages display a collisional cross-section increase upon EYE inclusion that is associated with the expansion of the binding pocket of the cage cavity, thus constituting a unique example of conformational pseudopeptidic host adaptation to accommodate the inclusion of the guest.
An important challenge in the design of new and improved peptide receptors is their structural and conformational analysis. As specific host modifications are introduced, the structure–function relationship of pseudopeptidic cages may become difficult to explore by solution-phase or solid state techniques.19–22 In the present case, the cages shown in Chart 1 display highly symmetric NMR spectra (providing limited structural insights) in aqueous biomimetic media. Furthermore, suitable single-crystals for X-ray diffraction studies could not be attained so far; thus structural and conformational details remain limited. Therefore, it is highly desirable to develop new analytical toolsets and strategies to assess the characterization of the Cy cages and their cage:guest complexes.
In this context, mass spectrometry (MS) soft ionization methods, such as electrospray ionization (ESI), have long been used as an analytical method to study biological noncovalent macromolecular complexes.23–26 Properties like stoichiometry, binding affinities or stability of host–guest complexes in the gas phase can be determined, with a very low sample consumption. In addition, ion mobility spectrometry (IMS) MS has emerged in recent years as a powerful tool to gain insights into the conformational dynamics of biological systems,27–30 offering a unique means of characterizing molecular dimensions, structure, flexibility and folding mechanisms.31–33 IMS separates gaseous ions based on their sizes and shapes as the ions travel through a chamber filled with a buffer gas in an electric field. Based on the IMS-MS experimental conditions and the arrival time distribution (ATD) of ions, their mobility (K) and the collision cross section (CCS) derived property can be determined. However, IMS-MS is still highly underexploited in the field of synthetic cages or macrocycles and their host:guest chemistry.34–43 Herein, a combination of MS methods (single-stage, collision induced dissociation (CID) and IMS-MS experiments), fluorescence spectroscopy, NMR and molecular modelling of the supramolecular complexes of the Cy cages and the peptide EYE is reported to decipher the intimate details of the binding event, both in aqueous solution and in the gas-phase.
To overcome the inherent limited structural and conformational information gathered from NMR, we turned our attention to MS methods as an alternative analytical tool. The ESI mass spectra of the cages shown in Chart 1, investigated from aqueous 10 μM solutions, are very much alike displaying protonated adducts of the general formula [M + nH]n+ (n = 2–5) with a unimodal distribution. For CyOrn, CyLys and CyHis, the average charge states (zaverage) range from +3.6 to +3.8 (see the Experimental section). For CyAsp and CyGlu, average charge states are similar (close to +3.5) under identical ESI conditions. Hence, the distribution of charge states of the Cy cages produced in the gas phase by ESI do not reflect either the number of acid/base groups on the side chain or the solution-phase charge state of CyLys and CyOrn (close to +5). These ESI characteristics are reminiscent of those of globular biomolecules for which it has been shown that the extent of ionization correlates well with the solvent-accessible surface area of the tridimensional structure,44–47 rather than the number of acid/base groups on the side chains. We hypothesize that the close zaverage values for Cy cages, regardless of the peripheral acid (CyGlu and CyAsp) or basic (CyLys, CyOrn and CyHis) groups, can be explained by the common cage topology across the Cy series that should result in similar solvent-accessible surface area.
For IMS-MS experiments, we employed an ESI travelling wave ion mobility (TWIM)-MS instrument with nitrogen as the buffer. Arrival time distributions (ATDs) for each protonated adduct were extracted from their corresponding ESI IMS mass spectra. Illustrative ATDs for [CyLys + nH]n+ (n = 2–4), [CyOrn + nH]n+ (n = 2–4) and [CyHis + nH]n+ (n = 2–4) are shown in Fig. 1.
Irrespective of the charge state, narrow and Gaussian-shaped ATD profiles were observed for the various m/z selected cage ions [Cy + nH]n+ (n = 2–4), confirming low conformational dispersion (see also Fig. S3†). Such a narrow conformational ensemble can be ascribed to the rigidity imparted by the DACH groups and the capping aromatic rings to the macrobicyclic structure. The TWCCSN2 values (TWCCSN2 refers to the determined CCS values using a TWIM-MS instrument and nitrogen as buffer gas)48 were determined by the calibration approach.49 There is a slight increase in TWCCSN2 (see Table 1) values as a function of charge state across the whole series that is associated with the charge repulsion effects together with the impossibility of such medium-sized and rigid cages to expand/fold to a large extent. For the CyLys/CyOrn and CyAsp/CyGlu pairs, the TWCCSN2 values increase simultaneously with the molecular weight in agreement with the six extra methylenes of the Lys (or Glu) pendant groups that increase the area colliding with the buffer gas giving corresponding larger CCS values.
Adduct/cages | CyOrn | CyLys | TWCCSN2a (Å2) | CyGlu | |
---|---|---|---|---|---|
CyHis | CyAsp | ||||
a Values obtained by calibrating the drift time scale of the TWIM device with standards of known DTCCSN2 (DTCCSN2 refers to the determined CCS values using a drift tube instrument and nitrogen as buffer gas) cross-sectional data from the literature. Samples were measured by triplicate, and standard deviations were below 0.5%. b The supramolecular [CyAsp + EYE + 3H]3+ and [CyGlu + EYE + 3H]3+ adducts were not observed experimentally; (n.d. means not detected). | |||||
[Cy + 3H]3+ | 414 | 467 | 437 | 411 | 415 |
[Cy + 2H]2+ | 401 | 448 | 395 | 377 | 388 |
[Cy + EYE + 3H]3+ | 480 | 515 | 506 | n.d.b | n.d.b |
The spray solvent, pH or counterion are known to induce distinctive conformation of macrocycles and cages, and IMS-MS is an excellent tool to uncover such conformational characteristics.35–37,40,50 For the investigated Cy cages, changing the solvent from water to CH3CN, CH3OH or water mixtures with both solvents produces ESI mass spectra in which charge envelopes shift towards lower charge states, yet the ATD of each identified protonated [Cy + nH]n+ adduct remains unchanged. The ESI mass spectra of formic acid-acidified (pH 4) aqueous solutions closely resemble those found in water, both in terms of the identity of the species as well as the narrow ATD for each protonated adduct.‡51 As pH of the solvent is raised, charge envelopes are slightly shifted towards lower charge states. Such a minor shift of charge states contrasts with the large charge state variation estimated in water solutions (see Fig. S1†) on going from acidic to basic pH. It is additional evidence of the small dependence on the solution-phase charge state of the Cy cages and the charge state observed upon ESI of aqueous solutions.52 Nonetheless, the ATD profiles for the various m/z selected cage ions [Cy + nH]n+ (n = 2, 3 and 4) do not vary, thus clearly evidencing that the inherent low conformational dispersion of [Cy + nH]n+ cations holds for a wide range of experimental conditions.
Entry | Cage | logβ | Selectivity | |
---|---|---|---|---|
a The weak interaction in this case leads to a less reliable fitting since in both cases CyAsp and CyGlu less than 60% of complexation is reached during the titration. b From ref. 14. | ||||
1 | CyOrn | 2.94(1) | 1150 ± 30b | 0.10 |
2 | CyLys | 3.26(2) | 550 ± 25b | 0.21 |
3 | CyHis | 3.94(1) | 115 ± 3 | 1.00 |
4 | CyAsp | 2.69(1) | 2040a ± 50 | 0.056 |
5 | CyGlu | 2.37(1) | 4270a ± 100 | 0.026 |
Table 2 shows that CyHis (entry 3) displays the strongest binding towards the guest with a Kd of 115 μM followed by CyLys and CyOrn. CyAsp and CyGlu show a much weaker binding with a Kd in the millimolar range. The corresponding selectivity, defined as the relative affinity towards the EYE substrate, is especially remarkable for CyHis in comparison with the other hosts. This trend implies that the Tyr inclusion within the cage is additionally modulated by the secondary side chain–side chain cage–EYE polar interactions. Intuitively, CyHis, CyLys and CyOrn show a stronger binding due to the basic R substituents, which can better interact with the anionic glutamate groups next to the Tyr in EYE, whereas the carboxylate residues of CyAsp and CyGlu would produce a repulsion effect reducing their affinity. The superior affinity of CyHis is associated with the unique characteristics of imidazole that can be engaged in a wide range of non-covalent interactions (ionic, H bonds as donor or acceptor, and through its π aromatic cloud) which play crucial structural roles in protein structure, interactions and function.53–59
The CyHis–EYE interaction was also studied by NMR in buffered D2O (deuterated Tris, pH 7). Despite the strong interaction determined by fluorescence titrations, modest chemical shift perturbations were observed in the 1H NMR signals. Such observed CSPs (see Fig. S9†) follow the trends previously observed in closely related host–guest systems.15 Attempts to observe host-to-guest intermolecular NOEs (or ROEs) were unsuccessful most likely due to unsuitable relaxation properties.60
Moved by the somehow limited structural information obtained by NMR, and the impossibility to grow single-crystals for X-ray diffraction studies, the EYE complexation was also investigated by MS methods. Increasing amounts of EYE, hereafter G (0.5, 1 and 4 equivalents), were added to 1 mM aqueous solutions of the host receptors. The resulting mixtures were diluted with water at different pH values to a final 10 μM concentration and directly analysed by ESI-MS. Two distinctive MS complexation behaviours are identified for hosts either with acid (CyAsp and CyGlu) or basic (CyOrn, CyLys and CyHis) pendant groups. For CyAsp and CyGlu, supramolecular 1:1 [CyGlu + G + 2H]2+ and [CyAsp + G + 2H]2+ adducts are barely detected in the positive ESI mass spectra. The negative ESI mass spectra did not show adducts assigned to supramolecular cage:EYE complexes. Supramolecular 1:1 adducts are identified as prominent peaks to a much larger extent for CyLys, CyOrn and CyHis as compared with the hosts with acidic pendant groups. Fig. 3 shows the positive ESI mass spectrum of CyLys and 1 equivalent of EYE recorded in water.
The supramolecular 1:1 [CyLys + G + 3H]3+ (m/z 607.1) adduct can be observed along with barely detected doubly-charged [CyLys + G + 2H]2+ after addition of 0.5 equivalent of G. These mass spectral peaks grow smoothly as the number of equivalents is increased. Apart from the deprotonated [Cy − H]− species, peaks corresponding to host:guest assemblies could not be observed in the negative ESI mass spectrum. The ESI mass spectrum recorded from acidified (pH 4) solutions closely resembles that obtained in aqueous solutions, both in terms of the identity of the detected supramolecular species, namely [CyLys + G + nH]n+ (n = 2 and 3) as well as their relative ratio. However, the ESI mass spectrum recorded at pH 9 shows a shift of the charge states of CyLys towards lower values accompanied by a significant reduction of the relative abundance of the 1:1 adducts with respect to the unbound cage. This result indicates a less favourable recognition of G under basic conditions and points to a pivotal role of cage protonation to attain EYE peptide recognition. Positive-mode ESI mass spectra of aqueous solutions of CyOrn and CyHis hosts with G are very much alike to those found for CyLys, resulting in mass spectra showing doubly and triply charged 1:1 [Cy + G + nH]n+ species (see Fig. S10 and S11†) and a closely related pH-dependent behaviour.
The intrinsic gas-phase stability of the 1:1 supramolecular complexes was investigated by CID experiments. Illustrative CID mass spectra upon mass-selection of [CyHis + G + 3H]3+ (m/z 625.7) are shown in Fig. 4a at different fragmentation conditions. Partial G dissociation is initially clear even at the lowest collision energy used under CID conditions. Each supramolecular adduct follows the same mechanism upon dissociation (dissociation by a charge separation mechanism into a singly charged guest [G + H]+ (m/z 481.2) and the doubly charged host [Cy + 2H]2+ according to eqn (1)). Dissociation efficiency curves (plots of percent dissociation versus centre-of-mass collision energy) were generated.61,62 The charge splitting dissociation depicted in eqn (1) is presumably facilitated by coulombic repulsion of the triply charged precursor ion together with the presence of mobile protons in the supramolecular complex.63,64 Hence, upon collisional activation, mobile protons may be transferred from the sites of higher gas-phase basicity from the cages to several carboxylate or backbone amides of the EYE guest to form energetically less favoured protonated species that eventually dissociate by disrupting non-covalent bonds. The involvement of mobile protons under CID conditions also implies that a population of precursor and product ions heterogeneous with respect to charge localization can be accessed. All attempts to identify other protomers in the precursor and product ions on the basis of CID-IM experiments were unsuccessful.
[Cy + G + 3H]3+ → [Cy + 2H]2+ + [G + H]+ | (1) |
Percent dissociation is calculated as the abundance of the precursor ion in proportion to the sum of the abundance of the host [Cy + 2H]2+, the guest [G + H]+ (m/z 481.2) and the product ion due to the successive fragmentation of the guest, namely [G + H-NH3]+ (m/z 464.2). The collision energy in the centre of mass required to dissociate 50% of the parent complex ions (referred to as CECOM50%) can be considered to compare the stability of the complexes.16 The dissociation efficiency curves for the [Cy + G + 3H]3+ (Cy = CyLys, CyOrn and CyHis) complexes are shown in Fig. 4b. Both CyOrn and CyLys supramolecular complexes are more readily dissociated than that containing CyHis (CECOM50% values close to 0.30 eV). Moreover, the absence of supramolecular adducts of the EYE tripeptide with CyAsp and CyGlu upon ESI-MS suggests a reasonable agreement with the trends observed in aqueous solutions in these particular cases. However, the complexes with CyLys and CyOrn display virtually identical CECOM50% values (close to 0.20 eV), despite the binding affinity towards the EYE guest in solution being favoured for CyLys (see Table 2). A plausible explanation for such higher affinity observed for CyLys can be ascribed to its higher basicity estimated in an aqueous solution (see Fig. S1†) whereas basicities for Lys and Orn are undistinguishable in the gas-phase.65
For the IMS-MS studies, we focused on the EYE complexation with CyOrn, CyLys and CyHis. The ATDs of the corresponding 1:1 adducts, namely [Cy + G + 3H]3+ display only a narrow and Gaussian-shaped ATD profile. Like their empty cage homologues, namely [Cy + 3H]3+ and [Cy + 3H]2+, they evidence a low conformation dispersion (see Fig. S10–S12†). Estimated TWCCSN2 values are 480 Å2, 506 Å2 and 515 Å2 for [CyOrn + G + 3H]3+, [CyHis + G + 3H]3+ and [CyLys + G + 3H]3+, respectively, which reflects a clear TWCCSN2 increase ranging from 50 to 70 Å2 upon EYE binding (see Table 1). The structural and conformational details that accounts for such increase are addressed in the next section on the basis of molecular modelling and comparison of experimental TWCCSN2 with those CCS calculated from the minimized structures.
Despite the soft desolvation and ion transfer conditions typically used in the present work, changes in the stability of the inter- and intramolecular interactions as a result of solvent removal cannot be excluded, even without separation of the host:guest partners. Solvent removal eliminates the competition of water for hydrogen bonds and reinforces electrostatic interactions in the host:guest supramolecular complex. The partial flexibility of the cages is reflected in the folded geometries of the triply charged ions of the hosts, stabilized by a network of intramolecular H-bonds between the amides/amines of the core and the amine or imidazole groups from the side chains. For the corresponding cage-EYE complexes, in both cases, the Tyr residue remains included within the cage cavity, stabilized by either π-stacking (CyHis) or π–cation (CyOrn) interactions. Besides, the phenolic OH of EYE is H-bound to the polar groups of the hosts. The side chains of the hosts additionally interact with the anionic Glu residues and the backbone of EYE, through salt bridges and polar H-bonds. In this case, the number of non-covalent interactions identified in the complex with CyHis is sensibly higher than for CyOrn (Fig. 5).
The CyOrn and CyHis cages suffer a remarkable distortion upon EYE binding as it was anticipated from the TWCCSN2 enhancement upon EYE inclusion. Specifically, the binding pocket defined by the two capping phenyl rings and the three DACH groups is expanded on going from [Cy + 3H]3+ to [Cy + EYE + 3H]3+. Such distortions are illustrated in Fig. 6 by the overlapping CyOrn and CyHis cages both in their respective [Cy + 3H]3+ and the 1:1 [Cy + EYE + 3H]3+ trications (aligning one of the capping aromatic rings in each case). It clearly shows how the binding pocket in both cages is expanded on going from the free cage (orange structures) to the complex (purple structures) species. Such distortion can be gauged by considering the size of the triangle defined by the centroid of the three DACH groups as well as the distance of the centroids of the two capping phenyl groups. If we consider the triangle defined by the centroids of the DACH moieties, the side lengths vary from 9.1–15.2–16.0 Å to 11.4–15.2–15.7 Å for [CyOrn + 3H]3+ to [CyOrn + EYE + 3H]3+ whereas this triangle size remains virtually unchanged for the [CyHis + 3H]3+ and [CyHis + EYE + 3H]3+ pair. However, the distance between the centroids of the capping phenyl groups moves from 5.6 and 7.1 Å in [CyOrn + 3H]3+ and [CyHis + 3H]3+ to 7.0 and 9.1 Å in the 1:1 supramolecular adducts [CyOrn + EYE + 3H]3+ and [CyHis + EYE + 3H]3+.
The suitability of the structural models considered was checked against the experimental TWCCSN2 values using two computational CCS methods, namely projection superposition approximation (PSA) and trajectory methods (TM). The percentage differences between experimental and theoretical values using different methods are shown in Fig. 7. Predicted CCS values using the PSA method, parameterized for nitrogen drift gas, are within good accuracy with the experimental values. The PSA method accounts for effects of size and shape of the ions and has proved to yield accurate ion mobility cross sections for complex peptide systems with great computational efficiency.67–70 Deviations lower than 4% for the [CyOrn + 3H]3+/[CyOrn + G + 3H]3+ pair and 3% for [CyHis + 3H]3+/[CyHis + G + 3H]3+ are estimated compared to experimentally obtained TWCCSN2 values. The TM constitutes the most physically rigorous approach to estimate CCS as it considers the potential between the drift gas and the polyatomic ion. Like the PSA method, the agreement between the calculated CCS via TM approximation using IMos and the experiment is notable (within 4% and 7% for the CyOrn and CyHis members, respectively).71 Overall, the very good agreement between calculated and experimental values suggests that the considered models represent a good approximation to the intrinsically complex molecular recognition event.
For collision induced dissociation (CID) experiments, the complete isotopic envelope of the 1:1 host:guest adducts of interest was mass-selected with the first quadrupole and interacted with argon in the T-wave collision cell while analyzing the ionic fragments with the TOF analyzer. The collision energy (CElab) was systematically stepped up in the CElab = 1–17 eV range. Relative abundance of the precursor ion was calculated as Ip/(Ip + ∑Ifrag), where Ip is the peak intensity of the precursor ion, and ∑Ifrag is the sum of the peak intensities corresponding to all fragments. This ratio is used to monitor the internal energy added to the complex ions to dissociate. The maximum energy that ions can acquire in the collision cell is the center-of-mass collision energy (CECoM) of each collision summed overall. Laboratory frame collision energies (CElab) are converted to center-of-mass energies using the equation CECoM = z × CElab × (m/m + M) where z is the charge state of the precursor ion and m and M are the mass of the target gas and of the complex ion, respectively. The guest ([EYE + H]+) and host [Cy + 2H]2+ product ions and the precursor [Cy + EYE + 3H]3+ ion were thus monitored. Their relative abundances from CID experiments were determined from mass spectra averaged over 120 scans. There was little variation (max. 3%) in the relative product ion abundances from three consecutive CID mass spectra.
A series of MS methods was used to characterize the supramolecular G:cage complexes and prove their intrinsic gas-phase stability. The negligible guest complex formation with CyAsp and CyGlu shown by single-stage ESI-MS suggests weaker guest binding than that observed for the CyHis, CyOrn and CyLys cages in agreement with the solution affinities determined by fluorescence spectroscopy. Thus, as in solutions, the selectivity in G recognition is mainly due to polar (ionic or H-bonding) interactions with the basic pendant groups, which are retained and even strengthened in the gas phase. This observation, together with the reduced binding abilities upon pH raising points to a crucial role of cage protonation in the binding event. However, the correlation between gas-phase dissociation (CECOM50%) and solution dissociation constants (Kd) is not perfect for CyHis, CyLys and CyOrn. CyHis features the tightest binding in both media. CyLys and CyOrn display virtually identical gas-phase dissociation characteristics whereas in solution the Kd value for CyLys is ca. half of that for CyOrn. Although soft ionization MS methods can be used to study a wide spectrum of host:guest systems based on biomolecules, the extent to which the technique is useful for studying binding affinities depends critically on the inherent nature of the noncovalent interactions involved in the host:guest complex. For the investigated Cy:G complexes, where electrostatic and a complex network of hydrogen bonding has been shown to play a predominant role in noncovalent complexation, the energetics of gas-phase binding more closely reflect aqueous solution behaviour. Thus, we envision that MS methods may be used as a high-throughput assay for Cy-guest complexation or in screening peptide libraries.
The use of novel analytical approaches such as IMS-MS was also introduced to obtain structural and conformational insights on the empty cages and their corresponding complexes with the EYE tripeptide. These pseudopeptidic hosts display a series of favourable characteristics, such as robustness, ESI-amenability and most importantly shape-adaptability that prompt us to undertake its study by ESI-IMS-MS aimed to link structures to unique functions and potentially add significant value to the field. In the present work, IMS-MS analysis has provided the first physical analysis related to the dimensions of the Cy cages in the gas phase, allowing us to monitor the change in the CCS in response to the scaffold alteration upon binding. Moreover, the inner cavity of the Cy cages is flexible enough to adjust its shape to accommodate the EYE guest according to a mechanism reminiscent of the induced-fit enzyme model formulated by Koshland77 in which allosteric regulation through conformational changes upon binding attains a tight binding between a molecular host (both synthetic or natural) and a guest.78 Hence, it is demonstrated that combining rigid and flexible moieties in the Cy cages circumvents the characteristic lower binding affinities attained by synthetic receptors that display strong rigidity (typically lower peptide affinity because the impossibility to adjust its shape to match its target) or large flexibility (that often display low pre-organization). These IMS-MS results provide a basis for refining the conformational and structural characteristics of the Cy cages and their supramolecular complexes otherwise inaccessible by other analytical techniques.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d2an01091g |
‡ Because of the inherent electrolytic nature of the ESI process, aqueous solution pH may be decreased in the positive scan mode as a result of the electrolytic oxidation of water (see ref. 52). |
This journal is © The Royal Society of Chemistry 2022 |