Shohei
Tashiro
*a,
Kosuke
Nakata
a,
Ryunosuke
Hayashi
a and
Mitsuhiko
Shionoya
*ab
aDepartment of Chemistry, Graduate School of Science, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan. E-mail: tashiro@chem.s.u-tokyo.ac.jp
bResearch Institute for Science and Technology, Tokyo University of Science, 2641 Yamazaki, Noda, Chiba 278-8510, Japan. E-mail: shionoya@rs.tus.ac.jp
First published on 2nd June 2025
The arrangement of amino acids within crystalline porous materials represents a unique approach to enhance their functionalities such as catalysis, separation and sensing. In particular, the simultaneous arrangement of distinct residues in crystalline frameworks, i.e., shape sorting, remains one of the most important challenges. Here, we demonstrate the shape sorting of two distinct amino acid residues, tryptophan and serine, within porous metal-macrocycle framework-1 via precise molecular recognition at multiple binding sites on the pore surface. Single-crystal X-ray diffraction analysis showed that the indole ring of an N-protected tryptophan molecule was effectively encapsulated within a binding pocket located at the bottom corners of the nanochannel, forming multipoint hydrogen bonds and CH–π interactions. In addition to tryptophan, N-protected serine was adsorbed onto the ceiling sites via multipoint hydrogen bonds. Moreover, modifying their protecting groups allowed us to control the relative positions of the two residues in the nanochannel. We further tested the co-adsorption of both residues in selective separation experiments. The results suggest that designing porous crystals with multiple binding sites is an effective strategy for precise heteroleptic arrangement of amino acid residues, resulting in enhanced functionalization of porous materials.
Amino acids, with more than 20 distinct residues, provide another attractive category as guest species to arrange within porous crystals because their arranged structures exhibit several unique functions that mimic enzymatic behavior. Conventional methods for arranging amino acids within porous crystals typically involve post-synthetic modification of the pore surfaces through the formation of covalent bonds, and these approaches have been successfully applied to catalysis, separation and storage.22–26 More recently, amino acids have also been arranged within porous frameworks via non-covalent or coordination bonds in addition to covalent arrangements. For example, Li et al. demonstrated the non-covalent arrangement of tryptophan at well-defined binding sites within MOFs, and their structures were determined by single-crystal X-ray diffraction (ScXRD).27 Furthermore, Liu et al. achieved post-synthetic modification of MOFs with two distinct amino acids, primarily via coordination bonds. The resulting supramolecular structure was characterized using spectroscopic and theoretical methods and then applied to a unique sensing system.28 However, the precise arrangement of multiple amino acid residues remains a major challenge (Fig. 1a) due to the difficulty in determining the crystal structure, even though it is essential for further functionalization of the porous materials.
Here, we present the shape sorting of two distinct amino acid residues within a crystalline nanochannel and the crystal structure determination of their co-located structures. Our group previously developed porous metal-macrocycle frameworks (MMFs) via the self-assembly of helically-twisted trinuclear PdII macrocycles.29 Among the MMF series, MMF-1 is composed of four stereoisomers of PdII macrocycles and features nanochannels with multiple binding sites on the enantiomeric (P)- and (M)-pore surfaces (Fig. 1b).30 Shape sorting of multiple aromatic compounds has been demonstrated in previous studies,30–32 and more recently, site-selective arrangement of the amino acid residue, serine, has also been demonstrated.33 In this study, based on these previous findings for MMF-1, the simultaneous arrangement of serine and tryptophan residues on the pore surfaces was confirmed by ScXRD analysis (Fig. 1). These residues were effectively recognized at their complementary binding sites via multipoint non-covalent interactions, and the relative positions of the two residues in the nanochannel were controlled by altering the N-terminal protecting groups. Previously, pairwise recognition of two amino acids has been achieved in solution using discrete host molecules,34 which has also been applied to control protein–protein interactions.35,36 Recently, heteroleptic pairwise recognition of distinct residues has finally been accomplished within the cavity of cucurbit[8]uril.37 However, to the best of our knowledge, simultaneous recognition of two amino acids in crystalline hosts, specifically heteroleptic pairwise recognition and its crystal structure determination, has not been realized.
To understand the binding mode specific to the tryptophan residue, the binding structure was further analyzed by focusing on the non-covalent interactions between the guests, the framework and solvent molecules. For example, the N–H group of the indole ring formed hydrogen bonds with one of the three PdCl2 moieties of the syn-isomers exposed on the pore surfaces. Furthermore, the electron-rich π-surface of the indole ring interacted with multiple C–H moieties of the framework and with acetonitrile molecules co-adsorbed in the pockets, forming CH–π interactions. Moreover, the water molecules trapped on the pore surface formed weak hydrogen bonds with the Ar–H moieties of the indole ring (Fig. 2c). These efficient multipoint non-covalent interactions may provide specific molecular recognition for the tryptophan residue in the corner pockets, while the solvent molecules may play a cooperative role in stabilizing the binding of the indole ring.
Notably, the occupancies of the guests in the enantiomerically paired pockets are significantly different (93.0% and 55.0% for the (M)- and (P)-pockets, respectively), which was also supported by the Flack parameter (0.16(4)). This difference may be due to the slightly different binding modes of the indole rings at the (M)- and (P)-sites. For example, the indole ring in the (M)-pocket was positioned close to the pore surface and formed effective CH–π interactions with the framework (Fig. 2d). The different binding modes at the (M)- and (P)-sites were verified by their distinct fingerprint plots from the Hirshfeld surface analysis (Fig. 2e).38 This analysis also suggested that weak non-covalent interactions, such as CH–π and van der Waals forces, act cooperatively in the molecular recognition. Furthermore, the main-chain structure of the amino acid was partially observed in the (M)-site but not in the (P)-site due to disorder. These differences are likely the result of steric repulsion around the chiral carbon center, suggesting that the enantiomeric binding pockets can recognize the chiral structure of tryptophan even when the chiral carbon is positioned far from the indole ring.
The uptake of Z-L-tryptophan into MMF-1 was further confirmed by 1H nuclear magnetic resonance (NMR) spectroscopy of the guest@MMF-1 crystals after dissolution in DMSO-d6/DCl (Fig. S19†). The 1H NMR spectra showed that approximately three molecules of Z-L-tryptophan were encapsulated into the nanochannel per unit space. Similar experiments were conducted with Z-L-phenylalanine or Z-L-tyrosine, and these non-adsorbed amino acid derivatives were also encapsulated within the channel (approximately 3 molecules per unit space) (Fig. S20 and S21†). These results indicate that the uptake of amino acids was comparable among these residues, but the adsorption efficiency to the pore surface, especially the corner pockets, varied significantly due to the different interaction modes with the pore surface.
Based on this finding, we next investigated the shape-sorting of tryptophan and serine in a single nanochannel. MMF-1 crystals were soaked in a mixed solution containing Z-L-tryptophan (950 mM) and Z-L-serine (300 mM) in acetonitrile at room temperature for 5 days and then analyzed by ScXRD at −180 °C. As a result, both serine and tryptophan residues were concurrently observed at the ceiling and bottom corner sites, respectively (Fig. 3). Remarkably, the binding modes of both residues were nearly identical to those observed in the single-guest experiments. For example, the indole rings were identified at the bottom corner pockets via hydrogen bonds and CH–π interactions, while the main-chain structure was observed only on the (M)-side, similar to the single bonding case (Fig. 3b). Hirshfeld surface analysis also verified the binding modes similar to those observed in the single-guest experiments (Fig. S9–S11†). Moreover, the arrangement of serine residues was successfully observed at the ceiling via multipoint hydrogen bonds, which is similar to our previous study that showed a single binding of Z-L-serine at the ceiling (Fig. 3b).33
However, closer inspection of the co-adsorbed structure revealed that the Z-L-tryptophan on the (M)-side was substitutionally disordered for steric reasons with Z-L-serine (Fig. S12†), preventing them from coexisting in the same unit space (Fig. 3c). The occupancies of tryptophan and serine on the (M)-side (50.4% and 49.6%, respectively), optimized to prevent coexistence, indicated that all the (M)-sides were equally occupied by tryptophan and serine. Therefore, to co-adsorb two different amino acids in the same unit space while avoiding steric interference, we used another serine derivative with a different N-protecting group, Boc-L-serine, because Boc-protected serine is known to occupy only one site on the ceiling, whereas Z-protected serine occupies both the enantiomeric (M)- and (P)-sites on the ceiling.33
To test this hypothesis, MMF-1 crystals were soaked in a mixed solution containing Z-L-tryptophan and Boc-L-serine in acetonitrile at room temperature for 5 days and then analyzed by ScXRD at −180 °C. As expected, both amino acids were co-adsorbed on the pore surface (Fig. 4a), specifically, the indole rings of the tryptophan residues were observed in the both corner pockets via hydrogen bonds and CH–π interactions (Fig. 4b). As observed in single-guest experiments, their binding modes in both enantio-paired pockets were slightly different from each other (Fig. S18†), which was also verified by Hirshfeld surface analysis (Fig. 4d). Notably, the molecular structure of Z-L-tryptophan at the (M)-site was fully assigned without loss of structure due to disorder, in contrast to the partial structure observed in the single-guest experiments. In addition to tryptophan residues, Boc-L-serine was also observed at the ceiling. Essentially, similar to our previous study,33 only the (P)-ceiling part was diastereoselectively occupied by the L-serine residue via multipoint hydrogen bonds.
Therefore, we investigated the possibility that both residues could exist simultaneously in the same unit space by carefully examining the obtained structure. The steric collision between both residues was evaluated on the (M)-side, and the closest interatomic distance between them was 2.96 Å (Fig. 4c), indicating that no substitutional disorder occurred between them, unlike in the combination of Z-L-tryptophan and Z-L-serine. Moreover, the occupancies of tryptophan and serine residues were optimized to 68.1% and 59.6%, respectively, further demonstrating that both amino acids could exist in the same unit space. This cooperative arrangement is due to the fact that Boc-L-serine diastereoselectively adsorbs only at the (P)-site,33 creating enough space for Z-L-tryptophan to adsorb at the (M)-site. These results demonstrated that by appropriately selecting amino acids, different amino acid residues can be precisely arranged as predicted beforehand. In addition, the C-terminal carboxy group of each residue is asymmetrically located but relatively close to each other, making such a structure potentially useful for a variety of applications such as asymmetric catalysis and chiral separations.
Finally, the simultaneous binding of tryptophan and serine residues was tested in a preliminary selective separation experiment involving multiple amino acids. For example, MMF-1 crystals were soaked in a CD3CN solution containing equal concentrations of six different Z-protected residues (Z-L-serine, Z-L-tryptophan, Z-L-phenylalanine, Z-L-leucine, Z-L-valine and Z-L-alanine) at 20 °C for 1 day. The crystals were collected by filtration and then re-soaked in CD3CN for one more day to extract the encapsulated residues from the crystals (Fig. 5a). When the extract was analyzed by 1H NMR spectroscopy, it was observed that both serine and tryptophan were preferentially adsorbed to MMF-1, likely due to their simultaneous binding to their respective recognition sites, as discussed earlier (Fig. 5b). Therefore, this result supports the effective recognition of the two residues observed by ScXRD. Since the amount of amino acid uptake estimated by 1H NMR analysis is similar whether observed by ScXRD or not, the selective adsorption is likely due to the inhibition of the uptake of other residues by the two residues that preferentially occupy the nanospace of MMF-1.
![]() | ||
Fig. 5 Selective separation experiment of Z-protected amino acids. (a) Schematic of the experiment. (b) Adsorption ratios of the six residues in MMF-1 estimated by 1H NMR analysis of the extract. |
Footnote |
† Electronic supplementary information (ESI) available: Spectroscopic analysis, preparation and characterization of the crystals, and crystallographic data could be found in the ESI. CCDC 2255578–2255580, 2453192. For ESI and crystallographic data in CIF or other electronic format see DOI: https://doi.org/10.1039/d5sc00795j |
This journal is © The Royal Society of Chemistry 2025 |