Jake H.
Schwab
,
Jake B.
Bailey
,
Milan
Gembicky
and
Julia M.
Stauber
*
Department of Chemistry and Biochemistry, University of California, 9500 Gilman Dr, La Jolla, San Diego, CA, USA. E-mail: jstauber@ucsd.edu
First published on 20th December 2022
Multivalency plays a key role in achieving strong, yet reversible interactions in nature, and provides critical chemical organization in biological recognition processes. Chemists have taken an interest in designing multivalent synthetic assemblies to both better understand the underlying principles governing these interactions, and to build chemical tools that either enhance or prevent such recognition events from occurring in biology. Rationally tailoring synthetic strategies to achieve the high level of chemical control and tunability required to mimic these interactions, however, is challenging. Here, we introduce a systematic and modular synthetic approach to the design of well-defined molecular multivalent protein-binding constructs that allows for control over size, morphology, and valency. A series of supramolecular mono-, bi-, and tetrametallic Fe(II) complexes featuring a precise display of peripheral saccharides was prepared through coordination-driven self-assembly from simple building blocks. The molecular assemblies are fully characterized, and we present the structural determination of one complex in the series. The mannose and maltose-appended assemblies display strong multivalent binding to model lectin, Concanavalin A (Kd values in μM), where the strength of the binding is a direct consequence of the number of saccharide units decorating the molecular periphery. This versatile synthetic strategy provides chemical control while offering an easily accessible approach to examine important design principles governing structure–function relationships germane to biological recognition and binding properties.
Fig. 1 Multivalent glycoassemblies including nanoparticles, polymers and dendrimers, and this work, which introduces a new class of glycosylated supramolecular multivalent complexes. |
The use of coordination-based supramolecular systems offers a platform to explore new classes of structurally complex, yet well-defined multivalent assemblies that are amenable to architectural tuning. Coordination-driven self-assembly has provided a foundation for the development of nano-scale supramolecular systems with wide ranging applications22 including but not limited to catalysis,23 molecular sensing,24 nanomedicine,25,26 and fundamental investigations.27,28 Work pioneered by Fujita,29,30 Stang,31,32 Raymond,33 and others34,35 has led to the development of a variety of three-dimensional scaffolds by providing a rational and versatile approach to the design of discrete frameworks with predictable directionality and structural morphology.36 Through judicious choice of organic subcomponents and metal ions, chemists have and continue to develop synthetic strategies that enable the preparation of complex supramolecular assemblies while maintaining precise control over their spatial, topological, stability, and solubility properties.
Here, we introduce a new approach to the synthesis of structurally well-defined multivalent glycosylated architectures through coordination-driven self-assembly. Employing this versatile and modular strategy, we have prepared a series of programmable and robust saccharide-grafted supramolecular Fe(II) complexes from simple organic building blocks. Mono, di-, and tetrametallic Fe(II) assemblies (Fig. 1) that feature distinct spatial arrangements of three, six and twelve appended carbohydrates, respectively, have been synthesized and spectroscopically characterized. We also report the single-crystal X-ray structure of one complex, which enabled us to gain a key understanding of the molecular structure on the atomic level. We investigated the binding of these hybrid assemblies to plant-based protein, Concanavalin A, and showcase how this strategy offers chemical tools for systematic and rational control over molecular recognition properties. This series has the potential to provide insight into how modular tuning of the valency and morphology of hierarchical constructs affects their binding behaviour, while offering a rational approach to building systems of high affinity and specificity.
With these considerations in mind, we initially selected benzidinedisulfonate as a polar, charged, ditopic amine subcomponent that has been shown to form tetrahedral assemblies in aqueous solution.47–49 Accordingly, treatment of glucose-substituted subcomponent A with commercial benzidinedisulfonic acid and Fe(BF4)2·6H2O in the presence of [NMe4][OH] in water (Scheme 1) resulted in the formation of one major species as assayed by 1H NMR spectroscopy. Notably, these reaction conditions were optimized to include an excess of hydroxide ion, which was used to both deprotonate benzidinedisulfonic acid, and to achieve de-O-acetylation of acetate-protected subcomponent Ain situ. The one-pot self-assembly and ester hydrolysis of A conveniently and cleanly enabled rapid access to the desired deprotected product and simplified the synthetic procedure by eliminating an independent de-O-acetylation step. After precipitation from the crude reaction mixture, the product was easily purified away from the low molecular weight by-products by size-exclusion chromatography.
The [NMe4]4[1-glc] product contains Fe(II) in the low-spin, S = 0, state as indicated by resonances exclusively located in the diamagnetic region of the 1H NMR spectrum (Fig. 2a). The intense magenta colour (λ = 505 nm [ε 20000 M−1 cm−1], 545 nm [ε 24000 M−1 cm−1], Fig. 2b) of [NMe4]4[1-glc] is also ascribed to strong metal-to-ligand charge transfer excitations characteristic of low-spin Fe(II) in an iminopyridine coordination environment.47,50,51 Only one set of aromatic signals were observed by 1H and 13C NMR spectroscopy, which is in good agreement with the isolation of a single, highly symmetric species in solution. The 1H NMR aromatic resonances of [NMe4]4[1-glc] are slightly broadened (Δν = 7–18 Hz) when compared with the signals of the A precursor, which is attributed to slow molecular tumbling of the rigid cage core in solution, and typical of large molecular-weight molecules (MW of [1-glc]4– = 6009 g mol−1).18,52 Imine bond formation during self-assembly was supported by the disappearance of the aldehyde 1H NMR signal of subcomponent A (Fig. 2a, δ 9.95 ppm, CDCl3), and the appearance of a new up-field shifted signal attributed to the imine proton located at δ 9.09 ppm in the 1H NMR spectrum of the product (Fig. 2a, D2O). Infrared spectroscopy also confirmed the formation of an imine bond and coordination of the nitrogen donor atom to the Fe centre as evidenced by the absence of an aldehyde vibration in the IR spectrum of [NMe4]4[1-glc], and the appearance of a strong vibration ascribed to the imine CN stretch (νCN 1558 cm−1, ESI Fig. S33†).53,54 Formation of the twelve-fold glucose-substituted assembly was unequivocally confirmed by high-resolution electrospray mass spectrometry (HR-ESI-MS(–)) by the presence of the tetraanionic molecular ion peak (1502.2307 m/z, Fig. 2c), which established both the charge and the intact Fe4L6 metal–ligand stoichiometry of [1-glc]4–.
Fig. 2 (a) 1H NMR spectra of subcomponent A (CDCl3 (*), 25 °C, 400 MHz), [NMe4]4[1-glc], [NMe4]2[2-glc], and [3-glc][SO4] (D2O, 25 °C, 400 MHz); (b) UV-vis spectra of [NMe4]4[1-glc], [NMe4]2[2-glc], and [3-glc][SO4] (50 μM, H2O); (c) HR-ESI-MS of [1-glc]4−, [2-glc]2−, and [3-glc]2+; (d) solid-state molecular structure of [2-glc]2− (Fe, orange; N, blue; O, red; S, yellow; C, grey; H, white). |
With a protocol for the clean synthesis of [NMe4]4[1-glc] in place, we next conducted an analogous self-assembly procedure employing either NaOH or KOH as the hydroxide source to prepare the sodium and potassium salts of [1-glc]4–. The facile preparation of Na4[1-glc] and K4[1-glc] (ESI Section S2.3.2 and S2.3.3†) showcases the versatility of this methodology and the ease of which salts of various countercations can be prepared. Ready access to alkali metal salts of [1-glc]4– also provides an excellent opportunity to study this system and others with higher potential biological compatibility when compared with the [NMe4]+ analogues.
Previous reports have demonstrated that the reaction between 2-formylpyridine and 4,4′-diaminodiphenyl ether-based subcomponents in the presence of divalent transition metal ions results in the formation of bimetallic triple-stranded helicates as opposed to tetrahedral assemblies on account of the enhanced flexibility in the ligand backbone.56,57 With this precedent, we employed a protocol analogous to the one described for the synthesis of [NMe4]4[1-glc] to prepare a bimetallic glucose-functionalized helicate. Treatment of 4,4′-diaminodiphenyl ether-2,2′-disulfonic acid with A in the presence of Fe(BF4)2·6H2O and [NMe4][OH]·5H2O resulted in clean conversion to the triple-stranded helix, [NMe4]2[2-glc], as judged by 1H NMR spectroscopic characterization of the crude product. Complex [NMe4]2[2-glc] was purified via size-exclusion chromatography, and the purified product was characterized by 1H (Fig. 2a) and 13C NMR spectroscopy (ESI, Fig. S37†). NMR characterization revealed two sets of resonances that are likely due to the presence of the right-handed, ΔΔ, and left-handed, ΛΛ, configurations (one set of resonances),58 and the presence of the ΛΔ isomer (second set of resonances), which contains two different configurations at the metal centres (Fig. 2a). Indeed, mixtures of ΔΔ, ΛΛ, and ΛΔ configurations are reported for other dinuclear M2L3 helicates in the literature.59,60 The presence of both ΔΔ and ΛΛ configurations in this product was confirmed by an X-ray diffraction study conducted on a single crystal of the salt (vide infra and ESI Section S7†). Complex [NMe4]2[2-glc] was also characterized by UV-vis (Fig. 2b) and infrared (ESI Fig. S41†) spectroscopy, with spectroscopic signatures consistent with those observed for the tetrametallic [NMe4]4[1-glc] analogue. The identity of [2-glc]2– was confirmed as the sixfold-glucose substituted bimetallic complex by HR-ESI-MS(−) (Fig. 2c).
To establish the structures of this new class of glycosylated molecules, we focused efforts on growing single crystals of the isolated assemblies for X-ray diffraction analysis. Crystals of [NMe4]2[2-glc] of sufficient quality for structural determination were grown by vapor diffusion of acetone into a saturated aqueous solution of the complex at 25 °C over the course of 48 h. An X-ray diffraction study of these crystals permitted structural determination of the [2-glc]2− anion, and the solid-state structure is displayed in Fig. 2d. As shown in the molecular structure, the framework of [2-glc]2− spans a 3 nm distance from end-to-end, in which each end of the helix is capped by three glucose fragments. The solid-state structure also displays all six sulfonate groups directed towards the exterior of the helicate, which enforces the C3 symmetry of the molecular core that is reflected in the high symmetry observed by NMR spectroscopy. Such crystallographic characterization is critical to this work by enabling a more complete understanding of the molecular structure of this class of assemblies.
We next expanded the series of glycosylated assemblies to include a monometallic Fe(II) complex that would enforce a trivalent saccharide framework. Treatment of tris(2-aminoethyl)amine with subcomponent A in the presence of stoichiometric amounts of [NBu4][OH] and FeSO4 in H2O at 70 °C resulted in the formation of mononuclear, [3-glc][SO4] (Scheme 1), as evidenced by 1H NMR spectroscopic analysis of the product. This self-assembly reaction is straightforward and proceeds cleanly to generate C3-symmetric [3-glc]2+ as supported by the high symmetry observed in the 1H and 13C NMR spectra of the product. Monometallic [3-glc][SO4] was also characterized by UV-vis and infrared spectroscopy, and the identity of this product as the triply-substituted glucose complex was confirmed by HR-ESI-MS(+) (Fig. 2c).
With a successful self-assembly procedure established, we extended this synthetic approach to the preparation of analogous glycosylated complexes employing subcomponents B (α-D-mannose), C (β-D-galactose), and D (β-D-maltose) as shown in Scheme 1. The mannose-, galactose- and maltose-substituted tetra-, bi-, and monometallic analogues were prepared following similar synthetic procedures described for the glucose-derivatives. This versatile synthetic strategy allowed us to easily and rapidly establish a molecular library of well-defined saccharide-appended assemblies that includes twelve unique examples. All complexes were characterized by NMR spectroscopy (1H and 13C), IR and UV-vis spectroscopy, and the identities of all products were confirmed by HR-ESI-MS (ESI Section 2.3†).
Con A and multivalent binders have been shown to form noncovalent, cross-linked aggregates that can precipitate out of solution depending on the strength of the binding interaction.15,18,66,67 Since Con A binds both terminal mannose and maltose groups, we evaluated the binding between the series of mannose and maltose-tethered systems with Con A through assessing the lectin–ligand aggregation by turbidimetric analysis. When dodeca- ([NMe4]4[1-man], [NMe4]4[1-mal]) and hexa-substituted ([NMe4]2[2-man], [NMe4]2[2-mal]) complexes were mixed with Con A at pH 7.0, the solutions became opaque within seconds of mixing, indicating the formation of colloidal lectin–ligand aggregates and direct evidence of protein binding. The turbidity of the mixtures was monitored by measuring the increase in absorbance at λ 545 nm over the course of 180 min. In the case of dodeca-, and hexa-substituted complexes, the absorbance reached a plateau after ca. 20 min, and remained constant until the end of the measurement (Fig. 3), suggesting these assemblies were able to cross-link and precipitate nearly all of the Con A in solution. In contrast, negligible changes in absorption were observed upon addition of trivalent [3-man][SO4] and [3-mal][SO4] to solutions of Con A. The lack of aggregate formation for the trivalent derivatives is attributed to lower binding affinity of [3-man]2+ and [3-mal]2+ with Con A, and indicative that tri-substituted complexes do not have suitable valency or peripheral substitution to crosslink lectin binding sites. This result is consistent with the report from Toone et al.,65 in which agglutination assays demonstrated no multivalency effect for bi- and trivalent dendrimers with Con A, whereas glycomolecules of higher valency showed a considerably enhanced cluster glycoside effect.3,68
After 180 min, an excess (1000 equiv.) of methyl-α-D-mannose was added to each mixture (shown in Fig. 3 with arrow), which resulted in immediate deaggregation (deagglutination) of the protein clusters. This result indicates that methyl-α-D-mannose can serve as a competitive inhibitor and that the multivalent crosslinking is reversible. Aggregate formation was not observed in the case of experiments conducted with complexes bearing β-glucose and β-galactose groups, which is consistent with the lack of binding between these assemblies and Con A (vide infra), and the known selectivity of the protein for α-mannose and α-glucose.3
Thermodynamic parameters associated with the binding of maltose and mannose-grafted complexes to Con A were next determined via isothermal titration calorimetry (ITC). It is well-known that the precipitation of cross-linked aggregates during ITC experiments has a significant negative effect on the reliability of thermodynamic data.64,69 Therefore, in order to minimize the potential of precipitation during measurements, all experiments in this study were performed at low protein concentration, low salt concentrations (NaCl, CaCl2, MnCl2), and in acidic conditions (pH 4.8) with Con A in its predominantly dimeric form.64
Calorimetric data for the titration of dimeric Con A with [NMe4]4[1-man] are shown in Fig. 4a, which display a monotonic decrease in the exothermic heat of binding with each addition of analyte. The titration was conducted a total of three times, and consistent results were obtained throughout the measurements (ESI Section S6.1†). The dissociation constant, Kd, for the interaction between [NMe4]4[1-man] and Con A was determined to be 2.4 μM, which corresponds to a 350-fold increase in affinity when compared with the Kd value determined for the monovalent binding epitope, D-mannose (Kd = 0.83 × 10−3 M, ESI Section S6.7†).70 The significant difference in Kd values reveals that the multivalent presentation of mannose groups on [1-man]4– results in a considerably stronger binding profile per saccharide equivalent. These findings are consistent with data reported for other systems that exhibit affinity enhancements of up to three to four orders of magnitude relative to the binding of monovalent D-mannose due to multivalency effects.15,71–74
We next investigated the impact of saccharide density on binding strength by determining the dissociation constants for the hexavalent, [2-man]2–, and trivalent, [3-man]2+, analogues. A Kd value of 12.7 μM was determined for the binding of [2-man]2– with Con A (Fig. 4b) under similar experimental conditions used for measurements with [1-man]4–. These data display that dodecavalent [1-man]4– exhibits an over five-fold enhancement in binding to Con A when compared with the affinity of [2-man]2–, which bears half the number of mannose units. We next evaluated the binding of the trivalent derivative, [3-man]2+, with Con A. Measurements indicate an over 30-fold decrease in binding strength of [3-man]2+ (Kd = 80.3 μM, Fig. 4c) when compared with the binding of [1-man]4–. The Kd values determined from ITC experiments demonstrate that the multivalent binding event strength increases exponentially as the valency associated with the molecular architecture increases, which is a result consistent with the cluster glycoside effect.3,5,68 These experiments establish the markedly enhanced binding avidity, which is the overall cooperative binding force,75 that results from the multivalent presentation of saccharides, and highlight the relationship between molecular architecture and affinity. The observed trend in Kd values of [1-man]4– < [2-man]2– < [3-man]2+ is governed by rational, systematic, and programmable molecular design and provides a valuable opportunity to expand these concepts to higher multivalent binding events through the development of even more effective constructs.
We note that the overall charge of each complex in this series varies from 4− to 2+, and that there is the potential for charge to play a role in the binding strength of each system with Con A. A benefit of the current synthetic approach is that the interchangeable nature of organic subcomponents provides an excellent opportunity to systematically modify only the charge of a molecule without significantly affecting the size, shape, or valency by using neutral ditopic amines in the case of 1, and 2, and a charged tritopic amine for 3. The influence of charge on binding strength is a question that is currently under investigation using the presented synthetic approach.
The ITC measurements also allowed us to calculate the stoichiometry of binding, n, which is defined as the number of analyte molecules per protein binding site. For example, the n value for the binding of methyl-α-D-mannose and trimannose to Con A are both 1.0,64 indicating one mannose epitope binds to a single Con A binding site.76 However, n values less than 1.0 are typical for the binding of multivalent carbohydrates to Con A.63,72 ITC-derived values of n that are less than 1.0 suggest crosslinking between multivalent proteins and multivalent analytes in solution. In agreement with this known behaviour, the n values for [1-man]4–, [2-man]2–, and [3-man]2+ binding to each Con A monomer were determined to be 0.22, 0.30, and 0.73, respectively, which are all considerably lower than 1.0.
It is important to note that the functional valence, which is defined as the exact number of epitopes participating in binding (N = 1/n), is lower than the structural valence, or the actual number of epitopes available for binding for the present systems.69 Although the structural valence of [1-man]4– is twelve, its functional valence is four, which indicates that one-third of the available mannose units of [1-man]4– participate in binding to each Con A monomer, and that [1-man]4– is involved in tetravalent cross-linking interactions with the lectin. A functional valence of four agrees well with the structure of [1-man]4– and suggests the three mannose groups at each of the four Fe-vertices of the tetrahedron engage in monovalent binding to Con A. This result is expected given the distance between each monomeric binding site of a Con A is ca. 6.5 nm,12 which is significantly greater than the distance between the three mannose units at each Fe vertex of [1-man]4– (ca. 2–3 nm). In comparison, the functional valence of [2-man]2– and [3-man]2+ are lower than that of [1-man]4–, which is in line with the lower structural valence and weaker binding of these complexes.63,64,72,77,78
The functional valence values for [1-man]4–, [2-man]2–, and [3-man]2+ of 4.55, 3.33, and 1.37, respectively, are slightly higher than the expected values of 4, 2, and 1, which suggests that each mannose epitope of these systems can interact with more than one Con A binding site at a given time by crosslinking proteins. In fact, it has been reported that short mannoside chains can crosslink two Con A binding sites such that the proteins can approach without making substantial contact.65,79
We further probed the binding capabilities of the current system by determining the Kd values of the maltose-decorated complexes ([1-mal]4–, [2-mal]2–, [3-mal]2+) with Con A by ITC.80 Direct comparison between the dissociation constants determined for the mannose and maltose derivatives indicate the maltose systems bind Con A with approximately three times lower affinity (ESI Section S6.2† and Fig. 5). These results compare well with the known selectivity exhibited by Con A, which binds D-mannose six times stronger than it binds D-maltose (Kd(mal)/Kd(man) 6:1).81,82
The extent of nonspecific binding was probed by investigating the interaction between the β-glucose ([1-glc]4–) and β-galactose ([1-gal]4–) substituted Fe4 tetrahedra with Con A. Even at high concentration, [1-glc]4– and [1-gal]4– fail to engage in binding with Con A as evidenced by negligible heat release upon titration. Similarly, minimal heat release was observed when a solution of the unfunctionalized parent Fe4 cage, [NMe4]4[Fe4L6] (L = bis[2-sulfonato-4(2-pyridylmethyleneamino)benzene]),27,47 was analysed at high injection concentration as a negative control. These findings highlight the high affinity and selectivity of the current system.
Footnote |
† Electronic supplementary information (ESI) available. CCDC 2190699. For ESI and crystallographic data in CIF or other electronic format see DOI: https://doi.org/10.1039/d2sc05689e |
This journal is © The Royal Society of Chemistry 2023 |