Carlos A.
Sanhueza
a,
Jonathan
Cartmell
a,
Amr
El-Hawiet
a,
Adam
Szpacenko
a,
Elena N.
Kitova
a,
Rambod
Daneshfar
a,
John S.
Klassen
a,
Dean E.
Lang
b,
Luiz
Eugenio
b,
Kenneth K.-S.
Ng
b,
Pavel I.
Kitov
a and
David R.
Bundle
*a
aDepartment of Chemistry, University of Alberta, Edmonton, Canada. E-mail: dave.bundle@ualberta.ca; Fax: (+1)-780-492-7705
bDepartment of Biological Sciences, University of Calgary, Calgary, Canada
First published on 21st October 2014
A focused library of virtual heterobifunctional ligands was generated in silico and a set of ligands with recombined fragments was synthesized and evaluated for binding to Clostridium difficile toxins. The position of the trisaccharide fragment was used as a reference for filtering docked poses during virtual screening to match the trisaccharide ligand in a crystal structure. The peptoid, a diversity fragment probing the protein surface area adjacent to a known binding site, was generated by a multi-component Ugi reaction. Our approach combines modular fragment-based design with in silico screening of synthetically feasible compounds and lays the groundwork for future efforts in development of composite bifunctional ligands for large clostridial toxins.
CDI manifests itself in symptoms ranging from mild diarrhea to life threatening pseudomembranous colitis that can lead to death in elderly and immuno-compromised patients. Growing resistance to antibiotics leaves few options for treatment other than the administration of either metronidazole or vancomycin.7 However, due to the ability of the bacterium to form endospores, the treatment with broad spectrum antibiotics is often inefficient since the suppression of regular intestinal microflora provides a competitive advantage to C. difficile, which often results in high recurrence rate (up to 20% of patients).2 Recently approved Fidaxomicin, a narrow scope antibiotic, causes less collateral damage to microflora and is indicated to high risk patients.8
Pathogenic strains of C. difficile express at least two UDP-glucosylating toxins A and B (TcdA and TcdB), and, in 6% of strains, also ADP-ribosylating binary toxin, which may increase severity but is unlikely to cause the disease in the absence of large clostridial toxins TcdA and TcdB.9 In contrast to TcdA and TcdB, the binary toxin does not have an internal cysteine protease domain and relies on trypsin for its activation. The cell surface receptor for TcdA and TcdB was identified in rodents as Gal(α1–3)Gal(β1–4)GlcNAc(β1-O) trisaccharide (αGalLacNAc).10 This glycan sequence, also known as “Galili antigen”11 for its role in xenotransplantation organ rejection, is found on the cell membrane of rabbit erythrocytes and other tissues in mammals but not in humans. Several oligosaccharide structures found in human milk12 or decorating the surface of human epithelial cells13,14 have been reported to bind clostridial toxins with weak affinities similar to that of the Galili antigen. The high potency of the toxins is attributed to multivalency-enhanced avidity due to simultaneous interaction of multiple carbohydrate-recognition domains located at the C-terminus with the host cell surface.15,16 Binding of the C-terminus initiates ligand-mediated endocytosis followed by auto-catalytic cleavage of the enzyme domain, which is translocated to the cytosol and glucosylates target proteins.17 Clostridial toxins covalently modify large number of proteins17 including Rho GTPases causing their functional inactivation and inducing disassembly of the actin cytoskeleton, which in turn leads to morphological changes and death of susceptible cells, and, ultimately, results in erosion and perforation of the intestinal epithelium.15,18
Considering the emergence of more virulent strains and resistance to antibiotics,19,20 for opportunistic pathogens such as Clostridia, nonlethal anti-infection therapies are required that target virulence factors rather than suppress bacterial growth. Passive immunotherapy21 and manipulation of the human microbiome by repopulating intestine with non-toxigenic strain of C. difficile22 are new approaches in managing CDI. Large clostridial toxins TcdA and TcdB also appear an attractive therapeutic target for development of both antibodies and chemical antidotes. A non-specific affinity agent, polystyrene-based sulfonate polymer Tolevamer that passively absorbs and neutralizes CD toxins in the gut is considered a proof-of-principle for such an approach. Although it has been shown to have lower efficacy than broad range antibiotics in clinical trials, treatment with Tolevamer led to dramatically reduced recurrence of CDI.23 Inhibitors of the glucosyltransferase24 and autocatalytic protease domain in TcdB have recently been reported.25
The crystal structure of TcdA-A2 fragment in a complex with Gal(α1–3)Gal(β1–4)GlcNAc(β1-O)26 (pdb code: 2G7C) provides a structural basis for the rational design of potential inhibitors. Inspection of the structure reveals a shallow groove located near the reducing terminus of the bound trisaccharide. The close proximity of this groove suggests an opportunity for design of heterobifunctional ligands, in which a variable fragment is linked to the trisaccharide via a short spacer (Fig. 1). Here we present a fragment-based strategy for developing ligands for clostridial toxins TcdA and TcdB, which interrogate a putative supplementary binding site on the protein surface adjacent to the known carbohydrate-binding site.
To avoid uncertainty in stereo configuration of products we decided to vary only two components, the carboxylic acid and amine in construction of a virtual library of Ugi products and limit the third component to formaldehyde. Lists of fragments R1 and R2 were compiled from structures available from the Aldrich catalogue. Approximately 500 low molecular weight fragments of each type were chosen. Combinatorial library of 3D structures was generated with in-house scripts using Open Babel software.27 During preparation of the structures for docking appropriate positional constrains were used to prevent distortions in carbohydrate portion of the virtual ligands. Of the total library (∼250000 structures), approximately 20% representative randomly chosen structures were processed with Autodock Vina.28 Due to the large number of rotatable bonds multiple docking sessions were required for the majority of compounds to obtain poses with low RMSD with respect to the trisaccharide moiety found in the crystal structure. The resulting poses with RMSD less than 1.5 Å were sorted by Vina score and six hits containing different fragments were selected (Fig. 2). The fragments were recombined providing 36 structures for synthesis and activity evaluation.
The synthesis started with the installation of the linker moiety into the oligosaccharide structure by boron trifluoride-catalyzed glycosylation of 2-(2-azidoethoxy)ethanol with lactose octaacetate 1 (Scheme 1). Deacetylation of the resulting azido-lactoside 2 and subsequent enzymatic processing followed by acetylation led to trisaccharide glycoside 3. Conversion of azido derivatives 2 and 3 to the corresponding isocyanides 6 and 7 was accomplished by a one-pot procedure including reduction, N-formylation and, finally, dehydration of the N-formate (Scheme 1). Although reduction of azide during the first step is coupled to in situ catalytic oxidation of formic acid31 we found that using a hydrogen atmosphere facilitates the conversion and improves the overall yield.
Condensation of lactosyl isocyanide 6 with variable components, amines and carboxylic acids, in the presence of formaldehyde provided 27 Ugi products as shown in Scheme 1. One carboxylic acid, C6, failed to give the target products, presumably, due to the high reactivity of the pyrazolone ring, which represents a cyclic Schiff-type base and may also undergo condensations with isocyanides. However, it's amine counterpart N6 was utilized in the synthesis of the combinatorial products. The subsequent deacetylation and enzymatic treatment of lactosyl derivatives afforded the final products in up to 80% yield (Scheme 1). The remaining 3 derivatives of this series, UN1C1, UN4C4 and UN6C5, were obtained via the trisaccharide isocyanide 7 (Scheme 1).
Our observations for N-aromatic amides are consistent with the previously reported trend in the conformational preferences for N-aryl substituted peptoids and ab initio calculations for substituted N-methylacetanilides.34 The observed NOE pattern for compound UN1C2 matches that obtained by MD simulation at 300 K for the trans conformer using the Amber force field (Table 1, Fig. 4). More bulky aliphatic substituents (N2 and N5) caused the corresponding Ugi derivatives to exist in both conformations. Up to 40% of the cis conformers were observed in the corresponding NMR spectra. This effect should be considered in virtual screening of peptoid structures and highly populated cis conformers should be tested for compounds containing amide bonds with aliphatic substituents at the amide nitrogen.
Listed in Table 2 are the results of the ESI-MS binding measurements performed on the thirty compounds. Where binding was detected, the affinities were determined based on at least six measurements performed at multiple ligand concentrations. In all cases, the ESI mass spectra were corrected for nonspecific binding using the reference protein method.36 Low affinities have prevented an accurate measurement of the affinity for a number of ligands. For the purpose of data analysis, apparent binding constants for these ligands were assigned the value of 500 M−1 as was previously determined for the reference trisaccharide, Gal(α1–3)Gal(β1–4)GlcNAc(β1-O(CH2)8COOMe).11
Compound | Apparent affinity to TcdA-A2 | Apparent affinity to TcdB-B3C |
---|---|---|
a No binding indicates low affinity and large error, not necessary complete absence of the complex in the spectrum. | ||
Gal(α1–3)Gal(β1–4)GlcNAc(β1-O(CH2)8COOMe) | 50011 | |
UN1C1 | NBa | 6500 ± 200 |
UN1C2 | 3000 ± 600 | 9400 ± 1300 |
UN1C3 | NB | 8300 ± 1700 |
UN1C4 | 1800 ± 400 | 6300 ± 400 |
UN1C5 | NB | 5700 ± 300 |
UN2C1 | NB | 6000 ± 300 |
UN2C2 | NB | 5800 ± 500 |
UN2C3 | 900 ± 200 | 5000 ± 800 |
UN2C4 | 3100 ± 400 | 6500 ± 200 |
UN2C5 | NB | 6700 ± 1200 |
UN3C1 | NB | 3900 ± 400 |
UN3C2 | 900 ± 300 | 4900 ± 800 |
UN3C3 | NB | 5600 ± 200 |
UN3C4 | 1800 ± 200 | 8700 ± 300 |
UN3C5 | NB | 6800 ± 600 |
UN4C1 | NB | 7500 ± 800 |
UN4C2 | NB | 4100 ± 200 |
UN4C3 | NB | 5600 ± 700 |
UN4C4 | 300 ± 200 | 5700 ± 600 |
UN4C5 | NB | 4100 ± 700 |
UN5C1 | 2300 ± 300 | 11300 ± 300 |
UN5C2 | 2600 ± 200 | 7000 ± 400 |
UN5C3 | 1800 ± 300 | 11000 ± 600 |
UN5C4 | 2200 ± 300 | 11000 ± 500 |
UN5C5 | 1300 ± 400 | 10100 ± 800 |
UN6C1 | NB | 6000 ± 600 |
UN6C2 | 2300 ± 600 | 7300 ± 800 |
UN6C3 | NB | 7500 ± 100 |
UN6C4 | 2300 ± 500 | 7000 ± 500 |
UN6C5 | NB | 5700 ± 300 |
Fig. 4 Mapping of binding sites of large clostridial toxins. The residues contacting the carbohydrate ligand are highlighted in red. Peripheral groove area, the putative supplementary binding site, is highlighted in green. Lower: amino acid sequence alignment of TcdA binding sites in fragment (pdb code: 2G7C). Sequence consistency index (CI) was calculated for strain 630 and includes 7 homologous sites in TcdA and 4 sites in TcdB. Symbol (*) designates highest rank of 10. |
The electron density for only one of the four sites is shown in Fig. 5, but the electron density in the region expected for the aglycone in the other three sites is equally weak and uninterpretable, even though the electron density for the trisaccharide is quite strong in all sites. The lack of clear electron density for most of the aglycone suggests the presence of a significant amount of flexibility in the linker and the lack of a well-defined, single mode of binding. These observations are consistent with the results from NMR and molecular dynamics simulations described above, which confirm the flexibility of the linker and indicate that a fairly wide range of conformations can be adopted by the aglycone.
The presence of multiple carbohydrate binding sites with homologous but not quite identical sequences of the repeats adds complexity to structure-based design of inhibitors for large clostridial toxins. Sequence alignment suggests the presence of 7 putative carbohydrate binding domains (CBDs) in TcdA and 4 CBDs in TcdB. Side chains of amino acids that form the binding sites differ significantly from site to site, and from strain to strain. This variation is even more significant between the binding sites found in TcdA versus TcdB. Polar side chains can be replaced with aromatic or hydrophobic ones, and charged side chains can be replaced with those carrying the opposite charge. The remarkable ability to maintain such diversity without loss of function may be beneficial for the bacteria for both evading the immune system and targeting a wide range of cell-surface receptors, but this significant degree of variation presents a formidable challenge for development of efficient antagonists for the receptor-binding domain of the large clostridial toxins.
The area on the protein surface that spans approximately 20 Å in diameter and can be accessed from the reducing terminal of the bound trisaccharide in the crystal structure 2G7C contains a groove composed of a relatively more conserved set of amino acids (Fig. 4, green highlighting). From a pharmacological prospective, targeting the relatively conserved region outside the native binding site appears to be an attractive strategy. Interaction with this supplementary site may be more uniform than with the highly variable principal binding site.
Even though only a short linker is required to bridge the trisaccharide and the putative ligand in this groove, the resulting structures of heterobifunctional composite ligands are very flexible with approximately 20 rotatable bonds (∼30 if OH groups are included). Docking of such structures presents a considerable challenge; selection of the docked poses cannot be guided solely by a scoring function and additional information about conformational preferences of the structural elements has to be considered. For instance, the conformational behaviour of oligosaccharides is governed by the exo-anomeric effect37 as is apparent from thousands of available glycan crystal structures and NMR studies.38 In this work, we reduced the conformational search during the automated docking by restraining all internal degrees of freedom in the trisaccharide fragment. Additionally, the pose of the trisaccharide in crystal structure 2G7C was used as a reference for selection of plausible docked poses. Applying these constrains should permit evaluation of possible contributions by the auxiliary binding site assuming the carbohydrate fragment binds consistently.
Exhaustive computational evaluation of the entire combinatorial library may be hindered due to large computational expense while sampling of a fraction of the library may still permit identification of promising fragments. We tested this by recombining the fragments found in a small subset of original virtual hits and evaluating activities of these recombinant compounds. Out of five synthesized ligands that were originally identified by virtual screening against TcdA-A2, only one (UN5C5) has shown greater affinity than the reference trisaccharide while recombination of the Ugi reaction components gave significantly better results with more than one third of recombinant ligands showing higher activity than the reference trisaccharide (Table 1). Interestingly, all derivatives containing an amine fragment N5 were active when combined with any tested carboxylic acid fragment and against both TcdA and TcdB. Some fragments produce more active Ugi derivatives than the others, however, no synergy was observed and combination of “active” amines with “active” carboxylic acid did not produce a compound with outstanding activity.
UN1C1. Ugi coupling and subsequent deacetylation of trisaccharide isocyanide 7 (81.5 mg, 0.08 mmol) afforded UN1C1 (45.9 mg, 0.05 mmol, 63%) as a white solid, Rf 0.36 (DCM–MeOH–H2O, 7:3:0.5), [α]D +55.2 (c 0.5, MeOH); IR (cm−1, solid): 3376, 2929, 2883, 1653; 1H NMR (D2O, 600 MHz), δ (ppm): 7.25 (m, 1H), 6.97 (m, 1H), 6.89 (m, 1H), 6.82 (m, 1H), 6.79 (m, 1H), 6.72 (m, 2H), 5.13 (s, 1H, H-1′′), 4.49 (d, 1H, J1,2 7.2 Hz, H-1′), 4.43 (d, 1H, J1,2 7.2 Hz, H-1), 4.30 (m, 2H), 4.25 (m, 4H), 4.17 (m, 2H), 4.00 (m, 1H), 3.95 (m, 3H), 3.84 (m, 1H), 3.75 (m, 8 H), 3.65 (m, 3H), 3.61 (m, 2H), 3.57 (m, 5H), 3.37 (m, 2H), 3.30 (m, 1H); 13C NMR (D2O, 150 MHz), δ (ppm): 175.1, 171.1, 164.1–114.4 (aromatic Cs), 103.8 (C-1′), 103.0 (C-1), 96.3 (C-1′′), 79.6, 78.1, 75.9, 75.6, 75.3, 73.6, 71.7, 70.5, 70.4, 70.2, 70.0, 69.7, 69.7, 69.1, 65.7, 65.4, 65.3, 61.9, 61.8, 61.0, 53.8, 41.2, 39.9; ESI-HRMS: m/z: calcd for C40H55FN2NaO21: 941.3174; found: 941.3166.
The general procedure for Ugi coupling lactose-derived isocyanide 6 (126.3 mg, 0.17 mmol) followed by transesterification and enzymic α-galactosylation was exemplified to prepare compound UN1C2. Final product UN1C2 (40.0 mg, 0.045 mmol, 41%) was obtained as a white solid, Rf 0.36 (DCM–MeOH–H2O, 7:3:0.5), [α]D +53.4 (c 0.9, MeOH); IR (cm−1, solid): 3372, 2940, 2873, 2494, 1650; 1H NMR (D2O, 500 MHz), δ (ppm): 6.92 (d, 1H, J 8.5 Hz), 6.89 (s, 1H), 6.8 (d, 1H, J 8.5 Hz), 5.12 (d, 1H, J1,2 3.5 Hz, H-1′′), 4.49 (d, 1H, J1,2 8.0 Hz, H-1′), 4.44 (d, 1H, J1,2 8.0 Hz, H-1), 4.30 (s, 2H), 4.28 (s, 4H), 4.17 (m, 2H, H-4′, H-5′′), 3.96 (m, 4H, H-3′′, H-4′′, H-6a, H-6b), 3.84 (dd, 1H, J2,1 3.5, J2,3 10.5 Hz, H-2′′), 3.77 (m, 3H, H-3′, H-5, H-5′), 3.72 (m, 5H, H-6a′, H-6b′, H-6a′′, H-6b′′, 1H), 3.64 (m, 5H, H-2′, H-3, CH2, 1H), 3.57 (m, 3H, H-4, CH2), 3.38 (m, 2H), 3.31 (dd, 1H, J2,1 ≈ J2,3 8.0 Hz, H-2), 2.19 (d, 2H, J 7.5 Hz), 2.07 (m, 1H), 1.65 (m, 2H), 1.42 (m, 4H), 0.95 (m, 2H); 13C NMR (D2O, 125 MHz), δ (ppm): 177.7, 171.4, 144.4, 144.2, 136.5, 122.0, 118.9, 117.6, 103.9, (C-1′), 103.1 (C-1), 96.4 (C-1′′), 79.7, 78.2, 76.0, 75.7, 75.4, 73.7, 71.8, 70.6, 70.5, 70.3, 70.1, 69.8, 69.8, 69.2, 65.8, 65.5, 61.9, 61.9, 61.1, 53.7, 40.1, 39.9, 37.6, 32.8, 32.8, 25.3, 25.3; ESI-HRMS: m/z: calcd for C39H60N2NaO21: 915.3581; found: 915.3574.
UN1C3. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (173.6 mg, 0.24 mmol) afforded final product UN1C3 (83.7 mg, 0.092 mmol, 75%) as a white solid, Rf 0.36 (DCM–MeOH–H2O, 7:3:0.5), [α]D +51.9 (c 0.5, MeOH); IR (cm−1, solid): 3338, 2921, 2877, 1646; 1H NMR (D2O, 500 MHz), δ (ppm): 7.11 (d, 2H, J 8.0 Hz), 6.91 (m, 3H), 6.81 (m, 2H), 5.21 (d, 1H, J1,2 4.0 Hz, H-1′′), 4.57 (d, 1H, J1,2 8.0 Hz, H-1′), 4.50 (d, 1H, J1,2 8.0 Hz, H-1), 4.34 (s, 2H), 4.25 (m, 6H, H-4′, H-5′′, 2 × OCH2), 4.08 (d, 1H, J4,3 3.0 Hz, H-4′′), 4.01 (m, 3H, H-3′′, H-6a, H-6b), 3.93 (dd, 1H, J2,1 3.5, J2,3 10.0 Hz, H-2′′), 3.87 (m, 3H, H-3′, H-5, H-5′), 3.79 (m, 5H, H-6a′, H-6b′, H-6a′′, H-6b′′, 1H), 3.72 (m, 5H, H-2′, H-3, CH2, 1H), 3.62 (m, 3H, H-4, CH2), 3.54 (m, 2H), 3.44 (m, 2H), 3.39 (dd, 1H, J2,1 ≈ J2,3 9.0 Hz, H-2), 2.28 (s, 3H); 13C NMR (D2O, 125 MHz), δ (ppm): 175.6, 171.2, 144.4, 144.3, 137.7, 136.1, 132.6, 130.1, 130.1, 129.9, 129.9, 122.0, 118.8, 117.7, 103.9 (C-1′), 103.1 (C-1), 96.4 (C-1′′), 76.7, 78.2, 76.0, 75.7, 75.4, 73.7, 71.8, 70.6, 70.5, 70.3, 70.1, 69.8, 69.8, 69.2, 65.8, 65.5, 65.4, 61.9, 61.9, 61.1, 53.8, 41.0, 39.9, 21.1; ESI-HRMS: m/z: calcd for C41H58N2NaO21: 937.3424; found: 937.3418.
UN1C4. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (136.9 mg, 0.19 mmol) afforded final product UN1C2 (73.2 mg, 0.083 mmol, 75%) as a white solid, Rf 0.36 (DCM–MeOH–H2O, 7:3:0.5), [α]D +55.1 (c 0.3, MeOH); IR (cm−1, solid): 3399, 2932, 2878, 1663, 1068; 1H NMR (D2O, 600 MHz), δ (ppm): 6.90 (d, 1H, 3J 9.0 Hz), 6.84 (d, 1H, 4J 1.8 Hz), 6.80 (d, 1H, 3J 9.0, 4J 1.8 Hz), 5.68 (s, 1H), 5.14 (d, 1H, J1,2 3.6 Hz, H-1′′), 4.50 (d, 1H, J1,2 7.8 Hz, H-1′), 4.46 (d, 1H, J1,2 7.8 Hz, H-1), 4.42 (s, 2H), 4.29 (s, 4H), 4.19 (m, 2H, H-4′, H-5′′), 4.01 (d, 1H, J4,3 3.0 Hz, H-4′′), 3.95 (m, 3H, H-6a, H-6b, H-3′′), 3.85 (dd, 1H, J2,1 4.8, J2,3 10.2, H-2′′), 3.78 (m, 3H, H-3′, H-5, H-5′), 3.75 (m, 4H, H-6a′, H-6b′, H-6a′′, H-6b′′, 1H), 3.67 (m, 3H, H-2′, H-3, 1H), 3.64 (m, 2H), 3.58 (m, 3H, H-4, 2H), 3.04 (m, 2H), 3.32 (m, 1H, H-2), 1.90 (s, 2H), 1.59 (s, 3H), 0.72 (s, 3H); 13C NMR (D2O, 125 MHz), δ (ppm): 171.4, 171.4, 144.0, 143.5, 140.0, 130.1, 121.5, 118.5, 117.0, 103.9 (C-1′′), 103.1 (C-1), 96.4 (C-1′′), 79.6, 78.2, 76.0, 75.7, 75.4, 73.7, 71.8, 70.6, 70.5, 70.3, 70.1, 69.8, 69.8, 69.8, 69.2, 65.8, 65.5, 65.5, 65.5, 61.9, 61.9, 61.1, 39.9, 21.5, 14.0, 13.0; ESI-HRMS: m/z: calcd for C38H58N2NaO21: 901.3424; found: 901.3421.
UN1C5. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (156.2 mg, 0.21 mmol) afforded final product UN1C5 (69.0 mg, 0.074 mmol, 63%) as a white solid, Rf 0.34 (DCM–MeOH–H2O, 7:3:0.5), [α]D +50.5 (c 0.5, MeOH); IR (cm−1, solid): 3375, 2929, 2879, 1645; 1H NMR (D2O, 500 MHz), δ (ppm): 8.23 (s, 1H), 8.14 (d, 1H, J 8.0 Hz), 7.80 (d, 1H, J 8.0 Hz,), 7.51 (dd, 1H, J 8.0, J 8.0 Hz) 6.86 (s, 1H), 6.79 (d, 1H, J 9.0 Hz), 6.71 (d, 1H, J 8.5 Hz), 5.21 (d, 1H, J1,2 4.0 Hz, H-1′′), 4.66 (s, 2H, NHCH2), 4.58 (d, 1H, J1,2 7.5 Hz, H-1′), 4.54 (d, 1H, J1,2 8.0 Hz, H-1), 4.25 (m, 2H, H-4′, H-5′′), 4.17 (m, 4H), 4.09 (d, 1H, J4,3 3.0 Hz, H-4′′), 4.04 (m, 3H, H-3′′, H-6a, H-6b), 3.93 (dd, 1H, J2,1 4.0, J2,3 10.5 Hz, H-2′′), 3.86 (m, 3H, H-3′, H-5, H-5′), 3.81 (m, 4H, H-6a′, H-6b′, H-6a′′, H-6b′′), 3.78 (m, 6H, H-2′, H-3, 2 × CH2), 3.70 (m, 2H, H-4, 1H), 3.63 (m, 1H), 3.52 (m, 2H, CH2), 3.40 (dd, 1H, J2,1 ≈ J2,3 8.5 Hz, H-2); 13C NMR (D2O, 125 MHz), δ (ppm): 171.4, 171.9, 148.2, 144.2, 143.8, 136.9, 136.5, 135.5, 130.5, 125.9, 124.2, 122.1, 118.6, 117.5, 103.9 (C-1′), 103.2 (C-1), 96.4 (C-1′′), 79.7, 78.2, 76.0, 75.7, 75.4, 73.8, 71.8, 70.6, 70.5, 70.3, 70.1, 69.9, 69.8, 69.2, 65.8, 65.3, 61.9, 61.9, 61.9, 61.2, 54.3, 40.0; ESI-HRMS: m/z: calcd for C39H53N3NaO23: 954.2962; found: 954.2960.
UN2C1. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (164.1 mg, 0.22 mmol) afforded final product UN2C1 (86.0 mg, 0.094 mmol, 59%) as a white solid, Rf 0.36 (DCM–MeOH–H2O, 7:3:0.5), [α]D +34.3 (c 1.2, MeOH); IR (cm−1, solid): 3371, 2930, 2497, 1636; 1H NMR (D2O, 500 MHz), δ (ppm): 7.33 (m, 0.3H), 7.27 (m, 0.7H), 7.07 (m, 5H), 6.99 (m, 0.3H), 6.95 (m, 0.7H), 6.89 (d, 0.3H, J 8.0 Hz), 6.81 (d, 0.7H, J 8.0 Hz), 5.67 (dd, 0.3H, J 6.5, J 8.5 Hz), 5.16 (dd, 0.7H, J 6.5, J 8.5 Hz), 5.12 (d, 1H, J1,2 4.0 Hz, H-1′′), 4.48 (d, 1H, J1,2 7.5 Hz, H-1′), 4.43 (d, 0.7H, J1,2 8.0 Hz, H-1), 4.37 (d, 0.3H, J1,2 8.0 Hz, H-1), 4.17 (m, 2H, H-4′, H-5′′), 3.99 (d, 1H, J4,3 3.0 Hz, H-4′′), 3.92 (m, 5H, H-3′′, H-6a, H-6b, CH2), 3.84 (dd, 1H, J2,1 4.0, J2,3 10.5 Hz, H-2′′), 3.76 (m, 3H, H-3′, H-5, H-5′), 3.73 (s, 2H), 3.69 (m, 4H, H-6a′, H-6b′, H-6a′′, H-6b′′), 3.67 (m, 3H, H-2′, H-3, 1H), 3.56 (m, 6H, H-4, 2 × CH2, 1H), 3.33 (m, 2H), 3.25 (m, H-2), 2.66 (m, 2H), 1.98 (m, 1H), 1.84 (m, 1H), 1.61 (m, 2H); 13C NMR (D2O, 125 MHz), δ (ppm): 176.0, 175.2, 171.5, 171.5, 164.6–114.8 (aromatic Cs), 103.9 (C-1′), 103.1 (C-1), 96.4 (C-1′′), 79.7, 78.2, 76.0, 75.7, 75.4, 73.7, 71.8, 70.5, 70.5, 70.4, 70.3, 70.1, 69.8, 69.7, 69.2, 65.8, 61.9, 61.9, 61.1, 59.1, 55.4, 47.9, 41.0, 40.8, 39.9, 39.8, 29.7, 29.5, 29.4, 28.3, 22.2, 22.1; ESI-HRMS: m/z: calcd for C42H59FN2NaO19: 937.3588; found: 937.3585.
UN2C2. Ugi coupling and subsequent Zemplen deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (181.1 mg, 0.25 mmol) afforded final product UN2C2 (67.0 mg, 0.075 mmol, 45%) as a white solid, Rf 0.40 (DCM–MeOH–H2O, 7:3:0.5), [α]D +42.5 (c 0.6, MeOH); IR (cm−1, solid): 3362, 2934, 2866, 1628; 1H NMR (D2O, 500 MHz), δ (ppm): 7.32 (m, 3H), 7.16 (d, 0.6H, J 7.3 Hz), 7.07 (d, 0.4H, J 7.3 Hz), 5.80 (dd, 0.4H, J 5.9, J 9.2 Hz), 5.39 (dd, 0.4H, J 5.9, J 9.2 Hz), 5.21 (d, 1H, J1,2 3.9 Hz, H-1′′), 4.57(d, 1H, J1,2 7.8 Hz, H-1′′), 4.55 (d, 0.6H, J1,2 8.0 Hz, H-1′), 4.53 (d, 0.4H, J1,2 8.0 Hz, H-1), 4.26 (m, 2H, H-4′, H-5′′), 4.08 (d, 1H, J4,3 3.0 Hz, H-4′′), 4.00 (m, 3H, H-3′′, H-6a, H-6b), 3.93 (dd, 1H, J2,1 3.9, J2,3 10.3 Hz, H-2′′), 3.85 (m, 3H, H-3′, H-5, H-5′), 3.76 (m, 4H, H-6a′, H-6b′, H-6a′′, H-6b′′), 3.71 (m, 6H, H-2′, H-3, 2 × CH2), 3.65 (m, 3H, H-4, CH2), 3.44 (m, 2H), 3.38 (m, H-2), 2.84 (m, 2H), 2.73 (m, 0.6H), 2.52 (m, 0.4H), 2.34 (m, 2H), 2.09 (m, 1H), 1.89 (m, 4H), 1.71 (m, 3H), 1.62 (m, 2H), 1.30 (m, 2H); 13C NMR (D2O, 125 MHz), δ (ppm): 178.9, 178.4, 172.0, 172.0, 140.5, 140.3, 135.2, 135.0, 130.6, 130.5, 128.7, 128.5, 128.1, 127.7, 127.3, 127.2, 103.8 (C-1′), 103.1 (C-1), 96.4 (C-1′′), 79.6, 78.2, 76.0, 75.7, 75.4, 75.4, 73.7, 71.8, 70.5, 70.5, 70.4, 70.3, 70.1, 69.8, 69.8, 69.7, 69.2, 65.8, 61.9, 61.9, 61.1, 58.9, 47.8, 40.4, 40.0, 39.8, 39.7, 37.7, 37.5, 33.2, 33.2, 33.0, 33.0, 29.7, 29.6, 29.5, 28.3, 25.5, 25.5, 25.4, 25.4, 22.3, 22.2; ESI-HRMS: m/z: calcd for C41H64N2NaO19: 911.3995; found: 911.3990.
UN2C3. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (185.1 mg, 0.25 mmol) afforded final product UN2C3 (114.0 mg, 0.13 mmol, 52%) as a white solid, Rf 0.40 (DCM–MeOH–H2O, 7:3:0.5), [α]D +33.7 (c 0.8, MeOH); IR (cm−1, solid): 3339, 2928, 2502, 1653; 1H NMR (D2O, 600 MHz), δ (ppm): 7.13–6.70 (m, 8H), 5.65 (m, 0.4H), 5.13 (d, 1H, J1,2 4.2 Hz, H-1′′), 5.06 (m, 0.6H), 4.48 (d, 1H, J1,2 7.8 Hz, H-1′), 4.41 (d, 0.6H, J1,2 7.8 Hz, H-1), 4.35 (d, 0.4H, J1,2 7.8 Hz, H-1), 4.18 (m, 2H, H-5′′, H-4′), 3.99 (d, 1H, J4,3 3.0 Hz, H-4′′), 3.94 (m, 3H, H-3′′, H-6a, H-6b), 3.86 (dd, 1H, J2,1 3.6, J2,3 10.2 Hz, H-2′′), 3.80 (m, 5H, H-3′, H-5, H-5′, CH2), 3.71 (m, 5H, H-6a′, H-6b′, H-6a′′, H-6b′′, 1H), 3.68 (m, 6H, H-2′, H-3, 2 × CH2), 3.55 (m, 4H, H-4, CH2, 1H), 3.31 (m, 2H), 3.26 (m, H-2), 2.26 (m, 2H), 2.17 (s, 1.2H), 2.10 (s, 1.8H), 1.87 (m, 2H), 1.54 (m, 2H); 13C NMR (D2O, 125 MHz), δ (ppm): 176.2, 175.5, 171.4, 171.3, 140.3–126–9 (aromatic Cs), 103.9 (C-1′), 103.2 (C-1), 96.4 (C-1′′), 79.7, 78.3, 76.0, 75.7, 75.4, 73.7, 71.8, 70.6, 70.5, 70.4, 70.3, 70.1, 69.9, 69.7, 69.2, 65.8, 61.9, 61.9, 61.2, 59.1, 48.7, 47.8, 41.3, 41.0, 39.8, 29.7, 29.6, 29.4, 28.3, 22.4, 21.3, 21.3; ESI-HRMS: m/z: calcd for C43H62N2NaO19: 933.3839; found: 933.3834.
UN2C4. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (122.0 mg, 0.17 mmol) afforded final product UN2C4 (65.2 mg, 0.075 mmol, 64%) as a white solid, Rf 0.42 (DCM–MeOH–H2O, 7:3:0.5), [α]D +46.0 (c 0.5, MeOH); IR (cm−1, solid): 3370, 2929, 2874, 2492, 1662; 1H NMR (D2O, 500 MHz), δ (ppm): 7.24 (m, 2H), 7.19 (m, 1H), 7.11 (m, 1H), 5.81 (ddd, 0.8H, J 1.5, J 7.5, J 7.5 Hz), 5.71 (ddd, 0.2H, J 1.5, J 7.5, J 7.5 Hz), 5.59 (dd, 0.2H, J 6.0, 10.5 Hz), 5.24 (dd, 0.8H, J 6.0, 10.5 Hz), 5.13 (d, 1H, J1,2 4.0 Hz, H-1′′), 4.49 (d, 1H, J1,2 8.0 Hz, H-1′), 4.46 (d, 0.8H, J1,2 8.0 Hz, H-1), 4.44 (d, 0.2H, J1,2 8.0 Hz, H-1), 4.17 (m, 2H, H-4′, H-5′′), 4.00 (d, 1H, J4,3 3.5 Hz, H-4′′), 3.95 (m, 5H, H-3′′, H-6a, H-6b, CH2), 3.84 (dd, J2,1 3.5, J2,3 10.0 Hz, H-2′′), 3.77 (m, 3H, H-3′, H-5, H-5′), 3.68 (m, 4H, H-6a′, H-6b′, H-6a′′, H-6b′′), 3.61 (m, 6H, H-2′, H-3, 2 × CH2), 3.53 (m, 3H, H-4, CH2), 3.36 (m, 2H), 3.29 (m, H-2), 2.74 (m, 2H), 2.11 (m, 3H), 1.94 (m, 1H), 1.89 (s, 3H), 1.75 (m, 2H), 0.95 (dd, 3H, J 8.0, J 8.0); 13C NMR (D2O, 125 MHz), δ (ppm): 178.9, 178.8, 171.8, 171.5, 140.4, 140.3, 135.9, 135.1, 135.1, 135.0, 130.6, 130.5, 130.3, 128.7, 128.5, 127.9, 127.8, 127.4, 127.3, 103.9 (C-1′), 103.1 (C-1), 96.4 (C-1′′), 79.7, 78.2, 76.0, 75.7, 75.4, 73.7, 71.8, 70.6, 70.5, 70.3, 70.1, 69.8, 69.2, 65.8, 61.9, 61.9, 61.1, 60.3, 55.5, 50.4, 46.9, 39.8, 29.7, 29.6, 29.5, 28.2, 22.3, 22.2, 21.5, 21.5, 14.5, 13.5; ESI-HRMS: m/z: calcd for C40H62N2NaO19: 897.3839; found: 897.3836.
UN2C5. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (152.1 mg, 0.21 mmol) afforded final product UN2C5 (73.5 mg, 0.079 mmol, 52%) as a white solid, Rf 0.38 (DCM–MeOH–H2O, 7:3:0.5), [α]D +14.4 (c 0.5, MeOH); IR (cm−1, solid): 3378, 2933, 2883, 2504, 1628; 1H NMR (D2O, 500 MHz), δ (ppm): 8.30–6.85 (m, 8H), 5.67 (m, 0.2H), 5.13 (d, 1H, J1,2 4.0 Hz, H-1′′), 4.77 (m, 0.8H), 4.48 (d, 1H, J1,2 7.5 Hz, H-1′), 4.44 (d, 0.8H, J1,2 8.0 Hz, H-1′), 4.37 (d, 0.2H, J1,2 8.0 Hz, H-1′), 4.17 (m, 2H, H-5′′, H-4′), 4.00 (d, 1H, J4,3 3.0 Hz, H-4′′), 3.94 (m, 5H, H-3′′, H-6a, H-6b, CH2), 3.84 (dd, 1H, J2,1 4.0, J2,3 10.5 Hz, H-2′′), 3.77 (m, 3H, H-3′, H-5, H-5′), 3.69 (m, 4H, H-6a′, H-6b′, H-6a′′, H-6b′′), 3.62 (m, 6H, H-2′, H-3, 2 × CH2), 3.59 (m, 3H, H-4, CH2), 3.52 (m, 1H), 3.39 (m, 1H), 3.28 (dd, 0.8H, J2,1 8.0, J2,3 9.5 Hz, H-2), 3.26 (dd, 0.2H, J2,1 8.0, J2,3 9.5 Hz, H-2), 2.54 (m, 2H), 2.07 (m, 1H), 1.73 (m, 2H), 1.27 (m, 2H); 13C NMR (D2O, 125 MHz), δ (ppm): 173.6, 172.9, 170.8, 173.7, 148.8–122.1 (aromatic Cs), 103.8 (C-1′), 103.0 (C-1), 96.3 (C-1′′), 79.6, 78.1, 75.9, 75.6, 75.5, 75.2, 73.6, 71.7, 70.4, 70.3, 70.2, 70.1, 69.8, 69.6, 69.5, 69.4, 69.1, 65.7, 61.8, 61.8, 61.0, 60.5, 47.3, 39.8, 39.6, 29.6, 29.3, 22.0; ESI-HRMS: m/z: calcd for C41H57N3NaO21: 950.3377; found: 950.3371.
UN3C1. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (162.6 mg, 0.22 mmol) afforded final product UN3C1 (84.9 mg, 0.094 mmol, 76%) as a white solid, Rf 0.40 (DCM–MeOH–H2O, 7:3:0.5), [α]D +55.4 (c 0.5, MeOH); IR (cm−1, solid): 3389, 2892, 1655, 1075; 1H NMR (D2O, 600 MHz), δ (ppm): 7.27 (m, 1H), 6.98 (m, 1H), 6.87 (s, 1H), 6.85 (s, 1H), 6.79 (m, 2H), 6.76 (d, 1H, J 1.8 Hz), 5.99 (s, 2H), 5.13 (d, 1H, J1,2 4.2 Hz, H-1′′), 4.49 (d, 1H, J1,2 7.8 Hz, H-1′), 4.44 (d, 1H, J1,2 7.8 Hz, H-1), 4.32 (s, 2H), 4.19 (dd, 1H, J5,6a ≈ J5,6b 6.0 Hz, H-5′′), 4.17 (d, 1H, J4,3 3.0 Hz, H-4′), 4.01 (d, 1H, J4,3 3.6 Hz, H-4′′), 3.96 (m, 3H, H-3′′, H-6a, H-6b), 3.85 (d, 1H, J2,1 4.2, J2,3 10.2, H-2′′), 3.77 (m, 3H, H-3′, H-5, H-5′), 3.72 (m, 5H, H-6a′, H-6b′, H-6a′′, H-6b′′, 1H), 3.66 (m, 3H, H-2′, H-3, 1H), 3.59 (m, 6H, H-4, 2 × CH2, 1H), 3.54 (m, 1H), 3.39 (m, 2H), 3.30 (m, 1H, H-2); 13C NMR (D2O, 150 MHz), δ (ppm): 175.2, 171.2, 163.4–109 (aromatic Cs), 103.8 (C-1′), 103.0, 102.9 (C-1), 96.3 (C-1′′), 79.6, 78.1, 75.9, 75.6, 75.3, 73.6, 71.7, 70.5, 70.4, 70.2, 70.0, 69.7, 69.7, 69.1, 61.9, 61.8, 61.0, 53.8, 41.1, 39.9; ESI-HRMS: m/z: calcd for C39H53FN2NaO21: 927.3017; found: 927.3011.
UN3C2. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (122.6 mg, 0.17 mmol) afforded final product UN3C2 (54.4 mg, 0.062 mmol, 81%) as a white solid, Rf 0.36 (DCM–MeOH–H2O, 7:3:0.5), [α]D +56.9 (c 0.6, MeOH); IR (cm−1, solid): 3388, 2944, 1658, 1074; 1H NMR (D2O, 600 MHz), δ (ppm): 6.90 (d, 1H, J 8.4 Hz), 6.89 (d, 1H, J 1.8 Hz), 6.85 (dd, 1H, J 1.8, J 8.4 Hz), 6.02 (s, 2H), 5.14 (d, 1H, J1,2 4.2 Hz, H-1′′), 4.50 (d, 1H, J1,2 8.4 Hz, H-1′), 4.46 (d, 1H, J1,2 7.8 Hz, H-1), 4.32 (s, 2H), 4.18 (m, 2H, H-5′′, H4′), 4.00 (m, 2H, H-4′′, H-6a), 3.95 (m, 2H, H-6b, H-3′′), 3.85 (dd, 1H, J2,1 3.9, J2,3 10.2, H-2′′), 3.78 (m, 3H, H-5, H-5′, H-3′), 3.73 (m, 4H, H-6a′, H-6b′, H-6a′′, H-6b′′), 3.68 (m, 2H, H-2′, H-3), 3.62 (m, 4H), 3.60 (dd, 1H, J4,5 ≈ J4,3 8.4 Hz, H-4), 3.57 (m, 2H), 3.41 (m, 2H), 3.31 (m, 1H), 2.23 (d, 2H, J 7.2 Hz), 2.09 (m, 1H), 1.67 (m, 2H), 1.44 (m, 4H), 0.99 (m, 2H); 13C NMR (D2O, 150 MHz), δ (ppm): 177.8, 171.4, 148.8, 148.1, 136.7, 122.4, 109.5, 109.4, 103.7 (C-1′), 103.1, 102.9 (C-1), 96.3 (C-1′′), 79.6, 78.1, 75.9, 75.6, 75.3, 73.7, 71.7, 70.5, 70.4, 70.2, 70.0, 69.7, 69.7, 69.1, 65.7, 61.9, 61.8, 61.1, 53.7, 40.1, 39.8, 37.5, 32.7, 32.7, 25.2, 25.2; ESI-HRMS: m/z: calcd for C38H58N2NaO21: 901.3424; found: 901.3420.
UN3C3. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (182.7 mg, 0.25 mmol) afforded final product UN3C3 (78.0 mg, 0.087 mmol, 54%) as a white solid, Rf 0.38 (DCM–MeOH–H2O, 7:3:0.5), [α]D +55.8 (c 0.7, MeOH); IR (cm−1, solid): 3369, 2890, 1650; 1H NMR (D2O, 500 MHz), δ (ppm): 7.12 (d, J 8.0 Hz, 2H), 6.93 (d, J 8.0 Hz, 2H), 6.86 (d, J 8.0 Hz, 1H), 6.78 (dd, J 2.0, J 8.0 Hz, 1H), 6.74 (d, J 2.0 Hz, 1H), 5.99 (s, 2H), 5.12 (d, J1,2 4.0 Hz, H-1′′), 4.48 (d, J1,2 7.5 Hz, H-1′), 4.43 (d, J1,2 7.5 Hz, H-1), 4.13 (s, NHCH2), 4.17 (m, H-4′, H-5′′), 4.00 (d, J4,3 3.0 Hz, H-4′′), 3.94 (m, H-3′′, H-6a, H-6b), 3.84 (dd, J2,1 3.5, J2,3 10.0 Hz, H-2′′), 3.75 (m, H-3′, H-5, H-5′), 3.70 (m, H-6a′, H-6b′, H-6a′′, H-6b′′), 3.64 (m, H-2′, H-3, 2 × CH2), 3.58 (m, H-4, 2 × CH2), 3.39 (m, CH2), 3.29 (dd, J2,1 8.0, J2,3 9.5 Hz, H-2), 2.27 (s, 3H); 13C NMR (D2O, 125 MHz), δ (ppm): 175.8, 171.3, 148.8, 148.2, 137.9, 136.5, 132.6, 130.1, 130.1, 129.9, 129.9, 122.6, 109.5, 109.5, 103.9 (C-1′), 103.1 (C-1), 102.9, 96.4 (C-1′′), 79.7, 78.2, 76.0, 75.7, 75.4, 73.7, 71.8, 70.5, 70.5, 70.3, 70.1, 69.8, 69.8, 69.2, 65.8, 61.9, 61.9, 61.1, 53.9, 40.9, 39.9, 21.1; ESI-HRMS: m/z: calcd for C40H56N2NaO21: 923.3268; found: 923.3260.
UN3C4. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (141.1 mg, 0.19 mmol) afforded final product UN3C4 (81.5 mg, 0.094 mmol, 70%) as a white solid, Rf 0.34 (DCM–MeOH–H2O, 7:3:0.5), [α]D +56.9 (c 0.7, MeOH); IR (cm−1, solid): 3390, 2928, 2891, 1662, 1066; 1H NMR (D2O, 600 MHz), δ (ppm): 6.87 (d, 1H, J 8.4 Hz), 6.84 (d, 1H, J 1.8 Hz), 6.78 (dd, 1H, J 1.8, J 8.4 Hz), 5.99 (s, 2H), 5.71 (s, 1H), 5.14 (d, 1H, J1,2 3.6 Hz, H-1′′), 4.50 (d, 1H, J1,2 7.8 Hz, H-1′), 4.46 (d, 1H, J1,2 8.4 Hz, H-1), 4.42 (s, 2H), 4.18 (m, 2H, H-5′′, H4′), 4.01 (d, 1H, J4,3 3.6 Hz, H-4′′), 3.95 (m, 3H, H-3′′, H-6a, H-6b), 3.85 (dd, 1H, J2,1 4.2, J2,3 10.8, H-2′′), 3.77 (m, 3H, H-3′, H-5, H-5′), 3.72 (m, 5H, H-6a′, H-6b′, H-6a′′, H-6b′′, 1H), 3.67 (m, 3H, H-2′, H-3, 1H), 3.63 (m, 2H), 3.59 (dd, 1H, J4,5 ≈ J4,3 8.4 Hz, H-4), 3.56 (m, 2H), 3.41 (m, 2H), 3.32 (dd, 1H, J1,2 7.8, J2,3 5.4 Hz, H-2), 1.92 (s, 2H), 1.59 (s, 3H), 0.74 (s, 3H); 13C NMR (D2O, 125 MHz), δ (ppm): 171.3, 171.3, 148.4, 147.4, 140.0, 130.1, 121.8, 109.2, 109.1, 109.0, 103.8 (C-1′), 103.0, 102.7 (C-1), 96.3 (C-1′′), 79.5, 78.1, 75.9, 75.6, 75.3, 73.6, 71.7, 70.4, 70.4, 70.2, 70.0, 70.0, 69.7, 69.7, 69.1, 65.7, 61.8, 61.8, 61.0, 39.8, 21.3, 13.9, 12.9; ESI-HRMS: m/z: calcd for C37H56N2NaO21: 887.3268; found: 887.3263.
UN3C5. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (108.5 mg, 0.15 mmol) afforded final product UN3C5 (24.0 mg, 0.026 mmol, 18%) as a white solid, Rf 0.38 (DCM–MeOH–H2O, 7:3:0.5), [α]D +54.9 (c 0.5, MeOH); IR (cm−1, solid): 3382, 2889, 1646; 1H NMR (D2O, 600 MHz), δ (ppm): 8.21 (s, 1H), 8.14 (d, 1H, J 8.4 Hz), 7.74 (d, 1H, J 7.8 Hz), 7.48 (dd, 1H, J 7.8, J 7.8 Hz), 6.81 (s, 1H), 6.72 (dd, 1H, J 1.8, J 7.8 Hz), 6.66 (d, 1H, J 8.4 Hz), 5.87 (s, 2H), 5.13 (d, 1H, J 3.6 Hz, H-1′′), 4.49 (d, 1H, J 7.8 Hz, H-1′), 4.47 (d, 1H, J 7.8 Hz, H-1), 4.18 (m, 2H, H-4′, H-5′′), 4.00 (d, 1H, J 3.0 Hz, H-4′′), 3.99 (dd, 1H, J3,4 4.2, J3,2 8.4, Hz, H-3′′), 3.96 (m, 2H, H-6a, H-6b), 3.85 (dd, 1H, J2,1 3.6, J2,3 10.2, Hz, H-2′′), 3.78 (m, 5H, H-3′, H-5, H-5′, CH2), 3.73 (m, 4H, H-6a′, H-6b′, H-6a′′, H-6b), 3.67 (m, 6H, H-2′, H-3, 2 × CH2), 3.62 (m, 2H, H-4, 1H), 3.57 (m, 1H), 3.46 (m, 2H), 3.31 (dd, 1H, J2,1 ≈ J2,3 9.0, Hz, H-2); 13C NMR (D2O, 125 MHz), δ (ppm): 171.7, 171.0, 148.7, 148.3, 147.8, 136.9, 136.8, 135.4, 130.6, 125.9, 124.2, 122.8, 109.4, 109.3, 103.9 (C-1′), 103.1 (C-1), 102.9, 96.4 (C-1′′), 79.7, 78.2, 76.0, 75.7, 75.4, 73.7, 71.8, 70.5, 70.5, 70.3, 70.1, 69.8, 69.8, 69.2, 65.8, 61.9, 61.9, 61.1, 54.4, 40.0; ESI-HRMS: m/z: calcd for C38H51N3NaO23: 940.2806; found: 940.2799.
UN4C1. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (148.9 mg, 0.20 mmol) afforded final product UN4C1 (57.4 mg, 0.063 mmol, 58%) as a white solid, Rf 0.40 (DCM–MeOH–H2O, 7:3:0.5), [α]D +54.8 (c 0.8, MeOH); IR (cm−1, solid): 3367, 2929, 2508, 1653; 1H NMR (D2O, 500 MHz), δ (ppm): 7.03 (m, 1H), 6.91 (d, 1H, J 7.0 Hz), 6.89 (d, 1H, J 7.0 Hz), 6.69 (m, 3H), 6.35 (d, 1H, J 9.5 Hz), 5.12 (d, 1H, J1,2 3.5 Hz, H-1′′), 4.48 (d, 1H, J1,2 7.5 Hz, H-1′), 4.40 (d, 1H, J1,2 8.0 Hz, H-1), 4.17 (m, 4H), 3.99 (d, 1H, J4,3 3.0 Hz, H-4′′), 3.93 (m, 3H, H-3′′, H-6a, H-6b), 3.84 (dd, 1H, J2,1 3.5, J2,3 10.5 Hz, H-2′′), 3.78 (m, 3H, H-3′, H-5, H-5′), 3.72 (m, 6H, H-6a′, H-6b′, H-6a′′, H-6b′′, CH2), 3.61 (m, 4H, H-2′, H-3, CH2), 3.50 (m, 3H, H-4, CH2), 3.31 (m, 5H), 2.39 (m, 4H), 1.42 (m, 4H); 13C NMR (D2O, 125 MHz), δ (ppm): 173.6, 170.4, 164–114.1 (aromatic Cs), 103.9 (C-1′), 103.2 (C-1), 96.4 (C-1′′), 79.7, 78.3, 76.0, 75.7, 75.4, 73.8, 71.8, 70.6, 70.5, 70.3, 70.1, 69.9, 69.7, 69.2, 65.8, 61.9, 61.9, 61.2, 53.7, 41.2, 39.9, 29.7, 29.5, 23.6, 23.5; ESI-HRMS: m/z: calcd for C42H59FN2NaO10: 937.3588; found: 937.3583.
UN4C2. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (158.8 mg, 0.22 mmol) afforded final product UN4C2 (78.0 mg, 0.088 mmol, 79%) as a white solid, Rf 0.40 (DCM–MeOH–H2O, 7:3:0.5), [α]D +58.0 (c 0.5, MeOH); IR (cm−1, solid): 3352, 2932, 2866, 1653; 1H NMR (D2O, 500 MHz), δ (ppm): 7.09 (m, 2H), 7.00 (d, 1H, J 8.0 Hz), 5.21 (d, 1H, J1,2 4.0 Hz, H-1′′), 4.58 (d, 1H, J1,2 7.5 Hz, H-1′), 4.52 (d, 1H, J1,2 8.0 Hz, H-1), 4.35 (s, 2H), 4.26 (m, 2H, H-4′, H-5′′), 4.08 (d, 1H, J4,3 3.0 Hz, H-4′′), 4.03 (m, 3H, H-3′′, H-6a, H-6b), 3.93 (dd, 1H, J2,1 3.5, J2,3 10.0 Hz, H-2′′), 3.86 (m, 3H, H-3′, H-5, H-5′), 3.81 (m, 5H, H-6a′, H-6b′, H-6a′′, H-6b′′, 1H), 3.72 (m, 5H, H-2′, H-3, CH2, 1H), 3.63 (m, 3H, H-4, CH2), 3.45 (m, 2H), 3.41 (dd, 1H, J2,1 ≈ J2,3 8.5 Hz, H-2), 2.66 (m, 4H), 2.17 (m, 3H), 1.64 (m, 6H), 1.38 (m, 4H), 0.88 (m, 2H); 13C NMR (D2O, 125 MHz), δ (ppm): 175.9, 170.7, 141.2, 139.3, 137.8, 131.1, 129.1, 125.9, 103.9 (C-1′), 103.2 (C-1), 96.4 (C-1′′), 79.7, 78.3, 76.0, 75.8, 75.4, 73.8, 71.8, 70.6, 70.6, 70.3, 70.1, 69.9, 69.7, 69.2, 65.8, 61.9, 61.9, 61.2, 53.7, 40.2, 39.9, 37.6, 33.0, 33.0, 29.9, 29.7, 25.5, 25.5, 23.8, 23.7; ESI-HRMS: m/z: calcd for C41H64N2NaO19: 911.3995; found: 911.3992.
UN4C3. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (167.3 mg, 0.23 mmol) afforded final product UN4C3 (56.5 mg, 0.062 mmol, 56%) as a white solid, Rf 0.40 (DCM–MeOH–H2O, 7:3:0.5), [α]D +49.3 (c 0.5, MeOH); IR (cm−1, solid): 3353, 2926, 1652; 1H NMR (D2O, 500 MHz), δ (ppm): 7.02 (d, 1H, J 7.0 Hz), 6.85 (m, 4H), 6.76 (d, 2H, J 7.5 Hz), 5.21 (d, 1H, J1,2 3.5 Hz, H-1′′), 4.57 (d, 1H, J1,2 8.0 Hz, H-1′), 4.48 (d, 1H, J1,2 8.0 Hz, H-1), 4.26 (m, 4H, H-4′, H-5′′, CH2), 4.08 (d, 1H, J4,3 3.0 Hz, H-4′′), 4.02 (m, 3H, H-3′′, H-6a, H-6b), 3.93 (dd, 1H, J2,1 3.5, J2,3 10.5 Hz, H-2′′), 3.86 (m, 3H, H-3′, H-5, H-5′), 3.78 (m, 6H, H-6a′, H-6b′, H-6a′′, H-6b′′, CH2), 3.70 (m, 4H, H-2′, H-3, CH2), 3.58 (m, 3H, H-4, CH2), 3.40 (m, 4H), 3.34 (m, 1H, H-2), 2.42 (m, 4H), 2.08 (s, 3H), 1.45 (m, 4H); 13C NMR (D2O, 125 MHz), δ (ppm): 174.2, 170.4, 140.8, 139.0, 137.8, 136.8, 132.9, 130.9, 129.8, 129.8, 129.8, 129.3, 125.9, 125.9, 103.9 (C-1′), 103.2 (C-1), 96.4 (C-1′′), 79.8, 78.4, 78.3, 76.0, 75.7, 75.4, 73.8, 71.8, 70.6, 70.3, 70.1, 69.9, 69.7, 69.2, 65.8, 61.9, 61.9, 61.2, 53.8, 40.9, 39.9, 29.7, 29.7, 23.7, 23.5, 21.4; ESI-HRMS: m/z: calcd for C43H62N2NaO19: 933.3839; found: 933.3832.
UN4C4. Ugi coupling and subsequent Zemplen deacetylation of trisaccharide isocyanide 7 (70.1 mg, 0.069 mmol) afforded UN4C4 (40.2 mg, 0.046 mmol, 67%) as a white solid, Rf 0.38 (DCM–MeOH–H2O, 7:3:0.5), [α]D +54.6 (c 0.2, MeOH); IR (cm−1, solid): 3390, 2928, 2891, 1662, 1066; 1H NMR (D2O, 600 MHz), δ (ppm): 7.13 (d, J 9.0 Hz, 1H), 7.00 (m, 2H), 5.70 (s, 1H), 5.14 (d, J1,2 1.8 Hz, H-1′′), 4.51 (d, J1,2 7.8 Hz, H-1′), 4.45 (m, H-1, NHCH2), 4.19 (m, H-5′′, H-4′), 4.02 (m, H-4′′), 3.96 (m, H-3′′, H-6a, H-6b), 3.86 (dd, J2,1 1.8, J2,3 10.2 Hz, H-2′′), 3.79 (m, H-3′, H-5, H-5′′), 3.74 (m, H-6a′, H-6b′, H-6a′′, H-6b′′, 1H), 3.65 (m, H-2′, H-3, –CH2, 1H), 3.58 (m, H-4, CH2), 3.40 (m, CH2), 3.32 (dd, J2,1 ≈ J2,3 7.8 Hz, H-2), 2.73 (m, 4H), 1.90 (m, 2H), 1.75 (m, 4H), 1.59 (s, 3H), 0.71 (s, 3H); 13C NMR (D2O, 125 MHz), δ (ppm): 171.4, 171.4, 141.6, 139.5, 138.0, 130.7, 130.3, 128.4, 124.8, 103.9 (C-1′), 103.1 (C-1), 96.4 (C-1′′), 79.6, 78.2, 76.0, 75.7, 75.4, 73.7, 71.8, 70.6, 70.5, 70.3, 70.1, 69.8, 69.8, 69.8, 69.2, 65.8, 61.9, 61.9, 61.1, 53.8, 39.9, 29.8, 29.5, 23.6, 23.5, 21.5, 14.1, 12.9; ESI-HRMS: m/z: calcd for C40H62N2NaO19: 897.3839; found: 897.3828.
UN4C5. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (143.2 mg, 0.20 mmol) afforded final product UN4C5 (85.0 mg, 0.092 mmol, 69%) as a white solid, Rf 0.38 (DCM–MeOH–H2O, 7:3:0.5), [α]D +56.2 (c 0.8, MeOH); IR (cm−1, solid): 3363, 2930, 1646; 1H NMR (D2O, 500 MHz), δ (ppm): 8.04 (s, 1H), 7.77 (m, 2H), 7.2 (s, 1H), 6.90 (m, 2H), 6.62 (m, 1H), 5.21 (d, 1H, J1,2 4.0 Hz, H-1′′), 4.57 (m, 2H, H-1′, 1H), 4.51 (d, 1H, J1,2 8.0 Hz, H-1), 4.26 (m, 2H, H-4′, H-5′′), 4.08 (d, 1H, J4,3 3.5 Hz, H-4′′), 4.01 (m, 3H, H-3′′, H-6a, H-6b), 3.93 (dd, 1H, J2,1 3.5, J2,3 10.5 Hz, H-2′′), 3.88 (m, 3H, H-3, H-5, H-5′), 3.81 (m, 5H, H-6a′, H-6b′, H-6a′′, H-6b′′, 1H), 3.71 (m, 6H, H-2′, H-3, 2 × CH2), 3.62 (m, 3H, H-4, CH2), 3.47 (m, 2H), 3.41 (dd, 1H, J2,1 ≈ J2,3 8.0 Hz, H-2), 2.40 (m, 4H), 1.41 (m, 4H); 13C NMR (D2O, 125 MHz), δ (ppm): 170.5, 170.2, 147.8, 140.9, 139.3, 137.5, 130.8, 130.4, 128.8, 125.7, 125.6, 125.3, 125.3, 125.1, 103.9 (C-1′), 103.2 (C-1), 96.4 (C-1′′), 79.8, 78.3, 76.0, 75.7, 75.4, 73.7, 71.8, 70.6, 70.6, 70.3, 70.1, 69.9, 69.7, 69.2, 65.8, 61.9, 61.9, 61.2, 53.7, 39.9, 29.7, 29.4, 23.5, 23.2; ESI-HRMS: m/z: calcd for C41H57N3NaO21: 950.3377; found: 950.3368.
UN5C1. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (129.7 mg, 0.18 mmol) afforded final product UN5C1 (88.5 mg, 0.10 mmol, 79%) as a white solid, Rf 0.28 (DCM–MeOH–H2O, 7:3:0.5), [α]D +57.3 (c 0.7, MeOH); IR (cm−1, solid): 3384, 2930, 2882, 1645, 1075; 1H NMR (D2O, 600 MHz), δ (ppm): 8.46 (d, 0.6H, J 4.8 Hz), 8.36 (d, 0.4H, J 4.8 Hz), 7.81 (m, 0.6H), 7.72 (m, 0.4H), 7.31 (m, 3H), 7.05 (m, 0.4H), 7.01 (m, 0.6H), 6.99 (d, 0.6H, J 7.2 Hz), 6.85 (d, 0.4H, J 7.2 Hz), 6.80 (d, 0.6H, J 7.2 Hz), 6.75 (d, 0.4H, J 7.2 Hz), 5.13 (d, 1H, J1,2 4.2 Hz, H-1′′), 4.50 (d, 0.6H, J1,2 7.8 Hz, H-1′), 4.48 (d, 0.4H, J 7.8 Hz), 4.46 (d, 0.6H, J 7.8 Hz, H-1), 4.42 (d, 0.4H, J 7.8 Hz), 4.18 (m, 2H, H-4′, H-5′′), 4.08 (s, 0.8H), 4.05 (s, 1.20H), 3.98 (m, 4H, H-3′′, H-4′′, H-6a, H-6b), 3.86 (m, 2H, H-2′′, 1H), 3.76 (m, 4H, H-3′, H-5, H-5′, 1H), 3.73 (m, 4H, H-6a′, H-6b′, H-6a′′, H-6b′′), 3.69 (m, 6H, H-2′, H-3, 2 × CH2), 3.61 (m, 3H, H-4, CH2), 3.55 (m, 1H), 3.52 (m, 3H), 3.34 (dd, 0.6H, J2,1 7.8, J4,3 9.0 Hz, H-2), 3.29 (dd, 0.4H, J2,1 7.8, J4,3 9.0 Hz, H-2), 3.04 (dd, 1.2H, J 6.6, J 6.6 Hz), 2.98 (dd, 0.8H, J 6.6, J 6.6 Hz); 13C NMR (D2O, 125 MHz), δ (ppm): 175.1, 171.7, 171.2, 164.4–114.6 (aromatic Cs), 103.7 (C-1′), 102.9 (C-1), 96.3 (C-1′′), 79.5, 78.1, 75.9, 75.6, 75.3, 73.6, 71.7, 70.4, 70.4, 70.3, 70.2, 69.9, 69.7, 69.6, 69.1, 65.7, 61.8, 61.8, 61.0, 51.9, 50.7, 50.4, 48.5, 40.3, 39.8, 39.4, 36.4, 35.4; ESI-HRMS: m/z: calcd for C39H56FN3NaO19: 912.3384; found: 912.3379.
UN5C2. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (116.9 mg, 0.16 mmol) afforded final product UN5C2 (69.0 mg, 0.08 mmol, 78%) as a white solid, Rf 0.28 (DCM–MeOH–H2O, 7:3:0.5), [α]D +57.8 (c 0.5, MeOH); IR (cm−1, solid): 3399, 2939, 2872, 1640, 1075; 1H NMR (D2O, 600 MHz), δ (ppm): 8.47 (d, 0.6H, J 4.8 Hz), 8.41 (d, 0.4H, J 4.8 Hz), 7.80 (m, 1H), 7.33 (m, 2H), 5.13 (d, 1H, J1,2 3.6 Hz, H-1′′), 4.50 (d, 1H, J1,2 7.8 Hz, H-1′), 4.49 (d, 0.6H, J1,2 8.4 Hz, H-1), 4.47 (d, 0.4H, J1,2 8.4 Hz, H-1), 4.18 (m, 2H, H-4′, H-5′′), 3.96 (m, 6H, H-3′′, H-4′′, H-6a, H-6b, CH2), 3.85 (dd, 1H, J2,1 3.6, J2,3 10.0 Hz, H-2′′), 3.82 (m, 3H, H-3′, H-5, H-5′), 3.76 (m, 4H, H-6a′, H-6b′, H-6a′′, H-6b′′), 3.72 (m, 6H, H-2′, H-3, 2 × CH2), 3.62 (m, 5H, H-4, 2 × CH2), 3.57 (m, 1H), 3.41(m, 2H), 3.33 (m, 1H, H-2), 3.06 (dd, 1.2H J 6.6, J 6.6 Hz), 2.99 (dd, 0.8H J 6.6, J 6.6 Hz), 2.26 (d, 0.8H, J 7.8 Hz), 1.97 (d, 1.2H, J 7.8 Hz), 1.96 (m, 0.4H), 1.89 (m, 0.6H), 1.61 (m, 2H), 1.45 (m, 4H), 1.02 (m, 0.8H), 0.87 (m, 1.2H); 13C NMR (D2O, 125 MHz), δ (ppm): 177.9, 177.8, 172.1, 171.8, 159.0, 158.6, 149.6, 149.2, 139.4, 139.1, 125.8, 125.3, 123.7, 123.4, 103.9 (C-1′), 103.1 (C-1), 96.4 (C-1′′), 79.6, 78.2, 76.0, 75.7, 75.4, 73.8, 71.8, 70.6, 70.5, 70.4, 71.3, 70.1, 69.8, 69.8, 69.2, 65.8, 61.9, 61.9, 61.1, 51.9, 50.7, 50.4, 39.9, 39.6, 38.9, 37.2, 36.7, 36.6, 35.7, 32.9, 32.9, 32.9, 25.4, 25.4, 25.3; ESI-HRMS: m/z: calcd for C38H61N3NaO19: 886.3791; found: 886.3788.
UN5C3. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (121.5 mg, 0.17 mmol) afforded final product UN5C3 (77.1 mg, 0.087 mmol, 73%) as a white solid, Rf 0.32 (DCM–MeOH–H2O, 7:3:0.5), [α]D +54.2 (c 0.9, MeOH); IR (cm−1, solid): 3396, 2926, 2886, 1639, 1075; 1H NMR (D2O, 600 MHz), δ (ppm): 8.45 (d, 0.6H, J 4.8 Hz), 8.36 (d, 0.4H, J 4.8 Hz), 7.80 (m, 0.6H), 7.71 (m, 0.4H), 7.36 (m, 0.6H), 7.28 (m, 1H), 7.23 (d, 0.4H, J 7.8 Hz), 7.18 (d, 0.8H, J 7.8 Hz), 7.15 (d, 1.2H, J 7.8 Hz), 6.98 (d, 0.8H, J 7.8 Hz), 6.91 (d, 1.2H, J 7.8 Hz), 5.13 (d, 1H, J1,2 4.2 Hz, H-1′′), 4.49 (d, 0.6H, J1,2 7.8 Hz, H-1′), 4.48 (d, 0.4H, J1,2 7.8 Hz, H-1′), 4.45 (d, 0.6H, J1,2 8.4 Hz, H-1), 4.40 (d, 0.4H, J1,2 8.4 Hz, H-1), 4.17 (m, 2H, H-4′, H-5′′), 4.03 (s, 2H), 3.94 (m, 4H, H-3′′, H-4′′, H-6a, H-6b), 3.85 (dd, 1H, J2,1 4.2, J2,3 10.8 Hz, H-2′′), 3.78 (m, 3H, H-3′, H-5, H-5′), 3.74 (m, 6H, H-6a′, H-6b′, H-6a′′, H-6b′′, CH2), 3.68 (m, 6H, H-2′, H-3, 2 × CH2), 3.55 (m, 4H), 3.51 (m, 1H), 3.39 (m, 3H, H-2, CH2), 3.01 (dd, 1.2H, J 6.6, J 6.6 Hz), 2.97 (dd, 0.8H, J 6.6, J 6.6 Hz), 2.31 (s, 1.2H), 2.28 (s, 1.8H); 13C NMR (D2O, 125 MHz), δ (ppm): 177.5, 177.5, 173.5, 172.9, 160.4, 160.0, 151.2, 150.8, 140.9, 140.6, 139.8, 139.7, 133.8, 133.5, 131.9, 131.9, 131.8, 131.7, 131.6, 131.5, 127.2, 126.9, 125.3, 124.9, 105.4 (C-1′), 104.6 (C-1), 97.9 (C-1′′), 81.2, 79.8, 77.6, 77.3, 76.9, 75.3, 75.3, 73.4, 72.1, 72.1, 71.9, 71.9, 71.7, 71.4, 71.2, 70.8, 67.4, 63.5, 63.5, 62.7, 53.7, 52.4, 52.2, 50.2, 42.2, 41.5, 41.1, 38.1, 37.1, 22.7; ESI-HRMS: m/z: calcd for C40H59N3NaO19: 908.3635; found: 908.3630.
UN5C4. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (150.8 mg, 0.21 mmol) afforded final product UN5C4 (99.2 mg, 0.11 mmol, 57%) as a white solid, Rf 0.30 (DCM–MeOH–H2O, 7:3:0.5), [α]D +57.8 (c 0.8, MeOH); IR (cm−1, solid): 3376, 2930, 2877, 1663, 1075; 1H NMR (D2O, 600 MHz), δ (ppm): 8.44 (m, 1H), 7.79 (m, 1H), 7.32 (m, 2H), 5.37 (dd, 0.5H, J 6.6, J 6.6 Hz), 5.13 (d, 1H, J1,2 4.2 Hz, H-1′′), 5.09 (dd, 0.5H, J 6.6, J 6.6 Hz), 4.49 (m, 2H, H-1′, H-1), 4.19 (m, 2H, H-4′, H-5′′), 4.09 (m, 1H), 4.04 (s, 2H), 4.01 (d, 1H, J4,3 3.6 Hz, H-4′′), 3.95 (m, 2H, H-6a, H-6b), 3.85 (m, 2H, H-3′′, H-2′′), 3.79 (m, 3H, H-3′, H-5, H-5′), 3.74 (m, 4H, H-6a′, H-6b′, H-6a′′, H-6b′′), 3.71 (m, 6H, H-2′, H-3, 2 × CH2), 3.64 (m, 5H, H-4, 2 × CH2), 3.57 (m, 1H), 3.41 (m, 2H), 3.34 (m, 1H, H-2), 2.00 (m, 2H), 1.63 (s, 1.5H), 1.51 (s, 1.5H), 0.90 (m, 3H); 13C NMR (D2O, 125 MHz), δ (ppm): 179.8, 179.5, 173.5, 173.2, 160.5, 160.1, 151.2, 150.7, 140.8, 140.7, 137.4, 136.7, 131.6, 131.1, 127.1, 127.0, 125.2, 125.0, 105.4 (C-1′), 104.7 (C-1), 97.9 (C-1′′), 81.2, 79.8, 77.6, 77.3, 76.9, 75.3, 73.4, 72.1, 72.0, 71.9, 71.7, 71.4, 71.4, 70.8, 67.4, 63.5, 63.5, 62.7, 54.5, 53.3, 50.9, 49.1, 41.5, 41.5, 38.1, 36.9, 22.9, 15.5, 15.1, 14.9; ESI-HRMS: m/z: calcd for C37H59N3NaO19: 872.3635; found: 872.3631.
UN5C5. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (125.0 mg, 0.17 mmol) afforded final product UN5C5 (87.1 mg, 0.097 mmol, 57%) as a white solid, Rf 0.32 (DCM–MeOH–H2O, 7:3:0.5), [α]D +54.4 (c 0.5, MeOH); IR (cm−1, state): 3394, 2924, 2886, 1633, 1075; 1H NMR (D2O, 600 MHz), δ (ppm): 8.49 (d, 0.3H, J 4.8 Hz), 8.33 (dd, 0.3H, J 1.2, J 8.4 Hz), 8.27 (dd, 0.7H, J 1.2, J 8.4 Hz), 8.17 (d, 0.7H, J 4.8 Hz), 8.08 (m, 0.3H), 7.84 (m, 0.3H), 7.72 (m, 0.7H), 7.69 (dd, 0.3H, J 8.4, J 8.4 Hz), 7.66 (m, 0.7), 7.63 (m, 0.7H), 7.60 (dd, 0.7H, J 8.4, J 8.4 Hz), 7.47 (m, 0.3H), 7.42 (d, 0.3H, J 7.8 Hz), 7.35 (m, 0.3H), 7.26 (m, 0.7H), 7.17 (d, 0.7H, J 7.8 Hz), 5.13 (d, 1H, J1,2 4.2 Hz, H-1′′), 4.50 (d, 0.3H, J1,2 7.8 Hz, H-1′), 4.49 (d, 0.7H, J1,2 7.8 Hz, H-1′), 4.48 (d, 0.7H, J1,2 8.4 Hz, H-1), 4.44 (d, 0.3H, J1,2 8.4 Hz, H-1), 4.33 (s, 2H), 4.18 (m, 2H, H-4′, H-5′′), 4.02 (d, 1H, J4,3 3.0 Hz, H-4′′), 3.94 (m, 3H, H-3′′, H-6a, H-6b), 3.86 (m, 2H, H-2, 1H), 3.80 (m, 3H, H-3′, H-5. H-5′), 3.76 (m, 6H, H-6a′, H-6b′, H-6a′′, H-6b′′, CH2), 3.69 (m, 4H, H-2′, H-3, CH2), 3.62 (m, 3H, H-4, CH2), 3.54 (m, 3H), 3.31 (m, 1H, H-2), 3.15 (dd, 0.6H, J 6.6, J 6.6 Hz), 2.94 (dd, 1.4H, J 6.6, J 6.6 Hz); 13C NMR (D2O, 125 MHz), δ (ppm): 172.9, 172.8, 171.2, 170.9, 158.9, 157.8, 149.4, 149.3, 148.6, 148.5, 139.1, 139.1, 136.8, 135.9, 133.7, 133.6, 131.4, 131.0, 125.9, 125.8, 125.6, 125.4, 123.6, 123.5, 123.5, 122.6, 122.2, 103.8 (C-1′), 103.0 (C-1), 96.3 (C-1′′), 79.6, 79.5, 78.1, 75.9, 75.6, 75.3, 73.7, 71.7, 70.5, 70.4, 70.3, 70.2, 70.0, 69.8, 69.7, 69.5, 69.1, 65.7, 61.9, 69.8, 61.0, 53.1, 52.2, 50.0, 48.3, 39.9, 39.8, 36.2, 35.3; ESI-HRMS: m/z: calcd for C38H54N4NaO21: 925.3173; found: 925.3165.
UN6C1. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (123.4 mg, 0.17 mmol) afforded final product UN6C1 (40.9 mg, 0.046 mmol, 60%) as a white solid, Rf 0.36 (DCM–MeOH–H2O, 7:3:0.5), [α]D +52.5 (c 0.6, MeOH); IR (cm−1, solid): 3370, 2930, 2532, 1647; 1H NMR (D2O, 500 MHz), δ (ppm): 7.25 (m, 1H), 7.20 (d, J 9.0 Hz, 2H), 7.08 (m, 1H), 6.96 (d, J 9.0 Hz, 2H), 6.82 (m, 1H), 6.75 (m, 1H), 5.12 (d, J1,2 4.0 Hz, H-1′′), 4.48 (d, J1,2 8.0 Hz, H-1′′), 4.42 (d, J1,2 8.0 Hz, H-1), 4.33 (s, NHCH2), 4.16 (m, H-5′′, H-4′), 4.00 (d, J4,3 3.5 Hz, H-4′′), 3.93 (m, H-3′′, H-6a, H-6b), 3.84 (dd, J2,1 3.5, J2,3 10.0 Hz, H-2′′), 3.79 (s, 3H), 3.76 (m, H-3′, H-5, H-5′), 3.69 (m, H-6a′, H-6b′, H-6a′′, H-6b′′, 1H), 3.65 (m, H-2′, H-3, CH2, 1H), 3.55 (m, H-4, 2 × CH2), 3.35 (m, CH2), 3.29 (dd, J2,1 8.0, J2,3 9.5 Hz, H-2); 13C NMR (D2O, 125 MHz), δ (ppm): 175.3, 171.3, 164–114 (aromatics Cs), 103.8 (C-1′), 103.1 (C-1), 96.4 (C-1′′), 79.6, 78.2, 76.0, 75.7, 75.4, 73.7, 71.8, 70.6, 70.5, 70.3, 70.1, 69.8, 69.8, 69.2, 65.8, 61.9, 61.9, 61.1, 56.6, 53.9, 41.2, 39.9; ESI-HRMS: m/z: calcd for C39H55FN2NaO20: 913.3224; found: 913.3218.
UN6C2. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (120.7 mg, 0.17 mmol) afforded final product UN6C2 (41.5 mg, 0.048 mmol, 57%) as a white solid, Rf 0.40 (DCM–MeOH–H2O, 7:3:0.5), [α]D +20.4 (c 1.2, MeOH); IR (cm−1, solid): 3365, 2937, 2869, 2491, 1652; 1H NMR (D2O, 500 MHz), δ (ppm): 7.25 (d, 2H, J 8.9 Hz), 6.99 (d, 2H, J 8.9 Hz), 5.11 (d, 1H, J1,2 3.9 Hz, H-1′′), 4.47 (d, 1H, J1,2 7.9 Hz, H-1′), 4.42 (d, 1H, J1,2 8.0 Hz, H-1), 4.33 (d, 1H, 2J 16.1 Hz), 4.30 (d, 1H, 2J 16.1 Hz), 4.15 (m, 2H, H-4′, H-5′′), 3.98 (d, 1H, J4,3 3.7 Hz, H-4′′), 3.94 (m, 3H, H-3′′, H-6a, H-6b), 3.81 (dd, 1H, J2,1 3.9, J2,3 10.3 Hz, H-2′′), 3.81 (s, 3H), 3.74 (m, 3H, H-3′, H-5, H-5′), 3.67 (m, 4H, H-6a′, H-6b′, H-6a′′, H-6b′′), 3.60 (m, 6H, H-2′, H-3, 2 × CH2), 3.54 (m, 3H, H-4, CH2), 3.36 (m, 2H), 3.29 (dd, 1H, J2,1 8.0, J2,3 9.3 Hz, H-2), 2.17 (d, 2H, J 7.5 Hz), 2.05 (m, 1H), 1.64 (m, 2H), 1.42 (m, 4H), 0.96 (m, 2H); 13C NMR (D2O, 125 MHz), δ (ppm): 177.9, 171.5, 159.7, 136.1, 130.0, 130.0, 115.9, 115.9, 103.9 (C-1′), 103.1 (C-1), 96.4 (C-1′′), 79.6, 78.2, 76.0, 75.7, 75.4, 73.7, 71.8, 70.6, 70.5, 70.3, 70.1, 69.8, 69.8, 69.2, 65.8, 61.9, 61.9, 61.1, 56.6, 53.7, 40.2, 39.9, 37.6, 32.8, 32.8, 25.2, 25.2; ESI-HRMS: m/z: calcd for C38H60N2NaO20: 887.3632; found: 887.3626.
UN6C3. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (130.6 mg, 0.18 mmol) afforded final product UN6C3 (84.2 mg, 0.095 mmol, 53%) as a white solid, Rf 0.40 (DCM–MeOH–H2O, 7:3:0.5), [α]D +57.1 (c 0.4, MeOH); IR (cm−1, solid): 3347, 2921, 1644; 1H NMR (D2O, 500 MHz), δ (ppm): 7.23 (d, 2H, J 9.0 Hz), 7.12 (d, 2H, J 8.0 Hz), 6.97 (d, 2H, J 9.0 Hz), 6.93 (d, 2H, J 8.0 Hz), 5.21 (d, 1H, J1,2 4.0 Hz, H-1′′), 4.57 (d, 1H, J1,2 8.0 Hz, H-1′), 4.49 (d, 1H, J1,2 7.5 Hz, H-1), 4.38 (s, 2H), 4.26 (m, 2H, H-4′, H-5′′), 4.08 (d, 1H, J4,3 3.0 Hz, H-4′′), 4.02 (m, 3H, H-3′′, H-6a, H-6b), 3.93 (dd, 1H, J2,1 3.5, J2,3 10.0 Hz, H-2′′), 3.87 (m, 3H, H-3′, H-5, H-5′), 3.83 (s, 3H), 3.79 (m, 4H, H-6a′, H-6b′, H-6a′′, H-6b′′), 3.72 (m, 6H, H-2′, H-3, 2 × CH2), 3.62 (m, 3H, H-4, CH2), 3.52 (s, 2H), 3.44 (m, 2H), 3.38 (dd, 1H, J2,1 ≈ J2,3 8.5 Hz, H-2), 2.29 (s, 3H); 13C NMR (D2O, 125 MHz), δ (ppm): 175.8, 171.3, 159.8, 137.8, 135.8, 132.6, 130.1, 130.1, 130.0, 130.0, 129.9, 129.9, 115.8, 115.8, 103.9 (C-1′), 103.1 (C-1), 96.4 (C-1′′), 79.7, 78.2, 76.0, 75.7, 75.4, 73.7, 71.8, 70.6, 70.5, 70.3, 70.1, 69.8, 69.2, 65.8, 61.9, 61.9, 61.1, 56.5, 56.5, 53.9, 40.9, 39.9, 21.1; ESI-HRMS: m/z: calcd for C40H58N2NaO20: 909.3475; found: 909.3569.
UN6C4. Ugi coupling and subsequent deacetylation and enzymatic α-galactosylation of lactose-derived isocyanide 6 (124.0 mg, 0.17 mmol) afforded final product UN6C4 (80.0 mg, 0.094 mmol, 55%) as a white solid, Rf 0.34 (DCM–MeOH–H2O, 7:3:0.5), [α]D +61.7 (c 0.5, MeOH); IR (cm−1, solid): 3352, 2930, 1662; 1H NMR (D2O, 500 MHz), δ (ppm): 7.28 (d, 2H, J 9.0 Hz), 7.04 (d, 2H, J 9.0 Hz), 5.75 (m, 1H), 5.21 (d, 1H, J1,2 4.0 Hz, H-1′′), 4.58 (d, 1H, J1,2 7.5 Hz, H-1′), 4.53 (d, 1H, J1,2 8.0 Hz, H-1′), 4.51 (s, 2H), 4.26 (m, 2H, H-4′, H-5′′), 4.08 (d, 1H, J4,3 3.5 Hz, H-4′′), 4.03 (m, 3H, H-3′′, H-6a, H-6b), 3.93 (d, 1H, J2,1 3.5, J2,3 10.0 Hz, H-2′′), 3.88 (s, 3H), 3.84 (m, 3H, H-3′, H-5, H-5′), 3.80 (m, 4H, H-6a′, H-6b′, H-6a′′, H-6b′′), 3.71 (m, 6H, H-2′, H-3, 2 × CH2), 3.63 (m, 3H, H-4, CH2), 3.45 (m, 2H), 3.39 (d, 1H, J2,1 ≈ J2,3 8.5 Hz, H-2′′), 1.95 (m, 2H), 1.62 (s, 3H), 0.76 (s, 3H); 13C NMR (D2O, 125 MHz), δ (ppm): 171.4, 171.4, 159.0, 130.3, 129.4, 129.4, 115.5, 103.9 (C-1′), 103.1 (C-1), 96.4 (C-1′′), 79.6, 78.2, 76.0, 75.7, 75.4, 73.7, 71.8, 70.6, 70.5, 70.3, 70.1, 69.8, 69.8, 69.8, 69.2, 65.8, 61.9, 61.9, 61.1, 56.5, 56.5, 39.9, 21.4, 14.1, 13.0; ESI-HRMS: m/z: calcd for C37H58N2NaO20: 873.3475; found: 873.3469.
UN6C5. Ugi coupling and subsequent Zemplen deacetylation of trisaccharide isocyanide 7 (69.3 mg, 0.068 mmol) afforded UN6C5 (46.7 mg, 0.052 mmol, 76%) as a white solid, Rf 0.36 (DCM–MeOH–H2O, 7:3:0.5), [α]D +54.0 (c 0.4, MeOH); IR (cm−1, solid): 3349, 2931, 1636; 1H NMR (D2O, 600 MHz), δ (ppm): 8.18 (s, 1H), 8.13 (d, 1H, J 8.4 Hz), 7.71 (d, 1H, J 7.8 Hz), 7.47 (dd, 1H, J 7.8, 7.8 Hz), 7.18 (d, 2H, J 8.4 Hz), 6.82 (d, 2H, J 8.4 Hz), 5.13 (d, 1H, J 3.6 Hz, H-1′′), 4.64 (s, 2H, NHCH2), 4.50 (d, 1H, J 7.8 Hz, H-1′), 4.45 (d, 1H, J 7.8 Hz, H-1), 4.18 (m, 2H, H-5′′, H-4′), 3.96 (m, 4H, H-4′′, H-3′′, H-6a, H-6b), 3.85 (dd, J2,1 3.6, J2,3 10.2 Hz, H-2′′), 3.79 (m, 3H, H-3′, H-5, H-5′), 3.72 (m, 5H, H-6a′, H-6b′, H-6a′′, H-6b′′, 1H), 3.69 (s, 3H), 3.67 (m, 3H, H-2′, H-3, 1H), 3.62 (m, 4H, H-4, CH2, 1H), 3.55 (m, 1H), 3.44 (m, 2H), 3.31 (dd, J2,1 ≈ J2,3 7.8 Hz, H-2); 13C NMR (D2O, 125 MHz), δ (ppm): 171.9, 171.1, 159.1, 148.2, 136.9, 136.1, 135.4, 130.7, 130.0, 130.0, 125.9, 124.3, 115.7, 115.7, 103.9 (C-1′′), 103.1 (C-1), 96.4 (C-1′′), 79.7, 78.2, 76.0, 75.2, 75.4, 73.7, 71.8, 70.6, 70.5, 70.3, 70.1, 69.8, 69.8, 69.2, 65.8, 61.9, 61.9, 61.1, 56.5, 54.2, 40.0; ESI-HRMS: m/z: calcd for C38H53N3NaO22: 926.3013; found: 926.3008.
Footnote |
† Electronic supplementary information (ESI) available: Mass and NMR spectra, Python scripts for virtual library screening. See DOI: 10.1039/c4ob01838a |
This journal is © The Royal Society of Chemistry 2015 |