David
Montoir
a,
Mehdi
Amoura
b,
Zine El Abidine
Ababsa
a,
T. M.
Vishwanatha
b,
Expédite
Yen-Pon
a,
Vincent
Robert
c,
Massimiliano
Beltramo
c,
Véronique
Piller
b,
Mouad
Alami
a,
Vincent
Aucagne
*b and
Samir
Messaoudi
*a
aBioCIS, Univ. Paris-Sud, CNRS, Univ. Paris-Saclay, Châtenay-Malabry, France. E-mail: samir.messaoudi@u-psud.fr; Tel: +33 0146835887
bCentre de Biophysique Moléculaire, CNRS, Orléans, France. E-mail: aucagne@cnrs-orleans.fr; Tel: +33 0238255577
cUMR Physiologie de la Reproduction et des Comportements, INRA, CNRS, Univ. Tours, IFCE, Nouzilly, France
First published on 20th September 2018
We describe herein a Pd-catalyzed methodology for the thioglycoconjugation of iodoaryl peptides and aminoacids. This operationally simple process occurs under semi-aqueous conditions and displays wide substrate scope. The strategy has been successfully applied to both the thioglycosylation of unprotected peptides and the generation of thioglyco-aminoacid building blocks, including those suitable for solid phase peptide synthesis. To demonstrate the broad potential of this technique for late stage functionalization, we successfully incorporated challenging unprotected β-S-GlcNAc- and α-S-GalNAc-derivatives into very long unprotected peptides. This study opens the way to new applications in chemical biology, considering the well-recognized advantages of S-glycosides over O-glycosides in terms of resistance towards both enzymatic and chemical degradation.
From a synthetic chemistry point of view, glycosyl thiols have become useful building blocks for the synthesis of S-linked glyco-aminoacids, glycopeptides and glycoproteins, exploiting the exceptional nucleophilicity of the thiol group. Glycosyl thiols can react chemoselectively through conjugate addition,10 nucleophilic substitution of halogenides,11 ring opening of aziridines12 and cyclic sulfamidates,13 or disulfide formation followed by a desulfurative rearrangement.14 Conversely, reaction of the thiol group of cysteine with a carbohydrate derivative through nucleophilic substitution of glycosyl bromides,15 free radical thiol–ene coupling16 or Ferrier reaction17 has also been described. While these methods have provided robust access to S-linked glycopeptides, in most cases the sugar moiety is attached to the peptide through cysteine or sometimes homocysteine residues. Strikingly, to the best of our knowledge there is no report concerning S-linked glycopeptides analogues of O-glycosylated tyrosines,18 despite the relevance of such compounds for glycobiological investigations. Tyrosine O-glycosylation,19 is indeed a rather underexplored PTM, and O-GalNAc Tyr residues were identified only recently in human20 and in murine proteins.21 Although this modification is rare, it was found associated with pathogenic conditions for the amyloid precursor protein (APP) in Alzheimer's disease. Characterizing the O-glycosylation landscape of human plasma, platelets and endothelial cells also showed the presence of such O-GalNAc Tyr linkages in several proteins involved in hemostasis.22 Very recently, pathogenic bacteria toxins able to inactivate eukaryotic Rho GTPases by catalyzing the O-GlcNAcylation of a Tyr residue have also been characterized.23 These recent findings prompted us to develop synthetic routes to O-glycotyrosine thioanalogues, by either preparing glycosylaminoacid building blocks suitable for peptide synthesis, or using a post-synthetic protocol based on the glycoconjugation of unprotected peptide under aqueous conditions. Both approaches are based on the reaction of para-iodophenylalanine (p-IPhe) residues with glycosyl thiols using highly chemoselective Pd-catalysis conditions.
In our efforts to functionalize sugars under transition-metal catalysis,24 we recently reported an efficient protocol for the palladium-catalyzed coupling of aryl halides with various α- and β-glycosyl thiols.25 The C–S bond-forming reaction was achieved within minutes at room temperature by using G3-XantPhos palladacycle pre-catalyst26 (1 mol%), in the presence of Et3N in THF. In order to expand the synthetic utility of this transformation onto more complex systems such as peptides and proteins, we herein extend the scope of this reaction for (i) the preparation of glycoaminoacid building blocks (ii) the late stage functionalization of unprotected peptides under aqueous conditions and (iii) application to a wide variety of protected and unprotected glycosyl thiols. In particular we demonstrate for the first time the application of this chemistry to the reputedly challenging 2-deoxy-2-N-acetamido sugar α-thioGalNAc.
This work thus opens unprecedented opportunities for chemical glycobiology studies of tyrosine glycosylation through precise oxygen-to-sulfur substitution. Besides such possible application, we also showed the utility of this straightforward methodology for the synthesis of various complex neoglycoconjugates, a growing area of research where the saccharide moiety can be exploited to modulate the bioactivity or physicochemical properties of a given peptide or protein.27
To establish the appropriate conditions for the coupling reaction, tetra-O-acetylated 1-thio-β-D-glucopyranose 1a and N-Boc-DL-4-iodophenylalanine 2a were initially selected as model coupling substrates (for the optimization of the reaction conditions see ESI†). A full conversion and 96% yield of isolated product 3a was obtained using 3 mol% of G3-XantPhos catalyst.28 Taking into account that suitable conditions for peptides and proteins functionalization include aqueous media and ambient temperature, we performed the coupling of 1a with 2a in different aqueous media (see ESI† for details). Thus, conducting the coupling reaction in THF/H2O (1:2) afforded 3a in an excellent yield of 92% (see ESI†). Motivated by this water-tolerant coupling procedure, we next expanded the scope of this Pd-catalyzed coupling of structurally diverse α- and β-thiol derivatives of mono-, di- and tri-saccharides with various amino acids 2a–f (Table 1). All the coupling reactions proceeded in good yields and without epimerization at anomeric position.29 In addition to 1a, tetra-O-acetylated 1-thio-β-D-galactopyranose 1b, tri-O-acetylated N-Ac-1-thio-β-D-glucosaminopyranose 1c, tetra-O-benzoylated 1-thio-β-D-glucopyranose 1d and tetra-O-acetylated 1-thio-β-D-mannopyranose 1e were efficiently coupled with both N-Boc-DL-p-iodophenylalanine 2a and its methyl ester 2b to give the corresponding thioglycoconjugates 3a–p in good yields. A slightly lower yield was observed with 1c as nucleophilic partner to give thioglycosylated amino acids 3e and 3f in 72% and 59% yields, respectively. Moreover, the reaction is not limited to monosaccharides, but can be applied to more complex di- and trisaccharide derivatives. Thus, hepta-O-acetylated 1-thio-β-D-cellobiose 1f and deca-O-acetylated 1-thio-β-D-maltotriose 1g were reacted with 2a or 2b to give corresponding glycoconjugates 3j–l in good yields. Noteworthy the reaction works also well with unprotected sugars. As an example, the reaction of 2b with 1-thio-β-D-glucose 1h afforded product 3m in a satisfactory 63% yield and with a total retention of the β-anomeric configuration as confirmed from the distinct J1,2 coupling of 9.7 Hz. 1a and 1h were readily coupled with the L-enantiomer of Fmoc-p-iodo-phenylalanine 2d to furnish 3o and 3p in good yields. Interestingly, the latter compounds are two building blocks suitably protected for their use in Fmoc-based SPPS. In addition to para-iodophenylalanine derivatives, N-Boc-3-iodo-D-tyrosine 2e as well as N-Boc p-iodobenzyl L-cysteine 2f were used successfully in this coupling to furnish thioglycoconjugates 3q, r in 82% and 75% yields, respectively.
Having demonstrated the excellent catalytic activity of the G3-XantPhos pre-catalyst with aminoacids, we next turned our attention to the generalization of the method with respect to IPhe-containing N-Boc protected di- and tripeptides (Table 2). In all cases studied, the coupling reactions proceeded selectively and cleanly in high yields (glycoconjugates 5a–f). The established C–S bond formation protocol is compatible with the indole NH of tryptophan (5b), phenol of tyrosine (5c and 5f) and carboxylic acids (5e), indicating a large tolerance towards functional groups.
To open a complementary synthetic pathway towards larger thioglycopeptides, we next assessed the suitability of the synthesized building blocks in standard Fmoc-based SPPS protocols. For a proof-of-concept, we selected lipo-triazolopeptide 6a, a synthetic derivative of kisspeptin-10 (KP10), an endogenous neuropeptide playing a pivotal role in the central control of the reproductive system in mammals and non-bird vertebrates. KP10 acts through the activation of its cognate G-protein-coupled receptor, KiSS1R. KP10 has a very short half-life in blood (<1 min) and 6a was rationally designed with the aim to overcome this limitation, by including both a 1,4-disubstituted 1,2,3-triazole as a protease-resistant amide bond surrogate30 to improve metabolic stability and an N-palmitoylated isoglutamate albumin-binding motif to decrease renal excretion.316a shows a dramatically enhanced in vivo activity as compared with KP10, while also benefiting from an increased in vitro potency towards KiSS1R. We chose the Tyr1 residue (KP10 numbering), that as has been shown by Ala-scanning to be tolerant to modifications, to incorporate a model β-thioglucosyl analogue.32,33 Note that, as many lipopeptides, 6a shows very limited solubility under aqueous conditions, and is prone to hydrogel formation such as the parent KP10.34 Our choice of this peptide as a model compound was in part driven by our curiosity to see if glycoconjugation could improve its physicochemical properties.
The use of the building block 3p, instead of a standard SPPS Fmoc-Tyr(tBu)-OH furnished the expected thioglyco-lipo-triazolopeptide 6b in similar yields as for 6a (Fig. 1). This demonstrates the compatibility of our thioglycoaminoacid building block with standard Fmoc-SPPS protocols, including couplings, piperidine-mediated Fmoc deprotections and TFA-based cleavage. 6b was tested for its agonist activity towards KiSSR, and showed similar activity (EC50 = 0.03–0.3 nM) as compared with 6a (EC50 0.02–0.2 nM, mean and 95% confidence interval). Physicochemical properties of 6b were also briefly evaluated. However, no marked differences were observed as compared to 6a: the glycopeptide is not soluble in water at a concentration of 0.1 mM, and slowly forms a gel after solubilization in DMSO followed by dilution with water (0.1 mM final concentration, 95:5 water/DMSO).
Fig. 1 Lipo-triazolopeptide analogue 6a of the neuropeptide KP10, and its thioglucosylated derivative 6b. |
To assess the utility of our palladium-catalyzed procedure in the synthesis of very long thioglycopeptides through post-SPPS thioglycosylation, we next investigated the synthesis of analogues of the human mucin MUC1. MUC1 is a densely O-glycosylated glycoprotein produced as a membrane-anchored as well as a soluble form, both expressed by many epithelial cells. Their highly enhanced expression by many tumor cells types is accompanied by their aberrant glycosylation giving rise to largely reduced O-glycan chain complexity and density. Therefore, tumor-associated carbohydrate antigens such as Tn and its sialylated form (STn), are frequently found on epithelial tumor cells and are considered as clinically relevant tumor markers.35 The extracellular and soluble domains of MUC1 proteins contain a variable number of tandem repeat sequences of 20 amino acids (APDTRPAPGSTAPPAHGVTS),36 each presenting 5 potential O-glycosylation sites.37 Thus, Tn antigens on MUC1 act as a signature associated with cancer38 and represent promising targets for the development of synthetic antitumor vaccines.39 However, many attempts to produce MUC1-based anticancer-vaccines illustrated the very low immunogenicity of the natural MUC1 glycoprotein. It is thus necessary to generate analogues of MUC1 to improve the immune response against the native oncogenic glycoprotein. As an example, unnatural derivatives were recently synthesized by incorporation of a linker placed between the peptide and the α-GalNAc moiety, in order to favor the presentation of the sugar for molecular recognition events.40 The crystal structure of major histocompatibility complex-I (MHC-I) with such a linker-containing neoglycopeptide indeed showed that while the aglycone part of the antigen binds to the MHC, the carbohydrate moiety can facilitate the recognition of T-cell receptors (TCR) and therefore stimulate immune response.41 Here, we propose to produce new MUC1 mimics in which the GalNAc moieties would also be distant from the peptide chain and could thus present novel properties interesting for the production of cancer vaccines.
We first examined the coupling reactions between tri-O-acetylated β-thio-GlcNAc 1c, its unprotected congener 1i and α-thioGalNAc 1j with two fully-unprotected modified 60-amino-acid sequences of MUC1 (e.g. 3 tandem repeats) in which threonine residues of the immunogenic APDTR sequence were replaced by p-iodo-L-phenylalanine in position 24 (mono-iodo peptide 7a, Fig. 2) or in positions 4, 24 and 44 (tri-iodo peptide 7b).
Fig. 2 Schematic representations of MUC1-derived mono-iodo peptide 7a and tri-iodo peptide 7b. X = p-iodophenylalanine. |
First attempts were initiated to find optimal conditions for Pd-catalyzed coupling with mono-iodo peptide MUC1 7a (1 mM). O-Acetylated β-thioGlcNAc 1c was used as the model thiosugar. It was found that the quantity of G3-XantPhos reagent is crucial as the use of almost 5 equivalents was required to obtain a complete conversion of the starting material over 1 h. Under these conditions thioglycoconjugate 8a was isolated in an excellent 80% yield (Fig. 3). Notably, coupling of unprotected β-thioGlcNAc 1i with 7a under these conditions furnished exclusively 8b without any detectable peptide side products (Fig. 3). Finally, peptide 8c bearing α-thioGalNAc was obtained by coupling of the challenging and unstable α-GalNAc thiol 1j with 7a in a good yield. These selective thioglycosylations of MUC1-derived mono-iodopeptide 7a represent an important preliminary result in view of modifying more complex peptides and proteins. These promising results encouraged us to investigate this coupling reaction with the tri-iodo peptide 7b (Fig. 2). This coupling was effective in all cases to produce conjugates 8d–f in good yields.
Fig. 3 Pd-catalyzed thioglycoconjugation of MUC1-derived mono-iodo peptide 7a and tri-iodo peptide 7b with 1c, 1i and 1j. aIsolated yield. |
In order to characterize, from a biochemical point of view, the obtained thioglycopeptides we next verified if the thioGalNAc-containing 8c and 8f could be recognized by an O-GalNAc-specific lectin. For that purpose, we conjugated a biotin moiety at the N-terminus of each thioglycopeptide 8c and 8f as well as on iodopeptides 7a and 7b used as negative controls (see ESI† for details).42 The biotinylated MUC1 analogues were efficiently immobilized on NeutrAvidin-coated plates in a quantitative and saturable manner, as judged through recognition by specific anti MUC1 antibodies (see ESI† for details).
We then checked if the Vicia villosa plant lectin (VVL) specific for α- or β-linked terminal O-GalNAc could still recognize the GalNAc-thioglycopeptides. The results of our ELISA assays, presented in Fig. 4, show efficient lectin binding since the apparent Kd was 26.0 ± 7.3 nM for the mono-GalNAc thioglycopeptide (biotinylated 8c) and 27.2 ± 12.1 nM for the tri-GalNAc thioglycopeptide (biotinylated 8f). Such values are in very good agreement with those recently obtained on O-GalNAc-Tyr containing glycopeptides43 with apparent Kd values ranging from 1.4 ± 0.1 to 243 ± 113 nM for neoglycoproteins showing different GalNAc densities. In our work the two thioglycopeptides tested gave roughly similar apparent Kd values indicating no effect on the lectin binding of the GalNAc density (1 vs. 3 GalNAc per peptide). These VVL binding results suggest the usefulness of our simple synthetic strategy in order to produce thioglycopeptides which could be used as mimics of Tyr-GalNAc containing peptides.
Fig. 4 ELISA-based assays showing FITC-labelled VVL binding of the different peptides and glycopeptides tested. The fluorescence values are the means of three independent experiments realized at the same peptide concentration and error bars indicate the standard deviation. Apparent Kd values were determined from assays at different peptide concentrations (see ESI† for details). |
Footnote |
† Electronic supplementary information (ESI) available: General, experimental procedures, and characterization of all new compounds. See DOI: 10.1039/c8sc02370k |
This journal is © The Royal Society of Chemistry 2018 |