Liubov
Yakovlieva‡
a,
Thomas M.
Wood‡
bc,
Johan
Kemmink
a,
Ioli
Kotsogianni
b,
Franziska
Koller
d,
Jürgen
Lassak
d,
Nathaniel I.
Martin
*b and
Marthe T. C.
Walvoort
*a
aChemical Biology Group, Stratingh Institute for Chemistry, University of Groningen, Groningen, The Netherlands. E-mail: m.t.c.walvoort@rug.nl
bBiological Chemistry Group, Institute of Biology Leiden, Leiden University, Leiden, The Netherlands. E-mail: n.i.martin@biology.leidenuniv.nl
cDepartment of Chemical Biology & Drug Discovery, Utrecht Institute for Pharmaceutical Sciences, Utrecht University, Utrecht, The Netherlands
dDepartment of Biology I, Microbiology, Ludwig-Maximilians-Universität München Planegg/Martinsried, Germany
First published on 7th December 2020
For canonical asparagine glycosylation, the primary amino acid sequence that directs glycosylation at specific asparagine residues is well-established. Here we reveal that a recently discovered bacterial enzyme EarP, that transfers rhamnose to a specific arginine residue in its acceptor protein EF-P, specifically recognizes a β-hairpin loop. Notably, while the in vitro rhamnosyltransferase activity of EarP is abolished when presented with linear substrate peptide sequences derived from EF-P, the enzyme readily glycosylates the same sequence in a cyclized β-hairpin mimic. Additional studies with other substrate-mimicking cyclic peptides revealed that EarP activity is sensitive to the method used to induce cyclization and in some cases is tolerant to amino acid sequence variation. Using detailed NMR approaches, we established that the active peptide substrates all share some degree of β-hairpin formation, and therefore conclude that the β-hairpin epitope is the major determinant of arginine-rhamnosylation by EarP. Our findings add a novel recognition motif to the existing knowledge on substrate specificity of protein glycosylation, and are expected to guide future identifications of rhamnosylation sites in other protein substrates.
Protein asparagine N-glycosylation is universally present and fairly conserved across species, and it involves the en bloc transfer of an oligosaccharide from a lipid-linked carrier to an acceptor protein, catalyzed by a membrane-associated glycosyltransferase, such as eukaryotic OST and prokaryotic PglB.2,3 The requirements for the primary sequence and structural folds are well-established for canonical N-linked glycosylation. In general, asparagine residues are modified in so-called “sequons” – recognition sequences of N-X-S/T, with X being any amino acid except proline.4 In addition, PglB of C. jejuni recognizes an extended sequon of D/E-Z-N-X-S/T (Z and X ≠ Pro).3 The X residue (+1) and Ser/Thr residue (+2) have been shown to play an important role in acceptor recognition by glycosyltransferases.5 In the co-crystal structure of the bacterial oligosaccharyltransferase PglB, the acceptor peptide was shown to adopt a distinct bound conformation, featuring the recognition sequon in a 180 degree loop.6 This structure would be impossible to adopt with proline in the +1 position, explaining the negative selection for Pro in the glycosylation sequon.5 The +2 hydroxy amino acid has been shown to contribute to recognition by interacting with the conserved WWD motif in PglB. Interestingly, when bound in the active site, Thr in position +2 has been shown to be engaged in more stabilizing interactions than Ser in position +2 which is reflected in faster glycosylation rates of Thr-containing sequons.6,7 Another important aspect of N-linked glycosylation is the mechanism of asparagine activation for nucleophilic attack. One of the early explanations implied the importance of the local secondary structure, namely the Asx turn (Fig. 1A).8 Within this structure, protonation of the Asn-amide carbonyl by the hydroxyl of the +2 Ser/Thr residue, in combination with deprotonation of nitrogen by an enzymatic base would result in the formation of a reactive imidol species capable of carrying out the nucleophilic attack (Fig. 1B). An alternative explanation of carboxamide activation in protein glycosylation is the so-called “twisted amide” hypothesis wherein activation of Asn occurs as a result of twisting the nitrogen lone pair out of conjugation with the carbonyl group (Fig. 1C).6,9 It is postulated that Asp and Glu residues in the OST active site form H-bonds with amide hydrogens of the acceptor Asn. As a result of this H-bond stabilization, de-conjugation of the nitrogen lone pair electrons occurs, resulting in a more nucleophilic nitrogen primed for attack.
Studies have demonstrated that peptides forced into the Asx turn exhibited increased affinity and faster glycosylation rates in comparison to linear peptides or β-turn peptides (Fig. 1D).10
Despite the clear three amino acid sequon in the substrate protein, not all predicted N-X-S/T sites are found to be glycosylated which indicates the role of additional recognition elements. A statistical analysis of eukaryotic N-glycosylation sites revealed that there is a preference for β-turns and -bends around the glycosylation site, whereas α-helices are disfavored.11 Additionally, it has been demonstrated that similarly to eukaryotic protein glycosylation, which occurs co-translationally on unfolded polypeptides, bacterial protein glycosylation preferentially takes place on glycosylation sites located in exposed loops and benefits from moderate structure disorder of the acceptor protein.12 Interestingly, while O-glycosyltransferases do not generally require a specific recognition sequence in their acceptor substrates, several recent examples indicate the preference for properly folded substrate domains, implying a fold recognition mechanism instead.13,14
In addition to the well-established forms of protein glycosylation, novel glycosylation systems have been discovered that are unique to bacteria.15 Recently, arginine rhamnosylation was identified as a novel type of N-glycosylation.16,17 Here the enzyme EarP transfers a rhamnose moiety from dTDP-β-L-rhamnose (TDP-Rha) to a specific arginine residue in the acceptor translation elongation factor P (EF-P) (Fig. 1E).16,18 Arginine glycosylation is a rare modification with only two other examples reported to date, i.e. autocatalytic modification of Arg with glucose (Glc) of sweet corn amygdalin,19 and with N-acetylglucosamine (GlcNAc) of human death receptor domains by the bacterial effector protein NleB.20
Genes associated with the newly discovered EF-P rhamnosylation (earP, efp, and rmlABCD genes for dTDP-β-L-rhamnose (TDP-Rha) donor synthesis) have been identified predominantly in beta- and gamma-proteobacteria, including multiple clinically relevant pathogens, e.g. Pseudomonas aeruginosa, Neisseria meningitidis, and Bordetella pertussis.16 The rhamnose modification has been shown to activate EF-P which alleviates ribosomal stalling during the synthesis of poly-proline stretches in nascent polypeptides.21–23 Abolishing rhamnosylation of EF-P in P. aeruginosa and N. meningitidis led to cellular effects that were detrimental to bacterial fitness and increased susceptibility to antibiotics.18,24 It is hypothesized that these severe effects are associated with importance of polyPro-containing proteins and virulence factors of investigated bacterial pathogens for their survival.18,24
Since the discovery of EF-P rhamnosylation in 2015, a number of studies have contributed to an increased understanding of this unique system. The stereochemistry of the α-glycosidic linkage between Arg and Rha was shown by two research groups independently,25,26 proving that EarP is an inverting glycosyltransferase. An anti-ArgRha antibody has also been developed, allowing for facile (in vitro) monitoring of arginine rhamnosylation.25,27 Several (co)-crystal structures have been reported for EarP (from P. putida,27N. meningitidis28 and P. aeruginosa29), also in complex with its EF-P substrate, providing insight into the specific amino acid interactions and the catalytic mechanism of rhamnosylation.
As only a single Arg residue in EF-P is modified, an important yet unexplored aspect of this novel glycosylation system is the basis for the observed specificity in recognizing this arginine residue. Previous reports indicate that domain I of EF-P (aa 1–65, Fig. 1E) is sufficient for recognition and rhamnosylation by EarP.27 Domain I, commonly referred to as a “KOW-domain”,27 is a conserved domain in various ribosome-associated proteins involved in transcription and translation,30 and it appears to contain all recognition elements necessary to promote Arg rhamnosylation.27–29 A recent study indicates that structural elements are more important than a specific sequon to promote EarP-mediated rhamnosylation.31 However, the precise determinants for recognition by EarP are currently not known. Elucidating the necessary substrate recognition elements will allow us to more fully understand this unique bacterial system, an important step towards exploiting bacterial glycosylation systems for the development of novel anti-virulence strategies.32
Here we report the discovery of a novel β-hairpin recognition element in arginine rhamnosylation of EF-P from P. aeruginosa. Using in vitro rhamnosylation assays and in-depth NMR studies we demonstrate the importance of this key structural motif in acceptor substrate recognition by EarP. Moreover, we report the shortest peptide fragment known to date to be rhamnosylated by EarP. Next to expanding the current knowledge on structural requirements of protein glycosyltransferases, our results have the potential to inform the development of inhibitors and activity assays to screen for inhibitors for EarP based upon the β-hairpin motif of the EF-P KOW domain.
Fig. 2 (A) EarP–EF-P complex from P. aeruginosa (generated with YASARA, PDB 6J7M). EarP is depicted in grey, domain I of EF-P is colored red, EF-P residues involved in binding with EarP are in green. (B) Cyclic peptide mimics of the EF-P β-hairpin. Arg32 is shown in pink, altered residues (with respect to 11mer_Pa) are shown in blue. (C) Zoom-in of the coupled HSQC spectrum of crude Rha-11mer_Pa reaction mixture. JCH = 168 Hz (α-glycosidic bond). (D) Experimentally determined NOE signals that are indicative of a β-hairpin structure are mapped on the 11mer L-Pro–D-Pro_Pa fragment (from 3OYY crystal structure). Hydrogen bonds inferred from the NH temperature studies are shown as red dashed lines. NOE signals are shown as double-ended arrows (magenta: medium NOE; purple: weak NOE). |
Based on the sequences of EF-P proteins from P. aeruginosa, Ralstonia solanacearum (a Gram-negative plant pathogen) and N. meningitidis, the corresponding cyclic 11mer peptides depicted in Fig. 2B were prepared using solid-phase peptide synthesis (SPPS) starting from Gly31 loaded onto 2-chlorotrityl resin with the peptides assembled terminating with Arg32 (see ESI† for details). Following mild acid treatment, the side-chain protected peptides where then cyclized by amide bond formation between Gly31 and Arg32 by activation of the C-terminal Gly to avoid any possible racemization. Solution-phase cyclization of these peptides proceeded cleanly after which side chain deprotection and HPLC purification provided the desired cyclic peptides (Table S1†). The peptides were tested in the in vitro rhamnosylation reaction with EarP_Pa and TDP-Rha, and the rhamnosylated products were identified by an increase in the mass of +146 Da with RP-LCMS. Gratifyingly, the prepared cyclic peptides revealed successful modification by EarP, albeit to varying extents. The best substrate identified was the L-Pro–D-Pro-cyclized 11mer fragment of EF-P from P. aeruginosa (11mer_Pa, 85% conversion overnight). The extent of rhamnosylation was calculated from ion intensities in the MS spectra and corrected for the relative ionization factor (RIF) values (Table S3†),34 as described in the ESI.† A detailed kinetic analysis of the rhamnosylation of 11mer_Pa and native protein substrate EF-P was obtained through a time course study. This analysis revealed that the cyclic 11-mer is indeed rhamnosylated in a time-dependent manner albeit with a lower rate of conversion relative to EF-P (Fig. S2†).
Interestingly, follow-up experiments with both shorter and longer cyclic peptides inspired by the successful 11mer_Pa peptide, revealed that the 11mer peptide (nine native amino acids, plus the L/D-Pro motif) was favored as a substrate, as EarP exhibited no activity towards the shorter (7mer_Pa) and only low levels of conversion (14%) were achieved with the longer (15mer_Pa) fragment. Detailed NMR analysis of the rhamnosylated 11mer_Pa peptide (Rha-11mer_Pa, prepared enzymatically) revealed that the rhamnose-arginine glycosidic linkage was formed in an α-stereoselective fashion (Fig. 2C and S3†), identical to the linkage described for Arg-rhamnosylation of EF-P.25,26 Interestingly, our initial attempts at purifying Rha-11mer_Pa using anion-exchange under basic conditions, led to full epimerization to the β-linked Arg-Rha species (Fig. S4†), in accordance with previous observations by Payne and co-workers.26
The co-crystal structure of EF-P bound to EarP revealed that Ala34 in EF-P undergoes a significant conformational change.29 It appears that upon binding to EarP, the “bulge” formed by Ala34 is significantly reduced, leading to a more narrowly shaped and structured loop than that in free EF-P. To investigate whether Ala34 and the concomitant conformational movement are important for binding, the residue was completely removed in peptide 11mer_A34_Pa. Interestingly, this substrate was not rhamnosylated by EarP, suggesting that the Ala34 bulge is important for recognition. Whereas the co-crystal structure suggests that Ala34 is not directly involved in binding to EarP,29 it is reasonable to assume that this residue contributes to shaping the β-hairpin, which in turn positions Arg32 in the active site. This was further corroborated by substituting Ala34 with glycine (11mer_A34G_Pa), a smaller and more flexible amino acid. This mutant retained its role as a substrate for EarP, albeit with reduced efficiency (41% conversion).
Next, alternate cyclization strategies were compared to assess their impact on recognition by EarP (Table S1, Fig. S5†). Specifically, peptides of varying lengths, based upon the same key EF-P amino acid substrate sequence, were synthesized and cyclized using CLIPS-35 (Chemical Linkage of Peptides onto Scaffolds) and disulfide strategies36 to introduce conformational rigidity and flexibility, respectively. In addition, we also prepared linear peptides containing the same EF-P sequence with Trp and Phe residues to introduce a so-called “tryptophan zipper” motif known to induce interstrand H-bonding as another means of generating a β-turn mimic.37 Notably, the majority of the trp-zip designs (especially longer sequences of 13mers, 17mers and 18mers) led to almost immediate protein precipitation when incubated with EarP, indicating that the hydrophobicity of these substrates mimics is not compatible with forming and maintaining a soluble complex with the enzyme.
Compared to the original 11mer_Pa peptide, the 11mer_A34G_Pa variant features only one of the two H-bonds (NH of Asn28, Table S4†) and several NOE signals are missing (K29HA-V36NH, S30NH-A35HA, Table S5†). It was also shown to exhibit more flexibility, presumably due to the inclusion of the inherently more flexible glycine in the loop. In the case of 11mer_A34_Pa, that showed no conversion to product, only very few structural elements were preserved (sequential NOE signals of P27HA-N28NH and R32NH-N33NH, Table S5†), and the structure was found to lack one of the two H-bonds (NH of Asn33, Table S4†) present in the 11mer_Pa that enable the formation of the β-hairpin structure. The 11mer_Rso variant structure has both H-bonds found in 11mer_Pa (NH of Ser28 and Asn33, Table S4†) and more of β-hairpin characteristic NOEs, although S28NH-V36NH was absent (Table S5†). On the other hand, it was not possible to determine the structure of the 11mer_Nm peptide, as it appears to be a mixture of several structures due to cis/trans isomerization of Pro residues. It is therefore difficult to conclude whether absence of conversion for 11mer_Nm peptide stems from the sequence variation or lack of structure. NMR analysis of the structures of the linear peptide, CLIPS-, disulfide- and Trp-rich peptides showed that these peptides tend to form disordered structures and show close to none of the β-sheet structure (see ESI† NMR spectra of these peptides).
Here we report the successful rhamnosylation of a cyclic peptide fragment, 11mer_Pa, designed to mimic the native EF-P substrate of the EarP rhamnosyltransferase. This 11mer L-Pro–D-Pro_Pa peptide is the smallest fragment of EF-P reported to date to be successfully rhamnosylated by EarP. The combination of enzyme activity assays and NMR structural analysis reveals that activity is directly linked to the propensity of the cyclic peptide to form a structured β-hairpin motif. Moreover, the glycosidic linkage is formed in an α-stereoselective fashion, analogous to the native Rha-EF-P and the resulting α-Rha-11mer epitope is successfully recognized by anti-ArgRha antibody. The results of the various 11mer peptide mutants and different cyclization strategies indicate that substrate recognition by the EarP rhamnosyltransferase is highly dependent on the conformation of the substrate and that some sequence variation is tolerated. Developing a successful structural mimic of EF-P beta-hairpin has proved to be challenging, as the majority of the strategies for secondary structure stabilization investigated led to inactive substrates. It would appear that many of the peptide mimics generated in this project are not able to recapitulate important enzyme–substrate contacts in the enzyme active site. In this regard, our study reveals that both specific amino acid residues and an optimal secondary structure are important for recognition by EarP.
The success of the substrate mimics bearing the L-Pro–D-Pro motif may be attributed to the right-handed twist that brings the strands together to allow the formation of H-bonds that stabilize the secondary structure.39 In contrast, CLIPS-bearing peptides may be too bulky to fit in the narrow active site of EarP, whereas disulfide cyclization may induce too much rotational freedom and a less defined β-hairpin. Trp zippers, although widely reported in literature to form β-sheet structures, did not induce measurable secondary structures in the peptides in our investigation, and proved to be too hydrophobic to be suitable EarP substrates. Alternative methods of secondary structure stabilization, such as N-methylation to reduce flexibility of the structure may be explored in the future.40
As apparent from the co-crystal structure, there are many points of contact between EarP and EF-P that are likely to contribute to substrate recognition (Fig. 2A). Interestingly, while the majority of the EarP–EF-P contacts resides around the Arg32-containing β-hairpin that we chose to mimic, several residues of the neighboring β-strand have also been shown to be involved in binding to EarP.29 Consequently, the low affinity of 11mer_Pa for EarP may be increased by extending the current scaffold to include more contact points. Notably, residue Lys55 is a promising residue to consider, particularly as replacing it by alanine resulted in a 200-fold decrease in affinity for EarP.29 One strategy for improving the affinity for EarP and the rates of conversion of future peptidomimetic substrates might include incorporation of either a fragment of the third strand, or a single Lys residue into the peptide. While it can be expected that expanding the structure to include the β-sheet increases the substrate affinity, β-sheets and other secondary structures are difficult to design as isolated motifs. They are stabilized by a larger protein structure they are found in, and without it, they tend to misfold and aggregate.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/d0sc05823h |
‡ Authors contributed equally. |
This journal is © The Royal Society of Chemistry 2021 |