Qiang
Shao
*ab,
Muya
Xiong
ab,
Jiameng
Li
c,
Hangchen
Hu
d,
Haixia
Su
a and
Yechun
Xu
*abcd
aState Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China. E-mail: qshao@simm.ac.cn; ycxu@simm.ac.cn
bUniversity of Chinese Academy of Sciences, Beijing 100049, China
cSchool of Chinese Materia Medica, Nanjing University of Chinese Medicine, Nanjing 210023, China
dSchool of Pharmaceutical Science and Technology, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310024, China
First published on 11th April 2023
Papain-like protease (PLpro) is a promising therapeutic target against SARS-CoV-2, but its restricted S1/S2 subsites pose an obstacle in developing active site-directed inhibitors. We have recently identified C270 as a novel covalent allosteric site for SARS-CoV-2 PLpro inhibitors. Here we present a theoretical investigation of the proteolysis reaction catalyzed by the wild-type SARS-CoV-2 PLpro as well as the C270R mutant. Enhanced sampling MD simulations were first performed to explore the influence of C270R mutation on the protease dynamics, and sampled thermodynamically favorable conformations were then submitted to MM/PBSA and QM/MM MD simulations for thorough characterization of the protease-substrate binding and covalent reactions. The disclosed proteolysis mechanism of PLpro, as characterized by the occurrence of proton transfer from the catalytic C111 to H272 prior to the substrate binding and with deacylation being the rate-determining step of the whole proteolysis process, is not completely identical to that of the 3C-like protease, another key cysteine protease of coronaviruses. The C270R mutation alters the structural dynamics of the BL2 loop that indirectly impairs the catalytic function of H272 and reduces the binding of the substrate with the protease, ultimately showing an inhibitory effect on PLpro. Together, these results provide a comprehensive understanding at the atomic level of the key aspects of SARS-CoV-2 PLpro proteolysis, including the catalytic activity allosterically regulated by C270 modification, which is crucial to the follow-up inhibitor design and development.
While significant progress has been made in the development of SARS-CoV-2 3CLpro inhibitors and two of them, nirmatrelvir and ensitrelvir, have been approved for the treatment of COVID-19,16,17 the development of PLpro inhibitors is currently in early-stage preclinical studies. So far, several types of PLpro inhibitors with moderate inhibitory potency have been identified.18 High-throughput screening experiments4,6,11,19 found GRL0617 to be one of the most potent SARS-CoV-2 PLpro inhibitors (IC50: 2.3 μM)20 and the main starting point for further optimization.10,14 The phenylthiophene derivatives of GRL0617 improved the inhibitory potency to a nanomolar level.14 These small-molecule inhibitors occupied the S3 and S4 subsites of the substrate binding pocket without forming a covalent bond to the catalytic C111. Meanwhile, two covalent peptidomimetic inhibitors, VIR250 and VIR251, fully occupied the S1–S4 subsites of PLpro, but showed weak potency with IC50 values of ∼50 μM.4
PLpro is well conserved in all coronaviruses. For example, SARS-CoV-2 PLpro shares a sequence identity of 83% with SARS-CoV PLpro.21,22 A collection of recently reported SARS-CoV-2 PLpro structures showed that it adopts a “thumb-palm-fingers” architecture that is constructed by a small N-terminal ubiquitin-like (Ubl) domain and a large catalytic domain (Fig. S1 in the ESI†).6,20 The latter is an extended right-handed scaffold with three characteristic subdomains of thumb, palm and fingers. A catalytic triad (C111–H272–D286) is located at the interface between the thumb and palm subdomains, neighboring an important BL2 loop (G266–G271) that closes upon substrate or inhibitor binding. Such a characteristic shadow and narrow binding site of PLpro results in featureless S1 and S2 subsites recognizing two consecutive glycines of substrates, thereby posing a great challenge for inhibitor design. In addition, the fact that human deubiquitinases also bind ubiquitin with the consensus sequence LXGG could raise potential concern about the off-target effects of PLpro inhibitors.18,23
Recently, we have experimentally observed a novel covalent allosteric site (C270) that can be used to regulate the catalytic activity of SARS-CoV-2 PLpro.24 The sidechain distance of C270 to the catalytic C111, H272 and D286 is ∼10, ∼9 and ∼14 Å, respectively, in the crystal structure of SARS-CoV-2 PLpro (Fig. S1†). The C270-target covalent binding of the activator or inhibitor did not directly compete for the binding of the substrate to the catalytic site, making C270 act as an allosteric modulation site. In addition, the mutagenesis of C270 substantially influenced the Km (Michaelis constant) and Vmax (maximum reaction rate) of SARS-CoV-2 PLpro catalyzing the hydrolysis of the fluorogenic substrate compared to the wild-type (WT) SARS-CoV-2 PLpro. Km reflects the protease-substrate binding affinity while Vmax is correlated with the reaction rate of the proteolysis. Among 11 C270 mutants, the C270R lowered the Vmax value the most (14.9 ± 2.2 (WT) → 4.7 ± 0.4 (C270R) 104 RFU per min), but raised the value of Km (566.9 ± 36 (WT) → 864.5 ± 33.6 (C270R) μM), therefore provoking an inhibitory effect on SARS-CoV-2 PLpro. The sequence alignment has revealed that C270 is unique to SARS-CoV-2 and SARS-CoV PLpros and other amino acids such as valine are located at the equivalent position in other PLpros or deubiquitinases.24 The design of allosteric inhibitors targeting C270 of PLpro could not only hopefully bypass the difficulty in targeting the narrow orthosteric site but also alleviate the potential problem of off-target effects. Therefore, a thorough understanding of the molecular mechanism underlying such an allosteric inhibition at an atomic level is of interest from both scientific and drug-design viewpoints.
In contrast to extensively reported in silico studies of 3CLpro including its catalytic cycle with the substrate and interactions with inhibitors,25–29 the computational simulation investigation of PLpro has been limited.30–32 In this context, we aimed to shed light on the molecular mechanism of the SARS-CoV-2 PLpro proteolysis by a combination of multiple computational approaches including molecular dynamics (MD) simulations, quantum mechanics/molecular mechanics (QM/MM) MD simulations and molecular mechanics Poisson Boltzmann surface area (MM/PBSA) calculations. The systems investigated include apo and substrate-bound wild-type protease as well as its C270R mutant, with an accumulated simulation time of ∼20 μs for the classical MD simulations and ∼0.2 μs for the QM/MM MD simulations. Moreover, the allosteric modulation induced by the C270R mutation on the catalytic activity of SARS-CoV-2 PLpro was explored through an exhaustive comparative analysis of the structure, dynamics and the free energy profiles associated with the entire proteolysis process. Accordingly, the present study for the first time provides an overview of the reaction energy profile associated with the SARS-CoV-2 PLpro proteolysis and clues about how C270 modification allosterically modulates the proteolytic activity.
To answer the aforementioned question, we first performed 2-μs Gaussian accelerated MD (GaMD) simulations (with a classical force field) on apo and substrate-bound SARS-CoV-2 PLpro containing the neutral form of C111/H272, respectively. GaMD is a sophisticated enhanced sampling MD method that adds a boosted potential to smoothen the biomolecular potential energy surface that allows for the quantitative measurement of the structural transition of biomolecules with economized computational resources and simulation time.45–47 All the MD simulation systems included in the present study are listed in Table 1 and related characterization of these simulation trajectories is shown in Fig. S2–S4.†
System | PLpro | PLpro-C270R | PLpro-substrate | PLpro-C270R-substrate | PLpro (EIP) | PLpro-C270R (EIP) |
---|---|---|---|---|---|---|
a N atoms: the number of atoms included in each simulation system; tsimult.: MD simulation time; Nwindows: the umbrella sampling windows used in QM/MM MD simulations for each proteolysis step; twindow: the simulation time per umbrella sampling window. EIP means the PLpro containing the ion pair form of C111 and H272. The substrate is a pentapeptide of Ac-RLRGG-ACC. In QM/MM MD simulations, the detailed numbers of Natoms and Nwindows are listed for the systems of C111–H272 proton transfera, acylationb, and deacylation following path Ic and path IId, respectively. | ||||||
GaMD simulations (enzyme structural dynamics evaluation) | ||||||
N atoms | 60265 | 60252 | 60289 | 60276 | 60265 | 60252 |
t simul. (μs) | 2.0 | 2.0 | 2.0 | 2.0 | 2.0 | 2.0 |
Conventional MD simulations (deacylating water measurement) | ||||||
N atoms | 60265 | 61385 | 55255 | 60265 | ||
t simul. (μs) | 2.0 | 2.0 | 2.0 | 2.0 | ||
QM/MM MD simulations | ||||||
N atoms | 60265a | 60252a | 60289a,b/61385c,d | 60276a,b/55255c,d | ||
N windows | 13a | 13a | 14a/24b/49c/35d | 14a/21b/47c/33d | ||
t window (ps) | 500 | 500 | 500 | 500 |
The root-mean-square fluctuations (RMSFs) per residue show that in the entire PLpro structure, the Ubl and fingers subdomains as well as the BL2 loop in the palm subdomain display high dynamic features (Fig. 2a). Previous studies showed that the BL2 loop at the entrance of the active site shows a high plasticity in SARS-CoV PLpro, with a more open conformation responsible for the binding of a larger inhibitor.44 Here, it is also showed that the substrate binding affects the BL2 loop of SARS-CoV-2 PLpro. As shown in the free energy landscape (FEL) produced from the GaMD simulation (Fig. 2c), while the BL2 loop in the crystal structure of apo PLpro is most energetically favored (1.3 Å < protein RMSD < 1.6 Å and 0.2 Å < BL2 loop RMSD < 0.4 Å), it is also capable of visiting a large conformational space. The substrate binding, however, confines its structure into a local minimum slightly different from the apo structure (1.2 Å < protein RMSD < 1.6 Å and 0.5 Å < BL2 loop RMSD < 0.7 Å, Fig. 2e).
Fig. 2 Effects of the substrate binding on the structural dynamics of the BL2 loop and catalytic triad. (a) The comparison of the RMSFs of apo (black) and substrate-bound (red) SARS-CoV-2 PLpro. Transparent column marks the position of the BL2 loop. Inset is the RMSF color coded onto the PLpro structure with RGB space (hues from blue to red indicating the increase of RMSF). (b) Superposition of SARS-CoV-2 PLpro crystal structures indicating the rotation of the H272 sidechain with respect to C111. (c–f) Free energy landscapes (FELs) along the main chain root-mean-square deviations (RMSDs) of protein and BL2 loop, the distance between C111-Sγ and H272-Nδ (dC111-Sγ–H272-Nδ) and the distance between H272-Nε and D286-Oδ (dH272-Nε–D286-Oδ) in GaMD simulations of (c, d) apo and (e, f) substrate-bound PLpro, respectively. The RMSD is according to the apo crystal structure (PDB code 6WRH). The contours in the two-dimensional subspace are spaced at intervals of 1.0 kcal mol−1. |
The substrate binding affects not only the BL2 loop but also the catalytic triad. The neutral H272 is supposed to be protonated at the Nε position in order to form a hydrogen bond with the sidechain carboxy group of D286, thus stabilizing the configuration of the catalytic triad, while its Nδ atom is ready to accept the proton transferred from C111. The distance between C111-Sγ and H272-Nδ atoms (dC111-Sγ–H272-Nδ) and the distance between H272-Nε and D286-Oδ (dH272-Nε–D286-Oδ) were thus used as collective variables (CVs) to evaluate the structural dynamics of the catalytic triad. In the FEL of apo PLpro, two local minima (3.2 Å < dC111-Sγ–H272-Nδ < 3.4 Å and 3.0 Å < dH272-Nε–D286-Oδ < 3.3 Å, 3.6 Å < dC111-Sγ–H272-Nδ < 3.8 Å and 3.0 Å < dH272-Nε–D286-Oδ < 3.3 Å) are presented (Fig. 2d). This suggests that the sidechain distance between the two neutral residues of C111 and H272 is rather flexible, due to the flexible orientation of the H272 imidazole ring. Intriguingly, the fact that the H272 sidechain rotates to alter its distance to C111-Sγ has been shown in multiple crystal structures of SARS-CoV-2 PLpro (Fig. 2b). In contrast, the binding of substrate confines the catalytic triad to a tightly contacted configuration (3.2 Å < dC111-Sγ–H272-Nδ < 3.3 Å and 2.8 Å < dH272-Nε–D286-Oδ < 3.0 Å, Fig. 2f).
In the present study, the most populated structure of either apo or substrate-bound SARS-CoV-2 PLpro for QM/MM MD calculations was extracted from the microsecond-timescale GaMD simulations. The QM/MM free energy profiles associated with the proton transfer and other reactions hereinafter were explored at the DFTB/MM level. The use of a similar tight-binding DFT method (DFTB3) to describe the QM region resulted in a geometrical description of the SARS-CoV-2 3CLpro–inhibitor reaction in good agreement with the DFT (e.g., B3LYP and M06-2X) results and reasonable evaluation of the activation free energies through single-point energy correlations using higher-level QM methods.49 In the QM/MM MD simulations, steered MD (SMD) simulations were run along specific reaction coordinates to collect the structures along the reaction path, and umbrella sampling (US) simulations were performed on the chosen structures to enhance the sampling of the reaction configuration space and explore the free energy profiles.
Before the detailed analysis, we first evaluated the factors affecting the QM/MM MD simulations, which are, namely, the starting structure and the cutoff value used for treating nonbonding interactions. We tested 3 different starting structures randomly chosen from the populated structure cluster identified in the GaMD simulation, and the yielded QM/MM free energy profiles of the C111–H272 proton transfer and acylation reaction are converged with each other, respectively (Fig. S5a and b†). Additionally, the often used QM/MM cutoff value is in the range of 8 ∼ 15 Å.25,26,50,51 We tried different cutoff values (8, 12 and 14 Å) to evaluate the free energy barrier of the E-I1 → E-I2 sub-step of the deacylation reaction, and the resulting free energy profiles are very close to each other (Fig. S5c†). Therefore, the two abovementioned factors might have trivial influence on the QM/MM MD simulations under study.
As shown in Fig. 3a, the QM/MM calculated free energy barrier for the proton transfer from C111 to H272 and the free energy level of the ion pair in apo PLpro are 1.3 and 6.8 kcal mol−1 lower than those in the substrate-bound PLpro, respectively. These results strongly suggest that apo SARS-CoV-2 PLpro predominantly contains the C111−/H272H+ ion pair. Accordingly, the GaMD simulation on apo PLpro containing the ion pair obtained a steady compact catalytic triad configuration (2.8 Å < dC111-Sγ–H272-Nδ < 3.2 Å and 3.1 Å < dH272-Nε–D286-Oδ < 3.6 Å) and a rigid BL2 loop (BL2 loop RMSD < 0.4 Å) (Fig. S6a†). More water molecules were found around the C111−/H272H+ ion pair in apo PLpro compared to the substrate-bound PLpro (Fig. 3b). The worse solvation of the catalytic triad in the presence of the bound substrate, as suggested by the previous QM/MM MD simulations of SARS-CoV-2 3CLpro,26 explains the substrate-binding induced increase in the free energy level of the ion pair form.
The free energy barrier associated with the proton transfer of SARS-CoV-2 PLpro (1.9 kcal mol−1 in apo or 3.2 kcal mol−1 in substrate-bound PLpro) is smaller than the previously reported values for SARS-CoV-2 3CLpro (e.g., ∼5.6 or 6.1 kcal mol−1 in apo or substrate-bound 3CLpro in ref. 26). Precluding the influence from the usage of different methodologies, the lower free energy barrier makes the proton transfer easier to occur in SARS-CoV-2 PLpro compared to SARS-CoV-2 3CLpro, owing to the assistance of D286 in the catalytic triad of PLpro. It is interesting that water molecules have been detected in the active site of SARS-CoV-2 3CLpro. A recent study using the ProBiS H2O approach to investigate water molecules within 72 3CLpro crystal structures observed four conserved water molecules,52 one of which is near H41 and may mediate formation of hydrogen bonds with H41 and D187.16 To evaluate the interactions of H41 with this water molecule, we ran conventional MD simulations on apo SARS-CoV-2 3CLpro containing either the neutral or ion pair form of the catalytic dyad. The resulting FEL (Fig. S7†) shows that the water molecule favors the location between H41 and D187, with short distances from its oxygen to both H41-Nδ and D187-Oδ atoms, but it also moves away from either or both residues, similar to the observed results in previous MD simulations on MERS-CoV and SARS-CoV 3CLpro.36 Therefore, even though the water molecule might help with the general acid–base function of H41 in catalysis by forming water-mediated hydrogen bonds, the stability deficiency limits its influence on H41 in SARS-CoV-2 3CLpro, as compared to the sustained contribution of D286 to H272 in SARS-CoV-2 PLpro.
Fig. 4 Free energy profiles associated with the acylation and deacylation reactions for the SARS-CoV-2 PLpro proteolyzing substrate resulting from the QM/MM MD simulations. The profiles were calculated following the proposed paths I and II in Fig. 1, colored by blue and red, respectively. The profiles are identical for the acylation and a part of deacylation in the two paths. Important states are presented: the PLpro-substrate complex after the C111–H272 proton transfer (EIP:S), the transition state (TSEIP:S→E-I1) and intermediate (E-I1) in the acylation step that are in common in paths I and II (black letters); the transition states (TSE-I1→E-I2 and TSE-I2→EN:P), intermediate (E-I2), and product (EN:P) in the deacylation step of path I (blue letters); the transition state (TSE-I1→EIP:P) and product (EIP:P) in the deacylation step of path II (red letters). |
While proton transfer is a straightforward process, the acylation and deacylation steps are complex. The reaction coordinates (RC) used for PMF calculation are: RC = (dC111-Sγ–P1-C + dH272-Hδ–P1′-N) for the transition of EIP:S → E-I1 (Fig. S9†); RC = (dH272-Nδ–water-H + dwater-O–P1-C − dwater-O–water-H) for E-I1 → E-I2 (Fig. S10†); RC = (dC111-Sγ–P1-C − dC111-Sγ–H272-Hδ + dH272-Hδ–H272-Nδ) for E-I2 → EN:P (Fig. S11†); RC = (dH272-Nδ–water-H + dwater-O–P1-C − dwater-O–water-H − dC111-Sγ–P1-C) for E-I1 → EIP:P. It should be noted that a good reaction coordinate could avoid hysteresis problems.55–57 Using the transition of E-I1 → E-I2 as an example, we tested multiple reaction coordinates to evaluate the hysteresis effects. It can be seen from Fig. S12† that, no matter what reaction coordinates are used, the non-equilibrium work drawing the reactants to products in the SMD simulations is almost identical. These SMD work profiles have features similar to the corresponding E-I1 → E-I2 PMF profile (Fig. S5c†). Moreover, individual distances selected as CVs change with the similar tendencies in the three SMD yielded paths, suggesting that the reaction paths are also identical (Fig. S12†). Thus, the energetic properties of the reaction do not appear to depend on the reaction coordinates used or suffer hysteresis. On the other hand, Fig. S9–S11† show that in each of the three abovementioned transitions, the evolution of individual CVs is asynchronous, implying that there is less probability of the present simulations yielding artificial concerted steps for complex reactions.
The free energy level of the intermediate E-I1 (acyl-enzyme complex) is much lower than that of the EIP:S Michaelis complex and accordingly, the free energy barrier in the deacylation step is significantly higher than that in the acylation step, suggesting deacylation is the rate-limiting step in the entire proteolysis process. Such a feature of the free energy profile of SARS-CoV-2 PLpro is similar to that of SARS-CoV-2 3CLpro resulting from various calculation methods.25–27
Collection of the QM/MM umbrella sampling trajectories shows that in the acylation reaction, the nucleophilic attack of P1-C by C111-Sγ and the associated breakage of the P1-C–P1′-N bond in the substrate follow the proton transfer from H272H+ to the P1′-N atom. As a result, in the TSEIP:S→E-I1 state, the Nδ-H proton in H272H+ approaches the substrate P1′-N atom, slightly increasing the Nδ–H bond distance (1.09 → 1.20 Å, Fig. 5a). Meanwhile, the C111-Sγ atom also stays closer to the P1-C atom (2.56 → 2.20 Å) and the peptide bond P1-C–P1′-N distance is increased from 1.41 to 1.50 Å. This process is roughly identical to the acylation step of SARS-CoV-2 3CLpro described in a previous QM/MM MD simulation.26 The free energy barrier associated with the acylation of SARS-CoV-2 PLpro (13.6 kcal mol−1) is also close to that of SARS-CoV-2 3CLpro (14.6 kcal mol−1) calculated by using the combination of the B3LYP functional and 6-31+G* basis set26 but relatively lower than that of SARS-CoV-2 3CLpro (19.9 kcal mol−1) calculated by using the M06-2X functional and semi-empirical AM1 method.25 The higher free energy barrier in the latter literature is attributed to the suggested different mechanism in which the acylation step begins with the neutral form of the catalytic dyad and the nucleophilic attack of the substrate P1-C atom by C145-Sγ is concomitant with the proton transfer from C145 to H41.25
In addition to the separated C111–H272 proton transfer and the nucleophilic attack of P1-C by C111-Sγ, other possible mechanisms of acylation of SARS-CoV-2 PLpro were also explored. First, the case whether the attack of P1-C by C111-Sγ could be concerted to the C111–H272 proton transfer as the acylation starts from an EN:S complex carrying a neutral form of C111/H272 was examined. In this way, a transient thiohemiketal (THA) state would be formed, which then would undergo P1-C–P1′-N bond breakage assisted by proton transfer from H272H+ to P1′-N, forming an E-I1 intermediate and releasing a P1′-NH2 fragment (Fig. S13a†). The calculated free energy of the THA state is significantly higher than that of EN:S (22.6 kcal mol−1) and the free energy barrier for this stepwise acylation is also large (30.5 kcal mol−1) (Fig. S13b†). The analysis of the reaction path shows that the C111–H272 proton transfer occurs distinctly prior to the attack of P1-C by C111-Sγ (Fig. S13b and c†). Second, an acylation with a C111-Sγ attacking the substrate P1-C concerted to the proton transfer from C111-Hγ to P1′-N was also investigated. The free energy barrier for such a concerted process is also significantly higher than that in the acylation step starting with the ion pair of C111−/H272H+ (20.6 vs. 13.6 kcal mol−1) (Fig. S13e–g†vs.Fig. 4). Similar results have been observed in the QM/MM MD simulations of SARS-CoV-2 3CLpro.26 These results together suggest that a separated C111–H272 proton transfer and nucleophilic attack of P1-C by C111-Sγ occurs in the PLpro proteolysis.
A P′-NH2 fragment (ACC-NH2) is yielded in the acylation step (Fig. 5b). Subsequently, a water molecule at the position neighboring H272 and the substrate P1-C atom is supposed to be involved in the deacylation reaction. This water molecule is activated by hydrogen bonding to H272 and then its oxygen atom attacks the P1-C atom, releasing the P-COOH fragment, and regenerating the free catalytic triad (path II in Fig. 1). Unlike SARS-CoV-2 3CLpro that contains a highly conserved water molecule at a position compatible for serving as the deacylating nucleophile in the high-resolution crystal structure of the acyl-enzyme intermediate,58 such a water molecule has not been found in crystal structures of SARS-CoV-2 PLpro yet.
To inspect whether the P′-NH2 fragment stays steadily around the active site of PLpro and the presence of the potential deacylating water molecule, we ran 2-μs conventional MD (cMD) simulations on the acyl-enzyme (E-I1 state) yielded in the acylation step. It can be seen from the cMD trajectory (Fig. 5c) that the P′-NH2 fragment moves away from PLpro, thus it is not included in the subsequent QM/MM MD calculations. On the other hand, a deacylating water molecule continuously exists in between H272 and the P1-C, with an existing probability of 38.4% or 52.9% in the last half stage of the 2-μs simulation trajectory of the wild-type or C270R mutant of the acyl-enzyme complex (Fig. 5d and S14†), respectively. The existing probability of such a water molecule was also calculated in various other stages, which is close to 0 in the Michaelis complex and slightly increased during the acylation reaction and reaches the maximum in the acyl-enzyme complex (Fig. 5e). This implies the existence of the deacylating water molecule that might come inside the binding pocket when the P′-NH2 fragment product is released after the acylation step.
The deacylation yielding ion-paired catalytic triad proceeds via a single transition state (Fig. 4). In the representative structure of the transition state, the water oxygen atom approaches the P1-C atom (1.87 Å) but the bond is not formed yet while the proton transfer from the water molecule to H272 is completed (Fig. 5f). Finally, at the EIP:P state, the hydroxyl group and the carbonyl carbon is bound with a distance of 1.40 Å and the C111-Sγ–P1-C bond is broken (2.50 Å), regenerating the protease with a C111−/H272H+ ion-paired catalytic triad and yielding the P-COOH fragment (Fig. 5g). This transition step has to overcome a free energy barrier of 22.4 kcal mol−1, a value higher than the counterpart calculated in the B3LYP/6-31+G* simulation (15.6 kcal mol−1)26 but very close to that in the M06-2X/AM1 simulation for SARS-CoV-2 3CLpro (22.8 kcal mol−1).25 The latter literature suggested a similar deacylation mechanism in which the water hydrogen is attracted by H41 of 3CLpro to let the water oxygen attack the P1-C atom,25 whereas the former literature proposed that the water hydrogen is attracted by the P′-NH2 fragment released in the acylation step.26
The comparison of the FELs (Fig. 2c–f and 6a–d) suggests that the C270R mutation induces the conformational change of the BL2 loop, generating a more hooked structure in the apo protease (see the local minimum of 1.3 Å < protein RMSD < 1.6 Å and 1.0 Å < BL2 loop RMSD < 1.2 Å in Fig. 6a and loop structure in Fig. 6e). The binding of the substrate, on the other hand, drives the BL2 loop to become more open to adopt the substrate peptide (Fig. 6c). And a trivial difference still exists between the BL2 loop structures of the substrate-bound wild-type and C270R mutated PLpro (Fig. 6f). Therefore, the structure and dynamics of the BL2 loop are influenced by the C270R mutation.
W106 displays a conformation diversity among the crystal structures (Fig. S16†), making it possible to form π–π interactions with the neutral H272 or cation–π interactions with the positively charged H272H+ so as to further stabilize the catalytic triad. Such a function could be somehow destroyed by the C270R mutation. As shown in Fig. 8c, the aromatic sidechain of W106 is more likely to approach the imidazole ring of H272 in the apo wild-type than in its C270R mutant in the case of carrying the neutral form of the catalytic triad. And in Fig. 8d, W106 forms steady cation–π interactions with H272H+ (dW106–H272H+, SC–SC < 4.5 Å) in the apo wild-type protease but not in its C270R mutant when the IP state of the catalytic triad is formed. Accordingly, the QM/MM free energy profile shows that the IP state yielded in the C111–H272 proton transfer is 3.5 kcal mol−1 lower and the associated free energy barrier is 1.3 kcal mol−1 lower in the apo wild-type PLpro than in its C270R mutant (Fig. 9a). The comparison of the IP states in the two systems indicates that W106 and H272H+ have their sidechains at a similar orientation (Fig. 8e) whereas the Cα–Cα distance is larger in the C270R mutant (Fig. 8f). Therefore, the failure of the W106–H272H+ cation–π packing in the apo C270R mutant is attributed to the movement of H272 further away from W106 induced by the conformational change of the BL2 loop (Fig. 8e and f).
As the substrate is bound, the ACC group of the substrate is located in the middle between W106 and H272, which can form π–π interactions with W106 and also probably with H272 (Fig. S17a†) in both the wild-type PLpro and C270R mutant. Meanwhile, in the E-I1 intermediates of both complexes, the W106 aromatic sidechain is not in proximity to H272 (Fig. S17b†). The C270R induced impact on H272 in apo PLpro thus cannot be seen in the substrate-bound complex. On the other hand, H272 accepts the proton from C111 in the C111–H272 proton transfer, donates its Nδ-H proton to the substrate P1-N atom in the acylation step, and accepts the proton from the deacylating water in the deacylation step, respectively. As indicated by the respective reaction paths of these steps, the hydrogen binding to H272-Nδ in the C111–H272 proton transfer step (Fig. S18a and d†) and the hydrogen breaking from H272-Nδ-H in the acylation step (Fig. S18b and e†) are involved in the respective transition states whereas the hydrogen bond to H272-Nδ is already formed before the transition state of the deacylation (Fig. S18c and f†). This implies less distinguished importance of H272 in the deacylation step compared to the C111–H272 proton transfer and acylation. Taken together, the C270R mutation induced change mainly impacts the C111–H272 proton transfer step in the apo enzyme but not substrate-involved acylation and deacylation.
Accordingly, the comparison of the free energy profiles associated with the acylation and deacylation reactions shows that the C270R mutant shares very similar features with the wild-type PLpro, and no obvious difference in the free energy barriers can be seen between the wild-type and C270R mutated PLpro (Fig. 9b and c). Meanwhile, the corresponding reaction paths are almost identical for the wild-type (Fig. S18e and f†) and C270R mutated PLpro (Fig. S18h and i†). For instance, in the acylation, the proton transfer from H272H+ to the P1′-N atom precedes the formation of the C111-Sγ–P1-C bond as well as the associated breakage of the P1-C–P1′-N bond. In the deacylation, the proton transfer from the water molecule to H272 precedes the binding of the water oxygen to the P1-C atom and the breakage of the C111-Sγ–P1-C bond, and the latter two events occur concertedly.
The C270 or R270 sidechain almost always stays away from H272 (>5.0 Å) in the MD simulations (Fig. S19†) and the QM/MM MD simulations of proton transfer (Fig. S20†). Therefore, the C270R mutation exerts an inhibitory influence on the proton transfer reaction in an indirect manner. Additionally, the change in the associated free energy profiles should not be the artifact caused by measurement uncertainty. The QM/MM MD simulations with various initial structures were tested that always obtained similar PMF profiles (Fig. S5†). The error bars derived with 100 ps/block averaging in individual umbrella sampling windows were also calculated to evaluate the convergence of QM/MM MD simulations. Fig. S21† shows that for the tested systems, the error bars are quite small compared to the detailed free energy values, suggesting the convergence of the relevant simulations.
It is also noteworthy that the deacylation step in path I involving a proton transfer from H272H+ to C111− was also simulated for the C270R mutant as a control. As shown in Fig. S22,† the deacylation with the regeneration of the neutral form (C111/H272) of the catalytic triad is still highly endergonic with a higher free energy level of EN:P relative to E-I1 (14.4 kcal mol−1), suggesting again that the deacylation of SARS-CoV-2 PLpro regenerates the ion-paired catalytic triad.
(1) |
(2) |
From the steady-state approximation,60Vmax and Km can be represented as:
(3) |
(4) |
Only the ion-paired enzyme is involved in the PLpro proteolysis. Thus, the total concentration of the effective enzyme in eqn (3) should be: [ET,eff] = [EIP] + [EIP:S] + [E-I1] + [EIP:P]. The C270R mutation studied here does not affect kd, the rate constant of the rate-limiting step (deacylation) of SARS-CoV-2 PLpro, but rather increases the free energy barrier and particularly the free energy level of the ion-paired catalytic triad and thus makes it more difficult for the C111–H272 proton transfer to occur. Meanwhile, the C270R mutation also decreases the PLpro-substrate binding interaction strength, making the substrate binding more difficult. Together, the C270R mutation might reduce [ET,eff] in eqn (3) as compared to the wild-type protease, thus displaying an inhibitory effect on the catalytic activity of SARS-CoV-2 PLpro.
One should note that although the enzymatic assay experimentally showed the inhibitory effect of the C270R mutant, the Vmax ratio of the mutant to the wild-type PLpro is not large (0.32-fold).24 Accordingly, if the Vmax is solely influenced by kd (eqn (3)), then the detailed free energy barrier increase should also not be large (∼0.68 kcal mol−1), and accurate measurement should be very difficult using the current simulation methodologies.
It is observed that in comparison to SARS-CoV-2 3CLpro,26,42,61 the tight contact of D286 with H272 facilitating the deprotonation of C111 in PLpro decreases the free energy barrier of the C111–H272 proton transfer and lowers the free energy level of the ion pair (EIP) relative to the neutral form (EN) in apo PLpro. These results suggest that the catalytic triad prefers the state containing the ion pair (C111−/H272H+) in apo SARS-CoV-2 PLpro. Therefore, while the proton transfer mechanism in apo and substrate-bound enzymes is still under debate for SARS-CoV-2 or SARS-CoV 3CLpro,25,26,36–43 the proton transfer mechanism of SARS-CoV-2 PLpro is more definite. Following path II in Fig. 1, the substrate binds to PLpro containing the ion-paired catalytic triad, carries out the acylation and deacylation step by step, and recovers the catalytic triad to the ion pair state for the next catalytic cycle.
The sidechain dynamics of H272 is highly correlated with a series of residues including the BL2 loop (G266–G271) and W106. The C270R mutation alters the conformation of the BL2 loop, enlarges the W106–H272 distance and thus hinders the packing of W106 to H272 in apo protease, impairing the function of H272 as a general acid–base in proton transfer. The binding of the substrate fills the S1′ subsite, which, however, blocks the interaction from W106 to H272, impairing the influence of C270R mutation on H272. As result, the influence of the C270 mutation is mainly displayed in the proton transfer reaction in apo PLpro but not substrate-involved acylation and deacylation. An assumption is ultimately proposed that the C270R mutation impedes the C111–H272 proton transfer and meanwhile weakens the PLpro-substrate binding, decreasing the concentration of the effective enzyme (eqn (3)) and thus exerting an inhibitory effect on the catalytic activity of SARS-CoV-2 PLpro. These results afford key clues for the future development of C270-target covalent inhibitors with greatly improved inhibitory activity by enlarging the difference in Vmax.
The atomic coordinates of apo SARS-CoV-2 PLpro were retrieved from the Protein Data Bank (PDB code 6WRH) with a resolution of 1.6 Å.20 The C111S mutant was recovered, all crystal water was maintained, and the solvated molecules (phosphate, glycerol, and chloride ions) were removed. Meanwhile, the substrate was docked into PLpro through aligning the ISG15 complexed system (PDB code 6YVA)6 to 6WRH and only the RLRGG segment was maintained from the ISG15 protein as the substrate used for the simulation. The protein termini remained freely charged (uncapped) whereas the N- and C-termini of the substrate peptide RLRGG were capped with the acetyl (Ac) group and the 7-amino-4-carbamoylmethylcoumarin (ACC) fluorophore (Fig. S1†), respectively. ACC has often been used as a fluorescent tag in enzymatic assay.
The protonation states of all titratable residues at pH 7.5 were evaluated using Schrodinger suite software. All residues were found in their standard protonation state. After a detailed inspection of the environment surrounding each histidine residue, all histidines were neutral except that His17 was positively charged. His47, His73, His89, His175 and His275 were protonated in the Nδ position, while the remaining His50, His272, and His255 were protonated on Nε. No S–S linkage was detected between Cys residues. All cysteine residues except Cys189, Cys192, Cys224 and Cys226 that form coordinate bonds with Zn2+ were protonated. The C270 was manually replaced with arginine in the mutant system.
Each protein (and substrate) system was solvated in a cubic box filled with a total of ∼18000 water molecules, in which multiple Na+/Cl− ions were added to neutralize the protein charges. The AMBER 18 suite of programs67 was employed for simulations with the underlying force fields of FF14SB force field68 for protein and TIP3P model69 for water molecules. The coordinate bond between Zn2+ and the surrounding cysteines in the zinc fingers subdomain of PLpro was modeled using the MCPB.py program implemented in AMBER 18.70 The ACC group and the non-standard residues in the peptide intermediates (e.g., E-I1, E-I2) were modeled using a generalized AMBER force field (GAFF)71 with restrained electrostatic potential (RESP)72 partial charges fitted with Gaussian 09.73
All GaMD simulations were run at the “dual-boost” level by setting the reference energy to the lower bound, one boost potential being applied to the total potential and the other to the dihedral energetic term. The average and the standard deviation (SD) of the system potential energies were calculated every 250000 steps (0.5 ns). The upper limit of the boost potential SD was set to 6.0 kcal mol−1 for both the dihedral and the total potential energetic terms. The coordinates were saved every 10000 steps.
To sample the existing probability of a water molecule in between H272 and the P1-C atom of the substrate for the deacylation process, a conventional MD simulation was performed lasting 2 μs for each acyl-enzyme complex system.
The reaction pathway was obtained using steered molecular dynamics (SMD)86 in the QM region following specific reaction coordinates (see the detailed description of the reaction coordinates in the “Results and discussion” section). Multiple harmonic force constants within 100 ∼ 500 kcal mol−1 Å−2 were tested to pull the system along the predefined reaction coordinate to see whether the yielded traction pathways are converged or not. Then the detailed free energy profile was calculated, in terms of potentials of mean force (PMF), for every step of the reaction using the Umbrella Sampling (US) approach87 combined with the Weighted Histogram Analysis Method (WHAM).88 Series of QM/MM US simulations were performed adding a constraint along the predefined reaction coordinates with an umbrella force constant of 100 kcal mol−1Å−2. Structures selected from the SMD simulation were used as starting points for the US simulations. The detailed number of QM/MM US windows for all systems is shown in Table 1, making sure the sampled reaction path is overlapped between individual windows. In every window, simulation was performed with a total of 250 ps of equilibration and 250 ps of production at 300 K with a time step of 1 fs.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d3sc00166k |
This journal is © The Royal Society of Chemistry 2023 |