Dóra K.
Menyhárd
a,
Gyula
Pálfy
a,
Zoltán
Orgován
b,
István
Vida
a,
György M.
Keserű
*b and
András
Perczel
*a
aLaboratory of Structural Chemistry and Biology, MTA-ELTE Protein Modelling Research Group, Institute of Chemistry, Eötvös Loránd University, Pázmány Péter sétány 1/A, 1117 Budapest, Hungary. E-mail: perczel.andras@ttk.elte.hu
bMedicinal Chemistry Research Group, Research Centre for Natural Sciences, Magyar tudósok körútja 2, 1117 Budapest, Hungary. E-mail: keseru.gyorgy@ttk.mta.hu
First published on 19th August 2020
Oncogenic RAS proteins, involved in ∼30% of human tumors, are molecular switches of various signal transduction pathways. Here we apply a new protocol for the NMR study of KRAS in its (inactive) GDP- and (activated) GTP-bound form, allowing a comprehensive analysis of the backbone dynamics of its WT-, G12C- and G12D variants. We found that Tyr32 shows opposite mobility with respect to the backbone of its surroundings: it is more flexible in the GDP-bound form while more rigid in GTP-complexes (especially in WT- and G12D-GTP). Using the G12C/Y32F double mutant, we showed that the presence of the hydroxyl group of Tyr32 has a marked effect on the G12C-KRAS-GTP system as well. Molecular dynamics simulations indicate that Tyr32 is linked to the γ-phosphate of GTP in the activated states – an arrangement shown, using QM/MM calculations, to support catalysis. Anchoring Tyr32 to the γ-phosphate contributes to the capture of the catalytic waters participating in the intrinsic hydrolysis of GTP and supports a simultaneous triple proton transfer step (catalytic water → assisting water → Tyr32 → O1G of the γ-phosphate) leading to straightforward product formation. The coupled flip of negatively charged residues of switch I toward the inside of the effector binding pocket potentiates ligand recognition, while positioning of Thr35 to enter the coordination sphere of the Mg2+ widens the pocket. Position 12 mutations do not disturb the capture of Tyr32 by the γ-phosphate, but (partially) displace Gln61, which opens up the catalytic pocket and destabilizes catalytic water molecules thus impairing intrinsic hydrolysis.
The structure of RAS proteins shows three major domains of decreasing sequence similarity, namely the effector-binding lobe (residues 1–85, similarity 100%), the allosteric lobe (residues 86–168, similarity 86%) – jointly constituting the G-domain – and the hypervariable region (HVR, residues 166–188/189, similarity 15%) (Fig. 1). The effector lobe is responsible for the recruitment and activation of effector proteins; however, the molecular mechanism of this process remains largely unknown.8,9 The structure of the sequentially conserved G-domain is similar for all the RAS proteins and contains four main regions that include the phosphate binding loop (P-loop, residues 10–17), switch I (residues 25–40), switch II (residues 57–76) and nucleobase binding loops (residues 116–120 and 145–147).10–12
Structural investigations on RAS activation are notoriously difficult. Despite the large number of available X-ray structures and thorough NMR studies, the molecular details of the highly organized conformational rearrangements at the switch regions still remain to be clarified. Although these changes are responsible for the exposure of the effector binding site, and consequently for oncogenic activation, their elucidation at the atomic level is challenged by multiple factors. On one hand, crystal structures do not provide a clear picture since the conformationally flexible switch I and switch II regions are often shaped and restricted in mobility by crystal contacts, show high B-factors if no such contacts are established (ESI Fig. 1†), or are simply unresolved and missing from electron density maps (ESI Fig. 2 and 3†). Furthermore, data collection for the GTP-bound active form is complicated by the hydrolysis of the nucleotide. To avoid this, small G-protein investigators use GTP analogues in most cases, though it has been shown that they behave somewhat differently from GTP itself.13–15 In a recent study, to avoid this problem, GEF protein (SOS) was added to the native HRAS-GTP NMR sample.16 However, since GEF displaces the switch I region in its entirety (by over 10 Å) and remodels switch II17 – even if it is added to the samples in minimal amounts – the results thus derived might be altered from the RAS-GTP native state. These limitations prevent clarifying what steps constitute the transition to the active form, which features of the activated state are recognized by the downstream partners and the conformational criteria of inactivation – thus, the exact mechanism of the rise and shutdown of the growth signal.
The motivation of our work came from two major sources. First, investigating the molecular mechanism of RAS activation might clarify the structural details of oncogenic signaling that would help to understand the activation of downstream signaling pathways involving the recruitment of effector proteins. More importantly, however, identifying the conformational signal could serve as the basis of inhibitor design – targeting the effector binding conformation directly. Indeed, most of the drug discovery efforts are based on enhancing GTP hydrolysis, influencing nucleotide exchange or inhibiting RAS–effector interactions.6,10,18
We developed a set of conditions that enabled us to work with the native KRAS-GDP/GTP systems, and guaranteed full GDP–GTP exchange by passing through the nucleotide free form. We applied an integrated structural biology approach, combining multidimensional NMR measurements with molecular modeling (MD and QM/MM relying on the pool of the relevant X-ray structures). We chose to study the wild type KRAS protein and two oncogenic mutations (G12C and G12D variants) that affect the intrinsic GTP hydrolysis rates differently while inhibiting GAP-mediated hydrolysis to a similar extent.
Our data revealed that the most significant change evoked by nucleotide exchange is the pivotal flip of Tyr32 (and the adjoining residues e.g. Thr35) of the switch I region. The results explain the effect of oncogenic P-loop mutations on intrinsic hydrolysis rates and effector protein binding capability and highlight the role of binding site waters during the activation process.
NMR measurements were performed at 298 K on a Bruker Avance III 700 MHz spectrometer equipped with a 5 mm Prodigy TCI H&F-C/N-D, z-gradient probe head operating at 700.05 MHz for 1H-, 70.94 MHz for 15N- and 176.03 MHz for 13C-nuclei. Temperature was calibrated using a standard methanol solution.20 All chemical shifts were referenced with respect to the internal 1H-resonance of DSS, and 13C, 15N-chemical shifts were referenced indirectly using the corresponding gyromagnetic ratios according to IUPAC convention.21 Sequence specific assignments of backbone 1H- and 15N-nuclei of GDP-bound forms were used from our previous work.19 In the case of GTP-bound forms, assignments were performed based on the 3D BEST-HNCO, BEST-HNCA, BEST-HN(CO)CA, BEST-HNCACB and CC(CO)NH spectra of the G12C mutant (its backbone and sidechain 13C nucleus assignment was determined as well). These results were transferred to WT and G12D using 3D NOESY-HSQC spectra, using mixing times of 120–150 ms. All the assigned resonances were deposited in the BMRB database with the following entries: 28021 (WT-GTP), 28015 (G12C-GTP), and 28022 (G12D-GTP) 1H, 15N-HSQC spectra of KRAS-G12C and G12C–Y32F mutants were compared in both GDP and GTP-bound forms. All spectra were processed with the Bruker TOPSPIN program and analyzed by using CARA (ETH Zürich).22
Standard backbone 15N-relaxation experiments (T1, T2 and steady state 1H–15N HetNOE) were performed using 15N-labeled WT, G12C and G12D mutants for both GDP- and GTP-bound forms. A typical set of spectra in the 0.1–4.5 s and 0.017–0.678 s range were recorded in 10 points for both T1 and T2 measurements. The spectra were processed using TOPSPIN. The longitudinal (R1) and transverse (R2) relaxation rates were determined by fitting the cross-peak intensities as a function of the delay to a single-exponential decay using NMRFAM-SPARKY software.23 The heteronuclear NOE values were obtained from the ratio of the peak intensities of saturated and unsaturated cross-peaks.
The obtained relaxation results were further analyzed by reduced spectral density mapping analysis24 and extended Lipari–Szabo model-free formalism.25–27 Reduced spectral density mapping analysis results in three different spectral density values characterizing the distribution of slow, medium and fast time scale motions of each residue's backbone N–H vector: J(0) for slow, J(ωN) for medium, and J(0.87ωH) = J(ωh) for fast time scales. Model-free analysis can be performed using an isotropic approach, assuming the molecule is approximately spherical, or using an anisotropic approach (axially anisotropic or fully anisotropic) based on the exact 3D structure of the protein. The analysis provides the global rotational correlation time (τc) that characterizes the tumbling motion of the molecule as a whole and for each residue a generalized order parameter (S2) describing how restricted the motion of its backbone N–H vector is. Further optional per-residue parameters can be obtained depending on which model of 1–5 are best fitted to the data: correlation time for internal motion (τe) characterizing a motion faster than the time scale of global tumbling and the exchange parameter (Rex) describing chemical or conformational exchange processes on a slower time scale than the global tumbling motion, here carried out using FAST-Modelfree software,28 the automated version of Modelfree 4.2 (ref. 29 and 30) using the isotropic approach, were used for Lipari–Szabo model-free analysis with a constant 1.02 Å for N–H bond length and −172 ppm for 15N-chemical shift anisotropy.31 The data for KRAS monomers were derived using the isotropic approach, while for dimeric forms of KRAS-WT, the axially anisotropic approach was used. The results of dynamics measurements and analysis were also deposited in the BMRB database with the previously given entries.
Molecular dynamics (MD) simulations were started from two different crystal structures: 4obe (WT KRAS/GDP) for the GDP- and 3k8y (WT/HRAS-GNP) for the GTP-bound forms.32,33 In the latter case the HRAS variant was selected because no WT KRAS/GTP-analogue structure is currently available that does not contain unresolved segments. Furthermore, in this structure the conformation of both Tyr32 and Gln61 indicates that it is in a truly activated conformation. Also, the authors identified 2 water molecules that they felt were catalytically important,32 and carrying out the simulation from a starting conformation nearly identical to theirs (aside from the HRAS → KRAS mutations and the GppNHp → GTP switch) also allowed us to compare the MD derived water-cluster around the active site to that determined experimentally. KRAS and HRAS share 94% sequence identity in the 1–169 region studied here; the HRAS → KRAS transformation involved a total of 10 changes, none of them in, or near, the switch I–II regions.
All mutations and modifications were introduced into the wild type (WT) structures without any further changes, using Maestro (Schrödinger Suite (Schrödinger Release 2019-3: Maestro, Schrödinger, LLC, New York, NY, 2019)). Maestro was also used for generating the figures shown in the text. Simulations were carried out as implemented in GROMACS,34 using the AMBER-ff99SBildnp* force field. The system was solvated with OPC water molecules35 in dodecahedral boxes with a size allowing 10 Å between any protein atom and the box. The total charge was neutralized and physiological salt concentration (0.15 M) set using Na+ and Cl− ions. Energy minimization of the starting structures was followed by sequential relaxation of constraints on protein atoms in three steps and an additional NVT step (100 ps) to stabilize pressure. Trajectories of 600–1200 ns NPT simulations at 310 K and 1 bar were recorded for further analysis (collecting snapshots at every 4 ps). Simulations satisfactorily reproduced the crystal structures and the experimental B-factors (see ESI Fig. 1†). For illustrating the effect of nucleotide exchange on the switch I region, snapshots were clustered36 based on the conformation of the main-chain atoms of residues 10–48 for the last 300 ns of the trajectories using a 1 Å cutoff. MD and NMR results were compared using the CoNSEnsX+ server.37
QM/MM calculations were carried out for KRAS-GTP complexes using Jaguar.38,39 The B3LYP/LACV3P* method was used in combination with OPLS3-based molecular mechanics.40 The starting system was derived from the mid-structure of the first cluster from the KRAS-GTP MD simulation (based on the full main-chain using a 1 Å cutoff). All waters that reach within 12 Å of any protein atom were kept. The quantum region was composed of the entire GTP, the Mg2+ ion, 4 water molecules (catalytic and assisting water, and 2 waters coordinated with the Mg2+ ion) and the sidechains of Lys16, Set17, Tyr32, Thr35, Gln61 and Lys117, forming a neutral system. The MM region was divided into two parts: residues 11–40, 55–76 and 117; GTP and the Mg2+ ion and all waters within 6 Å of these moved freely during optimization, while all the rest were kept frozen. The hydrolysis reaction was modeled by moving the catalytic water from PGGTP–OWcatalytic-water distance of 3.4 Å to 1.4 Å in 0.1 Å steps. Forward and backward scans were repeated several times to smooth inconsistencies, and since proton transfer lags behind both during the forward and the backward scans, a minimum energy path was chosen from the scan-points of both. The scan-grid was downscaled to 0.02 Å within ± 1 Å of the TS-like structure. The Y32F* model was created by in silico mutation of Tyr32 without any further modification.
The NMR structural characterization of RAS-GTP proteins is hindered by NMR-signal broadening due to intermediate exchange, which, in the most interesting regions, leads to undetectable or unassignable signals.42,43 We also encountered this problem, but have succeeded in detecting and assigning several new resonances within the P-loop and switch I regions of KRAS. Most probably this is because we used – for the first time – the physiological GTP ligand for which the binding site was optimized (without adding extra protein partners) and had a homogeneous sample due to complete nucleotide exchange. We took great care to ascertain that the signals in 1H, 15N-HSQC spectra indeed belong to the GTP-bound KRAS protein (see ESI Fig. 4†). In short: the already assigned19,44 and stable 1H, 15N-HSQC spectra of the GDP-bound form of all three variants were recorded (affording signal set 1). EDTA was added to the samples, and they were in turn diluted using the buffer solution to remove excess GDP. To these systems GTP and Mg2+ were subsequently added. This brought about the appearance of a new set of signals (KRAS-GTP: signal set 2) besides the signals of the KRAS-GDP (signal set 1). The samples were incubated for 3 days and then re-measured, showing a homogeneous state analogous to the original (signal set 1), confirming the hydrolysis of KRAS-bound GTP to GDP via the intrinsic GTPase activity of KRAS. Samples used for NMR dynamics measurements were prepared by first adding EDTA to remove the Mg2+ and the remnant nucleotide and then resupplying the Mg2+ and a great excess of GTP. The 1H, 15N-HSQC spectra then recoded only contained signal set 2 and remained stable for 3–4 days due to the shifted equilibrium of GTP intrinsic hydrolysis and nucleotide autoexchange. Based on the appearance of only two signal sets, the contribution of the nucleotide-free form to the spectra could also be ruled out. This form is expected to undergo significant rearrangement in the switch I region,45,46 which would have differentiated it from both signal set 1 and signal set 2. Using this methodology we could assign, in the case of KRAS-G12C-GTP – for example – hitherto undetected signals of Gly10, Cys12, Gly13, Lys16, Ile21, Val29, Tyr32, Thr35, Ser39, Tyr40, and Arg41 (Fig. 2A), although in some cases the resonance was still too broad to gain information about chemical shifts of the side chains. Signals remaining unassigned were of Val8, Ala11, Gly13, Gln22, Asp30, Glu31, Asp33, Ile36, Glu37, Asp38, Thr87, and Val103 as well as signals of the switch II region (Asp54–Arg73). In total 80.6% of the non-proline residues (133 of 165) were detected and assigned in the 1H, 15N-HSQC spectrum (exceeding that (76.4%) of a recently published study concerning the non-native GppNHp-bound KRAS-G12C).47 Similarly, 80.0% and 78.2% of all non-proline residues were detected and assigned in the 1H, 15N-HSQC spectrum of WT-GTP and G12D-GTP structures, respectively, also the highest fraction hitherto obtained. Resonance assignment is complex as there are two conformational states of the GTP-bound form of KRAS, namely that of state 1 and state 2. The main difference between these two forms is in the conformation of switch I, which is released from the active site in state 1, leading to a GTP-loaded but still inactive conformation.48 Interestingly, the relative population of state 1 and state 2 has recently been shown to be mutation-sensitive, the oncogenic G12D mutation – for example – seems to shift the equilibrium toward the fully activated state 2 form.49,50 The exchange between the two states results in some minor peaks with very low intensity in the 1H, 15N-HSQC spectra which remained unassigned, but their presence confirms the slow exchange between them. In contrast to the GTP-bound forms, there was no severe line broadening in the 1H, 15N-HSQC spectra of GDP-bound states, allowing the assignment of 98–99% of non-proline residues for all of the three variants.19
The triple resonance experiments performed for KRAS-G12C-GTP allowed us to compare the NMR chemical shift-based secondary structure propensities (SSP)51 to those of the GDP-bound form19 (Fig. 2B). This analysis confirmed that the secondary structures of the well-defined regions of GDP- and GTP-bound forms do not differ notably. We carried out detailed studies on the backbone dynamics of the three variants (WT, G12C and G12D) in both GDP- and GTP-bound forms. We investigated the fast backbone dynamics of KRAS by determining the standard R1, R2 and HetNOE relaxation parameters. Surprisingly, position 12 mutations only cause local variations in the dynamics in the presence of either nucleotide. However, when comparing the GDP to the GTP-bound form, we found significant differences (Table 1 and Fig. 3).
KRAS-WT | KRAS-G12C | KRAS-G12D | ||||
---|---|---|---|---|---|---|
+GDP | +GTP | +GDP | +GTP | +GDP | +GTP | |
R 1 (s−1) | 0.97 ± 0.05 | 0.85 ± 0.08 | 0.99 ± 0.06 | 0.85 ± 0.07 | 0.95 ± 0.06 | 0.82 ± 0.07 |
R 2 (s−1) | 13.85 ± 2.00 | 16.94 ± 3.12 | 14.16 ± 2.01 | 16.57 ± 2.63 | 14.83 ± 3.37 | 17.54 ± 2.76 |
HetNOE | 0.79 ± 0.10 | 0.75 ± 0.14 | 0.79 ± 0.10 | 0.77 ± 0.13 | 0.78 ± 0.10 | 0.76 ± 0.13 |
τ c (ns) | 9.95 | 12.33 | 9.97 | 11.97 | 10.44 | 12.59 |
S 2 | 0.82 ± 0.07 | 0.85 ± 0.09 | 0.84 ± 0.08 | 0.84 ± 0.08 | 0.84 ± 0.07 | 0.85 ± 0.08 |
J(ωh) (ps rad−1) | 3.26 ± 1.92 | 3.33 ± 2.29 | 3.30 ± 1.86 | 3.22 ± 2.34 | 3.35 ± 1.85 | 3.11 ± 1.94 |
Fig. 3 Relaxation parameters of the KRAS WT (light green/green), G12C mutant (cyan/blue), and G12D mutant (lilac/purple) in GDP-bound (left) and GTP-bound forms (right). (A) R1, longitudinal relaxation rates, (B) R2, transverse relaxation rates and (C) steady-state heteronuclear 1H–15N NOE data are plotted against the residue sequence numbers. No dynamics data are reported for several residues due to their signal-broadening (in the GTP-bound forms) or signal overlapping. P-loop (light red, residues 10–17), switch I (yellow, residues 25–40) and switch II (light blue, residues 57–76) are shown in colored rectangles (R2/R1 and R2 × R1 values are shown in ESI Fig. 5,† for individual values of the most interesting switch I residues see ESI Table 1†). |
R
1 and R2 values suggest a general trend in the global motions of the molecules: the former significantly decreased, while the latter increased when going from the GDP- to GTP-bound form arising from the increased global correlation time (τc) of GTP-bound forms. Since it is clear from previous crystallographic and NMR results that the overall shape and size of RAS proteins do not change substantially on nucleotide exchange, the significantly different global tumbling of the GDP- and GTP-bound forms could either arise from a drastic difference in viscosity of the samples or via dimerization or oligomerization processes that only involve one of them, or – in case of the GTP-bound state – via contamination by aggregates of the nucleotide-free form that transiently appears during nucleotide exchange. This latter possibility could be excluded by showing that the samples containing the nucleotide-free form are monomeric (see ESI Fig. 6†). A change in viscosity could be suspected, because – to counteract the intrinsic hydrolysis of GTP – the samples of the GTP-bound forms contained approximately 40-fold greater amount of the nucleotide than those of the GDP-bound forms. However, neither increasing the GDP concentration, nor decreasing that of GTP caused significant changes in the detected line-width of the 1H, 15N-HSQC spectra (see ESI Fig. 7†). On the other hand, dimerization/oligomerization of RAS proteins in complex with GTP-analogues has already been described.52,53 Determining whether the GTP-bound samples were truly free from significant levels of dimeric forms was critical for the interpretation of our results, since while oligomers of extended size are not detected by the NMR methodology applied by us, dimerization could lead to spectral changes comparable in magnitude to those caused by nucleotide exchange or mutation. To determine the oligomeric state of our samples, we first carried out size exclusion chromatography. The results indicated that while the GDP-bound state is a homogeneous solution of the monomeric form, the sample of the GTP-bound WT KRAS contained high-molecular-weight oligomers besides the monomer form – although the presence of dimers was not detected (see ESI Fig. 8D†). Accordingly, room temperature incubation of the samples led to the appearance of visible aggregates in the case of the GTP-containing sample, while the GDP-bound form remained stable in solution. Next we compared the spectra of 300 μM and 30 μM solutions of 15N-KRAS-G12C-GTP according to the method of Muratcioglu et al.52 In the case of the diluted sample 1H, 15N-SOFAST-HMQC measurements had to be carried out to reduce the measurement time, which does not allow detailed analysis of the results, but is sufficient for comparison with the standard 1H, 15N-HSQC spectrum. The combined chemical shift difference54 value was calculated for each residue using the following expression,
The obtained relaxation parameters were further analyzed by the Lipari–Szabo model-free analysis and by reduced spectral density mapping (Table 1 and Fig. 3). Elevated τc values of GTP-bound states originate from the dimerization/oligomerization process as discussed previously. The G12D mutant has the highest τc in both GDP- and GTP-bound forms, whilst the lowest value belongs to WT in GDP-bound and G12C in GTP-bound states suggesting differences in the global shape of the WT and the G12C and G12D mutants. No significant difference was found in the average generalized order parameters (S2) between GDP- and GTP-bound forms in any of the three variants. The same regions appear to be flexible which was determined based on HetNOE values. These flexible regions are evidenced by fast internal motions characterized by τe parameters, most characteristic of the WT-GTP complex. For the latter protein, to describe internal motion properly, the involvement of τe is needed for 35 residues, while for G12C and G12D only 20 and 22 residues, respectively. Therefore Gly12 site mutations seem to reduce the fast time scale motions in the ps–ns time range (Fig. 4B). The slow motions in the mutants are focused in the allosteric regions.
Fig. 4 Selected dynamic properties of KRAS WT (light green/green), G12C mutant (cyan/blue), G12D mutant (lilac/purple) in GDP-bound (left) and GTP-bound forms (right): (A) generalized order parameter, S2, (B) correlation times for internal motions, τe, and (C) slow exchange rates, Rex, all of them derived from the Lipari–Szabo model-free formalism. (D) Spectral density values of fast timescale motions according to the reduced spectral density mapping method, J(ωh). P-loop (light red, residues 10–17), switch I (yellow, residues 25–40) and switch II (light blue, residues 57–76) are shown in colored rectangles (for individual values of the most interesting switch I residues see ESI Table 1†). |
While there is no significant difference in the fast motions between the three proteins in GDP-bound states, slow motions in the μs–ms time range (characterized by Rex (Fig. 4C)) differ notably. It is WT-KRAS-GDP that possesses the most residues engaged in slow-time scale motions, namely for 34 residues the motions are characterized by Rex, while in the case of G12C and G12D it is 10 and 21 residues, respectively.
Switch I residues of all the three variants show slow-time scale motions; however in the G12D mutant the values tend to be higher than those in the WT and G12C variants: the largest Rex value measured is also in this region, describing the motion of Glu37 of G12D-KRAS-GDP. In the switch I region, Tyr32 (G12D)/Asp33 (in WT and G12C) show slow time-scale motions. Thr35 participates in slow motions in WT and G12D-KRAS-GDP and is not detected in the G12C variant. The most outstanding slow movement or exchange in the P-loop belongs to Gly13 of the G12C mutant, next to the site of the mutation. The motion of the Gly15 residue can be described by using a slow time scale parameter Rex only in the case of KRAS-G12D-GDP forms (model 3), but not in KRAS-WT-GDP and KRAS-G12C-GDP mutants (model 1 was used in both cases), while the motion of the Ser17 residue can be characterized by an Rex parameter in both KRAS-WT-GDP and KRAS-G12D-GDP forms (model 3 in both), but not in the KRAS-G12C-GDP mutant (model 1). This suggests that slow motions in the P-loop are the most relevant in KRAS-G12D-GDP and the least in KRAS-G12C-GDP; thus in the G12C form Gly13 might be involved in chemical/conformational exchange processes instead of slow motions.
To test the robustness of the results we repeated the Lipari–Szabo model-free analysis using the axial anisotropy approach using dimer-models instead of the monomeric form of KRAS-WT-GDP. We built dimer models using the two arrangements proposed in the work of Muratcioglu et al.:52 the α- and the β-dimer. We obtained very similar results when using either model to those derived by the isotropic approach (ESI Fig. 9†).
Flexible regions were also confirmed by the reduced spectral density mapping. The spectral density parameters referring to the propensity of fast motions (J(ωh)) reinforce the three flexible regions (Gly60–Ser65 in switch II, Ser106–Met111 and Ser122–Arg123) (Fig. 4D). In the GDP-bound forms Tyr32 is highly flexible (J(ωh) values are 9.77, 9.17 and 8.68 ps rad−1, while the averages are 3.26, 3.30 and 3.35 ps rad−1 for the WT and G12C and G12D mutants, respectively). In the GTP-bound state, the flexibility of Tyr32 is similar to or even less than the average flexibility (J(ωh) values are 1.10, 3.33 and 1.12 ps rad−1 while the averages are 3.33, 3.22 and 3.11 ps rad−1 for the WT and G12C and G12D mutants, respectively). Thus, it appears that the dynamics of Tyr32 – detected at its backbone amide NH – notably deviates from those of every other residue of switch I in the GDP-bound forms where its flexibility exceeds that of the surroundings. However, it becomes even more anchored than the neighboring residues (in the case of the WT and G12D variant) in the GTP-bound forms.
To further elaborate the effect of the Tyr32 side chain, and to specifically clarify whether its presence has a marked effect on the structural ensemble of the KRAS-G12C system too, we have designed a double mutant, namely that of G12C/Y32F and compared the relevant chemical shifts (1H, 15N-HSQC spectra) to those of the G12C mutant, both in the GDP- and GTP-bound states (Fig. 5A). In the case of the GDP-bound form significant changes were found mostly in switch I, in the proximity of the mutated Tyr32. In total, 29 of the assigned 164 resonances shifted slightly or even significantly primarily in the vicinity of Tyr32. In contrast, in the GTP-bound form more than half of the signals were shifted (89 of the assigned 133 signals) involving even far-lying sites (Fig. 5B). This is a clear indication of the key role of the –OH group of Tyr32 in the stabilization of the GTP-bound form. A similar conclusion was reached by Buhrman et al. in the case of the WT HRAS, where the Y32F mutation resulted in over a 2-fold decrease in the GTP hydrolysis rate – thus leading to the destabilization of the catalytically competent form. They found that, in the presence of a non-hydrolysable GTP analogue (GppNHp), the switch I region is distant from the active site and F32 points inward the switch I loop, in a conformation reminiscent of the GDP-loaded state.32
Fig. 5 (A) Overlaid 1H, 15N-HSQC spectra of KRAS-G12C-GDP (blue; left) and KRAS-G12C-Y32F-GDP (light red; left) and those of KRAS-G12C-GTP (dark blue; right) and KRAS-G12C-Y32F-GTP (red; right). Assignment is shown for the G12C mutant and only for those resonances which shifted significantly (Δδ > 0.05 ppm). The mutated Y32 → F32 is highlighted. (B) Combined chemical shift differences upon G12C/Y32F mutation along the sequence for GDP and GTP-bound forms. Δδ values are calculated according to Williamson.54 Residues for which Δδ > 0.05 ppm are colored red (GDP-bound form: left; GTP-bound form: right). (C) 3D distribution of the residues affected by the Y32F mutation (shown on the MD derived structures of the KRAS-G12C-GDP (left) and KRAS-G12C-GTP (right)): residues with significant Δδ shifts of the GDP-bound forms are colored light red (left) and those of the GTP-bound forms are colored red (right), those residues for which a signal could not be identified are shown in gray, and residues with negligible chemical shift upon mutation are colored cyan (left) and darker blue (right). The mutated Tyr32 is shown in CPK and the nucleotide is shown in ball-and-stick representation. |
In the GDP-bound RAS enzymes, Tyr32 points away from the nucleotide binding site, interacting with Tyr40 within the loop created by the switch I residues. Experimental information concerning the role of Tyr32 in the formation of the active, GTP-bound state of RAS proteins, however, is scarce due to the intrinsic hydrolysis of the physiological nucleotide, which hinders the detailed analysis of this state. The crystal structure of the HRAS-GTP complex was determined in an elegant study some twenty years ago.58 Using a caged-GTP variant (R-1-(2-nitrophenyl)-ethyl-GTP ester) the authors generated the active form within the crystal lattice by irradiation. However, due to a peculiar artifact this structure does not clarify the active, solution state conformation of Tyr32. In the crystal, each Tyr32 flips from the inner switch I region toward the nucleotide binding site but is captured in a tight H-bond (at 2.7 Å) by the neighboring monomer placed in the vicinity by crystal-packing, specifically by the O1G atom of the γ-phosphate of the neighboring GTP (ESI Fig. 10†). Curiously, this conformation of HRAS is very similar to that assumed in the HRAS-GAP complex,59 thus it was proposed that it might represent a GAP-compatible, less stable, and thus sparsely sampled sub-state of the activated, GTP-loaded form of HRAS.60 Recently, the GTP-bound form of KRAS was also explored in a comprehensive study of WT and mutant variants using both GTP and its various non-hydrolysable analogues.61 However, in these experiments GTP (or its analogue) was simply diffused into the preformed crystals of GDP-loaded KRAS – a methodology that does not guarantee complete nucleotide exchange. Accordingly, the structures reflect more the transition from the GDP- to the GTP-bound form and less the final stabilized activated state. In fact, both switch I and II are destabilized and have poorly localized electron density. The conformation of Tyr32 is seen as unchanged by nucleotide exchange; however a nearly 2-fold increase in its (relative) B-factors indicates that its position, distant from the nucleotide binding site, is far from certain. Reviewing all structures concerning the GTP-bound state (WT and mutant forms in complex with GTP or its analogues, excluding those crystal structures where switch I directly interacts with neighboring monomers) three distinct positions of Tyr32 (and the adjoining switch I region) can be found: (i) it remains secured within the switch I loop interacting primarily with Tyr40 (a GDP-complex-like state), (ii) it is flipped from this spot to the vicinity of the γ-phosphate of GTP, but points into the solvent and (iii) its hydroxyl group forms a direct H-bond to one of the oxygens of the γ-phosphate (ESI Fig. 11†).
Since our NMR results indicated a distinct change in the environment and flexibility of Tyr32 on nucleotide exchange, we considered the latter two possibilities to be plausible. Because nucleotide exchange and the corresponding conformational changes proceed ineffectively without the aid of the exchange factors, we expected this Tyr32-related conformational change to be substantial.
To generate unbiased and directly comparable models for the GDP and GTP-bound forms of WT, G12C and G12D KRAS, we carried out MD simulations starting from two crystal structures, one for each nucleotide-bound form.32,33 For representing the GTP-bound state we chose a structure where Tyr32 is near, but not directly coordinated with the γ-phosphate of GTP (O1GGTP–OHTyr32 distance of 4.3 Å).
The MD results were compared to NMR backbone dynamics data using S2 values and C′, Cα and Cβ chemical shifts (calculated from MD ensembles) for the G12C mutant in both the GDP- and GTP bound states since 13C chemical shifts were determined only for this system. The calculated and the measured data were in good agreement (as can be seen in ESI Fig. 12†).
Similarly to the NMR results, we found little structural consequence of the mutations in the core structure of KRAS in either the GDP- or the GTP-bound states (ESI Fig. 1†). The high flexibility of the switch regions is apparent, especially in the GDP-complexes. The overall effect of nucleotide exchange is a more defined structure – both the overall and especially the local conformational heterogeneity of the switch I region is considerably reduced in the GTP complexes (Table 2 and ESI Fig. 1†).
Full-sequence | Residue 10–48 | |
---|---|---|
WT-GDP | 23 | 170 |
WT-GTP | 7 | 49 |
G12C-GDP | 21 | 91 |
G12C-GTP | 12 | 52 |
G12D-GDP | 21 | 144 |
G12D-GTP | 10 | 85 |
In all three GDP complexes Tyr32 points into the loop created by switch I residues, bound directly or through a water to Asp38 or Tyr40, wedged between Ile21, Val29 and Ile36, blocking the entrance of the effector binding pocket below the plane of the nucleotide (Fig. 6A and B). Its secured binding within the switch I loop might be surprising in light of the HetNOE value indicating that its backbone amide NH assumes a number of different conformations, much more so than its immediate surroundings. The conformational heterogeneity of the switch I region does not reach that of switch II, but there is a variability which is created by the flapping movement of the segment following Tyr32. In fact, the backbone twists slightly, precisely at Tyr32. Since the motion of residues 32–36 involves the nearly rigid-body flapping of the entire, well-defined segment that is immersed into the solvent, the backbone amide NH vectors in this region will not experience a significant change in their chemical environment even if they fluctuate around their average positions. However, the amide H of Tyr32 is close to the α-phosphate of GDP and as it tilts, performing its hinge function, it passes through a broad distribution of NH–O1A/O2A distances between 3 and 7 Å. This produces not only conformationally but electrostatically quite different environments, which might be the reason why the flexibility (fast timescale motions) of the switch I region is detected only at Tyr32 (ESI Fig. 13†).
Fig. 6 The effect of nucleotide exchange on the structure of WT and mutant KRAS proteins in the MD simulations: rearrangement of the effector binding pocket on nucleotide exchange (WT-KRAS shown as an example, mid-structure of the most populated cluster). In the presence of GDP Tyr32 blocks the pocket (A), while in the GTP-bound state (B) the binding site becomes approachable (surfaces colored according to electrostatic potential (red: negative, blue positive)). For an overview of the structures of all three variants in the GDP- and GTP-bound states see ESI Fig. 1.† |
In the GTP-bound states, Tyr32 is flipped from its position within the effector binding site to the opposite side of switch I and binds directly to the O1G atom of the γ-phosphate (with OHTyr32–O1GGTP distances of 2.6 ± 0.1 Å, 2.6 ± 0.1 Å and 2.7 ± 0.4 Å in the case of the WT-, G12C- and G12D-KRAS-GTP complexes) – in accordance with the significant change in its HetNOE and J(ωh) values in the GTP-bound forms in our NMR measurements and the previous findings based on both crystallographic62–65 and theoretical approaches.60,66 This new connection results in the rearrangement of the surrounding residues as well, most significantly those of Glu31, Asp33, Thr35 and Asp38, which flip toward the inner side of the effector binding pocket, creating a significant change – the entrance becomes more spacious and lined by negatively charged groups in the GTP complexes (Fig. 6), potentiating the binding site for the docking of the effector proteins which anchor to RAS through positively charged amino acids.67
Anchoring of Tyr32 to O1G of GTP causes a further significant change in the case of WT-KRAS. This completes the trapping of the catalytic water pair near the γ-phosphate.68 According to the generally accepted mechanism, the catalytic reaction is aided by two waters: a catalytic water serving as the nucleophile and an assisting water that functions as the general base. Following the attack of the catalytic water on the phosphorus of the γ-phosphate, a proton shuffle takes place. The proton released by the catalytic water on binding to the phosphorus is accepted by the assisting water (activated by Gln61), which in turn protonates one of the oxygens of the γ-phosphate, resulting in a doubly protonated H2PO4− and GDP as the products.69–71
In our simulations, in the WT-KRAS-GTP complex with the Tyr32–GTP H-bond intact, the protein matrix creates a binding pocket for the nucleotide and the catalytic waters, sheltering them from the bulk solvent (Fig. 7).
The first water spot (always occupied) places the catalytic water at a distance of 3.3 ± 0.1 Å from the γ-phosphorus atom of GTP. This water primarily interacts with the backbone amides of switch II residues of Gly60 and Gln61. The second water molecule, at 2.8 ± 0.1 Å from the first one, is secured between the O1G oxygen of the γ-phosphate, the OE1 atom of Gln61 and the hydroxyl of Tyr32 forming an H-bond with these centers in 100%, 68.6% and 97.4% of the snapshots, respectively. Thus, this second water is positioned ideally for accepting a proton from the catalytic water and passing it on to the detaching γ-phosphate.
Interestingly, two water molecules in very similar positions can also be seen in the crystal structure of the HRAS-GppNHp complex (PDB: 3k8y)32 that was used for building the starting model for our simulation. This structure presents a conformation stabilized by the capture of an acetate-Ca2+ ion pair at the allosteric ligand binding site (formed between α3, β5 and α4 (see Fig. 1 and 8.)). There is a crystal water at 3.3 and 3.1 Å from the amides of Gly60 and Gln61 and another that is within 2.8 Å of Tyr32–OH, Gln61–OE1 and the O1G atom of the γ-phosphate. As Tyr32 is not bound directly to the nucleotide, the two captured waters are too far (4.2 Å) for interaction. This prompted the authors to propose that the catalytic reaction is initiated by a proton transfer from the catalytic water to the γ-phosphate directly, which disagrees with the QM/MM derived mechanisms of others.67,69,70 Instead, we suggest that both of these water molecules are required for the catalysis. If Tyr32 shifts to the position seen in our simulations and pulls the coordinated water molecule along, then both waters participate in the proton shuffle mechanism of hydrolysis (Fig. 8).
Fig. 8 Comparison of the crystal structure of the HRAS–GppNHp complex32 with the MD derived WT-KRAS-GTP structure (the mid-structure of the most populated cluster of the simulation). The crystal structure is shown in dark gray with the crystal waters shown in CPK, while the calculated conformer is colored in green, with cyan spheres indicating the position of the oxygen atoms of the catalytic and assisting waters. |
In the case of the mutants, the situation is slightly different. Either sidechain present at position 12 clashes with Gln61, which is thus displaced by ∼1 Å (with the PγGTP–CD1Q61 distance increasing from 6.5 ± 0.6 Å to 7.3 ± 0.7 Å in both cases) causing further rearrangement of the 61–67 segment of switch II. This rearrangement opens up the active site pocket toward the solvent (Fig. 7). Both catalytic water positions remain occupied in the GTP-bound forms of both G12C and G12D mutants, albeit in a continual exchange. While in the case of the WT-GTP complex, there are two waters that spend more than 80% of the last 300 ns of the simulation time within the active site pocket, in the case of the G12C–GTP complex the longest fraction of time spent within the pocket by a water molecule is 65.6%, and in the case of G12D–GTP, 43.9%. Retaining the water molecules in the case of the WT–GTP complex results in protein-like B-factors for both the catalytic and the assisting water (ESI Fig. 14†). The difference seen between the two mutants is likely the result of the Cys12 sidechain of G12C-KRAS-GTP H-bonding to the catalytic water in nearly a third of the snapshots (32.6%) providing additional stabilization for it within the pocket. The Asp12 sidechain of the G12D–GTP complex reaches into the solvent (in 59.6% of the snapshots the carboxylate moiety has no protein contacts) without providing such support. Comparing these findings to the intrinsic hydrolysis rates of the three forms (6.8 × 10−4 s−1, 5.0 × 10−4 s−1 and 1.9 × 10−4 s−1 for WT-, G12C- and G12D-KRAS, respectively),33 we can conclude that the distancing of Gln61 from the active center and the destabilization of the water molecules participating in catalysis might be the reason for the loss of catalytic efficiency.
To support the catalytic relevance of the derived activated conformation we carried out QM/MM calculations for the GTP hydrolysis step – considering the presence of Tyr32 in the active site for the first time. Experimentally determined intrinsic hydrolysis rates for WT-HRAS and KRAS enzymes vary between 2.1 × 10−4 s−1 and 6.8 × 10−4 s−1,41,72,73 corresponding to an activation energy of approximately 22 kcal mol−1. Using the MD derived arrangement of the WT-KRAS active site, the activation energy of the catalytic water approaching the phosphorus of the γ-phosphate of GTP – in a simple scan of their distance – afforded a 21.4 kcal mol−1 single barrier at 2.06 Å separation, which also results in the lengthening of the PG–O3B bond (GDP–Pi bond) to 1.98 Å. Further advance of the water prompts a triple proton shuffle, leading to a H2PO4− product state. Tyr32 participates in this step: the proton of the catalytic water is passed on to the assisting water, which in turn protonates Tyr32, prompting donation of its proton to the γ-phosphate in a single step (Fig. 9).
The product thus achieved is quite reminiscent of the PROD2 state of the reaction derived by Grigorenko et al.69 In their study – in the absence of Tyr32 – this state was reached from a transient product (PROD1) by rearrangement of the H-bond motif around the phosphate (requiring negligible activation energy). This suggests that the reaction proceeds quite effectively in the absence of Tyr32 also, but it requires a further rearrangement step to stabilize the product state.
To investigate the role of Tyr32 in supporting catalysis, we created a computational mutant (Y32F*) simply by exchanging the –OH group of the tyrosine to an –H, keeping the rest of the system unchanged. Our very local computational mutation (Y32F*) did not cause a significant change in the reaction profile (with a 22.1 kcal mol−1 barrier) as expected, but it did lead to a less stable product state (ESI Fig. 15†) – supporting the notion that hydrolysis is possible in the absence of Tyr32 as well. Thus, our QM/MM calculations indicate that direct bonding of Tyr32 to the γ-phosphate of GTP is not an “anti-catalytic” arrangement as was previously suggested.55,61,74 Indeed, it does not hinder the attack of the catalytic water and contributes to straightforward product formation. However, it does seem that even though the –OH group of Tyr32 is positioned very similarly to the Arg finger of GAP that provides electrostatic enhancement to the hydrolysis reaction, this is not the case for Tyr32; the significance of the Tyr32–GTP connection lies in creating a local environment that binds and activates the catalytic waters and in facilitating product formation.
Finally, we analyzed the arrangement of binding hot spots of the GTP complex of wild type and mutant KRAS proteins. Although similar analysis has been published by the Mattos group75 their work used FTMAP76 on HRAS structures co-crystallized with the GTP analogue GppNHp. Having structural information on the native, GTP-bound state of KRAS prompted us to investigate the architecture of the available hot spots. Our calculations confirmed the results of the Mattos group identifying multiple hot spots on the protein surface. In addition to the nucleotide binding site FTMAP predicted allosteric sites at the α2–β2 groove close to the effector binding loop and also at the α2–α3 groove (ESI Fig. 16†). In fact, both of these sites were validated by KRAS-inhibitor complexes; the former is used by small fragments that inhibit GEF binding77 and pyrazolopyrimidine-based KRAS inhibitors drastically lower its affinity toward effector protein RAF78 (ESI Fig. 17†), and the latter serves as the binding site of ARS compounds.79 Our study revealed that Tyr32 flips over the nucleotide binding site in GTP-bound activated complexes and gets closer to the ARS site. Since FTMAP predicted here a strong binding hot spot for the oncogenic mutants, covalent inhibitors targeting Tyr32 would represent an alternative strategy for mutant specific KRAS inhibitors. These covalent inhibitors would bind to the validated hot spot located in the α2–α3 groove and label Tyr32 irreversibly, preventing its ligation to the γ-phosphate and, through this, the emergence of the conformational signal prompting the binding of the effector proteins.
NMR measurements allowed a detailed analysis of the changes in structure and dynamics taking place in WT KRAS and its G12C and G12D mutants on nucleotide exchange. The results of HetNOE measurements and data analysis by reduced spectral density mapping singled out Tyr32 as a central contributor to the changes. We found that it is by far the most flexible switch I residue of the GDP-bound forms and the most rigid of the GTP-complexed states of WT and G12D mutant KRASs, while also proving to be crucial for the stability of the G12C–GTP complex – as demonstrated using the G12C–Y32F variant.
Combining our NMR and molecular modeling results, we propose that the capture of Tyr32 by the terminal phosphate of GTP is one of the key steps of RAS activation. It enlarges and switches the polarization of the effector binding loop preparing it for the docking of downstream partners and it contributes to the capture, positioning and activation of the catalytic waters. We found that closing of the active site pocket and securing water molecules is upset to varying degrees by position 12 mutations – in agreement with the experimental findings concerning their reduced intrinsic hydrolysis capacities.
For assuring controllability, RAS proteins were engineered to be “poor” enzymes thus providing the possibility of a wide range of different regulatory functions to oversee emergence of the growth signal. The most basic enzymatic functions of these small GTPases – nucleotide binding and hydrolysis – are carried out very ineffectively on their own and are assisted by exchange factors and GTPase activator proteins, themselves under tight regulation. It was recently shown that the RAS function can also be disrupted in an independent regulatory cycle by phosphorylation of Tyr32 and Tyr64 by impairing RAF binding but also lowering the rate of intrinsic GTP hydrolysis. Catalytic activity is reinstated by dephosphorylation.81 Inhibition of dephosphorylation was shown to suppress cancerous growth evoked by RAS mutations.82–85 In the work of Kano et al.81 the authors point out that phosphorylation also lowers the rate of intrinsic GTP hydrolysis. This finding was an ideal test for our proposed active form; thus we also carried out MD simulations for WT-, G12D- and G12V-pKRAS, phosphorylated at Tyr32, and the WT phosphorylated at both Tyr32 and Tyr64 in the GTP-bound states. The results indicate that in pKRAS-GTP complexes, the connection between pTyr32 and GTP is lost, and the opening of the active site further destabilizes the water network of the active site, which we've shown to be essential for catalysis: there is only a single water captured in the catalytic pocket of the WT protein phosphorylated at Tyr32, and none in the case of the mutants or the doubly phosphorylated WT (which is the physiological state before dephosphorylation by SHP2) (see ESI Fig. 18†). This lends support to our results: WT-KRAS is more effective in the intrinsic hydrolysis of GTP than mutants or phosphorylated forms because its active site captures both catalytically essential water molecules. In our WT-KRAS-GTP model Tyr64 is also quite close to the active site; in fact it plays a role in forming the pocket that captures the active waters – therefore its phosphorylation would also contribute to the destabilizing of the “switched-on” conformation.
Our hot spot analysis on the mutant KRAS proteins revealed that a similar effect might be achieved by the covalent modification of Tyr32. The vicinity of the strong binding hot spot in the GTP-bound structure of the oncogenic mutants might open new opportunities for designing Tyr32 targeting covalent inhibitors. Similarly to its phosphorylation, irreversible binding to Tyr 32 might also stall the GTPase cycle and probably impairs binding to effectors. Our efforts now focus on the discovery of this novel type of KRAS inhibitor with significant therapeutic potential.
BMRB | Biological magnetic resonance data bank |
DSS | Sodium trimethylsilylpropanesulfonate |
GDP | Guanosine diphosphate |
GTP | Guanosine triphosphate |
GppNHp | 5′-Guanylyl-imidodiphosphate |
HetNOE | Heteronuclear Overhauser effect |
HSQC | Heteronuclear single quantum coherence |
BEST | Band-selective excitation short-transient |
MD | Molecular dynamics |
NOESY | Nuclear Overhauser effect spectroscopy |
PDB | Protein data bank |
QM/MM | Quantum mechanics/molecular mechanics simulation |
SSP | Secondary structure propensities |
TOCSY | Total correlation spectroscopy |
WT | Wild type |
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/d0sc03441j |
This journal is © The Royal Society of Chemistry 2020 |