Keita
Tateno‡
a,
Takami
Ando‡
a,
Maako
Tabata
a,
Haruka
Sugasawa
a,
Toshifumi
Hayashi
a,
Sangya
Yu
a,
Sayeesh
PM
a,
Kohsuke
Inomata
a,
Tsutomu
Mikawa
ab,
Yutaka
Ito
*a and
Teppei
Ikeya
*a
aDepartment of Chemistry, Graduate School of Science, Tokyo Metropolitan University, 1-1 Minamiosawa, Hachioji, Tokyo 192-0397, Japan. E-mail: tikeya@tmu.ac.jp; ito-yutaka@tmu.ac.jp
bRIKEN Center for Biosystems Dynamics Research, RIKEN, 1-7-22 Suehiro-Cho, Tsurumi-Ku, Yokohama 230-0045, Japan
First published on 12th September 2024
The adaptor protein human GRB2 plays crucial roles in mediating signal transduction from cell membrane receptors to RAS and its downstream proteins by recruiting SOS1. Recent studies have revealed that GRB2 also serves as a scaffold for liquid–liquid phase separation (LLPS) with SOS1 and transmembrane receptors, which is thought to regulate the magnitude of cell signalling pathways. In this study, we employed solution NMR spectroscopy to investigate the interactions of the full-length GRB2 with proline-rich motifs (PRMs) derived from ten potential GRB2-binding sites in SOS1, as well as a peptide from a phosphorylation site of EGFR. Our findings indicate that the binding affinity of the two SH3 domains of GRB2 for PRMs differs by a factor of ten to twenty, with the N-terminal SH3 domain (NSH3) exhibiting a markedly higher affinity. The interactions of PRMs with the SH3 domains affected not only the regions surrounding the PRM binding sites on the SH3 domains but also the linker area connecting the three domains and parts of the SH2 domain. Analysis of the interaction between the phosphorylated EGFR binding site and the SH2 domain revealed chemical shift perturbations in regions distal from the known binding site of SH2. Moreover, we observed that the inter-domain interactions of the two SH3 domains with the SH2 domain of GRB2 are asymmetric. These findings suggest that the local binding of PRMs and phosphorylated EGFR to GRB2 impacts the overall structure of the GRB2 molecule, including domain orientation and dimerisation, which may contribute to LLPS formation.
GRB2 is a key mediator in this upstream process, comprising an Src homology 2 (SH2) domain flanked by N-terminal and C-terminal Src homology 3 (SH3) domains (Fig. S1a†). While the SH2 domain associates with the phosphorylated tyrosine of RTKs, the N-terminal and C-terminal SH3 domains (henceforth referred to as NSH3 and CSH3, respectively) interact with proline-rich motifs (PRMs) of SOS1. SOS1, serving as a guanine-nucleotide-exchange factor (GEF) of RAS and RAC, is also a multi-domain protein composed of six major domains or motifs, a Histone fold (HF), Dbl homology (DH), Pleckstrin homology (PH), RAS exchange motif (REM), cell division cycle 25 (CDC25), and proline-rich (PR) (Fig. S1b†). The PR domain, which associates with the SH3 domains of GRB2 through PRMs, is an intrinsically disordered region (IDR) as an extended random coil-like conformation in solution.
In addition to the role of GRB2 as a carrier of SOS1 to the membrane, recent studies have shown that GRB2 also serves as a major component for liquid–liquid phase separation (LLPS) in the RAS-mediated transduction pathway by acting as a scaffold for other proteins.3,4 Several experiments indicate that the association and dissociation dynamics of LLPS can regulate the magnitude of the signal transduction on the RAS-mediated signal pathways.5 LLPS involving GRB2 was initially discovered in the system with GRB2, SOS1, and linker for activation of T cells (LAT), and has recently been found in a system with GRB2 and EGFR.6 Although the precise molecular mechanisms of phase separation in these systems are not fully understood, it is thought that the three domains of GRB2 work together to bundle the other proteins within these complexes. In the ternary complex composed of GRB2, SOS1 and LAT, it is proposed that GRB2 serves as a bridge between LAT and SOS1, owing to the association of its SH2 domain with phosphorylated tyrosine residues on LAT and two SH3 domains with PRMs on SOS1, respectively (Fig. S1c†). In the binary complex formation of GRB2 and EGFR, the dimerisation of GRB2 is thought to be a key for the connection with EGFR.7 However, sufficient experimental evidence to substantiate these models remains lacking.
Hence, it is crucial to elucidate the interactions of GRB2 with SOS1 at atomic resolution for a detailed understanding of the regulatory mechanisms in the signal transduction pathway. While the 3-dimensional (3D) structure of GRB2 is determined by X-ray crystallography,8 other structural and computational studies, such as small angle X-ray scattering (SAXS),9 molecular dynamics simulation,10 and protein structure predictions via AlphaFold 2 (UniProt Accession ID: P62993), suggest that the relative orientation of the three domains connected with two flexible loops significantly differs from the crystal structure (Fig. S1d†) in aqueous solutions. The domain orientation of GRB2 appears to be dynamic even under the condensed conditions, allowing for flexible interactions with SOS1, LAT or EGFR. Considering that the PR domain of SOS1 is also extraordinarily flexible, an intrinsically disordered region (IDR), the molecular interaction of GRB2 and SOS1 could be highly dynamic, leading to an enormous number of different conformations in the condensed droplet. Optical microscope experiments have shown that these associations and dissociations are also highly dynamic on a time scale of seconds to minutes.3,5,6 Yet, it remains unclear how the microscopic molecular motions of GRB2 and SOS1 on a time scale of μs to ms result in macroscopic droplet formation at a much slower time rate. It is therefore necessary to investigate the 3D structures and dynamics of these proteins in solution at atomic resolution. It still remains a challenge to observe it by generally used structural analysis methods, such as X-ray crystallography and cryo-electron microscopy (cryo-EM). Thus, we employed solution Nuclear Magnetic Resonance (NMR) spectroscopy which is an optimal tool for studying highly dynamical proteins, including multi-domain proteins with flexible linkers and IDRs in aqueous or condensed environments.
Liao et al. recently reported interactions of isolated NSH3 and CSH3 domains of GRB2 with several PR peptides derived from potential PRMs of SOS1 by solution NMR.10 This work proposed that the SOS1 PR domain possesses ten potential binding sites for the SH3 domains of GRB2 (henceforth referred to as S1–S10 from the N-terminus of the SOS1-PR domain; Fig. S1b†). This report also suggested that nine of the ten sites preferentially bind to the NSH3 domain, while one site has a higher affinity for CSH3. However, these data were obtained using the isolated NSH3 and CSH3 domains, not the full-length GRB2. It is assumed that if the two SH3 domains of GRB2 bind to a single binding site in SOS1, competition between the two SH3 domains for the single SOS1 binding site would occur, resulting in different binding affinities and modes compared to independent bindings. Furthermore, the two linkers connecting the SH3 domains are highly flexible, and their relative orientations can significantly change, which may affect their respective interactions with SOS1. In fact, significant differences in binding affinity have been reported between a single domain and a full-length protein,11 highlighting the need for careful investigation of the binding sites of the full-length GRB2 and SOS1 PRMs.
Hence, in this work, we elucidate the molecular recognition mechanisms and binding affinities of the full-length GRB2 with several SOS1 PRMs and an EGFR phosphorylated peptide. Through NMR titration as well as isothermal titration calorimetry (ITC), we carefully identify the binding sites and modes, and their affinities for each SOS1 PRM by considering several theoretical binding models. In addition, docking simulations based on the experimental data provide insights into the interaction mechanisms based on tertiary structures.
2D 1H–15N HSQC spectra of GRB2 in the presence of S4, S5, S9, and S10 PRMs with various protein-PRM concentration ratios are shown in Fig. 1 and S6–S8.† Chemical shift perturbations (CSPs) of the backbone 1H and 15N resonances during the titration with the SOS1 PRMs are plotted against the amino acid sequence (Fig. 2). Fig. 2a presents CSPs at each titration point with different colours from blue to red, demonstrating that, for SOS1 S4, S5, and S9 PRMs, the NSH3 bars predominantly show blue colours, whereas the CSH3 shades are mostly orange or yellow. This indicates that the CSPs in NSH3 occurred at early stages in the titration experiments, whereas those in CSH3 were seen even at its later points. This observation supports the previous report, utilising isolated NSH3 and CSH3 domains, that GRB2 possesses two distinct binding affinities, with NSH3 exhibiting higher affinities for S4, S5, and S9 PRMs compared to CSH3.10Fig. 2b and S9† further illustrate this trend, showing that NSH3 undergoes early CSP changes, while CSH3 shifts occur later in the titration. Additionally, our data using the full-length GRB2 reveals that the CSPs are also observed in some residues of the SH2 domain, occurring at later stages in the titration compared to NSH3, thus aligning or presumably synchronising with the CSP patterns of CSH3 (Fig. 2 and S9†). Moreover, the previous study with the isolated SH3 domains proposed that the binding affinity of SOS1 S10 PRM was higher for CSH3 rather than for NSH3. However, our titration experiments with SOS1 S10 PRM do not show a clear difference in CSPs between NSH3 and CSH3, indicating that the CSP patterns for S10 PRM are clearly different from the other PRMs probably due to its unique properties.
![]() | ||
Fig. 2 Plots of chemical shift perturbation of backbone 1HN and 15N nuclei of GRB2 upon the titration with S4, S5, S9, and S10 PRMs. (a) Bar plots of chemical shift perturbation of backbone 1HN and 15N nuclei of GRB2 upon the titration with S4, S5, S9, and S10 PRMs. The mean shift difference Δδave was calculated as [(Δδ1HN)2 + (Δδ15N/5)2]1/2 where Δδ1HN and Δδ15N are the chemical shift differences (ppm) between GRB2 on its own and in the presence of the PRMs. The bar graphs are colour-coded according to the protein–peptide concentration ratio and are overlaid. The proline residues as well as the residues for which 1HN–15N correlation cross-peaks were not analysed due to signal overlap or other reasons are shown in grey. The secondary structures of GRB2 are also shown. (b) Chemical shift perturbation 3D bar plots with an added axis for titration points to the plots of figure (a). The vertical, horizontal, and height axes represent the residue number, titration point, and chemical shift perturbation, respectively. NSH3, SH2, CSH3, and the linker regions are shown in green, orange, blue, and grey, respectively. (c) Chemical shift perturbation upon the titration with S4, S5, S9, and S10 PRMs represented on the crystal structure of GRB2 (PDB ID: 1GRI). |
Significant CSPs observed upon PRM binding were mapped onto the crystal structure of GRB2 (Fig. 2c). As expected, for all PRMs, notable CSPs were predominantly located within the RT loops of both SH3 domains and the adjacent β-strands (β2, 3, 11, and 12). Meanwhile, as described above, significant CSPs were also identified at residues 120–145 in the SH2 domain, which is not considered to have PRM binding sites. The CSPs of N143 in the SH2 domain were especially pronounced, with the S10 PRM inducing the most significant change (Fig. 1 and S6–S8†). The spatial separation of the SH3 and SH2 domains in the crystal structure (Fig. S1d†) implies interactions beyond the direct binding sites of SH3. Furthermore, substantial CSPs were detected in the linker region connecting SH2 and CSH3 but not in the linker between NSH3 and SH2. Near CSH3 in the linker to SH2, non-linear trajectories, such as curves or U-turn patterns, were observed at residues Q157, T159, and Y160 (Fig. 1 and S6–S8†). Three possible explanations can be considered for this observation: (1) direct but weak interactions of the PRMs with SH2, (2) PRM-binding to the SH3 domains altering their relative domain locations by affecting the linker and the adjacent areas of the SH2 domain, and (3) GRB2 dimerisation influencing the SH2 domain and linker. Considering the unique U-turn shaped perturbations in the linker between SH2 and CSH3, the CSPs synchronised between CSH3 and SH2, as well as the independently dynamic behaviour of CSH3, these potentially indicate a shift in the overall domain orientation of GRB2 upon PRM bindings to the SH3 domains. Subsequently, this domain orientation may also cause GRB2 dimerisation. More interestingly, exclusively S10 shows a linear chemical shift change but not U-turn patterns in this region (residues 157–160), similar to those in the other residues, also suggesting that S10 has a characteristic binding mode from the others. These CSPs in the SH2 domain and the linker regions, other than the SH3 domains, were not previously considered and were revealed for the first time by analysis using full-length GRB2.
Comparing the magnitude of the CSPs in the titration experiments, the overall shift changes were more significant for CSH3 than for NSH3.
PRM | Sequence | Motif | K D (μM) from full-length GRB2a | K D (μM) from isolated SH3sb | ||
---|---|---|---|---|---|---|
K D1 | K D2 | NSH3 | CSH3 | |||
a The dissociation constants were calculated by globally fitting NMR titration data for selected residues (with maximum CSPs over 0.03 ppm) to an independent two-site binding model. The KD values in parentheses for NSH3 were calculated using the single-binding site model. The concentrations of GRB2 used in the NMR measurements ranged from approximately 200 to 270 μM. NA stands for data for which clear signals were not detected. b The dissociation constants were reported in the previous report using isolated NSH3 and CSH3, respectively.10 c S4 PRM used in the previous report has one additional amino acid at both the N- and C-termini, PVPPPVPPRRRP. | ||||||
S1 | LEIEPRNPKP (1014–1023) | PRxPKP | — | — | 730 ± 890 | NA |
S2 | PKPLPRFPKK (1021–1030) | PxPxPRxPKK | — | — | 250 ± 96 | 440 ± 146 |
S3 | APNSPRTPLTPPPAYS(1079–1093) | PxxPRxPxxP | — | — | 280 ± 103 | NA |
S4 | VPPPVPPRRR (1149–1158) | PxxPxR | 38.7 ± 2.5 (72.1 ± 3.7) | 520.5 ± 33.5 | 37 ± 22c | 142 ± 18c |
S5 | DSPPAOPPRQPT (1177–1188) | PxxPxR | 40.9 ± 2.4 (44.9 ± 3.8) | 900.8 ± 34.6 | — | — |
S6 | ESPPLLPPREPV (1209–1220) | PxxPxR | — | — | — | — |
S7 | PSPFTPPPPQTPSP (1253–1266) | PxxPPPPxxP | — | — | 580 ± 462 | NA |
S8 | PSPHGTRRHLPSPP (1264–1277) | RR | — | — | 208 ± 42 | 400 ± 447 |
S9 | IAGPPVPPRQST (1287–1298) | PxxPxR | 28.4 ± 2.9 (35.0 ± 3.8) | 515.9 ± 17.6 | — | — |
S10 | PKLPPKTYKREH (1303–1314) | PxxPxKxxKR | 483.1 ± 45.7 | 579.2 ± 45.3 | 71.5 ± 24.2 | 130.8 ± 54.4 |
The two dissociation constants for PRMs S4, S5, and S9 were quite similar among the three PRMs, with S9 exhibiting the lowest values. In contrast, those for S10 were approximately ten times higher compared to the others. The KD1 values associated with the NSH3 domain for S4, S5, and S9 were significantly lower, by a factor of ten to twenty, than the corresponding KD2 values for CSH3. Notably, these variances are more pronounced than the 1.5 to 4.0-fold differences observed in the previous research using the isolated SH3 domains.10 The profound differences in the KD values between the two domains are also reflected in the CSP data (Fig. 2), where early titration points showed substantial CSPs for NSH3, whereas the perturbations for CSH3 were only observed at later stages of the titration. To verify the significantly smaller KD values of NSH3 for S4, S5, and S9, we also calculated their KDs using the simpler single-site binding model for comparison (Table 1). Although the calculated KDs were slightly higher in all three cases due to the absence of the second KD, substantial differences were not observed between the two models. This considerable difference in binding affinity between the two domains could markedly impact the nature of the multivalent interactions with SOS1. Meanwhile, the two dissociation constants for S10 were nearly equivalent, or the KD1 for NSH3 was slightly lower than KD2 for CSH3. This also contrasts with the previous analysis employing the isolated SH3 domains in which its KD1 was four times larger than its KD2. This ten to twenty-fold difference between the two dissociation constants for S4, S5, and S9 was further corroborated by our ITC analysis of the full-length GRB2 (Table S1†). The ITC analysis was also conducted for the remaining six PRMs. For S1, S3 and S7, the binding affinity to the two SH3 domains was likely so weak that little calorimetric change was observed, preventing the determination of their dissociation constants. For S2 and S8, their dissociation constants for one-side with higher binding affinity could be determined, but other constants with weaker affinity could not be obtained. Nevertheless, a comparison of the dissociation constants obtained from the ITC data suggests that these six PRMs have considerably lower affinities than S4, S5 and S9.
![]() | ||
Fig. 4 Plots of chemical shift perturbation of backbone 1HN and 15N nuclei of GRB2 upon the titration with the EGFR phosphorylated peptide. (a) Bar plots of chemical shift perturbation of backbone 1HN and 15N nuclei of GRB2 upon the titration of the EGFR phosphorylated peptide without (upper) and with (lower) SOS1 S4 PRM. The mean shift difference Δδave was calculated as [(Δδ1HN)2 + (Δδ15N/5)2]1/2 where Δδ1HN and Δδ15N are the chemical shift differences (ppm) between GRB2 on its own and in the presence of the PRMs. The bar graphs are colour-coded according to the protein–peptide concentration ratio and are overlaid. The proline residues as well as the residues for which 1HN–15N correlation cross-peaks were not analysed due to signal overlap or other reasons are shown in grey. The secondary structures of GRB2 are also shown. (b) Chemical shift perturbation upon the titration with the EGFR phosphorylated peptide without (upper) and with (lower) SOS1 S4 PRM represented on the crystal structure of the GRB2 SH2 domain (PDB ID: 1BMB). |
Interestingly, slight CSPs were observed not only in the SH2 domain but also in the two SH3 domains. A comparison of CSPs between NSH3 and CSH3 revealed a more pronounced overall chemical shift change in NSH3. The NSH3 domain is far from SH2 on the crystal structure (Fig. S1d†), and therefore, the influence of NSH3 upon the EGFR-like peptide binding was unexpected. This observation further supports the model described earlier, suggesting that NSH3 and SH2 are in closer contact, while CSH3 exists independently.
The results of the model validation via bootstrap for the three models demonstrated that there were no significant differences in residuals and standard deviations among the models, regardless of whether PRM S4 was bound or not (Fig. S10b†). Consequently, the NMR titration data did not provide correlations between conformational changes and dissociation constants, and it was not determined whether the SH2 domain underwent conformational changes. Therefore, we adopted the dissociation constant from the most straightforward single-binding model. The derived dissociation constants were 4.2 ± 1.6 μM without S4 PRM and 1.8 ± 8.9 μM when S4 PRM was bound, approximately representing a two-fold increase in binding affinity for the S4 PRM-bound state (Table 2). This result is consistent with the Surface Plasmon Resonance (SPR) analysis, indicating a two-fold enhancement in the binding affinity of the phosphorylated EGFR peptide to GRB2 upon S4 PRM binding.7 Our results validate the model that the SOS1 PRM binding promotes the association of phosphorylated EGFR, and it is very interesting that the interaction in the SH3 domains, which is quite distant in the crystal structure (Fig. S1d†), affects the interaction with the EGFR-like peptide in the SH2 domain. Meanwhile, although the tendency of CSPs from the EGFR-like peptide titrations with and without SOS1 S4 PRM was clearly different (Fig. 4), the standard deviation values obtained by the bootstrap method were relatively large. Thus, further experiments will be required to confirm this.
The dissociation constants obtained in our study showed binding affinities that were 6–10 fold weaker than those obtained in the SPR analysis. This difference may be due to the phosphorylated EGFR-like peptide used in the present analysis being the difference of the SQ sequence and about four amino acids shorter than that used in the SPR analysis. Our previous analysis has also shown that differences in a few amino acids before and after the binding motif affect the magnitude of the overall dissociation constant.24
![]() | ||
Fig. 5 Comparisons of residue-specific KDs and docking simulations of S4 and S10 PRMs between NSH3 and CSH3. The top five structures of SOS1 PRM S4 (a) and S10 (b) with the lowest energy from the docking simulations are presented for NSH3 (left) and CSH3 (right), with mapping of residue-specific dissociation constants (KDs) on the crystal structure of the NSH3 (PDBID: 1AZE) and CSH3 domains (PDB ID: 1IO6). The magnitudes of residue-specific KDs are colour-coded both on the structures and aligned amino acid sequences of the SH3 domains. |
As expected, the residue-specific dissociation constants with lower values and the PRM binding sites from the docking simulations were primarily localised to the RT loop and the adjacent β-strand. However, there was a clear difference in the pattern of residue-specific dissociation constants between NSH3 and CSH3. Regions around Y7 and D8 in NSH3, and F167 and D166 in CSH3, demonstrated high affinity for one edge of S4 PRM. Conversely, NSH3 exhibited high affinities at the opposite edge of S4 PRM, particularly at the region comprising D15, E16, W36, K38 and F47, whereas CSH3 showed markedly lower affinities at this site. This variance may account for the 10–20 fold difference in binding affinity for S4, S5, and S9 between NSH3 and CSH3. Regarding S10, the pattern of residue-specific dissociation constants was also considerably different between NSH3 and CSH3. In CSH3, the relatively high dissociation constants are broadly distributed across the surface of its domain without any significantly low values at specific sites. In addition, the top five conformations of S10 PRM predicted by docking simulations on CSH3 clustered into two groups: one associated with the RT loop and its neighbouring β-strand, and the other at the centre of the β-sheet. The dispersion for weak residue-specific dissociation constants over the domain's surface and the non-arbitrary docking conformations suggest a unique binding mode for S10 PRM to CSH3.
Comparison of the NMR spectra of the full-length GRB2 and its individual isolated domains revealed that the spectrum of the isolated CSH3 domain agrees well with that of the full-length GRB2, whereas the spectra of the isolated NSH3 and SH2 showed little similarity to the full-length protein. This inconsistency suggests that the NSH3 and SH2 domains of full-length GRB2 engage in interdomain contacts in solution, while the CSH3 domain exists independently. In the crystal structure (Fig. S1d†), the two SH3 domains are proximal to each other but distant from the SH2 domain, indicating significant differences in domain orientations in the crystal compared to in solution, as predicted by previous studies.9 Although the linker region between NSH3 and SH2 (linker 1) appears as a long random coil in the crystal structure, the properties of the amino acid sequence in the region, which includes several prolines and aromatic residues (MKPHPWFF), suggest significant bulkiness and low flexibility in this region (Fig. S1d†). These amino acid sequence properties also support our NMR data indicating that NSH3 and SH2 are in contact and CSH3 is isolated from the others.
NMR titration experiments with GRB2 and SOS1 PRMs showed significant chemical shift perturbations at the RT-loop as well as at the β-strand located parallel to the loop in both NSH3 and CSH3. These results confirm that this region is the primary binding site for PRMs in the two SH3 domains, as previously shown.10 Meanwhile, chemical shift perturbations were also observed in the linker region and the SH2 domain, indicating that the PRM bindings may induce alterations in the relative domain orientations of the GRB2's three domains and subsequently result in GRB2 dimerisation. Notably, the trajectories of the chemical shift changes in the region (Q157, T159, and Y160), near CSH3 in linker 2, were curved or U-turn shaped unlike the perturbation patterns in other regions, suggesting the presence of multiple-states with different chemical shifts in this area. This region corresponds to the crossing point of linkers 1 and 2 in the crystal structure (Fig. S1d†), where an inhibitory effect on GRB2 dimerisation was observed upon mutation of Tyr160 to glutamate (Y160E).6 This implies that the PRM bindings may impact this crossed structure or the GRB2 dimerisation. Furthermore, the comparison of the NMR spectra between full-length GRB2 and the three isolated domains, as well as the previous NMR relaxation experiment,9 suggests that only the CSH3 domain of GRB2 exhibits larger motion (Fig. 6a). The structure predicted by AlphaFold 2 shows that while NSH3 and SH2 are adjacent to each other, CSH3 is distant from the others with low values of AlphaFold's Model Confidence in the linker 2 connecting SH2 and CSH3. This also supports the model that the CSH3 domain has fewer contacts and isolated motion. Considering the previous reports that GRB2 dimerisation is crucial for LLPS formation by GRB2, SOS1, and EGFR,6 this significant dynamics of CSH3 may facilitate equilibrium between dimeric and monomeric conformations (Fig. 6a) and cause dynamic droplet formation and dissociation.
The calculated dissociation constants revealed that the binding affinities of NSH3 for the PRMs S4, S5 and S9 were 10–20 times higher than those of CSH3. The differences in affinity between NSH3 and CSH3 in the analysis of the full-length GRB2 were considerably larger than the 4- to 5-fold difference obtained by the previous research isolating the individual domains.10 It is conceivable that two binding sites with different binding affinities, high and low ones, coexist and compete in the interaction with a counterpart binning site, making the difference more pronounced. This significant difference between the two experiments indicates that analysis using the full-length GRB2 is necessary for accurate measurement of the protein's affinities.
GRB2 possesses two SH3 domains which are thought to have almost identical functions but very different binding affinities. The analysis of the MD simulations suggests that this difference in binding affinity is influenced not only by amino acid differences at the binding interfaces of NSH3 and CSH3 but also by significant differences in the flexibility and recognition specificity of this region.26 The biological significance of this large difference of their binding affinities must be considered. According to the database OpenCell (https://opencell.czbiohub.org),27 the GRB2 concentration in HEK293 cells is 1.4 μM, whereas the SOS1 concentration is 32 nM. This shows that SOS1 is considerably less abundant than GRB2 in intracellular environments, and the two SH3 domains of a GRB2 molecule are likely to competitively bind to PRM sites in a single SOS1 molecule. Hence, taking into account the currently widely accepted model for the LLPS formation by GRB2-SOS1,3 the previous research for them,5,6 and our results, this interaction process may be mediated by the following mechanism: first, NSH3 of GRB2 binds to one of the PRMs of SOS1 due to its higher affinity. Second, whereas CSH3 of the same GRB2 molecule has more chances to bind to the remaining sites of the PRMs due to their spatial proximity and the single-strand connection, NSH3 of a second GRB2 molecule, containing higher affinity, preferentially interacts with the PRMs over CSH3 of the first GRB2 (Fig. 6b, left). This process leads to the bindings of several GRB2 molecules to a single SOS1 and the formation of GRB2–SOS1 cross-links, which might be a key for the LLPS formation or network structures by GRB2 and SOS1 (Fig. 6b, right). Furthermore, CSH3 is expected to have weak contacts with other domains and be highly mobile, which could help capture remaining free PRMs. These asymmetric physical properties of the two SH3 domains of GRB2, including dynamics and binding affinity, suggest that GRB2 has a mechanism beyond the simple function of binding to SOS1 and recruiting it near the transmembrane, as previously thought.28 These properties may enable GRB2 to form complex multivalent interactions and bridges among molecules, potentially facilitating LLPS formation. In terms of the formation of complex networks, it is also interesting to note that exclusively for the SOS1 S10 PRM, NSH3 and CSH3 exhibit comparable binding affinities and no U-turn shape trajectories of CSPs of residues 157, 159, and 160, indicating that GRB2 can engage in multivalent interactions with different binding modes and different affinities to SOS1. As described in the previous research,6 dimerisation of GRB2 would also contribute to the complex network in LLPS (Fig. 6b, right). Meanwhile, the model we propose here is one of several possible models, and to validate these models and provide a comprehensive understanding of the GRB2–SOS1 PR interaction and their LLPS formation, it will be necessary to conduct future verification through various methods, such as mutation experiments, mathematical modeling, and molecular dynamics simulations. We did not explicitly include the effect of the GRB2 dimerisation in the fitting models for the ITC and NMR titration experiment data, despite the possibility that the equilibrium between the monomeric and dimeric states of GRB2 may shift with the addition of PRMs. This will also become an important point in future research.
In this study, we analysed the interaction of full-length GRB2 with proline-rich motifs (PRMs) in detail, primarily using solution NMR. Solution NMR provides information on binding affinity and dynamics at atomic resolution. We analysed the data in statistical detail, considering several possible models, to extract the maximum amount of information from the NMR data. We believe that the extensive modelling presented here will become increasingly effective in biophysical fields. The models and calculation methods are packaged as a single software, which can be made available upon request. By using a large amount of data and these data analysis methods, we have succeeded in observing local interaction patterns and changes in the relative orientations of the three domains of GRB2 that cannot be seen by ITC and SPR, which have been commonly used for interaction analysis. NMR is particularly useful in the structural analysis of highly flexible proteins such as multidomain proteins, IDPs, and RNAs. Thus, we believe that it will further contribute to the understanding of biological phenomena such as liquid–liquid phase separation, in which these molecules are often involved, at atomic resolution.
GRB2 SH2 domain (54–159) in addition to an N-terminal GST tag, a TEV protease cleavage site, and GG linker was constructed into the expression vector pLEICS02 and over-expressed in the E. coli Rosetta™ 2 (DE3) strain (Merk). Uniformly 13C/15N and 15N-labelled GRB2 SH2 samples were prepared by growing the bacteria at 37 °C in LB media in the same M9 minimal media described above, containing [13C6]-glucose and 15NH4Cl as the sole carbon and nitrogen sources, respectively. The expression was induced by adding IPTG at an OD 600 nm of 0.5. The cells were further incubated for 18 h at 20 °C, and harvested.
S1(LEIEPRNPKP), S2(PKPLPRFPKK), S3(APNSPRTPLTPPPAYS), S4(VPPPVPPRRR), S5(DSPPAIPPRQPT), S6(ESPPLLPPREPV), S7(PSPFTPPPPQTPSP), S8(PSPHGTRRHLPSPP), S9(IAGPPVPPRQST), and S10(PKLPPKTYKREH) were commercially obtained from Eurofins Genomics. EGFR (EpYINSQV) peptides were obtained from Toray Research Center Incorporated.
For purification of GRB2 SH2 domain, the harvested cells were re-suspended in a phosphate buffer [pH 7.2, 1% Triton X100, 1% 2-mercaptoethanol, 0.1 mM p-amidinophenyl methansulfonyl fluoride (p-APMSF; FUJIFILM), 0.1% Halt™ Protenase Inhibitor single-use cocktail EDTA-free (Thermo Scientific)] and lysed by sonication in the presence of hen egg lysozyme (0.1 mg mL−1). The cleared lysate was collected after centrifugation. The supernatant was loaded onto Glutathione Sepharose 4B (Cytiva) pre-equilibrated with buffer D [50 mM sodium phosphate (pH 7.2), 150 mM NaCl]. The column was washed with 6 column volumes of the same buffer, followed by elution with a linear gradient of 0–50 mM glutathione. The fractions containing GRB2 SH2 domain were concentrated using an Amicon Ultra-15 3 kDa (Merck) and then incubated with uTEV3 protease to remove the GST tag in buffer D for approximately 16 h at 25 °C. The cleaved sample was loaded onto a HiLoad 16/60 Superdex 75 column pre-equilibrated with D buffer. The final GRB2 SH2 domain fractions were dissolved in NMR buffer [90% 1H2O/10% 2H2O containing 20 mM Na2HPO4–NaH2PO4, pH 7.2, 0.05% NaN3].
The titration experiments with the SOS1- and EGFR-derived peptides were performed in an NMR buffer [20 mM sodium phosphate, 0.05% (w/v) NaN3] at 25 °C. 15N-labelled GRB2 was successively titrated up to a molar ratio of 1:
12 with the S4, 1
:
36 with S5 and S9, 1
:
24 with S10, and 1
:
2.0 with EGFR-like peptide, respectively. 2D 1H–15N HSQC spectra were measured at each titration point. Chemical shift differences, Δ, were analysed by Δ = ((ΔH)2 + (ΔN/5)2)1/2, where ΔH and ΔN are the differences in ppm for the backbone amide 1H and 15N chemical shifts between the two conditions. 1 ppm corresponds to 600.13 Hz for 1H and 60.81 Hz for 15N.
For the conformational selection model, the KD and Kf between two states were calculated with the equation.
For the induced-fit model, the KD and Kf between free and complex states were computed with the equation.
For the two-site independent binding site model,15 the KD1 and KD2 between P–L and L–P complex states in Fig. 3a were computed with the equation.
For the two-site cooperative binding site model,
KD3 = αKD2, KD3 = αKD22/KD1 |
In all models, the non-linear least-squares fitting was performed by the Levenberg–Marquardt algorithm. The precision of the model parameters was assessed by conducting the bootstrap resampling on the residuals of the regression model. This procedure was repeated 1000 times to generate the bootstrap distribution of the model parameters. The bootstrap confidence intervals of the model parameters were computed with the percentile method, i.e., the 25th and the 975th smallest values of the bootstrap distribution were regarded as the lower and the upper limits of the 95% bootstrap confidence intervals, respectively.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4sc02656j |
‡ These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2024 |