Dhanusha Yesudhas,
Muhammad Ayaz Anwar and
Sangdun Choi*
Department of Molecular Science and Technology, Ajou University, Suwon, 16499, Korea. E-mail: sangdunchoi@ajou.ac.kr; Fax: +82 31-219-1615; Tel: +82 31-219-2600
First published on 13th March 2019
The efficiency of stem cell transcriptional regulation always depends on the cooperative association and expression of transcription factors (TFs). Among these, Oct4, Sox2, and Nanog play major roles. Their cooperativity is facilitated via direct protein–protein interactions or DNA-mediated interactions, yet the mechanism is not clear. Most biochemical studies have examined Oct4/Sox2 cooperativity, whereas few studies have evaluated how Nanog competes in the connection between these TFs. In this study, using computational models and molecular dynamics simulations, we built a framework representing the DNA-mediated cooperative interaction between Nanog and Sox2 and analyzed the plausible interaction factors experienced by Nanog because of Sox2, its cooperative binding partner. Comparison of a wild-type and mutant Nanog/Sox2 model with the Nanog crystal structure revealed the regulatory structural mechanism between Nanog/Sox2–DNA-mediated cooperative bindings. Along with the transactivation domains interaction, the DNA-mediated allosteric interactions are also necessary for Nanog cooperative binding. DNA-mediated Nanog–Sox2 cooperativity influences the protein conformational changes and a stronger interaction profile was observed for Nanog-Mut (L103E) in comparison with the Nanog-WT complex.
The self-renewal efficiency of embryonic stem cell (ESC) is determined by the Nanog protein level expression. However, how Nanog is regulated at the protein level and the protein partners of Nanog that function to direct self-renewal are largely unclear.3,4 Based on experimental studies, more than 130 proteins (including transcription factors [TF], chromatin modifying complexes, phosphorylation and ubiquitination enzymes, basal transcriptional machinery members, and RNA processing factors)5,6 made a Nanog interactome, and Sox2 was identified as a healthy interacting partner of Nanog. Nanog-bound promoters are co-occupied by the octamer binding protein 4 (Oct4) and SRY-related HMG-box gene 2 (Sox2) proteins.2 Nanog and Sox2 are mainly interacting via their transactivation domain (TAD), whereas modeling and simulating the unstructured TAD domain residues (∼250) of both the proteins is a tedious process.
To date, biochemical characterization of protein–protein interactions in pluripotent cells has been studied extensively for Sox2 and Oct4.6–8 From a biochemical as well as computational perspective, little is known about how Nanog fits into the tight relationship between Oct4 and Sox2.9 Hutchins et al. (2013) described a de novo motif representation for the Nanog–Sox2 complex. They developed a tool for systematically evaluating ChIP-seq data (from mouse ESCs) to identify TF composite motifs and found that the Nanog–Sox2 motifs are in proximity to each other.10 ChIP-seq peaks of the Nanog–Sox2 motif have been observed in Zfp42, Klf5, Ncam1, and Myst4.6 Since there is no crystal structure showing the direct physical interaction of Nanog–Sox2, we modeled the complex based on the motif representation10 described by Hutchins et al. The model system actively included the mutant L122E, which enhances protein stability and DNA-binding affinity.10,11 Comparative study of the wild-type and mutant Nanog–Sox2 model systems against the Nanog crystal structure reveals the cooperative protein–protein and protein–DNA mediated interactions.
Molecular dynamics (MD) simulation is suitable for exploring the mechanism of a protein–protein/protein–DNA interface.12 Therefore, we conducted comparative studies of the hypothesis models (Nanog–Sox2 partnership) and Nanog crystal structure using MD simulation. Our results revealed the structural mechanism and changes of Nanog that are influenced by its cooperative binding partner, Sox2.
The formula for covariance matrix with elements Cij for coordinates i and j is given as
Cij = 〈(Xi − 〈Xi〉)(Xj − 〈Xj〉)〉 | (1) |
The root mean square deviation (RMSD) of the backbone atoms of the Ng-Crystal was constant throughout the simulation, whereas the Ng-WT and Ng-Mut complexes showed deviation in their backbone atoms (Fig. S1†). The superimposed structures of first, last and intermediate snapshots were displaying the fact that in both complexes, Nanog structures were moving towards DNA for better interaction (Fig. 2). The minimum distance between Nanog and Sox2 fluctuated from around 3.3 to 3.5 nm for both the complexes (Fig. S2†). Even though the distance between Sox2 and Nanog was ∼3.4 nm, the hydrogen bond interactions between Nanog and Sox2 were observed to be zero in Ng-WT, whereas in Ng-Mut only one interaction had been observed between Arg76 of Sox2 with Gln135 of Nanog. However the mutant residue E103 does not take part in any of the interaction with Sox2 (Table 1). Table 1 lists the residues that are making both protein–protein and protein–DNA interactions in all three systems. The subtle difference between the number of hydrogen bonds are critical as the energy contribution for individual hydrogen bond can be from −1.5 kcal mol−1 per hydrogen bond that can be translated to a roughly 10-fold difference in probability of two conformations.29,30 Thus, the breakage/formation of a single hydrogen bond has profound effect on protein–DNA stability.
a The protein residues (from Sox2 and Nanog) with the base pairs interactions have been tabulated, and the interacting distances are in Å. The protein–protein interaction marked with italic. | |||||||||
---|---|---|---|---|---|---|---|---|---|
Ng-WT (Sox2–Nanog) | Nil | ||||||||
Ng-Mut (Sox2–Nanog) | ARG76 | NH1 | GLN135 | O | 3.232 Å | ||||
Ng-WT (Sox2–DNA) | Ng-WT (Nanog–DNA) | ||||||||
SER31 | HB1 | DC4 | O2 | 2.542 | SER31 | HB2 | DT5 | O4′ | 2.706 |
LYS35 | HZ1 | DT5 | H5′ | 3.142 | SER34 | HB2 | DT5 | H2′ | 2.403 |
SER31 | HB2 | DC4 | H1′ | 3.34 | LYS35 | HG1 | DT5 | H2′ | 3.468 |
LYS35 | HE2 | DT6 | O1P | 2.316 | GLY38 | CA | DT6 | H4′ | 3.215 |
PHE10 | HZ | DT6 | H | 2.889 | MET11 | HE3 | DT6 | O2 | 3.232 |
MET11 | HE2 | DT6 | H3 | 3.417 | PHE10 | CZ | DT6 | H2′ | 3.035 |
GLY38 | HA2 | DT6 | H2′ | 3.334 | GLY38 | HA2 | DT7 | H5′ | 3.418 |
LYS42 | HZ3 | DT7 | O1P | 3.28 | TRP41 | HD1 | DT7 | H5′ | 2.936 |
MET11 | HE1 | DT7 | O2 | 2.806 | TRP41 | HD1 | DT7 | C4′ | 3.053 |
TRP41 | HD1 | DT7 | C3′ | 3.327 | ASN8 | ND2 | DT7 | O2 | 3.356 |
ASN8 | HD2 | DG8 | C1′ | 3.397 | ARG5 | NH1 | DG8 | N2 | 3.188 |
MET11 | HE2 | DA32 | N3 | 3.447 | TYR72 | HE2 | DC10 | O2 | 2.546 |
MET7 | HG1 | DA32 | H2′ | 3.352 | TYR72 | HD2 | DC10 | O2 | 3.225 |
LYS4 | HZ2 | DA33 | P | 2.359 | PRO74 | HG2 | DC10 | O2 | 3.104 |
ASN30 | H | DA36 | O | 2.88 | ARG76 | HH2 | DT12 | P | 2.423 |
HIS29 | HD2 | DA36 | C1′ | 3.033 | LYS77 | HZ3 | DG29 | H4′ | 3.35 |
HIS29 | HD2 | DA36 | C4′ | 3.296 | PRO74 | HG2 | DG29 | N2 | 3.073 |
MET28 | C | DA36 | H | 3.466 | ARG75 | O | DA30 | C4′ | 3.241 |
Ng–Mut (Sox2–DNA) | Ng–Mut (Nanog–DNA) | ||||||||
SER31 | HB1 | DC4 | H1′ | 3.342 | LYS121 | HZ2 | DG8 | O2P | 1.775 |
LYS35 | HE2 | DT5 | H5′ | 2.273 | GLN125 | HE2 | DG8 | C2′ | 3.041 |
SER31 | CA | DT5 | H1′ | 3.007 | TYR100 | HE2 | DT9 | O1P | 3.277 |
SER34 | HB2 | DT5 | C1′ | 3.277 | ARG128 | HH2 | DT9 | O2P | 1.839 |
LYS35 | HE2 | DT5 | C3′ | 3.293 | GLN125 | NE2 | DT9 | O2P | 2.746 |
GLY38 | HA1 | DT6 | C4′ | 3.448 | ARG128 | HE | DT9 | O2P | 2.201 |
PHE10 | CE2 | DT6 | H1′ | 3.245 | GLN125 | HG2 | DT9 | C5 | 3.44 |
SER34 | HB1 | DT6 | H1′ | 2.76 | MET129 | SD | DC10 | O5′ | 3.373 |
ARG5 | HD2 | DC31 | H4′ | 2.088 | MET129 | HE1 | DC10 | H3′ | 2.63 |
TYR72 | CE1 | DC31 | H4′ | 3.139 | ARG133 | HD2 | DT24 | H5′ | 3.246 |
ARG75 | HH2 | DC31 | O2P | 2.407 | LYS130 | CE | DC25 | O2P | 3.259 |
HIS29 | ND1 | DC37 | H4′ | 3.383 | ARG133 | HD2 | DC25 | O2P | 3.341 |
ASN30 | HB1 | DA36 | O4′ | 3.256 | LYS130 | HZ3 | DC25 | O5′ | 3.367 |
ARG75 | NH2 | DC31 | O5′ | 3.341 | TRP123 | HD1 | DA26 | O2P | 2.875 |
TYR72 | CE1 | DC31 | H5′ | 3.372 | THR81 | H1 | DA26 | H4′ | 2.54 |
TYR72 | HE1 | DC31 | H5′ | 2.318 | GLN119 | NE2 | DA26 | H3′ | 3.437 |
ARG75 | HE | DC31 | P | 2.518 | THR81 | H2 | DA26 | O3′ | 2.847 |
GLN119 | HE2 | DA27 | O1P | 3.305 | |||||
THR122 | HG2 | DA26 | C8 | 3.186 | |||||
ASN126 | HD2 | DA27 | H61 | 3.193 |
Furthermore, the lowest energy structure was taken based on the free energy landscape (FEL) energy values, and the interaction patterns were observed (Fig. S3†). The importance of the mutant residue (L103E) was monitored very carefully by including the simulation of Ng-Crystal–Mut structure also (Fig. S4 and S5†). The radius of gyration shows that the Ng-Crystal–WT complex was observing very high compact conformation compared with the Ng-Crystal–Mut complex; same has been reflected in the RMSF profile also (Fig. S4†). The interaction profile of L103E residue against DNA was observed and it was evident that for Ng-Crystal–Mut and Ng-Crystal–WT complexes alone was experiencing a direct interaction of E103 with the DNA bases at less than 4 Å, whereas the other complexes (Ng-WT and Ng-Mut) didn't observe this specific interaction (Table 1 and S1†). Although the minimum distance between Nanog and DNA was ∼2.8 nm for both the complexes (Fig. S5A†), both Ng-WT and Ng-Mut failed to make direct interaction between E103 and DNA base pair.
The alignment of charged residues from Nanog helix H3 with the plane of the DNA was observed and distinguished throughout the simulation. In the Ng-WT system, all residues (K118, T122, and Q125) surveyed the A15, A16, and T17 bp sequences except M129 residue (Fig. S6†). The M129 residue showed a stronger interaction with A16 bp, and thus its movement to other bp sequences was restricted. In contrast, in the Ng-Mut system, more residues from helix H3 (K118, Q119, K121, T122, Q125, R128, and M129) interacted with its corresponding DNA bps, and their movements towards the DNA bp sequences were higher compared to Ng-WT (Fig. S7†).
Further, we evaluated the cause of such conformational behavior by determining residue-wise level fluctuations along the two principal eigenvectors for the wild-type and mutant systems (Fig. 4). Region-specific displacement of each residue was observed for each of the two principal eigenvectors. The L103E mutation contributed to the fluctuation of the surrounding 96Q, 97R, 98Q, and 99K residues, whereas the L103E residue itself showed little fluctuation. The important residues were observed to interact with DNA in the crystal structure, but some residues, such as K118, Q119, K121, T122, Q124, Q125, R128, and M129, showed large fluctuations in the Ng-Mut system, as shown in Fig. 4. Eigenvector 2 values showed fluctuations of most residues in the Ng-Mut complex (Fig. 4), which may be because of the L103E mutation. This residual fluctuation may account for the divergence in the conformational behavior of both systems.
These scattered conformations visited by the Ng-WT and Ng-Mut proteins were further verified by drawing a porcupine graph (Fig. 5). Both the systems displayed similar as well as distinct contradicting movements with respect to each other. The dominant motions displayed by Nanog in Ng-WT and Ng-Mut were found to be similarly oriented, however, the intensity of motion in Ng-Mut was higher towards the DNA. The residues of Nanog showed more harmonious movements in Ng-WT, whereas, such harmony was reduced in Ng-Mut. Sox2 exhibited the similar motion in both the systems, whereas the movement of Sox2 in Ng-Mut complex is less. The residues in the helix 3 region of Nanog showed less movement towards the DNA, enabling non-specific binding. Thus, the L103E mutation may have influenced the essential motions of the surrounding atoms, facilitating their non-specific binding, thereby increasing the stability. In the absence of Sox2, the residual movement of Nanog showed a complete incoherence as indicated by the arrows pointing away from the DNA. This may suggest that the binding of neighboring protein can drastically influence the Nanog binding and functional activity.
The protein–DNA interface involving the major third helix was significantly altered in the Ng-WT and Ng-Mut systems compared to the Ng-Crystal. Most interactions disappeared because of Sox2 binding. Except for T122, K125, and M129 residues, the other residues failed to maintain its stronger interactions with its DNA. Hence, it could be understood that the binding of Sox2 altered the stability of Nanog binding to its DNA binding site.
Sox2 is known to bend DNA, and examination of protein-induced DNA bending is necessary to gain insight into DNA's structural deformation. As a result of this bending nature, the binding of Sox2 altered the conformation of B-form DNA into the non-standard B-form (or A-form). DNA can be classified based on various parameters such as twist (A = 33; B = 36), rise (A = 2.56A; B = 3.38A), roll (A = 6; B = 0) and slide (A < −0.8; B > −0.8)32,33 (Fig. S8†). The crystal structure without Sox2 showed an average bending angle of approximately 22°, whereas Ng-WT and Ng-Mut possessing Sox2 adjacent to the Nanog binding site showed a bend angle of approximately 50–60°. The bend angle was lower in Ng-WT initially, however, towards the end of simulation, both complexes showed a bend angle of similar range. The binding of Sox2 adjacent to Nanog caused the DNA to be in a nonlinear state. Even though the average bending value in Ng-Mut (∼55) was higher than Ng-WT (∼50), the fluctuation was quite less than the other.
The entropy of the protein–DNA complex was calculated by superposition of all frames using the heavy atoms (non-hydrogen) atoms as a reference. The entropy has been extracted from the covariance matrices using quasi-harmonic approximation after removing rotational and translational movement. The entropy values of all three systems were increasing until it reaches a plateau (Fig. S9†). The configurational entropy values were higher in Ng-WT and Ng-Mut compared to the crystal structure that may be due to the number of atoms involved (Ng-Crystal system lacks Sox2). The specific tight binding of the Nanog in the crystal structure required less configurational subspace, whereas Sox2 binding influenced the Nanog-bound structure by recruiting a larger configurational subspace, resulting in significantly higher entropy values.
Although the binding of Sox2 in both cases (Ng-WT and Ng-Mut) influenced the binding orientation of Nanog with its DNA and sampled less binding energy, the complex remained aligned with its native structure via protein–protein and protein–DNA interactions (Fig. 2, 3 and Table 1). The Ng-Mut complex is maintaining a single hydrogen bond between Nanog and Sox2 (Arg76 of Sox2 with Gln135 of Nanog), whereas the Ng-WT failed to do so (Table 1). In addition to that, Hayashi et al., studied the Nanog L122A mutation which enhance the DNA binding affinity in Oct4 promoter region brings up the fact that the mutation of Lys122 is very much important for Nanog reprogramming.11 Similarly, our mutant complex (L102E) was showing the better affinity with its binding partner as well as DNA (Fig. 5 and Table 1); however, both the complexes are existing with good number of interaction with their DNA (Table 1).
Differential responses of Ng-WT and Ng-Mut because of Sox2 binding may be correlated to the localized protein motions when the systems were analyzed using PCA. The internal motion of Ng-WT was limited to a subspace with fewer dimension compared to Ng-Mut, whereas the internal motion of the crystal structure was negligible (Fig. 3C). The Ng-WT and Ng-Mut systems largely remained in one conformational space indicating lower energy, while the infrequent transition to different space for other conformations, though fewer, but have been observed for these complexes (Fig. 3A and B). The porcupine graph shows that the protein dominant movements in the Ng-WT and Ng-Mut systems were different; the protein residue movements in Ng-Mut were more coherent with lower magnitude, and the movements of Nanog residues were supportive to each other in DNA binding, whereas, Ng-WT showed higher degree of residual movement away from DNA (Fig. 5). The L103E mutation may have influenced the essential motions of the surrounding atoms, facilitating non-specific binding of Nanog and increasing stability (Fig. 4). Similarly, non-specific binding of lactose repressor showed fluctuating residual movements,36,37 suggesting that Sox2 alters the specific binding to become non-specific.
The RMSF of the DNA bases dynamics decreased in the presence of Sox2 protein molecule (Fig. 6). However, the configurational entropy of the Ng-WT and Ng-Mut systems revealed higher energy requirements compared to the Ng-Crystal system (Fig. S9†), suggesting that the Sox2–Nanog bound DNA structure and its transcriptional regulation is achieved through concerted modulation of DNA-mediated interactions.
The largest changes in the conformational entropy of a protein arise from the energetic restraints from the backbone and side chain groups.34 The configuration of Ng-Crystal system is lower due to the absence of Sox2, thus, the entropy of the simulated crystal structure was lower than those of the model systems (Ng-WT and Ng-Mut) (Fig. S9†). Although, it is almost impossible to calculate the absolute entropic values, however, a qualitative observation can be useful to assign the energetic state to each molecule. Ng-Mut has slightly more entropy that allowed the complex to visit more states, and this can be attributed to the addition of a charged residue. As the entropy difference is no more than 1 kcal mol−1 K−1, both systems essentially visited similar meta-stable states.
Cooperative binding of Sox2 forced the system to use more configurational subspace and energy. The helix–turn–helix region of the HMG domain bent the DNA to approximately 50–60° and the bending nature of the Sox2 molecule is necessary for its activity,38 and to provide better stability.14,39 Ng-Crystal showed a constant bending angle, whereas Nanog bound to Sox2 showed an increased bending angle, revealing that the systems require a stable bending angle to stabilize the conformation compared to the DNA-bound structures; additionally, apart from this DNA-mediated protein–protein interaction, the system may require a DNA-independent interaction.6
The Ser-rich region and the transactivation domain of Nanog are unconstrained regions, where modeling and simulating the whole length TAD protein is nearly impossible and may require very long simulation time with supercomputing facilities. We tried to model this domain, however, lack of suitable template and abundance of non-structured region hindered its proper modeling. Therefore, a low confidence protein model might have a spurious effect in simulation and may invalidate the overall conclusion of this study. Moreover, a common way for the DNA binding proteins to interact is through non-DNA binding domains and this has been reported for Nanog–Sox2 interaction as well.6 Both, the Sox2 and Nanog have multiple domains such as HMG, transactivation domain, and Ser-rich motifs for Sox2, and DNA binding domain, transactivation domain, and Trp repeats for Nanog. The experimental evidence suggest that the major interaction between Nanog and Sox2 is governed by the sequences of non-DNA-binding domains (through the transactivation domains). However, the SELEX (Systematic evolution of ligands by exponential enrichment) results suggested that the interaction occurs in a specific spatial relationship of the DNA-binding domains of these proteins.10 Sox2 can interact with Oct4 over the DNA as reported by Merino et al., 2014,40 so the role of DNA-based interaction cannot be ruled out in case of Sox2–Nanog interactions.41 Moreover, the bending of DNA by Sox2 has drastic influence over the ability of transcription factors to induce transcription of target genes.38 Other findings that the bending or conversion to non-standard DNA is desirable and supports the notion that Sox2 not only interacts through its TAD but also facilitate the Nanog transcriptional ability by indirect physical interaction through bending of DNA. As it has already been reported that the DNA-independent interaction is governed by the transactivation domain of Sox2 and tryptophan repeat (WR) domain of Nanog,6 these two proteins may facilitate better and stronger binding via fewer DNA-mediated interactions, which is driven by their DNA-binding domains of the proteins. The overall interactions and the energy profile obtained for Ng-Mut is favoring for its better stability than the Ng-WT, which is correlating with the experimental data. Our MD simulation results explains the interactions between Nanog and Sox2 through their DNA-binding domains, and suggest that, despite of this weaker DNA-independent interaction profile, Nanog–Sox2 cooperativity via DNA-binding domains are also necessary for its better and stable interaction profile.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c8ra10085c |
This journal is © The Royal Society of Chemistry 2019 |