Alfinda Novi Kristanti*ab,
Nanik Siti Aminahab,
Imam Siswantoac,
Yosephine Sri Wulan Manuharabd,
Muhammad Ikhlas Abdjanae,
Andika Pramudya Wardanaae,
Ei Ei Aungaf and
Yoshiaki Takayag
aDepartement of Chemistry, Faculty of Science and Technology, Universitas Airlangga, Surabaya 60115, Indonesia. E-mail: alfinda-n-k@fst.unair.ac.id; nanik-s-a@fst.unair.ac.id
bBiotechnology of Tropical Medicinal Plants Research Group, Universitas Airlangga, Indonesia
cBioinformatic Laboratory, UCoE Research Center for Bio-Molecule Engineering, Universitas Airlangga, Surabaya, Indonesia
dDepartment of Biology, Faculty of Science and Technology, Universitas Airlangga, Surabaya 60115, Indonesia
ePhD Student of Mathematics and Natural Sciences, Faculty of Science and Technology, Universitas Airlangga, Komplek Kampus C UNAIR, Jl. Mulyorejo, Surabaya, 60115, Indonesia
fDepartement of Chemistry, Yadanarbon University, Amarapura Township, Mandalay, Myanmar
gFaculty of Pharmacy, Meijo University, 150 Yagotoyama, Tempaku, Nagoya, 468-8503 Japan
First published on 13th July 2022
The human estrogenic enzyme 17beta-hydroxysteroid dehydrogenase type-1 (HSD17B1) provides biosynthesis regulation of active estrogen in stimulating the development of breast cancer through cell proliferation. The β-sitosterol is classified as a steroid compound and is actually a type of triterpenoid compound that has a similar structure to a steroid. This similarity provides a great opportunity for the inhibitor candidate to bind to the HDS17B1 enzyme because of the template similarity on the active site. Several in silico approaches have been applied in this study to examine the potential of these two inhibitor candidates. Pharmacokinetic studies showed positive results by meeting several drug candidate criteria, such as drug-likeness, bioavailability, and ADMET properties. A combination of molecular docking and MD simulation showed good conformational interaction of the inhibitors and HSD17B1. Prediction of binding free energy (ΔGbind) using the Molecular Mechanics-Generalized Born Surface Area (MM-GBSA) approach shows ΔGbind (kcal mol−1) of C1–HSD17B1: −49.31 ± 0.23 and C2–HSD17B1: −33.54 ± 0.34. Meanwhile, decomposition energy analysis (ΔGresiduebind) suggested several key residues that were also responsible for the interaction with inhibitors, such as C1–HSD17B1 (six residues: Leu96, Leu149, Pro187, Met193, Val225, and Phe226) and C2–HSD17B1 (four residues: Ile14, Gly94, Pro187, and Val188). Hopefully, the obtained results from this research could be considered for the mechanistic inhibition of the HSDS17B1 enzyme at molecular and atomistic levels.
Several research projects are looking for candidate inhibitors that can block the process of estrogen biosynthesis through enzyme HSD17B1 inhibition.4,9,10 There are several inhibitors with known activity (Km) as HSD17B1 inhibitors, such as dihydrotestosterone (DHT): 32 ± 9 μM, and androstandione: 26 ± 9 μM.4 It should be noted that both inhibitors have a similar structure, which is often known as a “mimic structure”. The structural similarity is expected to have a better chance of binding to the targeted protein because of the template similarity on the active site.11 Thus, it is necessary to select a candidate inhibitor based on these considerations. Several natural product compounds can offer this feature, including triterpenoid and steroid groups as secondary metabolic compounds.12,13
Research on natural products as inhibitor candidates shows promising prospects in finding a potential inhibitor of the HSD17B1 enzyme.14 Additionally, natural products have provided good activity in inhibiting breast cancer cells.14,15 Some of them are β-sitosterol and oleanolic acid compounds, which were isolated and characterized from the stem bark of Syzygium aqueum in a previous study.15 The structural similarity of these compounds to dihydrotestosterone (a known HSD17B1 inhibitor) is expected to mean they will interact on the active site of the HSD17B1 enzyme. Structurally, it has –OH and –COOH groups at positions C3 (β-sitosterol) and C3, C28 (oleanolic acid) (Fig. 1). These groups increase the possibility of interacting with water molecules on the enzyme active site.
Additionally, the presence of these groups can allow the formation of hydrogen bond interactions (donor or acceptor) with key residues on the enzyme active site at the molecular level.
Theoretical studies using an in silico approach provide an alternative way of finding candidate compounds that have potential as HSD17B1 inhibitors at the molecular level.16,17 A combination of molecular docking and molecular dynamics (MD) simulation provides a comprehensive structure-based approach to studying the interaction of the inhibitor with the HSD17B1 enzyme.16–18 Structure-based studies can provide a more detailed description of the inhibition mechanism through interactions with amino acid residues on the active site of the targeted protein. Molecular docking provides good calculation efficiency for inhibitor coordination on the enzyme active site.19,20 Meanwhile, the MD simulation offers a comprehensive analysis of the interaction between an inhibitor and the targeted protein during the simulation time.17,21 The evaluation proceeds by several considerations, such as grid-score20 and binding free energy,22 aiming to see the binding affinity of inhibitor to HSD17B1. In addition, an evaluation of toxicity and drug-likeness provides an initial description of the pharmacokinetic properties of each inhibitor.23–25 Several considerations of the in silico approach offered in this research are expected to explain the inhibition mechanism of the HDS17B1 enzyme by β-sitosterol and oleanolic acid at the molecular level.
Grid-score = EvdW + Eele | (1) |
ΔGbind = ΔGligand–HSD17B1 − (ΔGHSD17B1 + ΔGligand) | (2) |
ΔGbind = ΔGgas + ΔGsol − TΔS | (3) |
ΔGbind = ΔEbonded + ΔEvdW + ΔEele + ΔGnonpolarsol + ΔGelesol | (4) |
The ADMET study provides a clear image of a drug candidate's effects on the body. This parameter plays an important role in drug discovery.23–25 Prediction of ADMET properties shows that each compound shows promising potential (Table S2†). This is indicated by the value of Caco-2 permeability > 0.90 and intestinal absorption-human > 30% (+HIA).23 These results showed that the compounds were absorbed into the body very well. The distribution of C1 compound showed that it was able to penetrate the blood–brain barrier (logBB > 0.3) and CNS permeability > −2logPS. It identified that this compound could affect the work of the central nervous system (brain). On the other hand, compounds TES and C2 do not have the potential to penetrate the blood–brain barrier. However, they have the opportunity to penetrate the central nervous system (CNS). On the other hand, the metabolic parameters showed that each compound did not have any effect on the activity of metabolic enzymes (cytochrome isoenzymes), such as CYP1A2, CYP2C19, CYP2C9, CYP2D6, or CYP3A4. Hopefully, the compounds will not interfere with the activity of metabolic enzymes when consumed by the body. Crucial parameters such as toxicity are key parameters in the study of ADMET properties. Interestingly, all compounds belong to the non-toxic category (non-AMES toxicity and non-skin sensitization). However, compound C2 shows a category of hepatotoxicity that can cause damage to liver cells. Therefore, it is necessary to develop and modify C2 compounds for medicinal purposes. The ADMET results indicated that initial data on pharmacokinetic predictions show the compounds have potential worth considering.
The optimized geometry of the candidate inhibitor was docked into the HSD17B1 active site. The results show that each candidate has a grid-score < grid-score reference, i.e., C1: −69.90 kcal mol−1 and C2: −61.46 kcal mol−1 (Fig. 3). This indicates that the inhibitor candidate has good interaction in the gas term.32 In detail, the results of the energy contribution (EvdW + Eele) in kcal mol−1 for each inhibitor were C1: EvdW = −70.25 and Eele = 0.35 and C2: EvdW = −55.99 and Eele = −5.47. A similar energy contribution was also shown by TES–HSD17B1 (Fig. 2E). The interaction energy based on the functional grid scoring (gas term) for each inhibitor showed good binding stability with the targeted protein. This can be identified through a grid score that has a negative value. The grid score (higher negative value) indicated more thermodynamically stable interaction between the inhibitor and the targeted protein. Overall, the contribution of van der Waals energy (EvdW) shows the largest contribution to the interaction between the inhibitor and HSD17B1 compared to electrostatic energy (Eele). Meanwhile, several residues are responsible for the inhibitor–HSD17B1 interaction (Fig. 2F and S1†), such as TES (7 residues: Leu149, Pro187, Tyr218, His221, Val225, Phe259, and Glu282), C1 (11 residues: Val143, Met147, Leu149, Tyr155, Cys185, Pro187, Val188, Tyr218, His221, Phe259, and Val283), and C2 (4 residues: Ser142, Leu149, Tyr155, and Gly186). Details of the overall interactions for each complex are provided in Table S3.† It should be noted that some of the parameters measured in the molecular docking are the initial conformations at the gas term. Therefore, the process of evaluating these results needs to be studied using MD simulation to get more comprehensive results. However, molecular docking is quite helpful in efficiently determining the initial coordinates for each complex.
Fig. 3 The molecular docking result of candidate–HSD17B1: the conformation of each candidate on the enzyme pocket area. |
Energy analysis in each system in the form of Etot, Epot, and Ekin aimed to see the convergence achieved during the simulation time. Notably, the contribution of the Etot parameter shows the contribution of Epot + Ekin.33 The results showed that each system had achieved good convergence. This can be identified by the absence of significant fluctuating changes in each system (Fig. S2†). The average value of the contribution of each energy is shown in Table S4.† A system with convergence can be used to continue the process of analyzing the compactness and stability.
System stability analysis is indicated by the complex RMSD value.21,34 The fact the simulation process reached 100 ns identified that each system has good stability. It is characterized by fairly good fluctuating stability from 15 ns to 100 ns (Fig. 4). Significant fluctuations occurred at the beginning of the simulation (0–200 ps) due to the heating process and increased gradually from 200 ps to 15 ns to reach an equilibrium process. Overall, each system has a stability difference that is not too significant. This is indicated by the average complex RMSD (nm) of TES–HSD17B1: 0.26 ± 0.03, C1–HSD17B1: 0.26 ± 0.03, and C2–HSD17B1: 0.27 ± 0.03. Based on our assumptions, the stability that occurs in the candidate inhibitor system is caused by having a similar structure to the reference ligand. Thus, the positions of the C1 and C2 structures were able to occupy the HSD17B1 enzyme pocket well. However, this assumption will be explained further in the next section by considering the binding affinity for each system.
Fig. 4 Trajectory analysis: the root-mean-square displacement of each complex plotted during the 100 ns of MD simulation. |
The structural compactness is based on the RoG parameter.35 The RoG analysis aims to see whether the structure is stably folded or not.
The calculation was carried out for 100 ns to see the RoG fluctuations shown in Fig. S3.† Changes in RoG fluctuations in each system did not show a significant difference. This is supported by the average values of RoG (nm), TES–HSD17B1: 19.73 ± 0.14, C1–HSD17B1: 19.79 ± 0.09, and C2–HSD17B1: 19.89 ± 0.11. Changes in the values of these fluctuations indicate that the structure of each system is well folded. This is corroborated by the average structure taken per unit time of 20 ns for each system during the simulation time of 100 ns (Fig. S4†). Overall, there was no significant change in structural conformation during the simulation time (0–100 ns) in each system.
The RMSF was calculated to identify the flexibility of the protein over the simulation time (Fig. S5†). The TES–HSD17B1 system showed higher fluctuation compared to the other systems. In summary, the flexibility trend for each system was TES–HSD17B1 > C2–HSD17B1 > C1–HSD17B1 (Table S4†). Higher fluctuation (∼2.00 nm) was shown by residues 130–133, 174–177, 199–201, 204–205, 208–210, 281, and 284–285 (alpha-helix and loop regions). Moreover, twelve key residues (Val143, Met147, Leu149, Pro187, Tyr218, His221, Val225, Phe226, Phe259, Leu262, Met279, and Val283) that affect the main thermodynamic force for binding were also analyzed. The fluctuation changes from the key residues on the active site showed that His221, Met279, and Val283 residues had higher fluctuations compared to other residues (Fig. S6†).
The description of several parameters shows that the dynamic conformation formed during the simulation time had reached a fairly good convergence and stability. Therefore, further analysis was considered in the form of binding affinity, hydrogen bonding, and water accessibility using the last 20 ns (80–100 ns) of the trajectories. This consideration was made based on the efficiency of calculations that are often used for simulation analysis.21,34 It should be noted that this consideration is useful if the complex RMSD has achieved good enough stabilization.
Energy (kcal mol−1) | TES–HSD17B1 | C1–HSD17B1 | C2–HSD17B1 |
---|---|---|---|
ΔEvdW | −40.99 ± 0.27 | −57.84 ± 0.24 | −45.93 ± 0.34 |
ΔEele | −9.36 ± 0.29 | −4.66 ± 0.33 | −15.03 ± 0.41 |
ΔGgas | −50.35 ± 0.33 | −62.51 ± 0.36 | −60.97 ± 0.43 |
ΔGelesol | 23.12 ± 0.24 | 20.52 ± 0.30 | 32.84 ± 0.28 |
ΔGnonpolarsol | −5.05 ± 0.01 | −7.33 ± 0.02 | −5.41 ± 0.02 |
ΔGsol | 18.07 ± 0.24 | 13.19 ± 0.29 | 27.42 ± 0.28 |
ΔGbind | −32.28 ± 0.25 | −49.31 ± 0.23 | −33.54 ± 0.34 |
The energy contributions at C1 and C2 show that ΔGbind is promising as an inhibitor of the HSD17B1 enzyme. It is taken into consideration that the candidate ΔGbind (C1 and C2) is stronger than that of the native ligand ΔGbind as a reference (a higher negative value). Thus, the candidate inhibitors show promising potential to inhibit the HSD17B1 enzyme. It is hoped that candidates with good binding affinity will be able to bind strongly to the active site of the targeted protein.36 The goal is to block the active site of the HSD17B1 enzyme, which is responsible for regulating the supply of active estrogen as fuel for breast cancer cell development.7,8
Analysis of binding free energy (ΔGbind) showed the energy trend (kcal mol−1) of C1–HSD17B1 (−49.31 ± 0.23) < C2–HSD17B1 (−33.54 ± 0.34) < TES–HSD17B1 (−32.28 ± 0.25) based on the MM-GBSA approach. The binding free energy cannot be separated from the effect of the energy contribution in gas (ΔGgas) and solution (ΔGsol) terms. In general, the energy contribution in the gas term (ΔEvdW + ΔEelec) has advantages compared to the solution term (ΔGnonpolarsol + ΔGelesol). This is due to the unfavorable contribution of the polar solvation free energy (ΔGelesol) of the Generalized Born (GB) solvent model of each complex. In previous reports, the dielectric constant strongly influenced the calculation of ΔGelesol and the variable dielectric model showed potential power.28,37 However, the energy contribution of ΔGgas + ΔGsol showed quite promising results for each inhibitor.
Specifically, free energy key residues play a crucial role in observing the inhibitory mechanism of the HSD17B1 enzyme in calculating energy decomposition (ΔGresiduebind). The key residues with a promising free energy contribution were evaluated based on ΔGresiduebind ≤ −1.00 kcal mol−1 criteria.38 Calculation of ΔGresiduebind was performed using 100 snapshots on the last 20 ns of the trajectories via dcomp tools. The results (Fig. 5) showed that there are key residues that have stronger free energy in each complex, including TES–HSD17B1 (five residues: Leu149, Pro187, His221, Ser222, and Val225), C1–HSD17B1 (six residues: Leu96, Leu149, Pro187, Met193, Val225, and Phe226), and C2–HSD17B1 (four residues: Ile14, Gly94, Pro187, and Val188). These results identify that these key residues interact with inhibitors on the HSD17B1 pocket. The key residues Leu149 and Pro187 were highlighted in the inhibition mechanism of the HSD17B1 enzyme. Previous research reported that the Leu149 residue plays a role in steroid discrimination and modulates DHT levels in breast tissue.4 Additionally, the residue of Pro187 is a hydrophobic and aromatic residue that contributes to the main thermodynamic force for the binding mode. Therefore, the interaction evaluation of the inhibitors with key residues plays a crucial role in understanding the inhibition mechanism. Specifically, the energy contribution ΔGresiduebind is affected by the energy contribution in the gas and solution terms, ΔEvdW + ΔGnonpolarsol(GB) and ΔEele + ΔGelesol(GB) provided by Fig. 6. We can see that the energy contribution ΔEvdW + ΔGnonpolarsol(GB) is able to stabilize the free energy of each residue. It is inseparable from the contribution of ΔEvdW energy in binding mode key residues39
Fig. 5 The residual energy decomposition was plotted along with the simulation over the last 20 ns of each complex. |
The H-bond number parameter shows the number of H-bond interactions between ligand and protein. The results of the analysis during the last 20 ns of simulation time showed TES–HSD17B1 had three H-bonds, C1–HSD17B1 had three H-bonds, and C2–HSD17B1 had four H-bonds. The obtained number of H-bonds was used to look at occupation levels. This procedure aimed to see how long the bond was recorded during the simulation time. Surprisingly, the C1-HSD17B1 complex has only two H-bonds with a percentage occupation <10.0%. Meanwhile, the C2-HSD17B1 complex has four H-bonds, with two H-bonds having occupancy percentages >90.0%. Several reports state that the criterion for a strong category of H-bonding is H-bond ≥70%.42,43 This explains that the H-bond interaction which has the highest percentage occupation deserves to be considered in the ligand–protein interaction. In detail, the TES–HSD17B1 interaction involves two H-bonds: (i) O17⋯HN(His221) with 20.8%, 2.97 Å, HBA and (ii) O17–H⋯O(Glu282) with 15.9%, 2.76 Å, HBD. The interaction of the H-bond with these two residues (His221 and Glu282) showed good correlation with the molecular docking results (Fig. 2F) and the results of previous studies.4 Meanwhile, the interactions of the H-bond in C1–HSD17B1 were (i) O3⋯H–O(Tyr155) with 4.45%, 3.08 Å, HBA, and (ii) O3–H⋯O(Glu194) with 2.45%, 2.78 Å, HBD. Additionally, the interactions of the H-bonds in C2–HSD17B1 were (i) O28a⋯HO(Tyr155) with 94.30%, 2.85 Å, HBA, (ii) O28b-H⋯O(Gly186) with 94.05%, 2.93 Å, HBD, (iii) O28b⋯HO(Ser142) with 9.10%, 3.17 Å, HBA, and (iv) O28a⋯HO(Ser142) with 4.50%, 3.12 Å, HBA. In particular, the C2–HSD17B1 complex has two strong H-bond categories with the % occupation of H-bonds ≥70%. As explained, the presence of the –OH and –COOH groups contributes to the H-bond interaction. This further strengthens the existence of this group playing an important role in ligand–protein interactions at the atomistic level.
The water molecule contained in the HSD17B1 active site needs to be studied for its distribution process during the simulation time. This aims to see the opportunities for water molecules to approach the ligand interface. In the previous section, the presence of the –OH and –COOH functional groups played an important role in the interaction with water molecules, in particular, the analysis of the oxygen (O) atoms in each ligand. Therefore, the possibility of water molecules approaching the oxygen atom can be analyzed through the radial distribution function (RDF).38 Analysis of this parameter was undertaken through the integration number on the first minimum value of the oxygen atom in each ligand.43 The TES–HSD17B1 complex showed that O3 atoms were more exposed to water molecules than O17 atoms. This can be seen in the atomic distance of O3 (n(r): 2.15 and g(r): 0.22) at the first peak, which is 0.35 nm. On the other hand, the O17 atom shows almost no hydration peak at all. Meanwhile, complexes C1–HSD17B1 and C2–HSD17B1 showed that water molecules were well distributed around the oxygen atom interface of each inhibitor. Except that O28b on the C2 inhibitor shows rather low access of water molecules at the interface. Meanwhile, the remaining atoms in each inhibitor C1 (O3) and C2 (O3 and O28a) showed promising access to water molecules (Fig. 8). In detail, the parameters measured for each atom are C1 (O3): distance = 0.33 nm, n(r) = 2.54, and g(r) = 0.37; C2 (O3): distance = 0.34 nm, n(r) = 2.51 and g(r) = 0.32; and C2 (O28a): distance = 0.36 nm, n(r) = 1.03 and g(r) = 0.04. The RDF analysis provides information about how the water molecules approach the oxygen atom in each ligand. The nonzero value at the first minimum peak indicates a high level of water transfer in the first solvation shell. These values are observed at TES (O3), C1 (O3), and C2 (O3 and O28a). This statement is supported by the value of n(r) being within ∼2.1–2.5 at TES (O3), C1 (O3), and C2 (O3). In particular, C2 (O28a) has a consistent value, being stably solvated by one molecule of water. This can be seen through the first minimum distance at ∼0.4 nm. As mentioned earlier, the oxygen atoms at TES (O17) and C2 (28b) have low hydration peaks. It can be seen that the first minimum of the two oxygen atoms is zero. This explains that there is no water transfer in the first solvation shell. Additionally, this condition also states that no water molecules are exposed at the interface of both oxygen atoms. Taking together all the findings discussed above, this explains that the presence of –OH and –COOH groups in each ligand can provide greater opportunities for interaction with water molecules. Based on the consideration of several parameters (SASA and RDF) above, the water accessibility of each system shows that the distribution of solvent molecules is quite well exposed.
Footnote |
† Electronic supplementary information (ESI) available. See https://doi.org/10.1039/d2ra03092f |
This journal is © The Royal Society of Chemistry 2022 |