Wataru Murakamia,
Masahiro Yamamotob,
Kazuo Edaa and
Toshiyuki Osakai*a
aDepartment of Chemistry, Graduate School of Science, Kobe University, Nada, Kobe 657-8501, Japan. E-mail: osakai@kobe-u.ac.jp
bDepartment of Chemistry of Functional Molecules, Faculty of Science and Engineering, Konan University, Higashinada, Kobe 658-8501, Japan
First published on 11th June 2014
Recently, a non-Bornian model was successfully applied to evaluate the Gibbs energy of hydration (ΔG°hyd) for spherical ions (mainly inorganic ions). In this model, the long-range, Born-type electrostatic ion–solvent interaction is not explicitly included in the calculation of ΔG°hyd, since its contribution is small, whereas the short-range interaction, including Coulomb, polarization, and charge-transfer interactions, is considered as the dominant factor that determines the ΔG°hyd of ions. The ΔG°hyd scaled by the surface area of an ion can be given by a quadratic function of the surface field strength (E) of the ion. In this study, the non-Bornian model was further applied to organic ions with charged groups. Using the Gaussian 09 program package, the geometries of ions in vacuum were optimized at the B3LYP/6-311++G(2d,p) level, and the partial atomic charges were computed in the Mulliken, Merz-Kollman (MK), natural population analysis (NPA), Hirshfeld, and ChelpG methods. Introducing a new subprogram, we could estimate local electric fields on the ion surface (van der Waals surface or solvent-accessible surface (SAS)). This enabled us to perform regression analyses based on the non-Bornian model, by using the experimental values of ΔG°hyd for 109 ions. When the NPA-SAS combination was chosen, the best regression result was obtained, giving the mean absolute error of 4.3 kcal mol−1. The non-Bornian model would provide a simple and relatively accurate way of determining ΔG°hyd of ions.
In computational methods such as molecular dynamics (MD) simulation10,11 and Monte Carlo (MC) simulation,10,12 the solvent is treated explicitly, i.e. taking into account the detailed structure of solvent molecules. Using these simulations combined with a thermodynamic integration technique13,14 or a free energy perturbation technique,15 ΔG°hyd can be calculated efficiently. On the other hand, a statistical mechanical integral equation method called the reference interaction site model (RISM)16,17 has also been utilized for calculating ΔG°hyd of ions.18–20
Previously, our group21,22 proposed a non-Bornian model for evaluation of the ion-transfer or resolvation energy of ions between two immiscible liquids (i.e., water and nitrobenzene). In the model, the short-range interactions between an ion and its adjacent solvent molecules were taken into account, while the long-range, Born-type electrostatic interaction for the outer bulk solvents was not explicitly included in the calculation. In the recent paper,23 we have successfully employed the non-Bornian model to evaluate ΔG°hyd for spherical ions, including 85 cations (with the charge number, z = 1 to 4) and 53 anions (with z = −1 and −2). In the proposed model,23 the ΔG°hyd scaled by the solvent-accessible surface24,25 (SAS) area of an ion is simply given by a quadratic function of the electric field strength (E) at the SAS. Three coefficients of the quadratic function of E should be associated with the short-range ion–solvent interaction, which includes Coulomb, polarization, and charge-transfer interactions. Although the coefficients could not accurately be determined in a theoretical manner, they were estimated by multivariate regression analyses for the above spherical cations or anions. The obtained regression equations were found to be useful for accurate prediction of ΔG°hyd. The performance of the non-Bornian model was compared with those of other existing models including Born,1 non-local electrostatic,7 and Poisson–Langevin26 models, showing that the non-Bornian model is one of the highest performance models for the prediction of ΔG°hyd for spherical ions.
So far, theoretical studies of ΔG°hyd have been mainly confined to those for the ions that are spherical or may be assumed as such (i.e., mainly for inorganic ions). Recently, however, experimental values of ΔG°hyd for organic ions have been evaluated by using thermochemical cycles that involve pKa values, gas-phase acidities, and ΔG°hyd values of the neutral species and hydrogen ion.27–29 Then, theoretical models for prediction of ΔG°hyd for organic ions have been proposed by previous authors.28–32
In this study, we have extended the non-Bornian model to organic ions with charged groups. In the extended model, a quantum mechanical calculation by Gaussian 09 (ref. 33 and 34) is used to evaluate partial atomic charges located at nuclear positions in an organic ion. Using a newly developed subprogram, local E values have been obtained at arbitrary points on the molecular surface (or SAS) of the ion. Since the E value is regarded as constant at each local point, the non-Bornian approach can be applied to the respective minute surfaces of the organic ion. The developed novel and simple algorithm has been found to be promising for relatively accurate evaluation of ΔG°hyd for organic ions.
Fig. 1 The proposed hydration model for an organic ion. The dashed line shows the SAS. For details, see the text. |
In the non-Bornian approach, the ΔG°hyd of an ion is regarded as the energy required for the formation of a nano-sized ion–solvent interface around the ion. The contribution from the long-range electrostatic interaction of an ion with water molecules in the outer solvation shell is not explicitly included in the theoretical framework. By generalizing the Uhlig formula,35 the ΔG°hyd of an ion is here expressed as
(1) |
γ12 = γ1 + γ2 − W12 | (2) |
γ12 = γ1 + γ2 + NUSR | (3) |
Substituting this equation into eqn (1) yields
(4) |
Referring to the energy decomposition analysis in ab initio self-consistent field molecular orbital studies,37–39 USR can be partitioned into Coulomb (COU), polarization (POL), charge-transfer (CT), and exchange (EX) terms:
USR = UCOU + UPOL + UCT + UEX | (5) |
As described in our previous papers,21–23 COU, POL, and CT terms can be given by the function of E, i.e., UCOU = −μ < cosθ > E, UPOL = −(1/2)αE2, and UCT = −ζ0 − ζ1E − ζ2E2, respectively. Substituting these functions into eqn (5), we obtain
USR = −ζ0 + UEX − μ < cosθ > E − ζ1E − (1/2)αE2 − ζ2E2 | (6) |
Further, substituting eqn (6) into eqn (4), we obtain
(7) |
A = Nζ0 − NUEX − γ1 − γ2 | (8) |
B = Nμ < cosθ > + Nζ1 | (9) |
C = (1/2)Nα + Nζ2 | (10) |
As seen in eqn (9) and (10), coefficient B is due to the COU and CT interactions, while coefficient C is due to the POL and CT interactions.
In order to calculate ΔG°hyd based on eqn (7), the closed surface S0 has been divided into a finite number of minute surface elements (whose area is shown by Si with i being a finite positive integer). Then it has been assumed that the electric field strength at an arbitrary minute surface has a specific value of Ei (defined as the vertical component). If we also assume that coefficients A, B, and C are independent of either Si or Ei, the value of ΔG°hyd in eqn (7) can be obtained as
ΔG°hyd = ∑Si(−A − BEi − CEi2) = −A∑Si − B∑SiEi − C∑SiEi2 | (11) |
As shown in eqn (8)–(10), coefficients A, B, and C are defined in terms of several parameters for short-range ion–solvent interactions. However, it is rather difficult to estimate their theoretical values in an accurate manner. This is because the present non-Bornian model is based on some daring assumptions, e.g., neglect of the interaction between the first and outer solvation layers and of the lateral dipole–dipole interactions among the primary solvent molecules. It is also not easy to estimate the CT-interaction parameters, ζ0, ζ1, and ζ2. Consequently, coefficients A, B, and C have been determined by a regression analysis with the literature data of ΔG°hyd for organic ions.28
Atom | avdW (Å) | Ref. | Atom | avdW (Å) | Ref. |
---|---|---|---|---|---|
H | 1.20 | 40 | Rb | 3.04 | 41 |
He | 1.40 | 40 | Sr | 2.63 | 41 |
Li | 1.82 | 40 | Y | 2.20 | 41 |
Be | 1.38 | 41 | Zr | 1.96 | 41 |
B | 1.20 | 41 | Nb | 1.86 | 41 |
C | 1.70 | 40 | Mo | 1.76 | 41 |
N | 1.55 | 40 | Tc | 1.73 | 41 |
O | 1.52 | 40 | Ru | 1.81 | 41 |
F | 1.47 | 40 | Rh | 1.75 | 41 |
Ne | 1.54 | 40 | Pd | 1.63 | 40 |
Na | 2.27 | 40 | Ag | 1.77 | 40 |
Mg | 1.73 | 40 | Cd | 1.58 | 40 |
Al | 1.75 | 41 | In | 1.93 | 40 |
Si | 2.10 | 40 | Sn | 2.17 | 40 |
P | 1.80 | 40 | Sb | 2.03 | 41 |
S | 1.80 | 40 | Te | 2.06 | 40 |
Cl | 1.75 | 40 | I | 1.98 | 40 |
Ar | 1.88 | 40 | Xe | 2.16 | 40 |
K | 2.75 | 40 | Cs | 3.27 | 41 |
Ca | 2.41 | 41 | Ba | 2.71 | 41 |
Sc | 1.98 | 41 | La | 2.29 | 41 |
Ti | 1.80 | 41 | Hf | 1.94 | 41 |
V | 1.72 | 41 | Ta | 1.87 | 41 |
Cr | 1.67 | 41 | W | 1.77 | 41 |
Mn | 1.66 | 41 | Re | 1.75 | 41 |
Fe | 1.65 | 41 | Os | 1.83 | 41 |
Co | 1.64 | 41 | Ir | 1.77 | 41 |
Ni | 1.63 | 40 | Pt | 1.72 | 40 |
Cu | 1.40 | 40 | Au | 1.86 | 41 |
Zn | 1.39 | 40 | Hg | 1.55 | 40 |
Ga | 1.87 | 40 | Tl | 1.96 | 40 |
Ge | 1.77 | 41 | Pb | 2.02 | 40 |
As | 1.85 | 40 | Bi | 2.17 | 41 |
Se | 1.90 | 40 | Th | 2.20 | 41 |
Br | 1.85 | 40 | U | 1.86 | 41 |
Kr | 2.02 | 40 |
Because electric fields satisfy the superposition principle, the electric field strength at any point on a molecular surface can be obtained from the vector sum of the electric fields created from the point charges of all constituent atoms. This is exemplified in the scheme of Fig. 2, in which the ion is simplified as a two-atom ion. A contribution to the electric field at a minute surface from each point charge (qj, where j indicates a specific atom) can be given by Gauss' law: Eij = qj/(4πε0Rij2) where ε0 and Rij are, respectively, the permittivity of vacuum and the distance from atom j to minute surface i (i.e., Rij = avdW or avdW + Δa). In the present program, the electric field (Ei) on a minute surface is provided as the component vertical to the molecular surface (see again Fig. 2). An execution of the program provides two files in txt and RES formats. The former file describes Cartesian coordinates in three dimensions for all points indicating the minute surfaces and the corresponding Si and Ei values, while the latter one describes the calculation of numerical sums (i.e., ∑Si, ∑SiEi, and ∑SiEi2) appearing in eqn (11). Using these calculation values and the literature values of ΔG°hyd, we have performed regression analysis to obtain coefficients A, B, and C in eqn (11).
Cation | ΔG°hyd (kcal mol−1) | Anion | ΔG°hyd (kcal mol−1) | ||||
---|---|---|---|---|---|---|---|
Exp.a | Non-Bornianb | Diff.c | Exp.a | Non-Bornianb | Diff.c | ||
a Experimental data taken from ref. 28.b Theoretical values obtained from eqn (13) with the NPA charges and the E values at the SAS.c Differences between the experimental and theoretical values. | |||||||
H3O+ | −110.3 | −105.8 | −4.5 | OH− | −104.7 | −107.1 | 2.4 |
CH3OH2+ | −93.0 | −83.1 | −9.9 | HOO− | −97.3 | −91.7 | −5.6 |
CH3CH2OH2+ | −88.4 | −73.5 | −14.9 | O2− | −83.3 | −89.1 | 5.8 |
(CH3)2OH+ | −79.7 | −71.1 | −8.6 | HS− | −72.1 | −88.2 | 16.1 |
(C2H5)2OH+ | −71.5 | −61.8 | −9.7 | HC2− | −76.5 | −81.6 | 5.1 |
(CH3)2COH+ | −77.1 | −67.5 | −9.6 | CN− | −70.2 | −87.7 | 17.5 |
CH3C(OH)C6H5+ | −64.5 | −59.1 | −5.4 | CH3O− | −95.0 | −88.4 | −6.6 |
NH4+ | −85.2 | −97.3 | 12.1 | CH3CH2O− | −90.7 | −80.2 | −10.5 |
CH3NH3+ | −76.4 | −80.4 | 4.0 | CH3CH2CH2O− | −88.3 | −80.4 | −7.9 |
CH3(CH2)2NH3+ | −71.5 | −72.7 | 1.2 | (CH3)2CHO− | −86.3 | −76.1 | −10.2 |
(CH3)2CHNH3+ | −69.6 | −69.1 | −0.5 | CH3CH2CHOCH3− | −84.2 | −75.7 | −8.5 |
(CH3)3CNH3+ | −67.3 | −65.6 | −1.7 | (CH3)3CO− | −82.3 | −74.1 | −8.2 |
c-C6H11NH3+ | −68.7 | −65.7 | −3.0 | H2CCHCH2O− | −86.6 | −78.9 | −7.7 |
H2CCHCH2NH3+ | −72.0 | −71.9 | −0.1 | CH3OCH2CH2O− | −89.4 | −81.7 | −7.7 |
(CH3)2NH2+ | −68.6 | −69.9 | 1.3 | HOCH2CH2O− | −85.3 | −79.7 | −5.6 |
(C2H5)2NH2+ | −63.4 | −60.4 | −3.0 | C6H5CH2O− | −85.1 | −77.1 | −8.0 |
(n-C3H7)2NH2+ | −60.5 | −60.1 | −0.4 | CF3CH2O− | −77.5 | −71.4 | −6.1 |
(H2CCHCH2)2NH2+ | −61.6 | −59.6 | −2.0 | CH(CF3)2O− | −65.5 | −63.0 | −2.5 |
(CH3)3NH+ | −61.1 | −64.0 | 2.9 | CH3OO− | −93.2 | −86.1 | −7.1 |
(C2H5)3NH+ | −54.6 | −56.0 | 1.4 | CH3CH2OO− | −89.2 | −85.2 | −4.0 |
(n-C3H7)3NH+ | −50.9 | −55.9 | 5.0 | HCO2− | −76.2 | −79.1 | 2.9 |
C6H5NH3+ | −72.4 | −70.7 | −1.7 | CH3CO2− | −77.6 | −77.6 | 0.0 |
o-CH3C6H4NH3+ | −70.3 | −68.0 | −2.3 | CH3CH2CO2− | −76.2 | −76.3 | 0.1 |
m-CH3C6H4NH3+ | −69.6 | −70.3 | 0.7 | CH3(CH2)4CO2− | −74.6 | −81.2 | 6.6 |
p-CH3C6H4NH3+ | −69.8 | −70.6 | 0.8 | H2CCHCO2− | −74.0 | −74.9 | 0.9 |
m-NH2C6H4NH3+ | −65.8 | −69.8 | 4.0 | CH3COCO2− | −68.5 | −72.3 | 3.8 |
C6H5NH2CH3+ | −62.6 | −63.6 | 1.0 | CH2ClCO2− | −69.7 | −72.4 | 2.7 |
C6H5NH2CH2CH3+ | −62.2 | −59.7 | −2.5 | CHCl2CO2− | −62.3 | −68.5 | 6.2 |
C6H5NH(CH3)2+ | −57.2 | −60.0 | 2.8 | CF3CO2− | −59.3 | −64.4 | 5.1 |
p-CH3C6H4NH(CH3)2+ | −55.9 | −60.5 | 4.6 | C6H5CO2− | −71.2 | −73.1 | 1.9 |
C6H5NH(CH2CH3)2+ | −54.0 | −56.8 | 2.8 | C6H5O− | −71.9 | −69.4 | −2.5 |
C10H7NH3+ | −67.4 | −70.3 | 2.9 | o-CH3C6H4O− | −70.2 | −67.4 | −2.8 |
C2H4NH2+ | −70.9 | −73.6 | 2.7 | m-CH3C6H4O− | −71.1 | −70.2 | −0.9 |
C3H6NH2+ | −67.7 | −68.1 | 0.4 | p-CH3C6H4O− | −72.0 | −69.9 | −2.1 |
C4H8NH2+ | −66.0 | −65.3 | −0.7 | m-HOC6H4O− | −73.8 | −68.3 | −5.5 |
C5H10NH2+ | −64.2 | −63.1 | −1.1 | p-HOC6H4O− | −77.6 | −69.1 | −8.5 |
C6H12NH2+ | −63.3 | −61.6 | −1.7 | o-NO2C6H4O− | −60.1 | −66.3 | 6.2 |
C4H4NH2+ | −61.4 | −68.1 | 6.7 | m-NO2C6H4O− | −61.9 | −63.8 | 1.9 |
PyridineH+ | −61.1 | −64.8 | 3.7 | p-NO2C6H4O− | −57.8 | −63.1 | 5.3 |
C9H7NH+ | −56.0 | −60.0 | 4.0 | o-ClC6H4O− | −66.1 | −66.5 | 0.4 |
C4H8NHNH2+ | −66.0 | −63.4 | −2.6 | p-ClC6H4O− | −66.0 | −66.7 | 0.7 |
CH3CNH+ | −75.3 | −75.1 | −0.2 | CH2CHO− | −76.5 | −76.4 | −0.1 |
H2NNH3+ | −84.6 | −85.9 | 1.3 | CH3C(O)CH2− | −76.2 | −73.6 | −2.6 |
p-CH3OC6H4NH3+ | −71.2 | −74.4 | 3.2 | CH3CH2C(O)CHCH3− | −73.7 | −67.9 | −5.8 |
p-NO2C6H4NH3+ | −75.9 | −83.9 | 8.0 | NCNH− | −72.2 | −78.5 | 6.3 |
C4H8ONH2+ | −69.6 | −68.6 | −1.0 | CH2CN− | −66.6 | −74.2 | 7.6 |
CH3CONH3+ | −73.9 | −77.6 | 3.7 | C6H5NH− | −62.9 | −66.1 | 3.2 |
C6H5CONH3+ | −67.2 | −70.4 | 3.2 | p-NO2C6H5NH− | −57.4 | −62.3 | 4.9 |
(CH3)2SH+ | −64.5 | −68.3 | 3.8 | (C6H5)2N− | −54.6 | −62.5 | 7.9 |
(CH3)2SOH+ | −67.7 | −70.7 | 3.0 | CH3CONH− | −80.2 | −78.5 | −1.7 |
m-ClC6H4NH3+ | −74.7 | −72.1 | −2.6 | CH2NO2− | −76.5 | −75.9 | −0.6 |
p-ClC6H4NH3+ | −74.1 | −72.3 | −1.8 | CH3S− | −73.8 | −76.0 | 2.2 |
CH3CH2S− | −71.8 | −70.9 | −0.9 | ||||
CH3CH2CH2S− | −70.5 | −71.1 | 0.6 | ||||
C6H5S− | −63.4 | −65.3 | 1.9 | ||||
CH3S(O)CH2− | −67.7 | −74.5 | 6.8 | ||||
CCl3− | −54.1 | −61.6 | 7.5 |
In the above, we describe that coefficients A, B, and C in eqn (8)–(10) are assumed to be independent of Ei, i.e., the magnitude of the electric field strength at minute ion surfaces. However, these coefficients would be different between cations and anions, because the orientation of primary water molecules on the respective ion surfaces may be dependent largely on the sign of Ei (i.e., positive/negative). Fig. 3 exemplifies the distributions of Ei for (a) 4-nitrophenoxide anion and (b) 4-methoxyanilinium cation, which have been calculated with the NPA–SAS combination (vide infra). As seen in (a), the anion shows negative Ei values at all surface positions. On the other hand, as seen in (b), the cation shows positive Ei values at most surface positions, but shows slightly negative values (as indicated by pale blue) at around the oxygen atom of the methoxy group. In general, a cation can partially have anionic surface, and vice versa for an anion. Anyway, since the orientation of water molecules on positive and negative surfaces may be different, we used two different values for coefficients B and C, depending on the sign of Ei, and then rewrote eqn (11) as
(12) |
Multivariate regression analyses were performed on the experimental values28 of ΔG°hyd to determine the coefficients in eqn (12) (i.e., A, B+, C+, B−, and C−). The values of ∑Si, , , , and required for the regression analyses were obtained from the quantum chemical calculation described above. In the present study, we investigated the performance of regression analysis by using all combinations of five different partial atomic charges (Mulliken, MK, NPA, Hirshfeld, and ChelpG) and two different molecular surfaces (vdW surface and SAS). As shown in Fig. 4, we obtained the highest performance when using a combination of NPA charge and SAS. The regression equation obtained is
(13) |
Fig. 5 Scatter plots of the theoretical values of ΔG°hyd obtained using the non-Bornian model (i.e., eqn (13)) against the experimental values of ΔG°hyd for 109 ions (shown in Table 2). The combination of NPA–SAS has been employed. R2 = 0.9943. |
As shown in Fig. 4, generally higher regression performance was achieved in the use of SAS, rather than in the use of vdW surface, particularly when the Mulliken and NPA methods were used to evaluate partial atomic charges. This is in harmony with the previous regression results obtained for spherical ions.23 It is generally considered that the electrostatic interaction, including COU and POL interactions, makes a dominant contribution to ΔG°hyd; the SAS seems to be more suitable for evaluating such electrostatic interactions.
There is no “correct” method for assigning partial atomic charges, because they are not quantum mechanical observable.53 However, the NPA method (with SAS) has achieved the highest performance among the methods tested. This method generally overestimates ionic character, giving larger magnitude of atomic charges than the Mulliken method.54 This may give the NPA method an advantage for evaluating the electrostatic ion–solvent interaction as the main contribution to ΔG°hyd. As also seen in Fig. 4, the Mulliken method gave no good results for the present regression analysis. The Mulliken method, which is the default method in Gaussian 09,33,34 has been widely used for determining partial atomic charges, however its weakness is well known. The method tends to yield unnatural values at large basis sets,55 such as B3LYP/6-311++G(2d,p) employed in this study.
Thus, the MAE values reported by the previous authors are comparable with or somewhat lower than that obtained in this study. However, no rigorous comparison can be made, because the data sets employed in the previous studies are different with each other, though there are many ions in common. Nevertheless, we believe the performance of our non-Bornian model is not so bad, probably comparable with other previous models. In most of the previous models, a number of parameters have been adjusted so as to optimize the agreement with experimental values; some sort of parameters, such as “atomic surface tension”, have been optimized for each atom or atomic group. In our non-Bornian model, however, there are only five adjusting parameters (i.e., A, B+, C+, B−, and C− in eqn (12)), which are common to all atoms or atomic groups. It should be stressed that this very simple model can predict ΔG°hyd for various organic ions with fair accuracy. Using a higher-level quantum chemical calculation, the performance of the non-Bornian model would be further improved. In practice, a higher-level calculation at the M06-2X/6-311++G(2d,p) level slightly improved the result shown in eqn (13) (with MAE = 4.1 kcal mol−1; RMSE = 5.3 kcal mol−1; detailed data not shown), though the calculation time was 1.5 to 3 times longer than the above-shown calculation at the B3LYP/6-311++G(2d,p) level. The use of MP2 and CCSD, which were reported as more reliable than B3LYP,56 might provide a better or worse result than the present analysis.
We would like to add that in recent years, some other models with few adjusting parameters have been proposed for the evaluation of hydration energies of monovalent spherical ions and noble gas atoms57 and neutral and ionic solutes.58 Though no rigorous comparison can be made because of the difference in the solutes to be tested, a comparatively low value of MUE (= 2.4 kcal mol−1) has been reported for ionic solutes.58
Footnotes |
† Electronic supplementary information (ESI) available: The source code (c4ra02422b1.txt) and Readme (c4ra02422b5.txt) of the program for the calculation of Ei and Si and three Visual Basic files (ProgramNonBorn.vbp; ProgramNonBorn.frm; ProgramNonBorn.vbw; being uploaded as c4ra02422b2.txt, c4ra02422b3.txt, and c4ra02422b4.txt, respectively) for the execution of the program. See DOI: 10.1039/c4ra02422b |
‡ It is conventionally accepted that the Born formula describes the Gibbs energy, however it has recently been claimed that the Born formula describes the enthalpy. See ref. 2. |
This journal is © The Royal Society of Chemistry 2014 |