Chidozie Ezeakunnea,
Bipin Lamichhane
b and
Shyam Kattel
*a
aDepartment of Physics, University of Central Florida, Orlando, FL 32816, USA. E-mail: shyam.kattel@ucf.edu
bDepartment of Physics, Florida A&M University, Tallahassee, FL 32307, USA
First published on 14th February 2025
In this study, we used a combination of density functional theory with Hubbard U correction (DFT+U) and machine learning (ML) to accurately predict the band gaps and lattice parameters of metal oxides: TiO2 (rutile and anatase), cubic ZnO, cubic ZnO2, cubic CeO2, and cubic ZrO2. Our results show that including Up values for oxygen 2p orbitals alongside Ud/f for metal 3d or 4f orbitals significantly enhances the accuracy of these predictions. Through extensive DFT+U calculations, we identify optimal (Up, Ud/f) integer pairs that closely reproduce experimentally measured band gaps and lattice parameters for each oxide: (8 eV, 8 eV) for rutile TiO2; (3 eV, 6 eV) for anatase TiO2; (6 eV, 12 eV) for c-ZnO; (10 eV, 10 eV) for c-ZnO2; (9 eV, 5 eV) for c-ZrO2; and (7 eV, 12 eV) for c-CeO2. Our ML analysis showed that simple supervised ML models can closely reproduce these DFT+U results at a fraction of the computational cost and generalize well to related polymorphs. Our approach builds on existing high-throughput DFT+U frameworks by providing fast pre-DFT estimates of structural properties and band gaps. Since this work does not aim to improve the underlying DFT+U method, the ML model shares its limitations. We also note that the reported values of Up strongly depend on the choice of correlated orbitals, and caution is recommended with a different choice of correlated orbitals.
However, determining the appropriate value of U in the DFT+U approach is not trivial and requires an extensive benchmarking of the calculated band gap with the experimental value. In general, the Hubbard U correction is only applied to 3d and 4f orbitals of metals in metal oxides.25,29,30 This is reasonable because of the inability of DFT to treat 3d and 4f valence orbitals strictly as localized orbitals. Interestingly, recent studies have shown that the U parameter for the O 2p orbital of oxygen in metal oxides is beneficial for predicting the lattice parameters and band gap correctly. For example, Thoa et al.31 have shown that for rutile TiO2, an optimal combination of Up (10 eV) and Ud (8 eV) significantly enhances the accuracy of predicted properties, minimizing the deviations in lattice constants and band gap between DFT+U and experimental values. Similarly, Plata et al.29 demonstrated in their study on CeO2 that applying Hubbard U corrections to both Ce 4f and O 2p electrons leads to substantial improvements in the predictions of lattice parameters, band gaps, and formation energies. Gebauer et al.32 extended these insights to ZrO2, where the judicious selection of Up and Ud values resulted in band gap predictions that closely matched experimental results. Additionally, May & Kolpak33 illustrate the importance of incorporating Up values for oxygen 2p orbitals alongside Ud/f for metal orbitals in DFT+U calculations, significantly enhancing the accuracy of predicted crystal structures and band gaps for transition and rare metal oxides such as rutile and anatase TiO2, ZnO, and CeO2.
Even though the DFT+U approach has been widely used to study metal oxides’ bulk and surface properties, a systematic, coherent, and extensive study aiming to unravel the effect of Up and Ud/f parameters on the prediction of lattice parameters and band gap is limited. In this current study, we employed the DFT+U approach, integrating the DFT+U results with machine learning (ML) methods to investigate the influence of Up, Ud/f parameters on the prediction of crystal structure (lattice parameters) and band gap of five commonly used metal oxides system in heterogeneous catalysis community,34 namely, TiO2 (rutile and anatase), c-ZrO2, c-ZnO, c-ZnO2 and c-CeO2. In general, the results show that including Up in addition to Ud/f values in DFT+U calculations yields improved prediction in both lattice parameters and band gap. Furthermore, the results obtained using the ML scheme show that the regression algorithms can be used to accurately predict the band gap of the metal oxides used in this study. Thus, our combined DFT+U+ML study extensively benchmarks Up and Ud/f values on lattice parameters and band gap prediction of the Hubbard U approach on widely used metal oxides.
We leverage a hybrid approach that integrates DFT calculations with an effective Hubbard U correction (DFT+U)21 and simple supervised ML models to predict band gap and lattice parameters for a given Up, Ud/f values for a range of commonly used oxides in heterogeneous catalysis. The DFT+U calculations were carried out using the Vienna Ab initio Simulation Package (VASP) code version 5.4.4 included in MedeA®,47–51 employing the generalized gradient approximation (GGA) with both Perdew–Burke–Ernzerhof (PBE) and revised PBE (rPBE) functionals.52,53 For both PBE and rPBE, we used is the PBE PAW potentials based on the projector-augmented-wave (PAW) method, which is provided by VASP.51,54
The study investigates six primary unique metal oxides with their respective Materials Project ID:55 rutile and anatase titanium dioxide (TiO2; mp-2657 and mp-390), each with its unique tetragonal structure; cubic zinc oxide (c-ZnO; mp-1986); zinc peroxide (c-ZnO2; mp-8484) with a pyrite-like, also cubic configuration; and zirconium dioxide (c-ZrO2; mp-1565) and cerium dioxide (c-CeO2; mp-20194), both in a cubic fluorite structure. We also conducted DFT+U calculations on additional secondary metal oxides (Table S36, ESI†) to evaluate the transferability of the learned feature weights to a closely related metal oxide.
A key aspect of our computational methodology involved applying the Hubbard U parameters – we applied the Ud parameter to the d orbitals of titanium (Ti), zinc (Zn), zirconium (Zr), manganese (Mn), hafnium (Hf), and nickel (Ni). In contrast, the Uf parameter was designated for the f orbitals of cerium (Ce).29 In addition to Ud/f parameters, we uniformly applied the Up parameter to the oxygen (O) p orbitals across all oxide systems in this study.
The computational setup for structure optimization and band structure calculations was consistent across all oxide systems. We made minor adjustments to the KPOINTS, aligning them with similar, converged values found in the Materials Project repository,56,57 for each metal oxide in this study. The energy cutoff for the plane wave basis set (ENCUT) was 520 eV for all metal oxides ensuring compatibility with Materials Project while still being more than double the recommended value referenced in the pseudopotentials. To verify the converged values, we conducted a convergence test for the metal oxide with the smallest KPOINTS utilized (c-ZnO), and the results (Fig. S1 and S2, ESI†) confirm convergence.
Our VASP input parameters included a ‘normal’ precision setting for structure optimizations, with the stress tensor fully optimized (ISIF = 3). We set the electronic minimization parameters with a total energy convergence criterion (EDIFF) of 1.0 × 10−6 and a force convergence criterion (EDIFFG) of 0.01. Additional settings included spin polarization (ISPIN = 2) and the LDA+U parameters (LDAU, LDAUTYPE, LDAUL, LDAUU, LDAUJ). The Up values initially span from 0.00 eV to 10.00 eV in integer steps of 1 eV, and the Ud/f values range from 2.00 eV to 10.00 eV, also in integer steps of 1 eV, with a U of 0.00 eV generally representing a particular case where no U correction was applied to the orbital of interest. The U values remain consistent with the structure optimization calculation, allowing for an extensive evaluation of their impact on the metal oxides’ electronic structures (band gap) and lattice parameters (a, b, c).
To complement the DFT+U analysis, we employed a variety of supervised ML regressors, including linear regression (LR), random forest regression (RFR), gradient boosting regression (GBR), XGBoost regression (XGBR), and Gaussian process regression (GPR). Additionally, we utilized a second-order polynomial regression (PR) model, which includes polynomial combinations of the features to capture the non-linear relationships between the Up and Ud/f parameters and the band gaps and lattice constants for each primary metal oxide in this study. Our preliminary data analysis informed this decision, indicating a non-linear correlation between these U variables and the target band gap and lattice constants.58–60 Model training incorporated a K-fold cross-validation approach (with K = 5) over leave-one-out cross-validation to simulate training on fewer data than obtained (∼80%). This method meticulously divided the dataset, using a different fold as the test set in each iteration, and the remaining folds in each iteration comprised the training set, thus providing a comprehensive assessment of model performance across the entire dataset and mitigating overfitting risks.58–60 Following the initial training and evaluation phase, we conducted a comprehensive retraining of the regressors using the dataset of the initial range of U values per metal oxide. Subsequently, to assess the models’ extrapolation predictive accuracy, we evaluated the newly trained models on ten newly generated random integer pairs of Up and Ud/f values, which extended beyond the initially defined range for U parameters (Table S4, ESI†). Models’ performance was evaluated using commonly used metrics such as mean squared error (MSE), root mean squared error (RMSE), mean absolute error (MAE), and coefficient of determination (R2).61–65
After extensive benchmarking to identify the best-performing regressor for interpolation and extrapolation scenarios in predicting band gap and lattice parameters for each primary metal oxide. We retrained the model a third time, including the extrapolation data, using only this best-performing regressor. We equally extracted the learned model weights/coefficients. This approach aimed to create a more accurate model capable of extending beyond our initial range of U values. Using this newly trained model, we applied Bayesian optimization minimization using a Gaussian process.66,67 We defined the objective loss function to minimize the weighted mean absolute percentage error (WMAPE).68 This loss function normalizes the differences in units and scales between the band gap and lattice constants. We included weights to emphasize the predictions for both material properties (band gap and lattice constants) in this study – (1, 1, 1, 1) to represent equal contributions from both band gap and lattice parameters and (1, 0, 0, 0) to focus solely on the band gap. Using this technique, we identified the optimal combination of Up and Ud/f values to a higher precision. We investigated two constraints as before:
(a) Both Up and Ud/f were allowed to vary, starting from 0.01 eV.
(b) Up was fixed at zero, with only Ud/f varying, starting from 0.01 eV.
The optimal U values were obtained, and DFT+U calculations were performed for validation. For each scenario, we calculated the percentage differences from experimental values. This allowed for a comparative assessment of the impact of fixing Up to zero on the accuracy of the DFT+U band gap and lattice parameters.
Finally, we incorporated the entire dataset to build a comprehensive model capable of predicting the band gap and lattice constants of the metal oxides in this study. These models aim to achieve good agreement with the DFT+U calculated values using fundamental properties of the constituent elements of each metal oxide that are readily accessible (Table S33, ESI†).69 We also extracted the learned weights for the features used in training the models. Additionally, we assessed the applicability of these models in predicting the band gap and lattice constants of closely related metal oxides not considered in this study. A more detailed computational methodology and the data, scripts, and additional resources used in this study are accessible on our GitHub page: https://github.com/dozieeze/DFT-Hub-U-ML-Predictor.
Our methodology and scripts provide a flexible method for researchers to incorporate multiple pre-identified properties of materials beyond those used in this study. Researchers can predict optimal U values that balance the DFT+U errors in predicting the predefined material properties compared to experimental values. Users can assign weights to each property and utilize a few DFT+U calculations with non-converged U values. Consequently, the methodology and scripts are not restricted to the computational configurations or the DFT code used in this study and can be adapted to non-DFT schemes, enhancing their robustness and versatility. It is, however, crucial to recognize that the optimal U values we derive are not ab initio and may be property-specific; caution must be taken in interpreting the physical significance of the derived U parameters.
We integrated the DFT+U approach with the ML method to accurately predict the band gap and lattice parameters of our primary metal oxides – TiO2, ZnO, ZnO2, ZrO2, and CeO2. The DFT+U calculations employed two commonly used exchange and correlation functionals: PBE and rPBE. In the following discussion, we primarily focus on the results obtained with the rPBE method. The results for PBE, which demonstrate a trend like that of the rPBE method, are provided in the ESI.† The results obtained using the rPBE functional were chosen to demonstrate the feasibility of our approach since the rPBE method has been shown to reproduce experimental trends of catalytic activity and selectivity in metal oxide-based catalysts.70 The DFT+U calculations were performed to compute metal oxides’ lattice parameters and band gap with a range of Up and Ud/f values. Subsequently, we developed regression models to predict the target variables using the data obtained from DFT+U calculations. Up and Ud/f represent the features in our simplest models, and band gaps and lattice constants a, b, and c represent the target variables.
Scatter plots were constructed to discuss the DFT+U results. The color of the circles indicates the percentage difference between the DFT+U calculated and experimental values of band gaps. In contrast, the size of the circle represents the percentage difference between the DFT+U calculated and experimental values of the lattice parameters. A darker blue color signifies a negative percentage difference, corresponding to an underestimated band gap (compared to experimental values). Conversely, a darker brown signifies a positive percentage difference, indicating an overestimated band gap (smaller than the experimental values). A white color indicates a zero-percentage difference between the experimental and DFT+U values. We restricted the shade range of brown and blue color to show deviations in the DFT+U calculated band gap value to ±20%; this limitation was imposed to ensure consistency across the plots for the various oxides under investigation and to concentrate primarily on more minor deviations as a deviation more than 20% in the calculated band gap from the experimental band gap is indeed a significant deviation from experiments that it is unlikely to be remedied by DFT+U. For metal oxides with a cubic bulk phase (a = b = c), the size of the circles represents the percentage difference in lattice constant (a) between DFT+U and experiments. For rutile and anatase TiO2 with tetragonal bulk structures (lattice constant a = b ≠ c), the size of the circles represents the percentage difference in lattice constant ratio (c/a) between DFT+U and experiments.
Without Up, our results show that the Ud = 10 eV produces the best result for c/a. The % difference between our DFT+U calculated c/a and the experimental values for Ud = 10 eV is 1.5%. Using the (Up, Ud) integer pairs of (8 eV, 8 eV) and (5 eV, 9 eV) predicted the band gaps close to the experimental values. The % differences between computed c/a, and the experimental values for those pairs are 1.3% and 1.4%, respectively. Considering the above-discussed three (Up, Ud) integer pairs, the (8 eV, 8 eV) pair results in the minimum error in both the band gap and lattice constants. Thus, our extensive DFT+U calculations suggest that the (Up, Ud) integer pair of (8 eV, 8 eV) is a reasonable choice of U values to accurately predict both the lattice parameters and band gap of rutile TiO2.
In the case of anatase TiO2, our standard DFT calculations result in a band gap of 2.486 eV and lattice constants of a = b = 3.829 Å and c = 9.802 Å. The standard DFT predicted band gap (2.486 eV) is smaller than the experimental band gap (∼3.200 eV).14 Simultaneously, the lattice parameters are slightly overestimated compared to experimental values (a = b = 3.785 Å and c = 9.512 Å), consistent with previous DFT results.73–76 Fig. 2 shows the deviation between the DFT+U calculated and experimental band gap values and lattice constants (c/a ratio in this case). Without the Up, among all integer Ud values, Ud = 7 eV produces a band gap of 3.243 eV, close to the experimental value with just a 1.3% difference between the predicted and the experimentally measured band gap. Similar to rutile TiO2, we obtain more accurate band gap predictions when introducing Up in addition to Ud. Fig. 2 shows that the (Up, Ud) integer pairs of (7 eV, 5 eV) and (3 eV, 6 eV) have a calculated band gap of 3.201 eV (% difference of 0.0%; Fig. S17, ESI†) and % difference in c/a of 0.9% and 0.5% respectively. Considering the deviation in both band gap and lattice parameters, our results predicted that the (Up, Ud) integer pair of (3 eV, 6 eV) represents the optimal integer Up and Ud values, which results in a band gap of 3.201 eV and lattice constants a = b = 3.897 Å, c = 9.846 Å.
Fig. 3, 4 and Tables 1, 3 (Fig. S7, S9, S21, S23, and Tables S1, S2, S7, S8, ESI†) showcase the result of our supervised ML models to predict the band gap and lattice constants of rutile and anatase TiO2. An asterisk (*) indicates the best model for the initial range of U values. Although GPR performs best, slightly better than PR in the initial range of U values, PR performs better in new unseen data outside our initial range (extrapolation), suggesting that the PR model can be used to accurately predict the band gap and lattice constant of TiO2, representing a robust and much faster computational approach (compared to traditional DFT+U calculations) to predict the band gap and lattice constants for any combination of Up and Ud.
Oxide | Model | Initial range | Extrapolation | ||||||
---|---|---|---|---|---|---|---|---|---|
MSE (eV2) | RMSE (eV) | MAE (eV) | R2 | MSE (eV2) | RMSE (eV) | MAE (eV) | R2 | ||
Rutile TiO2 | PR | 0.00 | 0.00 | 0.00 | 1.00 | 0.00 | 0.02 | 0.01 | 1.00 |
*GPR | 0.00 | 0.00 | 0.00 | 1.00 | 0.00 | 0.06 | 0.05 | 0.99 | |
LR | 0.00 | 0.03 | 0.02 | 0.99 | 0.02 | 0.13 | 0.09 | 0.94 | |
GBR | 0.00 | 0.03 | 0.02 | 0.99 | 0.15 | 0.38 | 0.31 | 0.55 | |
XGBR | 0.00 | 0.04 | 0.03 | 0.99 | 0.15 | 0.38 | 0.32 | 0.55 | |
RFR | 0.00 | 0.04 | 0.03 | 0.98 | 0.16 | 0.39 | 0.32 | 0.52 |
Using the minimization technique discussed in our methodology to optimize U with a higher precision of ∼0.01 eV, we obtained the results shown in Tables 2 and 4. When weight is applied only to the band gap, we quickly notice that the optimized U values vary greatly depending on the random initial conditions. This suggests that multiple pairs of U values can accurately predict the band gap, albeit with varying deviations in lattice constants from experimental data. The higher the precision, the greater the number of possible pairs. This phenomenon is not observed when Up is fixed at 0.00 eV. Applying equal weight to the band gap and lattice constants introduces constraints, leading to U pairs that minimize deviations from both experimental band gaps and lattice constants. Under this constraint, we observe a much larger converged Up value.
Oxide | Weights [Eg, a, b, c] | Converged Up![]() ![]() |
rPBE Eg (eV) | rPBE [a = b, c] (Å) | % Difference Eg | % Difference [a = b, c] |
---|---|---|---|---|---|---|
Rutile TiO2 | 1, 0, 0, 0 | 0.00![]() ![]() |
3.039 | 4.7499, 3.1108 | 0.3 | 3.4, 5.0 |
1, 0, 0, 0 | 13.20![]() ![]() |
3.038 | 4.6585, 3.0355 | 0.3 | 1.4, 2.5 | |
1, 1, 1, 1 | 20.89![]() ![]() |
3.067 | 4.5909, 2.9806 | 1.2 | −0.0, 0.7 |
Oxide | Model | Initial range | Extrapolation | ||||||
---|---|---|---|---|---|---|---|---|---|
MSE (eV2) | RMSE (eV) | MAE (eV) | R2 | MSE (eV2) | RMSE (eV) | MAE (eV) | R2 | ||
Anatase TiO2 | PR | 0.00 | 0.00 | 0.00 | 1.00 | 0.00 | 0.03 | 0.02 | 1.00 |
*GPR | 0.00 | 0.00 | 0.00 | 1.00 | 0.00 | 0.07 | 0.05 | 0.99 | |
LR | 0.00 | 0.03 | 0.02 | 0.99 | 0.02 | 0.13 | 0.09 | 0.95 | |
GBR | 0.00 | 0.03 | 0.02 | 0.99 | 0.15 | 0.39 | 0.30 | 0.56 | |
XGBR | 0.00 | 0.03 | 0.02 | 0.99 | 0.15 | 0.39 | 0.32 | 0.55 | |
RFR | 0.00 | 0.04 | 0.03 | 0.99 | 0.16 | 0.40 | 0.31 | 0.53 |
Oxide | Weights [Eg, a, b, c] | Converged Up![]() ![]() |
rPBE Eg (eV) | rPBE [a = b, c] (Å) | % Difference Eg | % Difference [a = b, c] |
---|---|---|---|---|---|---|
Anatase TiO2 | 1, 0, 0, 0 | 0.00![]() ![]() |
3.206 | 3.9152, 9.8709 | 0.2 | 3.4, 3.7 |
1, 0, 0, 0 | 11.43![]() ![]() |
3.194 | 3.8350, 9.7753 | 0.2 | 1.3, 2.8 | |
1, 1, 1, 1 | 17.04![]() ![]() |
3.195 | 3.7848, 9.7221 | 0.2 | 0.0, 2.2 |
The ML analysis depicted in Fig. 6 and Table 5 reveals that PR is the superior model for predicting the band gap of c-ZnO using Up and Ud as features. However, it exhibits significantly higher error rates and a poorer fit than rutile and anatase TiO2. The analysis of predictions outside the initial training and test sets suggests that while a quadratic trend may be closely followed within the original range of Up and Ud values, deviations occur with larger Up and Ud values, leading to less favorable predictions in the band gap and lattice constants.
Oxide | Model | Initial range | Extrapolation | ||||||
---|---|---|---|---|---|---|---|---|---|
MSE (eV2) | RMSE (eV) | MAE (eV) | R2 | MSE (eV2) | RMSE (eV) | MAE (eV) | R2 | ||
c-ZnO | PR | 0.00 | 0.02 | 0.01 | 1.00 | 0.75 | 0.87 | 0.41 | 0.63 |
*GPR | 0.00 | 0.01 | 0.00 | 1.00 | 0.80 | 0.90 | 0.45 | 0.60 | |
LR | 0.00 | 0.05 | 0.04 | 0.99 | 1.14 | 1.07 | 0.58 | 0.44 | |
GBR | 0.00 | 0.04 | 0.03 | 0.99 | 2.00 | 1.41 | 0.84 | 0.02 | |
XGBR | 0.00 | 0.05 | 0.04 | 0.99 | 2.00 | 1.41 | 0.86 | 0.02 | |
RFR | 0.00 | 0.06 | 0.05 | 0.98 | 2.02 | 1.42 | 0.86 | 0.01 |
Given the difficulties in accurately predicting the band gap of c-ZnO with the initial range of integer Up and Ud values, which spanned from 0 eV to 10 eV for Up and 2 eV to 10 eV for Ud – we utilized the PR model, to optimize U to integer precision. When setting Up to zero, the (Up, Ud) integer pair of (0 eV, 13 eV) resulted in a lattice constant of 4.259 Å and a band gap of 3.396 eV, showing deviations from experimental results of approximately |8.0|% and 0.8%, respectively. With the inclusion of a non-zero Up, the (6 eV, 12 eV) pair led to a lattice constant of 4.302 Å and a band gap of 3.314 eV (Fig. S36, ESI†), indicating deviations from experimental values of approximately |7.1|% for the lattice constants and |1.7|% for the band gap. Our analysis and calculations suggest that an (Up, Ud) integer pair of (6 eV, 12 eV) is a judicious choice for accurately predicting both the lattice parameters and the band gap of c- ZnO.
Utilizing the U values optimized to ∼0.01 eV obtained from the minimization (Table 6), we get better results in the predictions of the band gap and lattice constants, albeit with much more deviations from experiments than in rutile and anatase TiO2, possibly due to less favorable predictions from the underlining PR model used in the optimization. Larger Up values are also noticed in cases with equal weights.
Oxide | Weights [Eg, a, b, c] | Converged Up![]() ![]() |
rPBE Eg (eV) | rPBE [a = b, c] (Å) | % Difference Eg | % Difference [a = b, c] |
---|---|---|---|---|---|---|
c-ZnO | 1, 0, 0, 0 | 0.00![]() ![]() |
2.878 | 4.2520 | 14.6 | −8.2 |
1, 0, 0, 0 | 6.97![]() ![]() |
3.091 | 4.3426 | 8.3 | −6.2 | |
1, 1, 1, 1 | 25.82![]() ![]() |
3.202 | 4.4250 | 5.0 | −4.4 |
The ML analysis is shown in Fig. 8 and Table 7. PR stands out as the better model, albeit with a less favorable prediction outside our initial range of data, similar to the result in c-ZnO. We refined the band gap prediction for c-ZnO2 employing a technique like the one used for c-ZnO. With Up set to zero, the (Up, Ud) integer pair of (0 eV, 13 eV) provided predictions that closely matched experimental values, yielding a band gap of 3.381 eV and a lattice constant of 4.635 Å. Nevertheless, these results showed deviations from experimental measurements, with |24.9|% for the band gap and |4.9|% for the lattice constant. Based on our calculations, the (Up, Ud) integer pair of (10 eV, 10 eV) emerges as a sensible choice for U, enabling accurate predictions of both the lattice parameters and the band gap of c-ZnO2. Optimizing the U values to a higher precision, we get better DFT+U predictions in line with experiments (Table 8).
Oxide | Model | Initial range | Extrapolation | ||||||
---|---|---|---|---|---|---|---|---|---|
MSE (eV2) | RMSE (eV) | MAE (eV) | R2 | MSE (eV2) | RMSE (eV) | MAE (eV) | R2 | ||
c-ZnO2 | PR | 0.00 | 0.01 | 0.01 | 1.00 | 0.30 | 0.55 | 0.23 | 0.57 |
*GPR | 0.00 | 0.01 | 0.01 | 1.00 | 0.34 | 0.58 | 0.32 | 0.51 | |
LR | 0.00 | 0.04 | 0.03 | 0.99 | 0.37 | 0.61 | 0.36 | 0.47 | |
GBR | 0.00 | 0.04 | 0.03 | 0.99 | 1.03 | 1.01 | 0.78 | −0.48 | |
XGBR | 0.00 | 0.06 | 0.06 | 0.98 | 1.03 | 1.02 | 0.79 | −0.48 | |
RFR | 0.01 | 0.07 | 0.06 | 0.98 | 1.07 | 1.03 | 0.80 | −0.53 |
Oxide | Weights [Eg, a, b, c] | Converged Up![]() ![]() |
rPBE Eg (eV) | rPBE [a = b, c] (Å) | % Difference Eg | % Difference [a = b, c] |
---|---|---|---|---|---|---|
c-ZnO2 | 1, 0, 0, 0 | 0.00![]() ![]() |
5.049 | 4.1441 | 12.2 | −14.9 |
1, 0, 0, 0 | 9.75![]() ![]() |
4.443 | 4.7558 | 1.3 | −2.4 | |
1, 1, 1, 1 | 13.63![]() ![]() |
4.567 | 4.8677 | 1.0 | −0.1 |
The performance of our ML models in accurately predicting the band gap and lattice constants of c-ZrO2 is summarized in Fig. 10, and Table 9 (Fig. S54 and Table S23, ESI†). PR remains the best choice model for predictions. Unlike c-ZnO and c-ZnO2, where more significant Up and Ud values led to prediction challenges, c-ZrO2 maintains predictive accuracy even at higher Ud and Up values, like rutile and anatase TiO2. This improves the accuracy of the optimized high-precision U values to produce results close to experiments, as shown in Table 10.
Oxide | Model | Initial range | Extrapolation | ||||||
---|---|---|---|---|---|---|---|---|---|
MSE (eV2) | RMSE (eV) | MAE (eV) | R2 | MSE (eV2) | RMSE (eV) | MAE (eV) | R2 | ||
c-ZrO2 | PR | 0.00 | 0.02 | 0.02 | 1.00 | 0.01 | 0.08 | 0.05 | 0.97 |
LR | 0.00 | 0.06 | 0.05 | 0.97 | 0.05 | 0.22 | 0.14 | 0.80 | |
*GBR | 0.00 | 0.04 | 0.02 | 0.99 | 0.06 | 0.25 | 0.22 | 0.73 | |
RFR | 0.00 | 0.05 | 0.04 | 0.98 | 0.08 | 0.28 | 0.24 | 0.67 | |
XGBR | 0.00 | 0.05 | 0.03 | 0.98 | 0.08 | 0.28 | 0.25 | 0.67 | |
GPR | 0.00 | 0.02 | 0.01 | 1.00 | 0.11 | 0.33 | 0.25 | 0.54 |
Oxide | Weights [Eg, a, b, c] | Converged Up![]() ![]() |
rPBE Eg (eV) | rPBE [a = b, c] (Å) | % Difference Eg | % Difference [a = b, c] |
---|---|---|---|---|---|---|
c-ZrO2 | 1, 0, 0, 0 | 0.00![]() ![]() |
4.582 | 5.2733 | −0.4 | 3.0 |
1, 0, 0, 0 | 4.45![]() ![]() |
4.592 | 5.2326 | −0.3 | 2.2 | |
1, 1, 1, 1 | 15.80![]() ![]() |
4.617 | 5.1180 | 0.4 | −0.0 |
For the ML analysis, as shown in Table 11, PR remains the preferred prediction model. Employing a strategy like in c-ZnO2/c-ZnO, the (Up, Uf) integer pair of (0 eV, 13 eV), with Up, set to zero, was notable for yielding a band gap of 3.286 eV and a lattice constant of 5.584 Å, exhibiting deviations from experimental values of |2.7|% and |3.2|%, respectively. Additionally, when employing a non-zero Up, the (7 eV, 12 eV) integer pair closely approximated the experimental band gap with a predicted value of 3.209 eV (Fig. S66, ESI†), deviating by only |0.3|% from experimental data alongside a lattice constant deviation of |2.4|% at 5.543 Å. Therefore, the (Up, Uf) integer pair of (7 eV, 12 eV) emerges as the best choice for the U parameters, enabling accurate predictions of the band gap of c-CeO2 with minimal deviations in lattice constants from experiments. The optimized high-precision U values in Table 12 enhance the predictions; however, applying equal weights to both the band gap and lattice constants results in an unusually large Up value, causing the subsequent validating DFT+U calculation to fail in performing structure optimizations (Fig. 12).
Oxide | Model | Initial range | Extrapolation | ||||||
---|---|---|---|---|---|---|---|---|---|
MSE (eV2) | RMSE (eV) | MAE (eV) | R2 | MSE (eV2) | RMSE (eV) | MAE (eV) | R2 | ||
c-CeO2 | *PR | 0.00 | 0.00 | 0.00 | 1.00 | 0.01 | 0.12 | 0.06 | 0.95 |
GPR | 0.00 | 0.01 | 0.00 | 1.00 | 0.02 | 0.15 | 0.09 | 0.93 | |
LR | 0.00 | 0.03 | 0.02 | 0.99 | 0.05 | 0.23 | 0.15 | 0.83 | |
GBR | 0.00 | 0.01 | 0.01 | 1.00 | 0.18 | 0.42 | 0.27 | 0.44 | |
RFR | 0.00 | 0.02 | 0.01 | 0.99 | 0.18 | 0.42 | 0.28 | 0.43 | |
XGBR | 0.00 | 0.02 | 0.01 | 0.99 | 0.18 | 0.42 | 0.28 | 0.43 |
Oxide | Weights [Eg, a, b, c] | Converged Up![]() ![]() |
rPBE Eg (eV) | rPBE [a = b, c] (Å) | % Difference Eg | % Difference [a = b, c] |
---|---|---|---|---|---|---|
c-CeO2 | 1, 0, 0, 0 | 0.00![]() ![]() |
3.205 | 5.5816 | 0.2 | 3.1 |
1, 0, 0, 0 | 7.61![]() ![]() |
3.202 | 5.5521 | 0.1 | 2.6 | |
1, 1, 1, 1 | 31.44![]() ![]() |
N/A | N/A | N/A | N/A |
Oxides | Model | Initial range | Extrapolation | ||||||
---|---|---|---|---|---|---|---|---|---|
MSE (eV2) | RMSE (eV) | MAE (eV) | R2 | MSE (eV2) | RMSE (eV) | MAE (eV) | R2 | ||
All six (6) primary metal oxides in this study | *GPR | 0.00 | 0.01 | 0.00 | 1.00 | 0.21 | 0.46 | 0.22 | 0.83 |
PR | 0.00 | 0.04 | 0.02 | 1.00 | 0.24 | 0.49 | 0.19 | 0.81 | |
LR | 0.02 | 0.14 | 0.10 | 0.98 | 0.43 | 0.65 | 0.37 | 0.66 | |
XGBR | 0.00 | 0.04 | 0.03 | 1.00 | 0.60 | 0.77 | 0.47 | 0.53 | |
RFR | 0.00 | 0.06 | 0.04 | 1.00 | 0.61 | 0.78 | 0.47 | 0.52 | |
GBR | 0.01 | 0.07 | 0.05 | 0.99 | 0.62 | 0.78 | 0.47 | 0.52 |
Expanding the dataset to 38 oxides, we validated the retrained models on three unseen systems: monoclinic-ZrO2 (mp-2858), orthorhombic-TiO2 (mp-1840), and tetragonal-WO3 (mp-2235359). The RFR model performed best (Fig. 14 and Table 14) for band gap predictions, demonstrating strong generalization across varying crystal structures. Using the expanded dataset, the RFR model also predicted both band gaps and lattice parameters (a, b, c) with high accuracy across all 38 metal oxides (Fig. S76–S79, ESI,† Fig. 15 and Table 15).
![]() | ||
Fig. 14 (a)–(c) Top three ML models for band gap prediction of test metal oxides using elemental, structural, and chemical descriptors. |
Test oxides | Model | Evaluation (test) | |||
---|---|---|---|---|---|
MSE (eV2) | RMSE (eV) | MAE (eV) | R2 | ||
m-ZrO2 | RFR | 0.02 | 0.15 | 0.12 | 0.99 |
o-TiO2 | DTR | 0.03 | 0.16 | 0.11 | 0.98 |
t-WO3 | GPR | 0.03 | 0.17 | 0.14 | 0.98 |
![]() | ||
Fig. 15 RFR predictions vs. rPBE calculated bulk properties across all thirty-eight (38) metal oxides: band gap (Eg) and lattice parameters (a, b, c). |
Bulk properties | Data (average) | MSE | RMSE | MAE | R2 |
---|---|---|---|---|---|
Eg (eV) | Train | 0.00 eV2 | 0.03 eV | 0.01 eV | 1.00 |
Test | 0.01 eV2 | 0.07 eV | 0.03 eV | 1.00 | |
a (Å) | Train | 0.00 Å2 | 0.01 Å | 0.00 Å | 1.00 |
Test | 0.00 Å2 | 0.01 Å | 0.00 Å | 1.00 | |
b (Å) | Train | 0.00 Å2 | 0.01 Å | 0.00 Å | 1.00 |
Test | 0.00 Å2 | 0.02 Å | 0.00 Å | 1.00 | |
c (Å) | Train | 0.00 Å2 | 0.01 Å | 0.00 Å | 1.00 |
Test | 0.00 Å2 | 0.02 Å | 0.00 Å | 1.00 |
These results highlight the potential of increasing data diversity to improve model performance, while showcasing the utility of simple supervised machine learning models for property predictions and parameter optimization in metal oxides. Building on this, our findings demonstrate that achieving bandgap and lattice parameters closer to experimental values is possible when using both Up and Ud/f, compared to the treatment of Ud/f alone. Higher values of Ud/f tend to lead to larger lattice parameters that deviate from experimental values, consistent with previous studies.31 However, for most oxides reported here, optimizing to an accurate description of the lattice parameter with U values less than 0.01 eV favors an unphysically high Up value. This trend arises because increasing Up, regardless of the Ud/f value, decreases the lattice parameter, this behavior may reflect context-specific limitations of the orbital truncation scheme used by VASP.85
One possible explanation relates to the choice of correlated orbitals used by the DFT+U scheme in treating oxygen p-orbitals. As noted by Geneste et al.,86 the DFT+U approach implemented in VASP employs a scheme analogous to renormalized truncated atomic orbitals, which can potentially produce spurious results for lattice parameters (and volume) when Up is applied to oxygen p-orbitals, as opposed to Ud/f for the metal's d or f orbitals. Studies27,85–87 suggest that alternative orbital definitions, such as Wannier orbitals, may yield structural properties more aligned with physical expectations. Resolving this potential intrinsic limitation is significant but falls outside the scope of this study; we emphasize that the physically meaningful choice of correlated orbitals – and thus the resulting Hubbard U parameters, particularly Up for O 2p orbitals – is strongly tied to the definition of the correlated orbitals in VASP. Accordingly, these values may not be universally transferrable or physically meaningful within different implementations or codes, and we advise careful consideration of such context-specific factors when applying or interpreting them. Our developed ML framework and scripts are designed to operate independently of any specific DFT+U implementation. However, because of its extensive use and compatibility with the body of existing literature, our data and findings are consistent with VASP. Future research could expand this framework to include different orbital definitions, like Wannier orbitals, to improve physical accuracy while preserving the flexibility and effectiveness shown here.
Oxide | Calculation method | Up (eV) | Ud/f (eV) | a = b (Å) | c (Å) | Eg (eV) | % Deviation from exp. Eg | % Deviation from exp. a = b | % Deviation from exp. c |
---|---|---|---|---|---|---|---|---|---|
Rutile TiO2 | GGA–PBE | 0 | 0 | 4.646 | 2.968 | 1.826 | −39.7 | 1.1 | 0.2 |
GGA–PBE+U | 0 | 10 | 4.708 | 3.089 | 2.974 | −1.9 | 2.5 | 4.3 | |
GGA–PBE+U | 8 | 8 | 4.661 | 3.052 | 3.027 | −0.1 | 1.5 | 3.1 | |
GGA–rPBE | 0 | 0 | 4.689 | 2.983 | 1.838 | −39.3 | 2.1 | 0.7 | |
GGA–rPBE+U | 0 | 10 | 4.746 | 3.106 | 2.972 | −1.9 | 3.3 | 4.9 | |
GGA–rPBE+U | 8 | 8 | 4.699 | 3.069 | 3.037 | 0.2 | 2.3 | 3.6 | |
Anatase TiO2 | GGA–PBE | 0 | 0 | 3.804 | 9.702 | 2.496 | −22.0 | 0.5 | 2.0 |
GGA–PBE+U | 0 | 7 | 3.893 | 9.789 | 3.249 | 1.5 | 2.9 | 2.9 | |
GGA–PBE+U | 3 | 6 | 3.873 | 9.740 | 3.199 | −0.0 | 2.3 | 2.3 | |
GGA–rPBE | 0 | 0 | 3.829 | 9.802 | 2.486 | −22.3 | 1.2 | 3.1 | |
GGA–rPBE+U | 0 | 7 | 3.918 | 9.876 | 3.243 | 1.3 | 3.5 | 3.8 | |
GGA–rPBE+U | 3 | 6 | 3.900 | 9.846 | 3.201 | 0.0 | 2.9 | 3.5 | |
c-ZnO | GGA–PBE | 0 | 0 | 4.629 | 4.629 | 0.615 | −81.8 | 0.0 | 0.0 |
GGA–PBE+U | 0 | 13 | 4.172 | 4.172 | 3.663 | 8.7 | −9.9 | −9.9 | |
GGA–PBE+U | 5 | 12 | 4.243 | 4.243 | 3.374 | 0.1 | −8.4 | −8.4 | |
GGA–rPBE | 0 | 0 | 4.682 | 4.682 | 0.658 | −80.5 | 1.1 | 1.1 | |
GGA–rPBE+U | 0 | 13 | 4.259 | 4.259 | 3.396 | 0.8 | −8.0 | −8.0 | |
GGA–rPBE+U | 6 | 12 | 4.302 | 4.302 | 3.314 | −1.7 | −7.1 | −7.1 | |
c-ZnO2 | GGA–PBE | 0 | 0 | 4.957 | 4.957 | 2.128 | −52.7 | 1.8 | 1.8 |
GGA–PBE+U | 0 | 13 | 4.528 | 4.528 | 3.495 | −22.3 | −7.0 | −7.0 | |
GGA–PBE+U | 10 | 10 | 4.684 | 4.684 | 4.511 | 0.2 | −3.8 | −3.8 | |
GGA–rPBE | 0 | 0 | 5.020 | 5.020 | 2.13 | −52.7 | 3.1 | 3.1 | |
GGA–rPBE+U | 0 | 13 | 4.635 | 4.635 | 3.381 | −24.9 | −4.9 | −4.9 | |
GGA–rPBE+U | 10 | 10 | 4.761 | 4.761 | 4.464 | −0.8 | −2.3 | −2.3 | |
c-ZrO2 | GGA–PBE | 0 | 0 | 5.118 | 5.118 | 3.307 | −28.1 | 0.0 | 0.0 |
GGA–PBE+U | 0 | 9 | 5.249 | 5.249 | 4.589 | −0.2 | 2.5 | 2.5 | |
GGA–PBE+U | 9 | 5 | 5.152 | 5.152 | 4.554 | −1.0 | 0.7 | 0.7 | |
GGA–rPBE | 0 | 0 | 5.152 | 5.152 | 3.295 | −28.4 | 0.7 | 0.7 | |
GGA–rPBE+U | 0 | 9 | 5.282 | 5.282 | 4.638 | 0.8 | 3.2 | 3.2 | |
GGA–rPBE+U | 9 | 5 | 5.184 | 5.184 | 4.589 | −0.2 | 1.3 | 1.3 | |
c-CeO2 | GGA–PBE | 0 | 0 | 5.464 | 5.464 | 1.871 | −41.5 | 1.0 | 1.0 |
GGA–PBE+U | 0 | 12 | 5.543 | 5.543 | 3.172 | −0.9 | 2.4 | 2.4 | |
GGA–PBE+U | 2 | 12 | 5.537 | 5.537 | 3.206 | 0.2 | 2.3 | 2.3 | |
GGA–rPBE | 0 | 0 | 5.506 | 5.506 | 1.816 | −43.3 | 1.8 | 1.8 | |
GGA–rPBE+U | 0 | 13 | 5.584 | 5.584 | 3.286 | 2.7 | 3.2 | 3.2 | |
GGA–rPBE+U | 7 | 12 | 5.543 | 5.543 | 3.209 | 0.3 | 2.4 | 2.4 |
In addition to explicit DFT+U calculations, we used the dataset generated by the DFT+U calculations to train supervised ML models. Of all the ML models, RFR best predicts the band gap and lattice constant of all 38 metal oxides included in this study as a function of U values. The trained model also has the potential to be applied to other closely related metal oxides not explored here. A possible improvement for future work would involve expanding the training data to include a more diverse set of metal oxides and their polymorphs as well as more chemical and structural descriptors, enhancing the models’ ability to predict the properties of new metal oxides not covered in this study. Our ML analysis showed that simple supervised ML models can closely reproduce these DFT+U results at a fraction of the computational cost and generalize well to related polymorphs. Our approach builds on existing high-throughput DFT+U frameworks by providing fast pre-DFT estimates of structural properties and band gaps. Since this work does not aim to improve the underlying DFT+U method, the ML model shares its limitations. We also note that the reported values of Up strongly depend on the choice of correlated orbitals, and caution is recommended with a different choice of correlated orbitals.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4cp03397c |
This journal is © the Owner Societies 2025 |