Sanjith
Krishna
and
Afkham
Mir
*
Chemical Engineering Department, Birla Institute of Technology and Science Pilani, Hyderabad Campus, Jawahar, Nagar, 500078, Telangana, India. E-mail: mirafkham@hyderabad.bits-pilani.ac.in; f20200742@hyderabad.bits-pilani.ac.in
First published on 21st October 2024
In the dynamic realm of energy storage devices, supercapacitors (SCs) stand out as promising options. Among the various materials considered for SC electrodes, layered substances like Ti3C2 have drawn attention due to their outstanding electrochemical qualities, especially their impressive volumetric capacitance. This study focuses on assessing the predictive abilities of three machine learning models: Bayesian ridge regression (BRR), K-nearest neighbors (KNN), and artificial neural network (ANN) in estimating specific capacitance in Ti3C2-based supercapacitors. BRR offered reliable predictions with an R-squared (R2) value of 0.759 and a low root mean square error (RMSE) of 0.074. KNN excelled in predicting supercapacitor performance with an impressive R2 of 0.928 and a minimal RMSE of 0.040. However, the ANN model stood out as it could reveal the significance of various inputs much like the human brain's intricate functioning. It achieved a high R2 of 0.8929 with a low RMSE of 0.0493, demonstrating its proficiency in capturing complex relationships in the dataset. The precise tuning of hyperparameters further enhanced its accuracy. The use of SHAP (SHapley Additive exPlanations) values emphasized cation mobility, and scan rates as key contributing factors. These findings provide a strong foundation for utilizing machine learning to predict specific capacitance in Ti3C2-based supercapacitors. Researchers can benefit from these versatile tools for precise predictions, facilitating systematic supercapacitor design and enhancing our understanding of electrode materials.
Among the diverse array of materials explored for this purpose, one class, namely layered materials, has garnered significant attention, and within it, MXene has emerged as an exceptional contender for supercapacitor electrodes, showcasing exceptional electrochemical properties.3 MXenes, characterized as 2D layered materials with a general formula of Mn+1XnTx (where n ranges from 1 to 3), possess a fundamental framework composed of an early transition metal (M), a metal or non-metal (A), carbon or nitrogen (X), and an end termination typically represented by F, O, OH, or Cl (T).4 Notably, when compared to other 2D materials, MXenes exhibit remarkable traits such as high mechanical flexibility,5 elevated energy density,6 and robust electrochemical performance,7 rendering them particularly suited as electrode materials for SCs.
Among the myriad of MXenes, one that has found significant application in supercapacitor synthesis is Ti3C2, distinguished by its remarkable volumetric capacitance, which can soar to values as impressive as 1500 F cm−3.8 The intricacies of the charge storage mechanism in Ti3C2MXene supercapacitors involve a harmonious interplay between electric double-layer capacitance (EDLC) and pseudo capacitance. The layered structure and expansive surface area of Ti3C2 MXene facilitate the congregation of ions at the interface between electrode and electrolyte, culminating in the formation of an electric double layer9 In addition, the participation of redox reactions at active sites on the MXene's surface contributes to pseudo capacitance, ushering in a process of reversible charge transfer between the electrode and electrolyte.10 This dual mechanism underscores the versatility of Ti3C2 MXene, enabling it to deliver a harmonious synergy of swift charge–discharge11 kinetics and augmented energy storage capacity. Thus, the prowess of Ti3C2 MXene positions it as a material with immense potential for driving high-performance supercapacitors. While earlier research has emphasized similar characteristics, our study takes a fresh approach by using Ti3C2's innate benefits inside a revolutionary machine learning framework. Unlike previous research, which frequently generalize over other MXenes, this study focusses particularly on Ti3C2, using tailored ML techniques that takes into consideration its distinctive characteristics, including specific surface area and unique charge storage mechanisms. This specificity not only improves the models' predictive accuracy but also sheds new light on the optimization of Ti3C2 for energy storage applications. By doing so, this study contributes to a deeper understanding of how Ti3C2's characteristics can be best exploited, setting a new benchmark for future research in the field.
Supercapacitor (SC) performance typically hinges upon a myriad of physical and chemical characteristics inherent to the system, encompassing factors such as specific surface,12 electrode thickness,13 electrolyte concentration,14 cation/anion mobility of the electrolyte,15 scan rate,16 and the working potential window17 to name a few. Wang and his colleagues18 recently conducted a study involving the assessment of charge storage in Ti3C2 using various lithium-containing electrolytes with distinct solvent types, including nitrile, carbonate, and sulfoxide-based solvents. Their findings revealed that the utilization of a carbonate-based solvent notably increases the charge stored in Ti3C2 compared to other solvent systems. Wang concluded that the chemical properties of the electrolyte solvent exert a substantial influence on the molecular and ionic arrangement within Ti3C2, leading to a direct correlation with the overall charge storage capacity. Similarly, it has been established that the adsorption and diffusion of ions within the electrolyte, as well as the process of charge transfer, exert a notable influence on capacitance.19 However, the process of experimentally assessing the electrochemical performance of Ti3C2-based supercapacitors based on these parameters is not only resource-intensive but also remarkably time-consuming. This challenge is compounded by the vast array of available configurations and materials, further emphasizing the complexity of the task at hand.20 In recent years, the combination of machine learning and material science has created new opportunities for the advancement of supercapacitor technology.21,22 However, despite the abundance of experimental data for Ti3C2-based supercapacitors, there has been a surprising dearth of research that use machine learning to forecast their electrochemical performance.23 This study is unique in that it uses machine learning (ML) not only as a supplemental tool, but as a primary component in revealing previously unknown insights into the particular capacitance and other essential features of Ti3C2 supercapacitors. This study establishes a robust predictive framework for high-performance supercapacitors by applying advanced ML models such as artificial neural networks (ANNs) and refining them to the unique characteristics of Ti3C2. This significantly reduces the reliance on trial-and-error experimentation, accelerating the development cycle. This approach represents a significant leap forward in the methodology used to design and optimize Ti3C2-based energy storage devices.
A substantial body of experimental data exists regarding the supercapacitance properties of Ti3C2, yet thus far, no machine learning-based insights have been extracted from this dataset. In previous instances, machine learning methodologies have been effectively harnessed to forecast properties of materials like MXenes, primarily by reducing the computational burden associated with density functional theory (DFT) calculations.24 For instance, research in this domain has demonstrated the potential of machine learning to expedite and enhance predictions of material properties,25 offering a promising avenue for the application of similar techniques to unlock hidden insights within the Ti3C2 supercapacitance dataset.
Likewise, machine learning models have found application in predicting pseudocapacitive properties across various MXene type.26 However, it's worth noting that these models have often overlooked the distinctive factors associated with individual MXenes, such as specific surface area and current density associated specifically with Ti3C2. Table 1, presented below, offers a comprehensive overview of the utilization of machine learning models in predicting properties of MXenes, with particular emphasis on Ti3C2.
S. no. | MXene | Input features | Output parameters | ML models | Ref. |
---|---|---|---|---|---|
1 | 85 MXene | DFT data | Locate band edges | Gaussian process regression | 27 |
2 | 1000 MXenes | Defect properties based on DFT | Ideal defects | Random forest, graph networks | 28 |
3 | 132 MXenes | DFT data | HER catalytic capacity | Random forest, ANN, KRR, SVR | 29 |
4 | 85 MXenes | Atomic empirical radius, atomic van der Waals radius, electronegativity, atomic, s-orbital electron number, first ionization energy. | Stability | Random forest, KNN, Logistic regression, SVM and Gaussian NB | 30 |
5 | 66 MX and 792 MAX | DFT data | Synthesis possibility | Positive and unlabelled learning | 31 |
6 | 450 MXenes | DFT data (electronic properties) | Pseudo capacitance | Crystal graph convolutional neural network | 32 |
7 | Ti3C2 | Experimental values of specific capacitance, cation and anion ion mobility, electrolyte concentration, scan rate, cell configuration, and potential window | Specific capacitance | KNN, ANN, BRR | This work |
In this study, we introduce machine learning (ML) frameworks to develop a predictive model bridging experimental data of specific capacitance, cation and anion ion mobility, electrolyte concentration, scan rate, cell configuration and potential window and electrolyte properties. This framework is pivotal for enabling the methodical design of Ti3C2-based supercapacitors. To scrutinize the impact of these seven crucial parameters, we meticulously curated 84 raw training data points encompassing a range of experimental scenarios from our extensive dataset. Our research scrutinized the predictive capabilities of three distinct ML algorithms (KNN, BRR, and ANN), with the artificial neural network (ANN) helping us quantify the weights of different input parameters. ANN successfully establishes a comprehensive ML model boasting a coefficient of determination (R2) as high as 0.89 and a very low root mean square error (RMSE) of just 0.049. KNN model gave us the best fit with the highest value of R2 as 0.93. These outcomes provide profound guidance for shaping future designs of Ti3C2-based supercapacitors.
One of the primary limitations of this study is the relatively small size of the dataset used for training and testing the machine learning models. The dataset, consisting of 84 raw data points, was curated from approximately 100 recent academic publications. However, the number of papers specifically focused on Ti3C2 supercapacitors is limited. Moreover, only a subset of these papers provided the complete set of experimental data necessary for our selected features, such as specific surface area, cation/anion mobility, and scan rate. This constraint on data availability may limit the generalizability of our findings. While machine learning techniques have proven effective in extracting insights from small datasets,33 future research with more extensive datasets will be crucial for validating and expanding upon the results presented here. Despite this limitation, the findings offer valuable preliminary insights and lay the groundwork for further exploration in the domain of Ti3C2-based supercapacitors.
The unique contributions made by this study are as follows:
• First-ever machine learning study on Ti3C2-based super: this research pioneers the application of machine learning models to predict, from experimental data, the capacitance of Ti3C2-based supercapacitors. Earlier studies have focused broadly on MXenes.34
• Exploration of diverse ML models: the study evaluates the performance of three advanced machine learning models—K-nearest neighbors (KNN), artificial neural networks (ANN), and Bayesian ridge regression (BRR)—to forecast specific capacitance, assessing their accuracy and robustness.
• Novel insights using SHAP analysis: this work reveals hitherto unexamined connections that affect Ti3C2 supercapacitor performance by using SHAP values to highlight important parameters like cation mobility and scan rate.
• Extensive dataset and model optimization: the study includes finely calibrated machine learning models designed for forecasting particular capacitance in Ti3C2 materials, together with a carefully selected dataset based on experimental settings. These techniques provide useful tools for expediting the design of supercapacitors.
The manuscript is structured as follows. Section 2 outlines the methodology used to curate the dataset and develop the machine learning models. In Section 3, the results of the model evaluations are presented, including the predictive performance of the KNN, ANN, and BRR models, along with insights derived from the SHAP analysis. Finally, Section 4 concludes the study by summarizing the key findings and suggesting future directions for research on Ti3C2-based supercapacitors and machine learning-guided design optimization.
We incorporated a range of features rooted in electrode structural properties and chemical attributes. These encompassed the specific surface area, cation and anion mobility, electrolyte concentration, scan rate, current density, cell configuration, potential window, sheet thickness, and the weight percentage of Ti3C2. The inclusion of these features was aimed at comprehensively capturing the intricate nuances and complexities inherent in supercapacitor experimental research. Notably, the specific capacitance (measured in F g−1) served as the target output feature for the Ti3C2-based electrode materials.
The integration of this extensive set of features was imperative to fully encapsulate the wide spectrum of variations observed in experimental work. Our intention to encompass the complete scope of experimental diversity is further underscored by the substantial upper limit of specific capacitance, reaching up to 1850 F g−1, within the dataset used to train our machine learning models.
Inputs | Units | Output | Units | |
---|---|---|---|---|
Electrode characteristics | Specific surface area | m2 g−1 | Specific capacitance | F g−1 |
Sheet thickness | mm | |||
Weight % of Ti3C2 | ||||
Electrolyte properties | Cation mobility | cm2 V−1 s−1 × 104 | ||
Anion mobility | cm2 V−1 s−1 × 104 | |||
Electrolyte concentration | M | |||
Operational parameters | Scan rate | mV s−1 | ||
Cell configurations | ||||
Working potential window | V | |||
Current density | A g−1 |
![]() | (1) |
The colour inside each square in the Pearson correlation displayed in Fig. 2 indicates the degree of the linear correlation between the intersecting features. Positive correlation coefficients indicate that one feature tends to increase or decrease alongside the other, while negative coefficients signify that an increase in one feature corresponds to a decrease in the other and vice versa. The heat map depicted in Fig. 2 employs purple and yellow shades to indicate strong correlations between features, with greenish hue signifies weaker associations. Correlation coefficients near zero suggest a low linear interdependence between the features.35 Notably, a correlation coefficient approaching 1 or −1 with specific capacitance suggests that the feature may be valuable for machine learning models in predicting specific capacitance. As the correlation of a feature with itself is always 1, all diagonal values in the heatmap are inherently 1. The results in Fig. 2 reveal a high correlation, in descending order, between specific capacitance and the following features: specific surface area > anion mobility > electrolyte concentration > sheet thickness > weight percentage of Ti3C2 > current density > potential window > scan rate >cation mobility > cell configuration. Utilizing all 9 features in model development yielded better performance than solely relying on features with high correlation values (those falling between 0.6 and 0.98) with specific capacitance.
![]() | ||
Fig. 2 Heatmap presenting Pearson correlation coefficient values, illustrating the degree of linear association between variables. |
Conversely, there exists a discernible yet weak ascending correlation between specific capacitance and the following factors: cell configuration, cation mobility, scan rate, potential window, and current density. This indicates that the prediction of capacitance cannot rely solely on straightforward linear or inverse associations. The intricate relationships between specific capacitance and these features, as modeled by machine learning algorithms, contribute to more accurate predictions. Fig. 3 depicts the histogram distribution of features utilized in constructing the machine learning models, alongside the mean and standard deviation (STD) values for each feature's distribution. Several notable insights arise from these distribution curves. For instance, the relatively low mean value of 0.69, accompanied by a standard deviation of 0.36, for the potential window, suggests that many surveyed research papers concentrated around a potential window of approximately 0.7. The substantial standard deviation and mean values for scan rate indicate a wide range (1–40000 mV s−1) covered in the surveyed experimental studies, occasionally reaching up to 40 V during GCD tests. Expanding the operational potential window and investigating supercapacitor performance at higher scan rates align with the objectives of supercapacitor design. The specific capacitance stands out for its large standard deviation, indicating the diverse array of supercapacitors encompassed within this study.
Similarly, the relatively high standard deviations of both pore size and pore volume, when contrasted with their low mean values, suggest a diverse set of surface morphologies within the dataset. The specific surface area spans from a minimum value of 22.35 m2 g−1 to a maximum of 98 m2 g−1, with a standard deviation of 41.14. The significant standard deviations and mean values for electrode configuration reflect the dataset's variety. In the interest of enhancing the flexibility of the machine learning models, we categorized different electrolyte compositions as cationic and anionic mobility, showcasing standard deviation values of 14.64 and 10.57, respectively, which are indicative of the wide range of electrolytes employed in the research.
![]() | (2) |
![]() | (3) |
Our work employs RMSE and R2 as evaluation metrics due to their specific advantages for supercapacitor performance predictions. RMSE was chosen for its ability to penalize larger errors more severely, which is critical in applications where large deviations in predicted capacitance can significantly impact design efficiency and performance. By weighting larger errors more heavily, RMSE ensures that models focus on minimizing these critical inaccuracies. R2 was selected to assess the proportion of variance in the data explained by the model. It provides a clear indication of how well input features, such as specific surface area, ion mobility, and scan rate, contribute to predicting the output variable—specific capacitance. This metric gives an intuitive sense of the model's overall predictive power and helps gauge how effectively the model captures the underlying patterns in the data. Together, RMSE and R2 offer complementary insights: RMSE focuses on reducing large errors, while R2 evaluates the overall explanatory power of the model, making them suitable choices for this study.
Notably, we employed a grid search approach to optimize the hyperparameter, denoted as alpha (α).37 Our Bayesian ridge regression (BRR) model exhibited a relatively modest alpha value of 0.001, indicative of mild L2 regularization. This modest regularization suggests that while the model imposes some constraint on the coefficient magnitudes to prevent overfitting, it remains sufficiently flexible to capture the complex relationships inherent in the data.
The significance here is the model's adeptness in closely adhering to the training data, potentially capturing intricate relationships, albeit with the caveat of overfitting in the presence of noisy or multicollinear data. Highlighting the model's predictive competence, our BRR model achieved a commendable R-squared (R2) value of 0.759, accompanied by an impressively low root mean square error (RMSE) of 0.074, substantiating its aptitude for precise predictions of Ti3C2 capacitances. Fig. 4d illustrates the convergence behaviour of the Bayesian regression model applied to predict the specific capacitance of Ti3C2 electrodes during the variational inference process. As shown, the log marginal likelihood stabilizes at approximately 50% following a key isolation step. This stabilization suggests that the model has reached an equilibrium point, where further iterations no longer enhance the model's fit to the data. The constant likelihood indicates that the model has successfully captured the critical relationships within the Ti3C2 dataset, balancing model complexity with explanatory power. This convergence confirms the model's robustness in predicting Ti3C2 capacitance without the risk of overfitting.
The outcomes of the KNN model reveal its efficacy, with a notable R-squared value of 0.928, signifying a high level of accuracy. Accompanying this, the root mean square error (RMSE) registers at 0.040, indicating minimal loss in prediction quality. The optimal value for K (1 to 12) in this context is determined to be 5 (highest R2), underlining the potential for KNN in predicting supercapacitance values with a remarkable degree of precision. Fig. 5a presents the validation curves for the KNN model. The first graph shows the relationship between RMSE and the number of neighbours, where the RMSE reaches its minimum at K = 5, highlighting this value as the optimal choice for minimizing prediction error. Similarly Fig. 5b shows the R-squared versus the number of neighbours, with the R-squared value peaking at K = 5, further validating this as the most effective choice for achieving the highest accuracy. Together, these curves demonstrate that K = 5 is the best choice for balancing model complexity and predictive performance, which makes the KNN model a reliable method for predicting Ti3C2 supercapacitance.
The training process of a neural network involves using the Levenberg–Marquardt backpropagation algorithm to adjust the weights and biases of the neurons as depicted in Fig. 6b. This training was done in Python using the framework TensorFlow. In this specific ANN model, four layers are employed: an input layer with seven neurons, an output layer with a single neuron, and two hidden layers consisting of 64 neurons each (Fig. 6a). The dataset is divided randomly, with 75% used for training the ANN and 25% for validation. The information flows through the network, with weights in the hidden layer neurons being adjusted to reach the desired output. This is done through the use of a nonlinear activation function, ReLu, represented by eqn (4),
![]() | (4) |
To enhance the training process further, the Adam optimizer is used, and hyperparameter optimization is performed through a grid search. This systematic exploration of hyperparameter combinations ensures the network's performance is fine-tuned to its best configuration. Training involves adapting the weights (w) and biases (b) through thousands of iterations. The iterative weight adjustments, informed by the Levenberg–Marquart backpropagation algorithm (shown in Fig. 6b) and the influence of the Adam optimizer, occur over 100 iterations according to eqn (5),
![]() | (5) |
The results of this ANN model are impressive, with a high R-squared value of 0.8929, indicating that it can explain nearly 89.29% of the data's variance (Fig. 4c). Additionally, a low root mean square error (RMSE) of 0.0493 suggests accurate predictions. These results validate the model's accuracy and reliability. The optimal selection of hyperparameters, including a batch size of 16, 100 training epochs, and 64 hidden units, underscores the careful tuning of the model, enhancing its overall effectiveness. Based on the results presented, it is evident that there is a strong correspondence between the actual and predicted specific capacitance values during both the training and test phases. In all the sample cases, we observed consistent and accurate agreement between the actual and predicted values, as depicted in Fig. 7b. This suggests that the ANN model can be effectively utilized to achieve high-speed processing and enhance the accuracy of predictions.
SHAP values were applied to gain insights from the developed ANN model. They were used to calculate the importance of each feature for specific predictions. A baseline value was established by averaging predictions on the test dataset. A positive SHAP value signifies that a feature contributed to an increased prediction compared to the baseline, while a negative SHAP value suggests a contribution to a decreased prediction. To explore the inner workings of the ANN model, SHAP values were computed for all predictions on the test data split. Fig. 7a offers an overview of these SHAP values. Examining the distribution of positive and negative SHAP values for each feature horizontally provides insights into the relationship between a feature's value and its associated SHAP value. Notably, in Fig. 7a, the most significant feature for the ANN predictions were cation mobility, electrolyte concentrations and scan rate. A closer look at the SHAP values gives important insights into the impact of each attribute on the model's predictions. Positive SHAP values for the scan rate, cation ion mobility, and electrolyte concentration imply all have a major role in increasing the model's anticipated specific capacitance. Higher specific capacitance for higher cation ion mobility can also be explained by the fact that higherion mobility makes it possible for ions to move through the electrolyte more quickly and efficiently. This leads to faster cycles of charge and discharge and higher supercapacitance performance. However, the SHAP analysis also reveals a nuanced perspective: beyond a certain threshold, the cation ion mobility can negatively impact the system, as indicated by the negative SHAP values. This negative effect may be attributed to phenomena such as ion overcrowding or excessive diffusion rates, which can lead to non-ideal interactions at the electrode interface. For instance, too rapid ion movement might reduce the time available for ions to effectively interact with the active sites on the electrode, leading to a decrease in capacitance. Additionally, high mobility might also cause an imbalance in the distribution of ions, potentially leading to reduced electrode efficiency and uneven charge storage, thus detracting from overall performance.
Furthermore, other parameters like potential window and cell configuration also show a mix of positive and negative SHAP values, implying that their effect on capacitance is context-dependent, altering with particular combinations of other input characteristics. For example, a positive SHAP for potential window implies a broader potential window will allow for a greater energy storage capacity, enabling the supercapacitor to store and deliver more charge. However, if the potential window is too wide, it may result in the breakdown of the electrolyte or degradation of the electrode material, leading to negative SHAP values. Therefore, the potential window must be carefully controlled to balance the trade-off between maximizing energy storage and maintaining system stability. This nuanced interpretation highlights the ANN model's intricate interconnections and gives a more complete understanding of how different parameters affect the performance of Ti3C2-based supercapacitors.
Fig. 7c displays the mean squared error (MSE) throughout the training process across various epochs. It is noteworthy that the MSE steadily decreases from an initial value of 0.65 to a significantly lower 0.0493, precisely over 100 iterations or epochs. Notably, the MSE exhibits rapid convergence, plummeting from 0.65 to a little over 0 within the first 20 epochs. However, as the training progresses beyond these initial 20 epochs, the reduction in MSE becomes more gradual, further decreasing to 0.0493. It's important to highlight that a smaller MSE implies a closer alignment between the actual and predicted values. These findings underscore the robust capability of the artificial neural network (ANN) model in accurately assessing the relative influences of the seven factors on the specific capacitance property.
• Distance netric: the Euclidean distance was used as the distance metric to identify nearest neighbors.
• Weighting: uniform weighting was applied, meaning each of the k nearest neighbors contributed equally to the prediction.
• Activation function: the rectified linear unit (ReLU) was used as the activation function for the hidden layers to handle non-linear relationships in the data, while a linear activation function was used for the output layer.
• Optimization: the Adam optimizer was used for training, with a learning rate of 0.001. The optimizer parameters, such as beta_1 (0.9), beta_2 (0.999), and epsilon (1 × 10−7), were set to their default values in TensorFlow.
• Training setup: a batch size of 16 was used, and the model was trained for 100 epochs. Early stopping was employed based on validation RMSE to prevent overfitting.
• Hyperparameter tuning: a grid search approach was used to determine the optimal values for the number of neurons in the hidden layers, learning rate, and batch size. Cross-validation with 5-fold splits was conducted to ensure generalizability of the results.
• Convergence criteria: the model training was terminated when the log marginal likelihood stabilized after approximately 50 iterations, indicating convergence.
• Training process: a probabilistic approach was used, meaning the BRR model estimates both the mean and variance of the parameters during training, allowing for uncertainty in the predictions to be quantified. The regularization strength (α) was optimized to prevent overfitting while ensuring flexibility in capturing the relationships within the data.
Footnote |
† Electronic supplementary information (ESI) available: [DETAILS]. See DOI: https://doi.org/10.1039/d4ya00460d |
This journal is © The Royal Society of Chemistry 2024 |