Jamilu Usmana,
Sani I. Abba*bc,
Fahad Jibrin Abdud,
Lukka Thuyavan Yogarathinama,
Abdullah G. Usmane,
Dahiru Lawalaf,
Billel Salhia and
Isam H. Aljundiag
aInterdisciplinary Research Centre for Membranes and Water Security (IRC-MWS), King Fahd University of Petroleum and Minerals, Dhahran, 31261, Saudi Arabia. E-mail: saniisaabba86@gmail.com
bDepartment of Chemical Engineering, Prince Mohammad Bin Fahd University, Al Khobar, 31952, Saudi Arabia
cWater Research Centre, Prince Mohammad Bin Fahd University, Al Khobar, 31952, Saudi Arabia
dSADAIA-KFUPM Joint Research Center for Artificial Intelligence (JRCAI), King Fahd University of Petroleum & Minerals (KFUPM), Dhahran, Saudi Arabia
eNear East University, Operational Research Center in Healthcare, Nicosia, TRNC 10, Mersin, 99138, Turkey
fMechanical Engineering Department, King Fahd University of Petroleum & Minerals, Dhahran, 31261, Saudi Arabia
gChemical Engineering Department, King Fahd University of Petroleum and Minerals, Dhahran, 31261, Saudi Arabia
First published on 1st October 2024
Addressing global freshwater scarcity requires innovative technological solutions, among which desalination through thin-film composite polyamide membranes stands out. The performance of these membranes plays a vital role in desalination, necessitating advanced predictive modeling for optimization. This study harnesses machine learning (ML) algorithms, including support vector machine (SVM), neural networks (NN), linear regression (LR), and multivariate linear regression (MLR), alongside their ensemble techniques to predict and enhance average water flux (AWF) and average salt rejection (ASR) essential metrics of desalination efficiency. To ensure model interpretability and feature importance analysis, SHapley Additive exPlanations (SHAP) were employed, providing both global and local insights into feature contributions. Initially, the individual models were validated, with NN demonstrating superior performance for both AWF and ASR, achieving the lowest mean absolute error (MAE = 0.001) and root mean squared error (RMSE = 0.0111) for AWF and an MAE = 0.0107 and RMSE = 0.0982 for ASR. The accuracy of predictions improved significantly with ensemble models, as evidenced by the near-perfect Nash-Sutcliffe efficiency (NSE) values. Specifically, the NN ensemble (NN-E) and Linear Regression ensemble (LR-E) reached an MAE and RMSE of 0.001 and 0.0111, respectively, for AWF. For ASR, NN-E reduced the MAE to 0.0013 and the RMSE to 0.0089, while LR-E maintained competitive performance with an MAE of 0.0133 and an RMSE of 0.0936. SHAP analysis revealed that features such as MDP and TMC were critical drivers of performance, with MDP showing the most significant positive impact on ASR. These findings demonstrate the dominance of ensemble methods over individual algorithms in predicting key desalination parameters. The enhanced precision in estimating AWF and ASR offered by these neuro-intelligent ensembles, combined with the interpretability provided by SHAP analysis, can lead to significant environmental and operational improvements in membrane performance, optimizing resource usage and minimizing ecological impacts. This study paves the way for integrating intelligent ML ensembles and SHAP-based interpretability into the practical field of membrane technology, marking a step forward toward sustainable and efficient desalination processes.
Interfacial polymerization is the key process in the fabrication of thin-film composite (TFC) polyamide membranes, particularly for RO applications. This process occurs at the interface between two immiscible phases: an aqueous phase containing m-phenylenediamine (MPD) and an organic phase containing trimesoyl chloride (TMC). When the two phases come into contact, MPD diffuses into the organic phase, where it reacts with TMC to form a polyamide layer. The reaction between MPD and TMC is rapid and exothermic, leading to the formation of a highly cross-linked polyamide network.9 This network constitutes the active layer of the TFC membrane, responsible for its desalination performance. This polyamide layer is thin yet robust, providing a high degree of salt rejection while allowing water molecules to pass through. The structure and properties of the resulting membrane are influenced by several factors, including the concentration of the monomers (MPD and TMC), the reaction time, and the curing temperature. These parameters dictate the thickness, roughness, and cross-link density of the polyamide layer, all of which are critical to the membrane's desalination efficiency. Understanding these parameters and their interplay is essential for optimizing the membrane fabrication process and achieving superior desalination performance. This process is influenced by several factors, including monomer concentration, reaction duration, and curing temperature, which collectively impact the membrane's structure and performance.9 The intricate interplay of monomer concentration, reaction time, and curing temperature during interfacial polymerization significantly affects monomer diffusion and reaction rate.9 Such complexity poses challenges to fully comprehending and refining the functionalities of the polyamide layer, especially concerning its capacity to improve water permeability and salt rejection.10 This complexity results in inconsistent outcomes regarding the polyamide layer efficacy, complicating efforts to boost and stabilize its performance predictably. Optimizing the reaction conditions is necessary to achieve the desired characteristics of thin, continuous polyamide chain arrangements and hydrophilic tendency.11 Wet chemistry methods necessitate extensive experimentation, consuming substantial time and resources to optimize the trade-off between water permeability and salt rejection. The emergence of machine learning (ML) algorithms has notably catalyzed advancements in membrane technology, especially in the realm of desalination, by promising significant enhancements in performance across a spectrum of applications.2,12,13
ML algorithms, as a subset of artificial intelligence (AI), harness data-driven insights to unearth patterns that significantly enhance efficiency, predict membrane performance, and refine desalination processes.14–17 This approach allows for the sophisticated analysis of vast datasets, leading to the optimization of both existing and novel desalination methodologies. In the domain of desalination research and application, artificial neural networks (ANN), support vector machines (SVM), decision trees, random forests, gradient boosting machines (GBM), deep learning, and genetic algorithms (GA) are instrumental.18 These ML methodologies are central to functions such as predictive modeling, directing optimization algorithms, forecasting membrane fouling occurrences, facilitating the discovery and engineering of novel materials, and fine-tuning operational parameters.19–21 Li et al. examined the role of nanomaterial optimization in the customization of thin-film nanocomposites through the application of a dual-output neural network (D-ANN).22 Zhang et al., proposed the evaluation of a deep learning neural network (DNN) ML model to assess the performance of nanofiltration (NF) membranes using a sparse dataset.23 Recently, Tayyebi et al., investigated utilizing Shapley Additive explanations (SHAP) within the framework of explainable artificial intelligence (XAI) to analyze the impact of amine monomer selection on the customization of polyamide, aiming to improve desalination application.24 A total of 583 diamines were modeled and the optimized diamine based polyamide TFC membrane surpassed the tradeoff between water permeability and selectivity. Usman et al., studied the Matérn Gaussian Process Regression (MGPR) model to evaluate the effect of chlorine stability on membrane flux and salt rejection.12 The MGPR model accurately predicted with minimal error values for the impact of acyl chloride monomer-modified polyamide membranes on both salt flux and separation efficiency. Mohammed et al., compared the ensemble and non-ensemble ML algorithm to evaluate their effectiveness in predicting the separation efficiency of RO membranes.20 The ensemble XGBoost model proved to be effective, exhibiting superior feature analysis capabilities through the use of SHAP. The influence of sparse datasets related to desalination performance through ML tools remains underexplored. Moreover, studies on the application of ML models to optimize interfacial polymerization parameters for predicting water flux and salt rejection are limited.
While significant advancements have been made in the development and optimization of TFC polyamide membranes for desalination, the integration of ML techniques in predicting and enhancing membrane performance remains underexplored. Most existing studies focus on empirical methods and traditional optimization techniques, leaving a gap in the application of advanced data-driven approaches, particularly in the context of ensemble models. The present research addresses this gap by leveraging ML algorithms to accurately predict and optimize key performance metrics such as average water flux (AWF) (LMH) and average salt rejection (ASR) (%). The study not only demonstrates the superiority of ensemble methods over individual algorithms but also highlights their potential to improve the efficiency and sustainability of desalination processes significantly. The current study aimed to employ ML algorithms, including SVM, neural networks (NN), linear regression (LR), and multivariate linear regression (MLR), to optimize the input variables to enhance both the flux AWF and the efficiency of ASR. Furthermore, the research introduced an efficient data combination strategy for precise prediction within the confines of a small dataset. Subsequently, ensemble techniques were proposed to improve the prediction skill of AWF and ASR using several models. NN tools have demonstrated efficacy in discerning the underlying patterns of polyamide membranes, contributing to enhanced performance in desalination processes. The novel approach of this study lies in its strategic application of multiple ML algorithms to optimize key variables in the interfacial polymerization process of thin-film composite polyamide membranes. By enhancing AWF and ASR, the study contributes to the efficiency of desalination technologies.
A significant innovation introduced in this research is developing an efficient data combination strategy specifically tailored for small datasets. This approach enhances the predictive accuracy and robustness of the ML models, making it a critical advancement for studies where data scarcity often impedes model reliability and performance. Further contributing to the field, the study pioneers the use of ensemble techniques that leverage the strengths of individual ML models to achieve superior predictive skills for AWF and ASR. The effective use of these ensembles, particularly the NN-E, showcases a breakthrough in understanding and optimizing the performance characteristics of polyamide membranes. These ensemble models improved the prediction accuracy and provided insights into the complex dynamics of the desalination process, ultimately leading to membranes with better flux and salt rejection capabilities. Moreover, these advancements push the boundaries of membrane technology and present a scalable approach for enhancing desalination processes, thus offering substantial environmental and operational benefits. This study sets a new benchmark for the application of advanced computational techniques in the field of chemical engineering and membrane science.
The theory behind ensemble learning, particularly with neural networks, advances combining the outputs of multiple models to enhance predictive performance beyond what is achievable by any single model (Fig. 2c). This improvement stems from leveraging the diversity among models achieved through variations in training data subsets, initialization parameters, or architecture to reduce bias and variance in predictions.30 By aggregating individual predictions using methods like averaging, weighted averaging, or voting, ensembles can capture a broader representation of the data distribution, mitigating overfitting and enhancing generalization to unseen data.31–34 Although this approach offers significant benefits in accuracy and robustness, it also requires careful management of the trade-off between computational costs and performance gains, as training and deploying multiple models inherently demand more resources. Nonetheless, the strategic use of neural network ensembles remains a powerful technique for boosting the reliability and efficacy of predictive modeling across diverse applications.33
For instance, R-squared (R2) quantifies the variance explained by the model, serving as a gauge for goodness of fit, whereas the Pearson correlation coefficient (PCC) assesses the linear relationship between observed and predicted values. Mean squared error (MSE) and root mean squared error (RMSE) measure the average of the squared errors and the square root of these averages, respectively, reflecting the magnitude of prediction errors; both are sensitive to outliers, with RMSE being more commonly used due to its units being the same as those of the dependent variable. Mean Absolute Error (MAE) and Mean Absolute Percentage Error (MAPE) provide an understanding of average prediction error in absolute terms and as a percentage, making MAE straightforward to interpret and MAPE particularly useful for relative comparison across different scales. Percent Bias (PBIAS) evaluates the tendency of the predictions to be higher or lower than their actual values, indicating a model bias towards over or underestimation. Collectively, these metrics furnish a comprehensive toolkit for model evaluation, enabling the identification of models that best capture the underlying data patterns while balancing the trade-offs between simplicity, interpretability, and predictive accuracy.
Model | Hyperparameters | Values set | Optimization technique | Rationale for choice |
---|---|---|---|---|
SVM | Kernel type | RBF | Grid search | Chosen for its balance between computational efficiency and accuracy in high-dimensional spaces |
C | 1.0 | Grid search | Adjusted for regularization to avoid overfitting | |
Gamma | 0.1 | Grid search | Optimized for influence of single training examples | |
NN | Number of layers | 3 | Random search | Sufficient depth to capture non-linear patterns in data without overfitting |
Neurons per layer | [64, 32, 16] | Random search | Layer sizes decrease to allow for complex feature extraction followed by fine-tuning | |
Learning rate | 0.01 | Random search | Set to balance between convergence speed and avoidance of local minima | |
Batch size | 32 | Random search | Chosen for computational efficiency and stability during training | |
LR | Regularization | L2 | Manual tuning | L2 regularization to penalize large coefficients, preventing overfitting |
Learning rate | 0.001 | Manual tuning | Small value chosen to ensure gradual learning and stable convergence | |
MLR | Coefficient estimation | Ordinary least squares | — | Default method for minimizing the residual sum of squares |
The results of implementing data-driven algorithms and ensemble techniques demonstrated significant improvements in predicting and optimizing the performance metrics of TFC polyamide membranes used in desalination processes. The dependency matrix from the study highlights that TMC concentration positively influences both target variables, antifouling AWF and antifouling ASR, with correlation values of 0.1642 and 0.1665, respectively. It suggests that higher concentrations of TMC generally improve both the throughput and efficiency of the filtration process. On the other hand, MPD concentration has a mixed impact; it shows a weak positive correlation with AWF (0.1327) (Fig. 3a), indicating a slight increase in water flux, but a moderate negative correlation with ASR (−0.3383), suggesting that higher MPD concentrations might degrade the separation efficiency. Furthermore, RT and CT exhibit contrasting effects on the two target variables. Both variables negatively affect AWF, with correlations of −0.2106 for RT and −0.2522 for CT, implying that longer RT and higher CT reduce water flux through the membrane. However, they show slight positive correlations with ASR (0.1575 for RT and 0.1530 for CT), indicating that these conditions may slightly improve the separation ratio. This differential impact highlights the complexity of membrane operation conditions, where adjustments to process parameters can enhance one aspect of performance at the expense of another (see Fig. 3b).
It is well known that understanding statistical parameters in modeling, such as mean, variance, skewness, and kurtosis, is crucial because it offers insights into data quality, reveals distribution characteristics, and helps in selecting appropriate modelling techniques.39,40 It also guides preprocessing steps like normalization to improve model accuracy and predictability. Furthermore, knowing data distribution aids in choosing robust statistical tests and models, which is especially important when data is skewed or has outliers. Ultimately, this foundational understanding supports informed decision-making in experimental design and process optimization, leading to more effective and accurate outcomes.41 For this purpose, Table 2 presents descriptive statistics for antifouling AWF and ASR, alongside input variables MPD concentration, TMC concentration, RT, and CT. The mean values for all variables are consistent across both target metrics, with MPD and TMC concentrations showing controlled low variability. At the same time, RT and CT exhibit higher variability and extreme values, as indicated by their high kurtosis (11.000) and positive skewness. AWF and ASR differ significantly in their distribution characteristics; AWF has a higher standard deviation (30.360) and sample variance (921.734) compared to ASR (SD: 7.486, Variance: 56.047), along with a positive skewness (1.701) compared to ASR negative skewness (−3.020), suggesting more spread and a tail towards higher values in AWF. The minimum and maximum ranges (AWF: 15.300 LMH to 110.500 LMH, ASR: 72.100% to 97.600%) further underscore the extent of variability, particularly in AWF, which could be sensitive to experimental conditions or specific operational settings that favor certain ranges, indicating the need for careful analysis and interpretation of these measurements in related studies.
AWF | MPD | TMC | RT | CT | AWF |
---|---|---|---|---|---|
Mean | 1.909 | 0.232 | 16.364 | 57.727 | 38.682 |
SD | 0.831 | 0.087 | 4.523 | 9.045 | 30.360 |
Sample variance | 0.691 | 0.008 | 20.455 | 81.818 | 921.734 |
Kurtosis | −1.485 | −1.621 | 11.000 | 11.000 | 2.363 |
Skewness | 0.190 | 0.409 | 3.317 | 3.317 | 1.701 |
Minimum | 1.000 | 0.150 | 15.000 | 55.000 | 15.300 |
Maximum | 3.000 | 0.350 | 30.000 | 85.000 | 110.500 |
ASR | MPD | TMC | RT | CT | ASR |
---|---|---|---|---|---|
Mean | 1.909 | 0.232 | 16.364 | 57.727 | 94.045 |
SD | 0.831 | 0.087 | 4.523 | 9.045 | 7.486 |
Sample variance | 0.691 | 0.008 | 20.455 | 81.818 | 56.047 |
Kurtosis | −1.485 | −1.621 | 11.000 | 11.000 | 9.413 |
Skewness | 0.190 | 0.409 | 3.317 | 3.317 | −3.020 |
Minimum | 1.000 | 0.150 | 15.000 | 55.000 | 72.100 |
Maximum | 3.000 | 0.350 | 30.000 | 85.000 | 97.600 |
R2 | PCC | MSE | MAPE | MAE | PBIAS | RMSE | |
---|---|---|---|---|---|---|---|
AWF (LMH) | |||||||
SVM | 0.8219 | 0.2697 | 32.4305 | 1.0137 | 0.5730 | 0.5644 | 5.6948 |
NN | 1.0000 | 1.0000 | 0.0001 | 0.0039 | 0.0010 | 0.0000 | 0.0111 |
LR | 0.7278 | 0.0596 | 42.3386 | 0.7865 | 0.4870 | 0.3872 | 6.5068 |
MLR | 0.1218 | 0.3489 | 24.0926 | 1.9257 | 0.6379 | 0.0000 | 4.9084 |
ASR (%) | |||||||
SVM | 0.8613 | 0.6734 | 1.6036 | 0.1099 | 0.0823 | −0.0245 | 1.2664 |
NN | 0.9942 | 0.9971 | 0.0096 | 0.0113 | 0.0107 | 0.0000 | 0.0982 |
LR | 0.5449 | 0.7381 | 0.7592 | 0.1413 | 0.1269 | 0.0000 | 0.8713 |
MLR | 0.2410 | 0.4909 | 1.2661 | 0.1655 | 0.1434 | 0.0000 | 1.1252 |
For the predictive approach, NN and MLR achieved a PBIAS of 0 for both AWF and ASR, indicating an unbiased prediction for these crucial desalination parameters. In contrast, SVM exhibited a PBIAS of 0.5644 for AWF, suggesting a slight underestimation, and −0.0245 for ASR, indicating a minor overestimation. LR, with a PBIAS of 0.3872 for AWF, also showed a tendency to underestimate but did not provide a PBIAS value for ASR. While NN demonstrated superior accuracy and lack of bias, ensuring that these findings are not a result of overfitting is vital for their application in desalination process optimization. Although MLR displayed no bias, its higher predictive errors imply it might be less dependable. Any bias or error in the model predictions can significantly impact the environmental outcomes of the desalination process by causing either an overuse or underuse of resources, which can lead to increased energy consumption and an escalated environmental footprint. Therefore, it is imperative for the selected model to not only exhibit minimal predictive error but also to faithfully represent actual operations to ensure that the desalination process is environmentally sustainable.
However, the predictive modelling using AWF and ASF was evaluated using a violin diagram. It is important to note that violin plots are essential for visually analyzing the distribution and relationships of variables such as MPD, TMC, RT, CT, and operating conditions, enabling the identification of influential factors and validation of predictive models in predicting desalination flux and rejection outcomes (see, Fig. 5). In assessing the performance of ML models for predicting AWF and ASR in desalination, the R2 reveals the proportion of variance each model captures from the dependent variable. For AWF, the NN model achieved a perfect R2 score of 1, indicating a model that accounts for all variance in the dataset, followed by the SVM with an R2 of 0.8219, signifying a robust model fit. The LR recorded an R2 of 0.7278, showing a good but lesser fit compared to SVM, while MLR lagged with an R2 of 0.1218, suggesting a poor model fit. When predicting ASR, the NN model remained superior with an R2 = 0.9942, closely approaching a perfect fit. The SVM model also performed well with an R2 = 0.8613, whereas the LR model's fit was moderate with an R2 = 0.5449, and the MLR model again showed a weak fit with an R2 = 0.241. These R2 values are crucial for evaluating the model's predictive quality in desalination, as a high R2 corresponds to more accurate predictions of membrane performance, which is fundamental to the efficiency and sustainability of water treatment processes. The numerical outcomes of AWF using the NN model showed a 21.67% increase over the SVM, a 37.40% increase over LR, and a substantial 721.02% increase over MLR. Similarly, for ASR, the increases were 15.43% over SVM, a significant 82.46% over LR, and a remarkable 312.53% over MLR. These increases underscore the NN model enhanced predictive accuracy for both key performance indicators in the domain of desalination.
Fig. 5 Spread of predictive results based between the observed and simulated AWF (a) and ASR (b) using violin diagram. |
Further understanding of violin plots comparing the predictive performance of various ML models against observed data for AWF and ASR. Violin plots are useful for displaying the distribution of data and its probability density. The left side of the figure shows the results for AWF, while the right side is for ASR. The plots for each model display the range of predicted values, with thicker sections representing a higher density of data points. The black horizontal line inside each violin represents the interquartile range (25–75%), with the black dot indicating the median of the predictions. The range within 1.5 times the interquartile range (IQR) is indicated by the black lines extending from the interquartile range, showing the spread of the majority of the data. For AWF (left plots), the observed data has a narrow interquartile range and a higher median compared to the predictions by the SVM and LR models, which show a wider distribution of responses, indicating variability in their predictions. The NN model has a very tight distribution, closely matching the observed data, suggesting high accuracy and precision. The MLR shows a broad distribution, suggesting lower precision. Similarly, for ASR (right plots), the observed data shows a slightly broader distribution compared to AWF but still maintains a higher median than most predictive models. The SVM and LR models show wide distributions, indicating variability and less precision in predictions. The NN has a narrow distribution for ASR as well, closely aligning with the observed data, which implies accuracy and consistency in its predictions. The MLR again shows a wide distribution. The lower plots are labelled with an E suffix (SVM-E, NN-E, LR-E), which could indicate an ensemble approach. These ensemble models generally show narrower distributions compared to their non-ensemble counterparts, especially for SVM-E and NN-E, suggesting that ensemble improves prediction accuracy and consistency. In general, for both AWF and ASR, the NN model, and potentially its ensemble version, provides the closest match to the observed data, indicating it may be the most reliable for predicting desalination membrane performance. The use of ensembles appears to refine the predictions, potentially leading to more precise and accurate models, which is crucial for designing efficient and environmentally friendly desalination processes. The predictive skills can also be proved using 2-dimensional (2D) Taylor diagram as indicated in Fig. 6.
In Table 4, the ensemble models show high NSE values for the prediction of AWF and ASR, indicative of excellent model performance. Specifically, for AWF, the SVM-E exhibits an NSE of 0.994, suggesting a very close match to the observed data, while both the NN-E and the LR-E achieve a perfect NSE of 1, reflecting predictions that perfectly match the observed measurements. For ASR predictions, SVM-E NSE of 0.8741, although lower than for AWF, still indicates a good predictive match, whereas NN-E again achieves a perfect NSE of 1, and LR-E is nearly perfect at 0.9947. These high NSE values signal the robustness of the ensemble models in capturing the true variance of the observed data, minimizing prediction noise. The exceptional performance of these ensemble models in desalination modeling has a direct environmental impact, as accurate predictions of AWF and ASR are critical for optimizing the desalination process, leading to significant energy and resource savings, more efficient water usage, reduced waste, and a lower environmental footprint for water treatment facilities. This in turn, minimizes the ecological footprint of desalination plants by ensuring they operate at peak performance, reducing waste and the potential for excessive chemical and energy use, which are pivotal considerations in the sustainable management of water resources. In Table 3, the MAPE values for ensemble ML models provide insights into the accuracy of predictions for desalination process parameters. For AWF, the SVM-E shows a MAPE of 0.2227 (22.27%), while the NN-E and the LR-E both demonstrate extraordinarily low MAPE values of 0.004 (0.4%) and 0.0039 (0.39%), respectively. In the case of Average Salt Rejection (ASR), SVM-E has a MAPE of 0.0378 (3.78%), NN-E achieves a MAPE of 0.0013 (0.13%), and LR-E presents a MAPE of 0.014 (1.4%) (see, Fig. 7). These low MAPE values, particularly for NN-E and LR-E, indicate a high level of precision in predictive modelling, which is environmentally beneficial for desalination operations.
R2 | NSE | PCC | MSE | MAPE | MAE | PBIAS | RMSE | |
---|---|---|---|---|---|---|---|---|
AWF (LMH) | ||||||||
SVM-E | 0.9948 | 0.9940 | 1.0000 | 0.1423 | 0.2227 | 0.0597 | −0.0171 | 0.3772 |
NN-E | 1.0000 | 1.0000 | 1.0000 | 0.0001 | 0.0040 | 0.0010 | 0.0000 | 0.0111 |
LR-E | 1.0000 | 1.0000 | 1.0000 | 0.0001 | 0.0039 | 0.0010 | 0.0000 | 0.0111 |
ASR (%) | ||||||||
R2 | NSE | PCC | MSE | MAPE | MAE | PBIAS | RMSE | |
SVM-E | 0.9265 | 0.8741 | 0.9898 | 0.1226 | 0.0378 | 0.0303 | −0.0036 | 0.3501 |
NN-E | 1.0000 | 1.0000 | 1.0000 | 0.0001 | 0.0013 | 0.0013 | 0.0000 | 0.0089 |
LR-E | 0.9947 | 0.9947 | 0.9974 | 0.0088 | 0.0140 | 0.0133 | 0.0000 | 0.0936 |
In Table 4, for AWF predictions, the SVM-E reports an RMSE of 0.3772 and an MAE of 0.0597, while both the NN-E and the LR-E exhibit remarkably low values with an RMSE of 0.0111 and an MAE of 0.001. As for ASR predictions, SVM-E demonstrates an RMSE of 0.3501 and an MAE of 0.0303, which, though moderate, are higher than the values for NN-E and LR-E. NN-E shows an exceptionally low RMSE of 0.0089 and an MAE of 0.0013, suggesting an extremely high accuracy in predictions. LR-E also performs well with an RMSE of 0.0936 and an MAE of 0.0133, indicating a high degree of precision, albeit slightly less than that of NN-E. These RMSE and MAE values emphasize the robustness of NN-E and LR-E in modelling, with NN-E being particularly notable for its precision, which is crucial for optimizing desalination processes, leading to environmental benefits such as reduced energy consumption, and minimized waste. The predictive comparison between Tables 3 and 4 reveals that ensemble models substantially enhance predictive accuracy for desalination processes. In Table 4, individual models, with NN performing the best, achieve an MAE of 0.001 and an RMSE of 0.0111 for AWF, and an MAE of 0.0107 and RMSE of 0.0982 for ASR. However, Table 4 ensemble models outshine these figures, with the NN-E and -E for AWF both yielding an MAE of 0.001 and an RMSE of 0.0111, while the SVM-E records a slightly higher MAE of 0.0597 and RMSE of 0.3772. For ASR, the NN-E impressively lowers the MAE to 0.0013 and the RMSE to 0.0089, and the LR-E follows closely with an MAE of 0.0133 and an RMSE of 0.0936, with the SVM-E improving to an MAE of 0.0303 and an RMSE of 0.3501. These reductions in error metrics underscore the effectiveness of ensemble methods in increasing the precision and reliability of predictive modeling for desalination, leading to environmentally and economically optimized operations through better resource management and reduced waste. Generally, the ensemble models, particularly those integrating NN and LR, exhibited enhanced predictive accuracy for AWF and ASR, outperforming individual algorithm-based models. This superiority is evidenced by lower MAE and RMSE values, alongside near-perfect NSE scores, highlighting the effectiveness of the ensemble approach in capturing complex nonlinear relationships within the data.
Fig. 8 SHAP results for global interpretability using Global bar plot and swarm plot for (a) ASR and (b) AWF. |
For example, if CT reduces the predicted ASR, it might indicate that higher CT values are associated with lower salt rejection efficiency under these conditions. The force plot thus provides a detailed, instance-specific breakdown of how MDP, TMC, and CT contribute to the final prediction, allowing us to see the dynamics between these features in a clear and interpretable way. If MDP shows the strongest positive contribution while TMC and CT have smaller or negative contributions, optimizing MDP could be more critical for enhancing ASR in this specific case. The force SHAP plot offers a granular view of how MDP, TMC, and CT interact to affect the model's prediction of ASR in individual cases, providing insights that can inform targeted improvements in membrane design or operational conditions (Fig. 9). This detailed interpretability is crucial for understanding the complex relationships within the model and ensuring that the predictions are both reliable and actionable.
AI | Artificial intelligence |
ANN | Artificial neural networks |
ASR | Average salt rejection |
AWF | Average water flux |
CT | Curing temperature |
D-ANN | Dual-output neural network |
DNN | Deep learning neural network |
GA | Genetic algorithms |
GBM | Gradient boosting machines |
LMH | Liters per meter square per hour |
LR | Linear regression |
LR-E | Linear regression ensemble |
MAE | Mean absolute error |
MAPE | Mean absolute percentage error |
MGPR | Matern Gaussian process regression |
ML | Machine learning |
MLR | Multivariate linear regression |
MPD | m-Phenylenediamine |
MSE | Mean squared error |
NF | Nanofiltration |
NN | Neural networks |
NN-E | Neural networks ensemble |
NSE | Nash-Sutcliffe efficiency |
PBIAS | Percent bias |
PCC | Pearson correlation coefficient |
R2 | R-squared |
RMSE | Root mean squared error |
RO | Reverse osmosis |
RT | Reaction time |
SD | Standard deviation |
SDG | Sustainable development goal |
SHAP | Shapley additive explanations |
SVM | Support vector machine |
SVM-E | Support vector machine ensemble |
TFC | Thin-film composite |
TMC | Trimesoyl chloride |
XAI | Explainable artificial intelligence |
This journal is © The Royal Society of Chemistry 2024 |