H. Guo*a,
J. Y. Zhaoab and
J. H. Yinc
aCollege of Computer Science and Engineering, Dalian Nationalities University, 18 Liaohe West Road, Dalian Development Zone, Dalian 116600, China. E-mail: guohai@dlnu.edu.cn
bFaculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian 116023, China
cSchool of Applied Science, Harbin University of Science and Technology, Harbin 150080, China
First published on 15th June 2017
As a new insulating material, nanoscale thin dielectric films have been widely used in variable frequency motors, electron devices and other fields. Dielectric loss is a key performance parameter of this material. Currently, studies on the dielectric properties of polymer matrix nanocomposite films are based mostly on experiments that are costly and time-consuming. In this article, an integrated method that combines experiment and ensemble learning is applied. In situ polymerization is employed to prepare 32 polyimide matrix nanocomposite films that have different weight ratios, sizes and thicknesses and that are doped with different inorganic nanoscale particles. The dielectric losses of these 32 prepared films are measured as well. Ten multilayer perceptrons are integrated into a random forest and multilayer perceptron (RF-MLP) model using the random forest (RF) method. As shown in the experimental results, under the 10-fold cross validation, the correlation coefficient, the mean absolute error, the root mean squared error and the root relative squared error of the RF-MLP model are 0.9447, 0.0007, 0.0013 and 32.0972%, respectively. The deviation between the predicted value and the measured value is small. The RF-MLP model has a better prediction performance than other single models, such as linear regression, backpropagation neural network, radial basis function neural network, support vector regression and multilayer perceptron as well as other ensemble learning methods, such as bagging, boosting and RF-decision stump. Therefore, the RF-MLP model is a fast and reliable method applicable to predicting the properties of the new nano-dielectric material and other materials.
ANNs (artificial neural networks) has been extensively applied for predicting film properties. Bahramian Alireza et al. simulated and predicted the growth rate of TiO2 nanostructured thin films by using a feed-forward back-propagation network.13 Ko Young-Don et al. made use of PCA (principle component analysis)-based neural networks to predict the physical and material properties of an HfO2 thin film.14 Nobrega Marcelo Medre et al. predicted the mechanical and barrier properties of biodegradable films by using the SOMs (self-organizing maps) neural network. The deviation between the predicted value and the measured value was not higher than 24%.15 M. Payandehdoost et al. optimized the group method of data handling (GMDH)-type neural networks by means of genetic algorithms to predict present separate polynomial relations for the area-weighted average film cooling heat transfer coefficient.16 Also using a neural network, Piliougine Michel et al. characterized the electrical parameters of several thin-film photovoltaic module technologies.17 Yang Yu-Sen applied a generalized regression neural network (GRNN) for predicting the friction coefficient of deposited Cr1−xAlxC films on high-speed steel substrates via direct current magnetron sputtering systems.18 By taking advantage of the neural network, these researchers did well in predicting film properties. However, there are still problems: (1) only conventional films are used for the predictions, and no studies on predicting properties of new nanocomposite films have been reported; (2) only single films are used for regression prediction, and very few studies have been conducted on predicting the properties of films doped with different particles, and; (3) the accuracy of the single neural network prediction model needs to be further improved.
Polymeric nanocomposites are a new type of dielectric. However, there is no report on using intelligent computing together with neural networks technology to predict its dielectric loss. On an experimental basis, this paper makes use of a RF-optimized MLP ensemble learning model to predict the dielectric loss of the polyimide nanocomposite films. In the following, materials and experimental procedures are introduced first. The construction of the sample library and the prediction model are described next. Then, experimental results are described and the establishment of the RF-MLP model with 10-fold cross validation results is carried out. The 10-fold cross validation are also progressed for discussing the accuracy of the RF-MLP model and the single prediction models, including BP (back-propagation) neural network, RBF (radical basis function) neural network, SMO-SVR (sequential minimal optimization-support vector machine), MLP (multilayer perceptron) neural network and linear regression. Comparisons to other ensemble learning methods are also made. Finally, the paper concludes with a summary of this study.
In this study, 32 polyimide matrix inorganic nanocomposite films with different weight ratios, sizes and thicknesses and doped with different inorganic nanoparticles (BaTiO3, Rutile TO2, SiO2, α-Al2O3 and Al2O3) were prepared. The film samples are listed in Table 1.
Nanoparticles | Particle size (nm) | Doped proportion (wt%) | Film thickness (μm) |
---|---|---|---|
BaTiO3 | 100 | 10, 15, 20, 25, 30, 50, 60, 70 | 25 |
Rutile TiO2 | 35 | 1, 3, 4, 5, 7 | 25 |
SiO2 | 40 | 5, 10, 15, 20, 25 | 25 |
SiO2 | 7 | 10, 15, 20, 25 | 30 |
α-Al2O3 | 30 | 4, 8, 12, 16, 20, 24 | 30 |
Al2O3 | 13 | 15, 20, 25 | 30 |
PI | 0 | 0 | 25 |
A transmission electron microscope (Trcnai G2 F30) was used to observe the micro-structures of the prepared composite film samples at a resolution ratio of 0.205 nm and accelerating voltages of 50–300 kV.
FEI Quanta 200, at an accelerating voltage of 20 keV, was used to observe the surface and the fracture topographies of the composite film samples. The samples were directly observed under an electron microscope without surface treatment.
Fig. 2 Microstructures of nanocomposite films (a) TEM image of the PI/Al2O3 15 wt%; (b) SEM image of the PI/BaTiO3 20 wt%; (c) SEM image of the PI/TiO2 20 wt%; (d) SEM image of the PI/SiO2 15 wt%. |
Sample | Type of nanoparticles | Input | Output | ||||||
---|---|---|---|---|---|---|---|---|---|
X1 | X2 | X3 | X4 | X5 | X6 | X7 | Y | ||
a X1: dielectric constant of the nanoparticle; X2: electrical resistivity of the nanoparticle (Ω m); X3: film thickness (μm); X4: thermal conductivity of the nanoparticle (W cm−1 K−1); X5: doping ratio of the nanoparticle (wt%); X6: size of the nanoparticle (nm); X7: specific surface area of the nanoparticle (m2 g−1), and; Y: dielectric loss of the nanocomposite film. | |||||||||
1 | BaTiO3 | 1400 | 1 × 105 | 25 | 0.50 | 10 | 100 | 12 | 0.00140 |
2 | BaTiO3 | 1400 | 1 × 105 | 25 | 0.50 | 15 | 100 | 12 | 0.00200 |
3 | BaTiO3 | 1400 | 1 × 105 | 25 | 0.50 | 20 | 100 | 12 | 0.00246 |
4 | BaTiO3 | 1400 | 1 × 105 | 25 | 0.50 | 25 | 100 | 12 | 0.00276 |
5 | BaTiO3 | 1400 | 1 × 105 | 25 | 0.50 | 30 | 100 | 12 | 0.00296 |
6 | BaTiO3 | 1400 | 1 × 105 | 25 | 0.50 | 50 | 100 | 12 | 0.00652 |
7 | BaTiO3 | 1400 | 1 × 105 | 25 | 0.50 | 60 | 100 | 12 | 0.01647 |
8 | BaTiO3 | 1400 | 1 × 105 | 25 | 0.50 | 70 | 100 | 12 | 0.01839 |
9 | Rutile TiO2 | 100 | 9 × 107 | 25 | 0.40 | 0 | 35 | 70 | 0.00177 |
10 | Rutile TiO2 | 100 | 9 × 107 | 25 | 0.63 | 1 | 35 | 70 | 0.00165 |
11 | Rutile TiO2 | 100 | 9 × 107 | 25 | 0.63 | 3 | 35 | 70 | 0.00199 |
12 | Rutile TiO2 | 100 | 9 × 107 | 25 | 0.63 | 4 | 35 | 70 | 0.00156 |
13 | Rutile TiO2 | 100 | 9 × 107 | 25 | 0.63 | 5 | 35 | 70 | 0.00278 |
14 | Rutile TiO2 | 100 | 9 × 107 | 25 | 0.63 | 7 | 35 | 70 | 0.00283 |
15 | SiO2 | 1.56 | 1 × 1016 | 25 | 160 | 5 | 40 | 300 | 0.00150 |
16 | SiO2 | 1.56 | 1 × 1016 | 25 | 160 | 10 | 40 | 300 | 0.00200 |
17 | SiO2 | 1.56 | 1 × 1016 | 25 | 160 | 15 | 40 | 300 | 0.00180 |
18 | SiO2 | 1.56 | 1 × 1016 | 25 | 160 | 20 | 40 | 300 | 0.00150 |
19 | SiO2 | 1.56 | 1 × 1016 | 25 | 160 | 25 | 40 | 300 | 0.00160 |
20 | α-Al2O3 | 10 | 1 × 1014 | 30 | 4.10 | 4 | 30 | 25 | 0.00300 |
21 | α-Al2O3 | 10 | 1 × 1014 | 30 | 4.10 | 8 | 30 | 25 | 0.00450 |
22 | α-Al2O3 | 10 | 1 × 1014 | 30 | 4.10 | 12 | 30 | 25 | 0.00500 |
23 | α-Al2O3 | 10 | 1 × 1014 | 30 | 4.10 | 16 | 30 | 25 | 0.00700 |
24 | α-Al2O3 | 10 | 1 × 1014 | 30 | 4.10 | 20 | 30 | 25 | 0.00800 |
25 | α-Al2O3 | 10 | 1 × 1014 | 30 | 4.10 | 24 | 30 | 25 | 0.00850 |
26 | Al2O3 | 8 | 1 × 1014 | 30 | 29.31 | 15 | 13 | 100 | 0.00266 |
27 | Al2O3 | 8 | 1 × 1014 | 30 | 29.31 | 20 | 13 | 100 | 0.00285 |
28 | Al2O3 | 8 | 1 × 1014 | 30 | 29.31 | 25 | 13 | 100 | 0.00305 |
29 | SiO2 | 1.56 | 1 × 1016 | 30 | 160 | 10 | 7 | 350 | 0.00335 |
30 | SiO2 | 1.56 | 1 × 1016 | 30 | 160 | 15 | 7 | 350 | 0.00372 |
31 | SiO2 | 1.56 | 1 × 1016 | 30 | 160 | 20 | 7 | 350 | 0.00413 |
32 | SiO2 | 1.56 | 1 × 1016 | 30 | 160 | 25 | 7 | 350 | 0.00460 |
Artificial neural networks have been widely used for predicting and identifying material properties in recent years.24–26 A multilayer perceptron (MLP) is a feedforward artificial neural network model that maps sets of input data onto a set of appropriate outputs.27 An MLP consists of three parts, which are the input layer, hidden layer and output layer. Fig. 3 is a standard three-layer feedforward MLP where i = 1, 2, …, n, j = 1, 2, …, h, r = 1, 2, …, p, and s = 1, 2, …, m. For each layer, there is a synaptic weight matrix that connects the former layer to the next layer. The definitions of a nonlinear input–output map and a nonlinear diagonal matrix are given as follows:
f(l)(*) = diag[f(1)[*], f(2)[*], …, f(l)[*]] | (1) |
xout1 = f(1)[ν(1)] = f(1)[W(1)x] | (2) |
If eqn (2) is taken as the input of the second layer, then the output of the second layer is as follows:
xout2 = f(2)[ν(2)] = f(2)[W(2)xout1] | (3) |
If eqn (3) is taken as the input of the third layer, then the output of the third layer is as follows:
y = xout3 = f(3)[ν(3)] = f(3)[W(3)xout2] | (4) |
Substituting eqn (2) into (3) and (4), we obtain the final network:
y = f(3)[W(3)f(2)[W(2)f(1)[W(1)x]]] = Ω[x] | (5) |
The RF (random forests) algorithm was proposed by Leo Breiman in 2001.28 The algorithm contains a multitude of decision trees, and random forests are generated from a number of combined optimization decision trees. RF is a fast and efficient machine learning method for classification and regression. It has many advantages. It can effectively prevent over-fittings, quickly process high dimensional data, and maintain high accuracy when the dataset is not complete. By virtue of the idea of random forests, we use an MLP (multilayer perceptron) neural network to replace the decision tree to construct an RF-MLP (random forest and multilayer perceptron) model, as shown in Fig. 4. Calculation steps of the RF-MLP model are as follows:
Step 1: Randomly extracting N samples from the training samples of the nanocomposite films;
Step 2: Randomly extracting K samples from the explanatory variables and using the Gini value to select the variable that leads to the minimal variability of the divided internal subset;
Step 3: Using the variable obtained in step 2 to divide the training set into two subsets;
Step 4: Repeating step 3 and training the MLP by using the divided subsets;
Step 5: Averaging the MLP regression models.
The CC (correlation coefficient) represents how close the linear regression relation between fi (independent variable) and yi (response variable) is. It is directly proportional to prediction accuracy.
MAE (mean absolute error) is the diversity between the predicted value and the measured value. It is inversely proportional to prediction accuracy. MAE can be expressed as follows:
(6) |
RMSE (root mean squared error) is inversely proportional to prediction accuracy. In other words, the smaller the value of RMSE is, the more accurate the predictor will be. It is expressed as follows:
(7) |
The expression of RRSE (root relative squared error) is as follows:
(8) |
It is inversely proportional to prediction accuracy, which means that the smaller the value of RSSE is, the more accurate the predictor will be.
Fig. 6 Variation of MAE and N; (a) N = 2, r changes from 0.003 to 0.06; (b) N = 4, r changes from 0.003 to 0.06; (c) N = 8, r changes from 0.003 to 0.06; (d) N = 6, r changes from 0.003 to 0.06. |
Fig. 7 Variation of RMSE and N; (a) N = 2, r changes from 0.003 to 0.06; (b) N = 4, r changes from 0.003 to 0.06; (c) N = 6, r changes from 0.003 to 0.06; (d) N = 8, r changes from 0.003 to 0.06. |
Attribute selection, also called dimensionality reduction, is a key factor to control the prediction accuracy of the model. Common attribute selection modes include all-attribute, principal component analysis (PCA) and the Haar wavelet transform. In the RF-MLP model, the three modes are experimentally compared. Table 3 lists the comparison results of the prediction accuracy when N = 8 and r = 0.04. As Table 3 shows, the CC value is 0.9447 in the principal component analysis mode, which is higher than those in the other two modes. In the principal component analysis mode, the MAE, RMSE and RRSE are 0.0007, 0.0013 and 32.0972%, respectively, which are all smaller than those in the all-attribute and Haar wavelet transform modes. Fig. 9 shows the comparison between the predicted values and the measured values when the model was set at the optimal parameters (N = 8, r = 0.04, principal component and 10-fold cross validation). The well-fitting curves show that the model is effective to predict the dielectric losses of the nanocomposite films.
Assessment | Normal | PCA | Haar wavelet |
---|---|---|---|
CC | 0.9327 | 0.9447 | 0.9419 |
MAE | 0.0009 | 0.0007 | 0.0008 |
RMSE | 0.0014 | 0.0013 | 0.0013 |
RRSE (%) | 35.2677 | 32.0972 | 32.8783 |
Assessment | RF-MLP | BP | RBF | SMO-SVR | MLP | Linear regression |
---|---|---|---|---|---|---|
CC | 0.9447 | 0.9159 | 0.9120 | 0.7555 | 0.9302 | 0.7699 |
MAE | 0.0007 | 0.0012 | 0.0011 | 0.0015 | 0.0010 | 0.0019 |
RMSE | 0.0013 | 0.0018 | 0.0016 | 0.0029 | 0.0015 | 0.0025 |
RRSE | 32.0972% | 45.8046% | 40.5254% | 71.1128% | 36.5720% | 62.1629% |
Table 5 and Fig. 11 show the results of the comparison between the RF-MLP model and other ensemble learning models, including the RF (random forest)-decision stump, bagging-MLP and AR (Additive Regression)-MLP. As seen in Fig. 11a, the CC value of the RF-MLP model is 0.9447, which is higher than that of the other three models. Fig. 11b, c and d show that the MAE, RMSE and RRSE values of the RF-MLP model are 0.0007, 0.0013 and 32.0972%, respectively, which are all smaller than those of the other three models. Therefore, compared with other ensemble learning models, the RF-MLP model could generate smaller error and better performance in predicting the dielectric losses of the nanocomposite films.
Assessment | RF-MLP | RF-decision stump | Bagging-MLP | Additive regression-MLP |
---|---|---|---|---|
CC | 0.9447 | 0.6054 | 0.8861 | 0.9373 |
MAE | 0.0007 | 0.0021 | 0.0011 | 0.0008 |
RMSE | 0.0013 | 0.0032 | 0.0018 | 0.0015 |
RRSE | 32.0972% | 80.5834% | 45.0949% | 36.2929% |
RF-MLP | GRNN18 | |
---|---|---|
CC | 0.896 | — |
RMSE | 0.053 | 0.36 |
(1) The 10-fold cross validation was used to predict the dielectric losses of 32 polyimide nanocomposite films. For the prediction results, the CC was 0.9447, the MAE was 0.0007, the RMSE was 0.0013, and the RRSE was 32.0972%, which indicate that the RF-MLP model is efficient in predicting such dielectric losses.
(2) The comparison experiment shows that the CC value of the RF-MLP model is higher than that of the linear regression, backpropagation neural network, radial basis function neural network, support vector regression and multilayer perceptron neural network. In addition, compared with the other models, the RF-MLP model has smaller MAE, RMSE and RRSE values and better prediction parameters.
(3) If compared with other ensemble learning methods, such as bagging, boosting and RF-decision stump, the RF-MLP model has a higher CC value but lower MAE, RMSE and RRSE values when used to predict the dielectric losses of polyimide nanocomposite films. This indicates that the RF-MLP model is more accurate for dielectric loss prediction.
(4) The results show that the RF-MLP model is applicable not only to predicting the dielectric loss of the polyimide nanocomposite films but also to predicting the friction coefficient of deposited Cr1−xAlxC films with prediction accuracy than other models. In addition, the RF-MLP model exhibits considerable robustness.
For further studies, we plan to use other ensemble learning methods to predict corona resistance, dielectric constant and thermal weight losses of polyimide matrix nanocomposite films to provide an efficient, fast and reliable method for predicting the dielectric properties of new nano-materials. In this paper, we have only discussed the dielectric losses of polymer-based composites with orientation polarization. In further research, we will focus on the applicability of this model for poly-anion-type materials, such as SiO2 and KH2PO4.
This journal is © The Royal Society of Chemistry 2017 |