Yukinori
Koyama
*a,
Hidekazu
Ikeno
b,
Masamichi
Harada
c,
Shiro
Funahashi
c,
Takashi
Takeda
c and
Naoto
Hirosaki
c
aResearch and Services Division of Materials Data and Integrated System, National Institute for Materials Science, Tsukuba, Ibaraki 305-0044, Japan. E-mail: KOYAMA.Yukinori@nims.go.jp
bDepartment of Materials Science, Graduate School of Engineering, Osaka Metropolitan University, Sakai, Osaka 599-8570, Japan
cResearch Center for Functional Materials, National Institute for Materials Science, Tsukuba, Ibaraki 305-0044, Japan
First published on 29th November 2022
For rapid and efficient development of new phosphors, a suitable method that proposes promising candidates is expected to focus time-consuming trial-and-error experiments. A data-driven approach to discover new phosphor materials with a designed luminescence color is demonstrated in this paper. To screen compounds for a desirable luminescence color, a machine learning model has been developed for predicting emission peak wavelengths from a dataset composed of 129 Eu2+-activated phosphors. General-purpose compositional and structural features are used to represent host compounds of phosphors. Bootstrap aggregation with the gradient boosted regression trees method is adopted to obtain high predictive performance and to avoid overfitting. The predictive performance of the machine learning model is estimated to be 25 nm of mean absolute error (MAE) and 33 nm of root mean squared error (RMSE) by 10-fold cross validation. To discover new green-emitting Eu2+-activated phosphors, twenty candidate compounds have been selected to have predicted emission peak wavelengths of about 500–550 nm from a materials database, and the candidates have been synthesized and characterized by experiments. Three new Eu2+-activated phosphors, Li2Ca4Si4O13:Eu2+, Na2Ca2Si2O7:Eu2+, and SrLaGaO4:Eu2+, successfully show green or blue-green emissions as designed.
Recently, data-driven approaches have been reported for the rapid discovery and development of new phosphors using screening of materials databases, high-throughput density functional theory (DFT) calculations, and machine learning on luminescence properties.5–10 The emission spectrum is one of the most important characteristics of phosphors because it determines their luminescence color. The emission spectrum is often characterized by its peak top and full width at half maximum (FWHM). A relationship among host compounds, the absorption spectrum, and the emission spectrum has been investigated empirically or semi-empirically for Ce3+ and Eu2+-activated phosphors so far.11Ab initio multi-configurational quantum chemical calculations have been performed to quantitatively calculate configuration coordinate diagrams and absorption spectra.12 Constrained DFT calculations have also been conducted to evaluate absorption and emission energies.13,14 However, these theoretical methods require time-consuming calculations at both the ground and excited states. Because of the high computational cost, high-throughput theoretical calculations to screen candidate compounds are not currently feasible.
Machine learning to predict emission spectra instead of the theoretical calculations has been investigated recently.6–8 Sohn and his coworkers reported the pioneering machine-learning study on a relationship among emission peak wavelength, FWHM, and local environments of substitution sites in host lattices,5 and recently reported comprehensive machine learning to predict band gap, excitation energy, and emission energy for Eu2+-activated phosphors.6 Nakano et al. reported machine learning to predict emission peak energy from chemical compositions of the host compounds for Eu2+-activated phosphors.7 The reported prediction accuracy is not directly comparable among the theoretical calculations and the machine learning studies because they used different datasets. But the results suggest that the machine learning models6,7 have comparable prediction accuracy to the DFT calculations.14
Based on the successful machine-learning studies to date, it is expected that new phosphors with desirable luminescence properties will be developed using machine learning. Although several research groups have reported new phosphors by data-driven approaches,9 discovery of new phosphors with a designed luminescence color is still a big challenge. In this paper, we report the discovery of three new green or blue-green emitting phosphors, which a machine-learning model has proposed as green emitting phosphors. First, we developed a machine learning model to predict the emission peak wavelengths of Eu2+-activated phosphors from an in-house phosphor dataset. Next, we explored a materials database and collected candidate host compounds predicted to show green emissions by the machine learning model. Then, we synthesized and characterized the candidates, and finally discovered the three new Eu2+-activated phosphors, Li2Ca4Si4O13:Eu2+, Na2Ca2Si2O7:Eu2+, and SrLaGaO4:Eu2+. The results clearly demonstrate the power of the machine learning on the emission peak wavelength for rapid and efficient development of new phosphors with a designed luminescence color.
Emission peak wavelength is used as a target variable in this study because the emission spectra of phosphors are usually measured and reported in wavelength. The emission peak wavelengths depend on the concentrations of activators and other factors. The conditions in the literature are inconsistent, and the reported values vary more or less. If multiple emission peak wavelengths are reported for a single phosphor material and the reported values differ by more than 30 nm, the phosphor is eliminated. In our opinion, a deviation of 10 nm or more in the emission peak wavelength is conceivable due to the different conditions.
Finally, a dataset composed of 129 Eu2+-activated phosphors was prepared. The distribution and statistics of the emission peak wavelengths are respectively shown in Fig. 1a and Table 1. Constituent elements of the host compounds are summarized in Fig. 1b. Among the constituent elements, sulfur appeared as both a cation (S6+) and an anion (S2−). N, O, F, Cl, Br, and I elements were anions, and the other elements were cations.
Count | 129 |
Mean (nm) | 495 |
Median (nm) | 478 |
Minimum (nm) | 368 |
Maximum (nm) | 681 |
Standard deviation (nm) | 80 |
Mean absolute deviation (nm) | 66 |
As the compositional features, general-purpose features18 were adopted. The general-purpose features were a set of statistics of elemental features to represent various aspects of chemical compositions. Nakano et al. used the same scheme for their machine learning.7 In this study, 22 elemental features and seven statistics, namely, weighted arithmetic mean, weighted geometric mean, weighted harmonic mean, weighted standard deviation, minimum, maximum, and range, were used. The elemental features and the statistics are respectively listed in Tables S1 and S2 in the ESI.† In addition to the elemental features, oxidation states were considered. As oxidation states are both positive and negative values and satisfy charge neutrality, the weighted arithmetic, geometric, and harmonic means were excluded. Instead, the seven statistics of absolute oxidation states were additionally included. As the hosts in this study are all ionic compounds, the statistics of the elemental features and the absolute oxidation states were also evaluated for each of the cations only and the anions only. The compositional features consisted of 487 features.
To represent the local environments of the substitution sites, Park et al. used geometrical and elemental features of activator-anion and activator-cation polyhedra.6 This idea was generalized, inspired by the general-purpose compositional features. The structural features used in this study consisted of three groups of features. The first group was a geometrical aspect of the substitution sites. The numbers of neighboring anions and cations, average distances to their neighboring anions and cations, distortion index,19 and bond valence sum20 were evaluated for individual Ca, Sr, and Ba sites. The neighboring anions were determined using the CrystalNN method.21 The neighboring cations were determined so that they shared neighboring anions with the substitution sites. As some of the host compounds used in this study have multiple substitution sites, the average and standard deviation of each feature among the substitution sites were evaluated and used as features of the host structures. The number of symmetrically inequivalent substitution sites was also included. The second group was analogous to the compositional features but specialized for the local environments of the substitution sites. The seven statistics of the 22 elemental features and the absolute oxidation states were calculated for the neighboring anions and the neighboring cations of individual Ca, Sr, and Ba sites. The average and standard deviation among the substitution sites were used as the features of the hosts. Besides the features of the substitution sites, density and numerical density were added as the third group. The structural features consisted of 659 features.
The features were evaluated using the Pymatgen package22 and a customized version of the XenonPy package.23
The predictive performance of the machine learning models was evaluated by 10-fold cross validation by means of the mean absolute error (MAE), root mean squared error (RMSE), and coefficient of determination (R2). The scores were averaged among the folds. The parameters of the regression models and the numbers of selected features were selected to minimize the average RMSE for the validation data. The parameter search was performed in a manner of Bayesian optimization using the Hyperopt25 and scikit-optimize26 packages with 1000 iterations for each method. Default parameters were used for the regression models used in RFE to reduce the computation time for the parameter search. The pipelines of the machine-learning models and the optimized parameters are summarized in Table S3 in the ESI.†
The products were first characterized using a powder X-ray diffractometer (XRD) (Bruker, D8 ADVANCE, Cu Kα radiation) and a spectrofluorometer (JASCO, FP-8600). The powder XRD analysis indicated that some products were mixtures of the target compounds and impurity phases. As the photoluminescence (PL) spectra of the powder samples are largely influenced by impurity phases with bright luminescence, it was not clear whether the PL spectra of the mixture products were derived from the target compounds or the impurity phases. Therefore, after the first screening using the powder samples, well-crystallized particles were picked up from the products and characterized by single crystal XRD and microspectroscopy in a manner of the single-particle diagnosis approach.4 The single crystal XRD data of the picked particles were collected using a diffractometer (Bruker-AXS, SMART APEX II Ultra) with Mo Kα radiation. The data were integrated and corrected for absorption using SADABS. The crystal structures were solved and refined with SHELX. The PL spectra of the particles were obtained using a spectrometer (Otsuka electronics, MCPD7700) through a microscope (Olympus, BX51M) under 365 nm LED excitation.
Regression method | MAE (nm) | RMSE (nm) | R 2 | |||
---|---|---|---|---|---|---|
Training | Validation | Training | Validation | Training | Validation | |
Ridge | 24 (1) | 29 (7) | 31 (1) | 36 (9) | 0.85 (0.01) | 0.74 (0.13) |
ARD | 25 (1) | 28 (6) | 31 (1) | 34 (9) | 0.84 (0.02) | 0.77 (0.12) |
RF | 10 (0) | 27 (7) | 14 (1) | 35 (11) | 0.97 (0.00) | 0.75 (0.19) |
GB | 0 (0) | 24 (9) | 0 (0) | 31 (12) | 1.00 (0.00) | 0.79 (0.18) |
Bagging of BG | 10 (0) | 25 (7) | 14 (1) | 33 (10) | 0.97 (0.00) | 0.77 (0.17) |
To improve the predictive performance, other regression methods were applied. The ARD regression is a Bayesian linear model with an intrinsic feature selection capability, and this method resulted in a slightly higher prediction accuracy to the validation data compared with the ridge regression. The ridge and ARD models showed relatively large fitting errors to the training data. This indicates that the relationship between the general-purpose features used in this study and the emission peak wavelength is basically nonlinear, although the general-purpose features are numerous and diverse. The small differences in the predictive performance scores between the training and validation data of these linear models imply that the obtained predictive performance almost reached the optimal of linear models.
Nonlinear regression methods were applied to obtain a higher predictive performance. The RF model showed slightly smaller MAE but larger RMSE to the validation data than the ARD model. The GB model showed much smaller MAE and RMSE to the validation data than the ARD and RF models. However, the fitting errors of the GB model to the training data were almost zero, and overfitting was concerned. To dispel the concerns about the overfitting of the GB model, the bagging technique was adopted to the GB regression. The bagging technique is also used in the RF regression and is expected to suppress the overfitting. The bagging of the GB model showed intermediate predictive performance to the validation data between the GB and RF models. The better predictive performance of the bagging of the GB model compared with the RF model is probably due to the higher predictive capacity of the GB regression as a base learner compared with that of the regression trees in the RF model.
The RF, GB, and bagging of GB models showed large prediction errors for some specific compounds in the validation folds. A plausible cause of these large prediction errors is that the phosphor dataset used in this study is not sufficiently large with respect to the diverse phosphor materials. If a host compound is unique in the dataset and is put in the validation data in a fold of the cross validation, the training data does not contain compounds like the unique host, resulting in a large prediction error. Another possible cause of the large prediction errors is the quality of the reported emission peak wavelengths. Some phosphor materials have a deviation of tens of nm or more in the reported emission peak wavelengths. Phosphors with large deviations have been eliminated from the dataset as mentioned in the Methods section, but the data might not be fully curated yet. Further investigation for the large prediction errors is beyond the scope of this study, whereas obtaining a high-quality dataset that covers diverse materials is a big issue in the data-driven materials research.
Emission peak wavelength is used as the target variable in this study, while the energy of the emission peak was used as the target variables in the previous studies.6,7 Note that in principle, correction of intensity is required to convert an emission spectrum from the wavelength to energy and vice versa, and its peak top shifts. For comparison with previous studies, the emission peak wavelengths were simply converted into energy without such intensity correction, and regression on the converted energy was conducted. The bagging of the GB method was used. The prediction accuracy and the plot of the predicted values with respect to the reported ones are shown in Table S4 and Fig. S1 in the ESI.† The present results (0.13 eV MAE, 0.16 eV RMSE) are slightly smaller (better) than those in ref. 7 (0.139 eV MAE, 0.183 eV RMSE), and slightly larger (worse) than ref. 6 (0.020 eV2 MSE corresponding to 0.14 eV RMSE). Only the features derived from the chemical composition were used in ref. 7, whereas features derived from the structure were also considered in ref. 6 and in this study. This would have resulted in the slightly poorer predictive performance in ref. 7. In ref. 6, the data were restricted to phosphors with only a single substitution site and to examples of the critical activator concentrations corresponding to concentrations showing the highest PL intensity. In contrast, some phosphors in the present dataset had multiple substitution sites and the activator concentrations depended on the literature. The restriction in ref. 6 might have suppressed the data variability and reduced the RMSE, but it also limited the coverage of the machine learning model.
The predicted and reported emission peak wavelengths of the additional 21 phosphors are illustrated in Fig. 3, which are overlaid on the cross-validation results (Fig. 2e). MAE and RMSE to the test data were 33 nm and 42 nm, respectively. The distribution of the prediction errors looks comparable with that for the validation data in the cross validation, but the MAE and RMSE were much larger than the values estimated by the cross validation. The test data contained Sr2GeO4:Eu2+, which looked like an outlier. Sr2GeO4:Eu2+ showed the largest prediction error: 515 nm of the prediction versus 620 nm reported in ref. 27. This host compound contains Ge element, which was not in the phosphor dataset as shown in Fig. 1b. MAE and RMSE to the other 20 test data except Sr2GeO4:Eu2+ were respectively 30 nm and 37 nm, which were comparable to the results from the cross validation. These suggest that it is essential to extend the phosphor dataset to cover the diverse phosphor materials for a higher predictive performance over a wide range of candidate compounds.
![]() | ||
Fig. 3 Predicted emission peak wavelengths with respect to reported values for the test data of the additionally collected Eu2+-activated phosphors (green) using the bagging of the gradient boosted regression trees method. The plot is overlaid on the cross-validation results (Fig. 2e). |
Index | Composition | Space group | Predicted wavelength (nm) | Experimental results |
---|---|---|---|---|
1 | Ba2MgGe2O7 |
P![]() |
501 | No luminescence |
2 | Ba2ZnGe2O7 |
P![]() |
500 | No luminescence |
3 | Ca2Ga2GeO7 |
P![]() |
513 | No luminescence |
4 | Ca2GeO4 | P63mc (186) | 518 | No luminescence |
![]() ![]() ![]() ![]() ![]() ![]() |
![]() ![]() ![]() |
|||
5 | Ca2ZnGe2O7 |
P![]() |
510 | Low-purity products |
6 | Ca3Al2Ge3O12 |
Ia![]() |
486 | Eu3+ luminescence |
7 | Ca5Ge3O11 | C2/m (12) | 517 | Eu3+ luminescence |
![]() ![]() ![]() |
![]() ![]() ![]() |
|||
8 | CaGa2O4 |
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
![]() ![]() ![]() |
No luminescence |
P21/c (14) | 515 | |||
9 | K4BaSi3O9 | Ama2 (40) | 520 | Eu3+ luminescence |
10 | K4CaGe3O9 |
Pa![]() |
524 | Eu3+ luminescence |
11 | K4SrSi3O9 |
![]() ![]() ![]() ![]() ![]() ![]() |
![]() ![]() ![]() |
Eu3+ luminescence |
Ama2 (40) | 519 | |||
12 | Li2Ca4Si4O13 |
P![]() |
529 | Eu2+ luminescence, 520 nm |
13 | Na2Ca2Si2O7 | C2/c (15) | 544 | Eu2+ luminescence, 527 nm |
14 | Na2SrSi2O6 |
R![]() |
519 | Eu3+ luminescence |
15 | Na4SrSi3O9 | C2 (5) | 527 | No luminescence |
16 | Sr2Al2GeO7 |
P![]() |
508 | Eu3+ luminescence |
17 | Sr2MgGe2O7 |
P![]() |
494 | No luminescence |
18 | Sr3Ga4O9 |
P![]() |
516 | No luminescence |
19 | SrGeO3 |
![]() ![]() ![]() ![]() ![]() |
![]() ![]() ![]() |
No luminescence |
P![]() |
496 | |||
20 | SrLaGaO4 | I4/mmm (139) | 548 | Eu2+ luminescence, 502 nm |
The target compounds were synthesized with a purity of 70 wt% or more estimated by the powder XRD analysis, except for Ca2ZnGe2O7. Ca2ZnGe2O7 was obtained only as low-purity products, and further characterization was not conducted to it. PL spectra were measured for the powder products of the remaining 19 candidates. No luminescence was observed from the 9 products. Only sharp 4f–4f emission spectra derived from Eu3+ activators were observed from the other 7 products. Finally, emission spectra from the Eu2+ activators were observed from three products, Li2Ca4Si4O13, Na2Ca2Si2O7, and SrLaGaO4. XRD patterns of the powder products, which are shown in Fig. S2 in the ESI,† indicated that the products were mixtures of the target compounds and impurity phases. Because the PL spectra of the powder samples are largely influenced by impurity phases with bright luminescence, it is essential to verify that the observed Eu2+ luminescence derives from the target compounds. Therefore, well-grown particles were picked up from the products of these three candidates and characterized in a manner of the single-particle diagnosis.4 The single crystal XRD confirmed that the crystal structures of the picked particles were identical to Li2Ca4Si4O13,28 Na2Ca2Si2O7,29 and SrLaGaO4,30 respectively. Crystallographic information by the single crystal XRD is listed in the ESI.†Fig. 4 displays photo images and emission spectra of the picked particles of these three phosphors under 365 nm LED excitation.
![]() | ||
Fig. 4 Photo images (upper panels) and emission spectra (lower panels) of particles of Eu-doped (a) Li2Ca4Si4O13, (b) Na2Ca2Si2O7, and (c) SrLaGaO4 under 365 nm LED excitation. |
Li2Ca4Si4O13 was predicted to have an emission peak wavelength of 529 nm, and the peak was observed at 520 nm. Na2Ca2Si2O7 were predicted to have a peak wavelength of 544 nm, and the peak was observed at 527 nm. These two phosphors show green emissions as designed, and the prediction errors were as small as 9 nm and 17 nm, respectively. In both cases, FWHMs were very large, 140 nm or more. There are many possible substitution sites for Eu2+ in both structures. The luminescence properties depend on the substitution sites, and the observed emission spectra are an integration of those from the individual sites. The machine learning prediction worked well even for such complex structures. In Na2Ca2Si2O7, weak Eu3+ luminescence was also observed. Some substitution sites might be suitable for Eu3+ even in the synthesis under a reducing atmosphere.
The Eu-doped SrLaGaO4 particle showed both a blue-green emission derived from Eu2+ activators and a characteristic red emission from Eu3+ activators. As for the Eu2+ luminescence, the predicted emission peak wavelength was 548 nm, whereas the peak was observed at 502 nm with a FWHM of 83 nm. The prediction error, 46 nm, was large, but would be acceptable with respect to the prediction accuracy of the present machine learning model. As Sr and La atoms occupy the same crystallographic site in SrLaGaO4, Eu atoms might occupy this site in a mixed valence of Eu2+ and Eu3+, resulting in the simultaneous Eu2+ and Eu3+ luminescence.
The three new Eu2+-activated phosphors were successfully discovered, but a large part of the candidates failed to show Eu2+ luminescence. The products were annealed in the reducing atmosphere to obtain Eu2+, but the reduction process seemed insufficient for some compounds. The stability of Eu2+/Eu3+ in the host lattices is attributed to the redox potential of the substituted Eu ions and the annealing conditions, whereas the annealing conditions were limited depending on the host compounds to prevent them from melting or decomposing. Even if Eu2+ is stable in the hosts, the luminescence may be quenched if the energy levels of the Eu2+ excited states overlap or are close to the conduction bands of the hosts. These are likely the reasons why many candidates have not exhibited Eu2+ luminescence. At this moment, it is hard to predict the valence and energy level of the substituted Eu ion in the host and to predict appropriate synthesis conditions to obtain Eu2+, prior to synthesis. Prediction of these factors is also important for efficient development of new Eu2+-activated phosphors and is a future task.
Using the machine learning model, new green-emitting Eu2+-activated phosphors were searched from the AtomWork-Adv materials database. Among twenty candidate compounds predicted to have emission peak wavelengths of about 500–550 nm, three new phosphors, namely, Eu-doped Li2Ca4Si4O13, Na2Ca2Si2O7, and SrLaGaO4, were successfully synthesized. Li2Ca4Si4O13:Eu2+ and Na2Ca2Si2O7:Eu2+ showed the Eu2+ luminescence of the green color as designed. Eu-doped SrLaGaO4 showed simultaneous Eu2+ and Eu3+ luminescence, and it shows a blue-green emission derived from the Eu2+ activators. These results clearly demonstrate that the machine learning on the emission peak wavelength is useful for the rapid and efficient development of new Eu2+-activated phosphors with a designed luminescence color.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d2ma00881e |
This journal is © The Royal Society of Chemistry 2023 |