Jiaqi
Yang
,
Panayotis
Manganaris
and
Arun
Mannodi-Kanakkithodi
*
School of Materials Engineering, Purdue University, West Lafayette 47907, IN, USA. E-mail: amannodi@purdue.edu
First published on 10th May 2023
Novel halide perovskites with improved stability and optoelectronic properties can be designed via composition engineering at cation and/or anion sites. Data-driven methods, especially involving high-throughput first principles computations and subsequent analysis based on unique materials descriptors, are key to achieving this goal. In this work, we report a density functional theory (DFT) dataset of 495 ABX3 halide perovskite compounds, with monovalent organic or inorganic cations as A, divalent Group 2 or Group 14 elements as B, and I, Br, or Cl as X, and different amounts of mixing applied at each site using the special quasirandom structures (SQS) approach. We perform GGA-PBE calculations on all 495 pseudo-cubic perovskite structures and between 250 and 300 calculations each using the more expensive HSE06 functional, with and without spin–orbit coupling, both including full geometry optimization and static calculations on PBE optimized structures. Lattice parameters, decomposition energy, band gap, and theoretical photovoltaic efficiency derived from computed optical absorption spectra, are determined from each level of theory, and some comparisons are made with collected experimental values. Trends in the data are unraveled in terms of the effects of mixing at different sites, fractions of specific elemental or molecular species present in the compound, and averaged physical properties of species at different sites. We perform screening across the perovskite dataset based on multiple known definitions of stability factors, deviation from cubicity in the optimization cell, and computed stability and optoelectronic properties, leading to a list of promising compositions as well as design principles for achieving multiple desired properties. Our multi-objective, multi-fidelity, computational halide perovskite alloy dataset, one of the most comprehensive to date, is available open-source, and currently being used to train predictive and optimization models for accelerating the design of novel compositions for superior performance across many optoelectronic applications.
Three of the most common ways of tuning the properties of HaPs are described below:
(1) Composition: the most promising HaP compositions for PV absorption explored to date usually contain a mix of MA, FA, and Cs at the A-site, primarily Pb at the B-site with minor fractions of other divalent cations such as Sn and Ge, and I or Br at the X-site often with little Cl. The discovery of novel HaP compositions with attractive properties is on the rise as researchers expand the search into more complex alloys, novel A-site organic molecules, and substitutes for Pb at the B-site from Group IV, Group II, or transition elements.8–11 Mixing at the A site improves the general stability to degradation, while B site and X site mixing can tune and optimize band gaps and optical absorption. The allure of A/B/X-site mixing, even creating high entropy perovskite alloys, is in obtaining starkly different properties compared to pure compositions, possibly eliminating the harmful effect of defects, and improving the long-term stability and consequent optoelectronic performance.
(2) Structure and phases: while the canonical perovskite phase is cubic, many HaPs are most stable in tetragonal, orthorhombic, or hexagonal phases.12 For a given composition and phase, there may exist many local minima on the potential energy surface, typically sampled via rigorous application of evolutionary or minima hopping algorithms, atomic perturbations within larger supercells, or by varying degrees of distortion and rotation in the octahedral networks. Stable or metastable structures thus obtained may show better properties than previously studied ground state structures.13 In addition, HaPs may also manifest as double perovskites or 2D layered perovskites which include large organic spacer ligands, providing another means of tailoring the stability and optoelectronic properties.
(3) Defects: investigation of the electronic structure of crystalline materials is incomplete without consideration of point defects, either native or impurity, which will affect the optoelectronic properties by modifying charge carrier lifetimes, equilibrium conductivity, and resulting trap-limited efficiencies.14,15 Point defects manifest as vacancies, interstitials, or substitutions, and the same defect may behave very differently in different compositions or structures, highlighting the need to include the presence of defects as another variable towards tuning HaP properties.
The chemical design space of HaPs is very much combinatorial and raises challenges for experimentalists to perform effective screening. First principles-based density functional theory (DFT) simulations have been systematically performed to study the optoelectronic properties of HaPs as a function of structure, composition, and defects. The expense of standard DFT computations is reasonable when searching for new promising candidates in such a boundless space. Recently, DFT simulations have been reliably used for modeling structure, heat of formation or decomposition, band gaps, optical absorption spectra, and defect formation energies of a variety of HaPs.2,16 High-throughput DFT (HT-DFT) computations provide the most effective way to screen across a large space of hybrid and inorganic ABX3 halide perovskites. An examination of the HaP-related computational literature reveals that there have been numerous medium (∼102 data points) to large (∼103 or more data points) DFT datasets reported for HaPs, which have been successfully used to screen promising materials with desired stability and formability as well as PV-suitable band gaps, among other properties.12,17–19
A clear limitation of HT-DFT-driven screening is the computational expense of applying a suitably advanced level of theory across a large number of materials. This problem is typically addressed by coupling DFT computations with state-of-the-art machine learning (ML) or artificial intelligence (AI) techniques. Within the area of perovskites, there are many examples in the literature where DFT datasets and suitable atomic/structural/compositional descriptors have been used to train a variety of ML-based predictive and classification models, leading to accelerated prediction of lattice constants, formation energies, band gaps, and other important properties.18,20,21 Such DFT-ML models, once rigorously trained and tested, are deployed for high-throughput screening across massive datasets of unknown perovskites. We recently published a thorough overview of many such efforts applying DFT and/or ML towards halide perovskite discovery.22
In this work, we report a large HT-DFT dataset of 495 chemically distinct, pseudo-cubic, halide perovskite alloys. This dataset builds upon the 229 compounds reported in prior work by Mannodi-Kanakkithodi and Chan,16 adding more types of mixing, better property estimates, and detailed analysis of trends and correlations. The relatively large size of this dataset is intended to provide an initial sampling suitable for a guided search within the HaP alloy space. In this dataset, all perovskite structures are cubic or pseudo-cubic, and the focus is more on investigating the dependence of computed properties on composition, and specifically the type of alloying.
Based on the generated perovskite structures, we perform GGA-PBE calculations and report the computed decomposition energy, band gap, and theoretical PV efficiency. In addition, around 250 to 300 calculations are performed using the HSE06 functional (henceforth referred to as HSE), in three different versions: using full geometry optimization, with and without spin–orbit coupling (SOC), and static calculations on GGA-optimized structures with SOC. The same properties are computed from all three types of HSE computations, which enables the comparison of PBE and multiple HSE estimates with experiments, as well as an understanding of the importance of SOC for certain compositions. Pearson correlation analysis is performed to study the contribution of specific A/B/X species and their known elemental/molecular properties on the DFT computed properties, leading to some useful design rules. We further combine DFT-computed properties with perovskite stability factors such as the octahedral and tolerance factors and determine a deviation from cubicity for all optimized structures, to obtain a list of promising candidates for solar absorption and related optoelectronic applications. We emphasize that this chemically diverse, multi-objective, multi-fidelity dataset of HaP alloys will serve many ML endeavors in the future for prediction and inverse design, be used as the foundation for extended datasets of non-cubic structures and new chemistries, and drive the experimental discovery of novel HaP compositions with targeted properties.
The pure (non-alloyed) possibilities are exhaustively sampled using 5 × 6 × 3 = 90 compounds. Starting from these pure perovskite structures, we perform systematic mixing at the A, B, and X sites. For simulating perovskite alloys, the special quasi-random structures (SQS) method23 is applied to build periodic structures that make the first nearest-neighbor shells as similar to the target random alloy as possible. The SQS can be considered the best possible periodic supercell representing a given mixed-composition perovskite. The distribution of different types of mixing across our dataset is shown in Fig. 1(b). For simplicity, only one type of mixing at a time is considered in this study; that is, mixing is not performed at multiple (A/B/X) sites simultaneously. In total, we performed GGA-PBE computations on 90 pure, 126 A-site mixed, 151 B-site mixed, and 127 X-site mixed HaPs.
Each HaP composition is simulated using a 2 × 2 × 2 supercell, which allows A- and B-site mixing to be performed in discrete 1/8th fractions of the total site occupancy, and X-site mixing in 1/24th fractions, though for simplicity, we restrict X-site mixing to fractions of 3 × (1/24). At these mixing levels, these systems may be referred to as perovskite alloys. Fig. 2 shows the distribution of various types of mixing of the 14 total species at the A, B, and X-sites, across the dataset of 495 compounds. Since mixing is only allowed on one out of the three sites at a time, there is a higher prevalence of the 8/8 fraction for each species. We also find a larger occurrence of the smallest fractions of mixing at A and B sites as compared to intermediate fractions; overall, every type of mixing is represented within the dataset a few times. Using the procedure presented in Fig. 1(c), we calculate the stability and optoelectronic properties for the HaP dataset using both the semilocal GGA-PBE functional and the hybrid HSE06 functionals. Ultimately, we generated a dataset of 495 points at the PBE level, and between 244 and 299 points each at the HSE–PBE + SOC (referring to HSE + SOC on PBE relaxed structures), HSE-relaxed, and HSE-relaxed + SOC levels. The exact constitution of the dataset from different levels of theory is presented in Table 1. The different number of data points from different functionals is a consequence of the number of computations that were completed within the constraints of computing resources and researcher time, but adequate chemical diversity is maintained in each dataset, and as explained later, insights from cheaper functionals can be extended to more expensive theories.
Fig. 2 Distribution of the mixing fractions of various species at the A (a), B (b), and X (c) sites across the PBE dataset of 495 compounds. |
Functional | Number of data points |
---|---|
PBE | 495 |
HSE-rel | 299 |
HSE-rel-SOC | 282 |
HSE–PBE-SOC | 244 |
(1) |
(2) |
The decomposition energy is calculated from 4 different levels of theory, namely PBE (ΔHPBE), HSE-relaxed (ΔHHSE-rel), HSE-relaxed-SOC (ΔHHSE-rel-SOC), and HSE–PBE-SOC (ΔHHSE-PBE-SOC). All decomposition energy values are reported per ABX3 formula unit. Calculating ΔH for X-site mixed compounds involves some additional work because of the multiple choices for AX and BX2 phases; more details are provided in the ESI and in Fig. S1 and S2.†
a(E) = 1 − e−2α(E)L | (3) |
(4) |
(5) |
Functional | Band gap RMSE vs. Exp (eV) |
---|---|
PBE | 0.78 |
HSE-rel | 0.93 |
HSE-rel-SOC | 0.74 |
HSE–PBE-SOC | 0.70 |
Fig. 3 Comparison between the DFT computed and experimentally measured properties of the selected HaPs: (a) cubic lattice constants and (b) band gaps. |
Fig. 3(b) shows that EPBEgap is generally an underestimation compared to experiments, as expected, showing an RMSE of 0.78 eV. The corresponding RMSEs of EHSE-relgap, EHSE-rel-SOCgap, and EHSE-PBE-SOCgap are respectively 0.93 eV, 0.74 eV, and 0.70 eV. We find that on average, HSE–PBE-SOC is the best approach out of the four for reproducing band gaps, but other functionals may be more accurate for certain types of compositions (such as purely inorganic vs. organic–inorganic, Pb-based or Pb-free, etc.), as will be discussed further later in this article. HSE-relaxed band gaps are heavily overestimated and brought down by the inclusion of SOC. It should also be noted that phase information was not always available for certain experimental data points collected from the literature, and non-cubic phases may affect the accuracy of the computational results here. We additionally performed a comparison for a much smaller dataset of 9 compounds known to be cubic from experiments; a plot showing these band gaps is presented in Fig. S4,† and the corresponding RMSE values in Table S1† show that EHSE-PBE-SOCgap has a respectable RMSE against experiments of 0.4 eV. Finally, it should be noted here that the PBE RMSE is not significantly different from the RMSE of HSE–PBE-SOC, which comes from the accidental accuracy of semi-local functionals without SOC for HOIPs.16,40
We find that stable compositions (ΔHPBE < 0 eV) are predominantly occupied by HOIPs, with a fair few pure, B-site mixed, and X-site mixed compounds. A large number of A-site mixed HOIPs as well as a majority of inorganic HaPs occupy the unstable region, indicating that although the presence of organic cations is desirable to prevent perovskite decomposition, mixing at the A-site may not always be beneficial. The band gap shows less clear trends, and as will be explained later, is largely dependent on the type and number of specific B and X-site ions. The region of desirable EPBEgap and ΔHPBE is largely populated by HOIPs with B-site or X-site mixing. Furthermore, Fig. 4(b) shows SLMEPBE plotted against EPBEgap, showing the characteristic relationship that has been explored in past studies.41,42 SLME rises initially as the band gap increases, reaching a peak of ∼25% around EPBEgap = 1.5 eV, and subsequently goes down until it goes to 0 for EPBEgap > ∼3 eV. The largest SLMEPBE values are shown by pure hybrid and B-site mixed compounds, both hybrid and inorganic.
Fig. 5(a) shows the linear correlation between composition descriptors and PBE properties, namely lattice constant, decomposition energy, band gap and SLME. In the heatmap, darker red implies large positive correlation, darker blue implies large negative correlation, and white means there is little or no correlation. A few important relationships immediately stand out from this plot: large ions like Ba and I lead to an increase in the lattice constant, while Cl has the reverse effect. An increase in the fraction of K at the A-site increases ΔHPBE and thus makes the compound more unstable, while increasing the fraction of FA will make it more stable. B-site elements generally have little effect on the stability, but have much larger correlations with EPBEgap and SLMEPBE. While Ca, Sr, and Ba increase EPBEgap and decrease SLMEPBE, Ge and Sn decrease EPBEgap and increase SLMEPBE. Pb shows a large positive correlation with SLMEPBE, which is not surprising given that FA/MA/Cs-based Pb iodide or bromide perovskites are most common in optoelectronic applications. Finally, X-site species show more modest correlation with EPBEgap and SLMEPBE, with Br showing virtually no correlation with the band gap, which is an effect of Br lying between I and Cl in the band gap spectrum. The lower values of correlation between X-site constituents and the band gap and SLME reveal that mixing or complete substitution at the B-site has a more significant effect on the optoelectronic properties. These correlations provide confirmation for some well-known effects and some simple design principles for HaP compositions with targeted properties.
Fig. 5 Pearson coefficients of linear correlation between 4 PBE computed properties and (a) 14 compositional descriptors and (b) 36 elemental property descriptors. |
Next, we calculated the linear correlations between the 36-dimensional elemental property descriptors and the 4 PBE properties, and the results are presented in Fig. 5(b). Once again, it can be seen that the biggest contributors to ΔHPBE are A-site properties: specifically, increasing the ionic radius, ionization energy, or atomic number of A-site species makes the compound more stable, whereas increasing the boiling point, electron affinity, or heat of vaporization makes it less stable. The largest correlations with the lattice constant are from X-site features, with higher electron affinity, ionization energy, or electronegativity of the X-site constituent reducing the lattice constant and all other properties increasing it. When it comes to the band gap and SLME, we once again notice an overwhelming contribution from the B-site species. Increasing the boiling point, electron affinity, ionization energy, or electronegativity of B-site species helps decrease EPBEgap and increase SLMEPBE, explaining why Pb/Sn/Ge are clearly more beneficial in PV applications than Ba/Sr/Ca at the B-site. These correlations help expand our design principles based purely on the fractions of different species and provide an opportunity to train predictive models for various properties.16,22
A visualization of the types of mixing per A/B/X-site species is presented in Fig. S5,† and different properties are plotted against each other for the three types of HSE datasets in Fig. S9–S11.† We find similar distributions to the PBE data, with notable differences coming from HSE band gaps being generally larger and eliminating a lot of the low SLME data points. Very similar ΔH values are obtained for all compositions from the 4 methods, showing that PBE-based stability metrics should be more than reliable. We note here that SLME from the different HSE functionals is obtained using the PBE-computed optical absorption spectrum shifted along the energy axis by the difference between EPBEgap and the corresponding HSE Egap; this is a method that helps us determine a theoretical efficiency from HSE without performing a full optical absorption calculation using HSE. Fig. 6 presents a comparison between the different types of HSE and PBE band gaps, dividing the data in terms of the nature of A-site species: purely organic, purely inorganic, or mixed organic–inorganic. While it is clear that B-site and X-site species are the major contributors to the band gap, we divide the data like this mainly to observe how important HSE vs. PBE geometry optimization is for hybrid vs. inorganic HaPs and the magnitudes of difference between PBE and HSE band gaps and between using and not using SOC. Furthermore, we observe from our dataset that HSE-relaxation might be superfluous, as HSE-relaxed lattice parameters, decomposition energies, and band gaps largely correlate with the corresponding PBE-relaxed values, as shown in Fig. S6–S8.†
It can be seen from the 299 data points in Fig. 6(a) that EHSE-relgap is, on average, 1 eV or more greater than EPBEgap, with larger differences appearing when A-site contains only organic molecules; we attribute this to the larger degree of geometry optimization from HSE in the presence of organic cations than when only inorganic cations are present, leading to larger differences in the band gap. Fig. 6(b) shows EHSE-rel-SOCgap plotted against EHSE-relgap for 282 data points. As expected, SOC brings down Egap for many of the compounds, and keeps Egap the same for many other compounds, confirming that SOC is certainly vital for certain compositions but can be ignored in others, as has been discussed in past studies.16,40,44 Interestingly, we observe that for several purely inorganic HaPs with lower Egap < 3 eV, SOC significantly reduces the gap. Next, we plot in Fig. 6(c)EHSE-PBE-SOCgapvs. EHSE-rel-SOCgap for 244 data points, in an attempt to understand the difference between HSE-relaxed and HSE-on-PBE-relaxed band gaps (with the inclusion of SOC in both). We find virtually identical band gaps from both methods for all pure inorganic HaPs, but large differences when organic cations exist at the A-site, which can once again be explained by the more severe geometry optimization from HSE in the latter. While SLMEPBE peaked at around 25%, the corresponding HSE peaks appear around 16% as a consequence of shifting the optical absorption spectrum by the difference between the PBE and HSE band gaps. For compounds with the highest SLMEPBE values, EPBEgap is around 1 eV, while the corresponding HSE-rel band gaps are higher and the HSE-rel + SOC and HSE–PBE-SOC are lower. These band gap differences take SLMEHSE lower than the PBE peak. This effect holds for all compounds with SLMEPBE > 15%, such that SLMEHSE following the band gap shift always tends to be lower and peaks at 15 to 16%.
Finally, we examine the relationships between HaP composition and various types of band gaps discussed above, by calculating Pearson coefficients of linear correlation. Fig. 7 shows the correlations for five types of properties, namely EPBEgap (PBE Gap), EHSE-relgap (HSE Gap), EHSE-rel-SOCgap (SOC Gap), EHSE-relgap − EPBEgap (Δ(HSE–PBE)), and EHSE-rel-SOCgap − EHSE-relgap (Δ(SOC-HSE)). For the first three quantities, we find virtually identical behavior, and it can be concluded that various A/B/X-site species have the same increasing or decreasing influence on any PBE or HSE Egap. Correlations with the (HSE–PBE) Egap difference values show that certain constituents such as FA, Cs, Sn, or I could have marginal influence, but the differences are largely uniform across the dataset. The effect of SOC is very evident in the correlation analysis for EHSE-rel-SOCgap − EHSE-relgap. While A-site species have no influence here, Pb has by far the highest negative correlation, Ge has a slightly positive correlation, and I has a slightly negative correlation. We conclude that the inclusion of SOC is of utmost importance for Pb-based HaPs, and Egap values will be significantly lower (and more accurate) when using SOC. Fig. S12–S14† present the complete correlation analysis for all properties computed from the three types of HSE functionals and the 50-dimensional descriptors introduced earlier, showing very similar trends as compared to the PBE dataset.
(6) |
(7) |
(8) |
Fig. 8(a) shows ΔHPBE plotted against the average deviation from cubicity (DCavg), calculated using eqn (8). Fig. S15 and S16† show individual plots of ΔHPBE against DC corresponding to , , α, β, and γ. It can be seen that ∼90% of the compounds show a DCavg of <2%, reinforcing confidence in the cubic/pseudo-cubic nature of a majority of the dataset. Around 20 compounds show a DCavg of >5%, and all of them are HOIPs with A-site, B-site, or X-site mixing. The non-symmetricity introduced in the supercell when large organic molecules are mixed with other organic or inorganic cations, and when complex mixing is performed at the B or X sites in the presence of large organic cations, leads to elongation, contraction, or twist along one or more directions. A consequence of the high-throughput nature of our computational work is the inability to visually examine the cubicity of every structure: the current analysis helps reveal some unfavorable deviations in certain compounds, which will be used as one of the factors while determining suitable compositions in terms of perovskite formability, stability, and optoelectronic properties. It should be noted that larger DCavg values tend to correspond to negative ΔHPBE, but despite their stability from DFT, such compounds have a non-perovskite like phase and are thus excluded from current screening and saved for future analysis.
Octahedral factor:
(9) |
Tolerance factor:
(10) |
Bartel7 tolerance factor:
(11) |
The suggested ranges for perovskite formability are o ∈ (0.442–0.895), t ∈ (0.813–1.107), and tB < 4.18. Fig. 8(b)–(d) respectively show t, tB, and o values for the entire dataset plotted against ΔHPBE. It can be seen that there is a roughly inverse relationship between t and ΔHPBE, which is to be expected as larger values of t mean more favorable perovskite formability and should thus also correspond to negative decomposition energies. A small number of compounds lie in the t > 1.1 range and also show negative ΔHPBE; upon closer inspection, we find that they are HOIPs with significantly distorted structures showing large deviation from cubicity. One example of such a compound is FAGeBr2.25Cl0.75, which has t = 1.13 and ΔHPBE = −1.06 eV, but a highly distorted PBE optimized structure with DCb = 16.4%. Fig. 8(c) shows that essentially all HaP compositions with negative ΔHPBE fall within the desirable tB < 4.18 region. Similarly, Fig. 8(d) shows that nearly all compounds with negative ΔHPBE lie in the desirable range of o values, barring a very small number of distorted structures. It should be noted from Fig. 8(b)–(d) that hundreds of compositions that satisfy the formability factor conditions show positive (often very large positive) ΔHPBE values, which means that they will easily decompose to other halide phases. Our observations point to the idea that such factors may be necessary but not sufficient conditions for perovskite formability and stability.
We apply the following screening criteria on the PBE dataset:
1. Formability: o ∈ 0.442–0.895, t ∈ 0.813–1.107, and tB < 4.18.
2. (Pseudo) cubicity: DCb < 10%, DCc < 10%, DCα < 5%, DCβ < 5%, and DCγ < 5%,
3. Thermodynamic stability: ΔHPBE < 0 eV. Negative ΔH is a necessary but not complete metric for preventing ABX3 decomposition to phases AX and BX2; decomposition could occur to other phases, and the effects of kinetics, ion segregation, defects, etc. are ignored in this work.
4. Band gap: EPBEgap ∈ 1–2.5 eV. PV-suitable band gaps lie close to 1.5 eV. We use a wide range here to account for the various inadequacies of the PBE band gap description: it will underestimate gaps of inorganic compounds but either accidently be accurate or slightly overestimate the gaps of HOIPs: this effect has been studied in prior studies.16,40
5. PV efficiency: SLMEPBE > 0.10. This criterion goes hand-in-hand with the band gap requirement, as it can be seen from Fig. 4(b) that the highest SLME values correspond roughly to the band gap range described above.
Fig. 9 shows our five-fold screening process, based on which we obtain 32 candidates (out of 495) that fulfill each requirement. Also shown is a pie chart with the distribution of various types of mixing in the screened list of compounds. It can be seen that a majority of the screened compounds, 19 in total, are B-site mixed, and there are only 6 unalloyed compositions. Fig. 10 further shows the relative distributions of various A-site, B-site, and X-site species. We find that MA followed by FA is by far the most common A-site cation, often occupying all of the A-site (8/8 mixing fraction), followed by Cs and Rb. There are no pure K-based compounds; K, as well as Rb and Cs, occur in small fractions in some of the compounds. Pb and Sn appear in an overwhelming majority of the compounds, with Ge, Ca, Sr, and Ba only occurring in smaller fractions of 3/8 or less. This is consistent with the observation that Pb and Sn, and sometimes Ge, are most beneficial for ideal optoelectronic properties, whereas Ca/Sr/Ba should occur in minor fractions to keep the band gap small. Pb has a high preference for 7/8 and 8/8 occupation, hinting at the difficulty in developing Pb-free perovskites with ideal properties. At the X-site, Br and I without any mixing are most common, and in the 4 compounds with X-site mixing, I, Br, and Cl are found in various fractions.
Fig. 10 Distribution of mixing fractions of various species at the A (a), B (b), and X (c) sites across the list of 32 promising compounds selected from the PBE dataset. |
We performed a similar screening procedure using the HSE–PBE-SOC dataset, which was found to compare best with experiments for the band gap. Applying the very same criteria as shown in Fig. 9 leads to a list of 14 stable and formable compounds with desirable band gaps and SLME, out of which 4 are pure unalloyed compounds, 1 is (purely inorganic) A-site mixed, 8 are B-site mixed, and 1 is X-site mixed. Distributions of the types of mixing fractions of various species in this screened list of compounds are shown in Fig. S17.† Although the HSE–PBE-SOC screened list is much smaller than the PBE screening list due to a smaller overall dataset, some similar trends are found in both screening procedures. B-site mixing is most prevalent, as is high fractions of Pb, and sometimes Sn and Ge, at the B-site. Ca/Sr/Ba prefer mixing in small fractions. Most compounds are MA-based and nearly all of them contain Br or I. We note that our conclusions on the populations of different species and types of mixing in the screened set of compounds may change slightly in the future if a much larger dataset is available, such as via machine learning-based predictions.
The entire screened lists of compounds from PBE and HSE–PBE-SOC are presented in Tables SII and SIII† respectively, along with their chemical formula and (PBE or HSE–PBE-SOC) computed ΔH, Egap, and SLME at 5 μm sample thickness. Interestingly, all the compounds in the HSE–PBE-SOC list appear in the PBE list as well. Three of the best performing compounds are selected and their HSE–PBE-SOC computed electronic band structures, optical absorption spectra, and SLME vs. sample thickness plots are pictured in Fig. 11. These compounds, namely CsPbBr3, CsPbI0.75Br2.25, and MACa0.125Sn0.75Pb0.125I3, show direct band gaps around 1.5 eV and SLME > 15% in their cubic or pseudo-cubic phases.
In our work, the immediate next extension is towards non-cubic perovskite phases. For instance, CsPbBr3 may prefer the orthorhombic phase, while MAPbI3 and MA(Pb–Sn)I3 may assume the tetragonal phase, and this work considers all such compounds only in a cubic or pseudo-cubic rendition. In previous work,16 it was shown that for the same composition, unalloyed or with mixing, changing the phase could modify the band gap by 0.5 eV or more in many cases. Cubic perovskite phases are often not the ground state, and are in many cases very unstable—as indicated by the large positive decomposition energies for many compounds in our dataset. Non-cubic phases are either the most stable, or metastable/competing phases for most of the compositions studied in this work. Currently, we have high-throughput computations ongoing for tetragonal, orthorhombic, and hexagonal phases of several mixed HaPs; the perovskite phase itself can be added as an input to the compositional and elemental descriptors to obtain new correlations. In addition, computations are being performed for further tailoring of properties by accessing polymorphs within each phase, e.g., via octahedral distortion and rotation,45 or via re-optimization of the same composition in larger supercells with slight distortions.46 As an example, plots showing the computed decomposition energy for selected perovskites with varying degrees of octahedral distortion as well as in different prototypical phases are presented in Fig. S18 and S19;† it can be seen that some amount of distortions can keep the perovskite stable, the cubic phase is not always the ground state, and sometimes the range of decomposition energies for a given composition can be quite broad.
We further anticipate significant improvements in DFT predictions of various properties. Our attempt to utilize a few different functionals to benchmark properties against experiments was hindered by a number of factors discussed in the manuscript, including the perovskite phase and lack of additional corrections. Our ongoing computations involve testing the influence of the PBEsol38 and PBE-D3 (ref. 39) functionals, combined with static HSE06 or GW computations with SOC,47 for better optical and electronic properties. Furthermore, the inclusion of new types of elemental or molecular species – such as transition metals (Cd, Zn, Ni, etc.) at the B-site would necessitate the use of specific levels of theory, such as GGA + U.48 The consideration of other important properties, such as defect formation energies and carrier mobilities, would involve testing and deploying multiple functionals as well.
It should be noted again that there might not be one best functional that works for the entire chemical space when considering organic vs. inorganic A-site cations, and Pb/Sn vs. other B-site cations. A likely solution is the use of an ensemble of functionals as well as experimental estimates (which might need to be averaged as well, given the range of values generally reported by different experimental researchers for the same materials) for hundreds of HaP compositions, and training of multi-fidelity ML models.49 Large quantities of low-fidelity data combined with more modest amounts of high-fidelity data can lead to highly accurate predictions of experiment-level property estimates.
In general, ML has a massive role to play here, as has been demonstrated for HaPs in multiple prior studies.16,22 Concurrent manuscripts are planned to report rigorously optimized predictive models for multiple properties and fidelities, based on the datasets and descriptors discussed in this work. Such models can easily be extended to new choices for A/B/X ions such as transition metals,16 as well as other phases, by addition of new dimensions to the descriptors. The inclusion of more general crystalline structure representations as inputs for ML, such as using crystal graphs and graph neural networks,50–52 would be essential for treating same compositions and structures with a variety of distortions or lattice strains. Once composition-based and/or structure-based ML predictive models are rigorously optimized and validated, they could be deployed for the prediction of over thousands of ABX3 compounds available in databases such as the Materials Project53 or Open Quantum Materials Database,54 as well as over millions of hypothetical materials, for prediction, screening, and discovery. Finally, inverse design techniques, such as using the genetic algorithm55 or generative neural networks,56 could be applied upon the DFT-ML surrogate models to drive the efficient discovery of new HaP compositions/structures with multiple desired properties. For instance, our ongoing work involves generating populations of novel HaP compositions using GA while optimizing a fitness function that includes metrics for chemical feasibility, negative ΔH, Egap between 1 and 2 eV, and SLME > 15%; this process can yield thousands of promising compounds beyond the scope of the current work and beyond brute-force enumeration. The dataset and analysis presented in this work serve as a springboard for efforts that are currently underway, to ultimately accelerate the prediction and design of novel perovskites for optoelectronics and to extend such approaches to other material classes and applications.
(1) All Density Functional Theory (DFT) computations are performed using the Vienna Ab initio Simulation Package (VASP), version 6.2. The software can be found at https://www.vasp.at/.
(2) Starting structures for this work, as well as a subset of reported structures and properties, can be found in our previous publication: A. Mannodi-Kanakkithodi and M. K. Y. Chan, Energy Environ. Sci., 2022, 15, 1930–1949, https://doi.org/10.1039/D1EE02971A.
(3) Tabulated data and scripts for extracting all properties of perovskites are available on our Github repo: https://github.com/yjq829/perovskite_dataset.git. The Tabulated data is also provided as an .xlsx file in the supporting documents.
(4) The code for calculating Spectroscopic Limited Maximum Efficiency (SLME) analysis can be found at https://github.com/ldwillia/SL3ME.git, based on the publication https://doi.org/10.1103/PhysRevLett.108.068701.
(5) All raw DFT data, including input and output files, can be found on Materials Data Facility:57 A. Mannodi-Kanakkithodi, M. K. Chan, J. Yang and P. Manganaris, High-Throughput DFT Dataset of Halide Perovskite Alloys, 2022, https://petreldata.net/mdf/detail/abx3_perovs_alloys_v1.1.
Footnote |
† Electronic supplementary information (ESI) available: https://github.com/PanayotisManganaris/manuscript–PIP-data-manifest. See DOI: https://doi.org/10.1039/d3dd00015j |
This journal is © The Royal Society of Chemistry 2023 |