Robert M.
Forrest
* and
A. Lindsay
Greer
Department of Materials Science and Metallurgy, University of Cambridge, UK. E-mail: rmf48@cam.ac.uk
First published on 4th January 2023
The size of composition space means even coarse grid-based searches for interesting alloys are infeasible unless heavily constrained, which requires prior knowledge and reduces the possibility of making novel discoveries. Genetic algorithms provide a practical alternative to brute-force searching, by rapidly homing in on fruitful regions and discarding others. Here, we apply the genetic operators of competition, recombination, and mutation to a population of trial alloy compositions, with the goal of evolving towards candidates with excellent glass-forming ability, as predicted by an ensemble neural-network model. Optimization focuses on the maximum casting diameter of a fully glassy rod, Dmax, the width of the supercooled region, ΔTx, and the price-per-kilogramme, to identify commercially viable novel glass-formers. The genetic algorithm is also applied with specific constraints, to identify novel aluminium-based and copper–zirconium-based glass-forming alloys, and to optimize existing zirconium-based alloys.
The glass-forming ability (GFA) of an alloy composition refers to its ability to resist crystallization during cooling. Two measurable quantities that are directly related to the GFA are Dmax, the maximum castable diameter of a fully glassy rod,5 and the critical cooling rate, the minimum rate at which a liquid must be quenched to avoid crystallization.6 Compositions with higher GFA can be quenched at lower rates and cast to larger diameters, leading more readily to use in commercial applications.
The lack of an atomic lattice and accompanying defects,2 the significant effects of which dominate the properties of crystalline materials,4 results in MGs having interesting properties, including excellent soft-magnetism, corrosion resistance, hardness, yield strength, and resilience.7 MGs typically have thermal expansion properties similar to crystalline materials, lower values of thermal conductivity, and undesirably low ductility and castable diameter.7
There are many potential applications for MGs, ranging from sporting goods8 and smartphone components,9 to precision gears10 and fuel-cell separators,11 among others.12 High hardness observed in many BMGs enables application in scratch-resistant fashion items, such as jewellery and watches.13 In comparison with other biomaterials, the mechanical properties of zirconium-based BMGs are better matched to those of bone,14 and are potentially biocompatible,15 making them promising implant candidates. The application of metallic glasses to functional and structural problems is limited by the difficulty in discovering alloys with both properties meeting the requirements of the design, and high GFA allowing casting to sufficient dimensions.
The wide variety of applications of MGs are merely those that have been imagined so far, for the MG-forming compositions that have been discovered. It is certainly the case that there are vast regions of composition-space not yet probed, containing glass-forming compositions suitable for applications so far unconsidered. Data-driven materials discovery is a promising tool to both explore and exploit the field of metallic glasses.
Due to the size of composition space, illustrated in Table 1 merely via equiatomic alloys, a simple grid-based search (wherein possible alloy compositions are stepped through one by one) is infeasible unless tightly constrained. This is further illustrated by Fig. 1, which presents a contour map of the number of possible alloy compositions given the number of elements in an alloy and the allowed percentage increment, on a logarithmic scale. It is apparent from Fig. 1 that a high-definition study of all possibilities in an 8-element alloy is infeasible computationally, let alone experimentally, as there are on the order of a hundred quadrillion valid combinations of 8 elements with a percentage step of 0.1%. Furthermore, specific isochrones on the map are labelled with the approximate amount of time it would take to compute predictions for that many alloy combinations, assuming a rate of 2000 combinations per second (measured from our model on typical desktop hardware) and serial calculation. Taking a reasonable limit of feasibility to be a month of constant computation, this places rough restrictions on current abilities with brute-force searching to a range between high-definition studies of 4-element alloys and low-definition studies of 8-element alloys. While the computation time may be reduced by more powerful computer hardware, or parallelized searching, the logarithmic nature of Fig. 1 requires exponential acceleration to meaningfully push the boundary of feasibility. Probing the high-definition space of an 8-element alloy would currently take around 3 million years, making this feasible would require a multi-million factor acceleration – something that the approaching failure of Moore's law16 suggests to be unlikely.
Number of constituent elements | Possible equiatomic alloys |
---|---|
1 | 118 |
2 | 6903 |
3 | 266916 |
4 | 7673835 |
It is essential that alternative approaches to materials discovery are used to avoid this problem. While historically trial-and-error has generated most innovations, this strategy cannot continue in the long term, particularly as material complexity increases and serendipity must contend with many degrees of freedom.17Genetic algorithms (GAs) use techniques analogous to processes observed in evolutionary biology to find good solutions to a given problem, with potentially far greater efficiency than a grid search, and are discussed further in Section 2.
In our previous work18 we trained a novel ensemble neural-network (NN) model capable of simultaneously predicting several properties of any given alloy composition, including the probability of glass-formation and the maximum castable diameter of a fully glassy rod, Dmax. To provide background, this model is briefly discussed in Section 1.1.
In the present work, a genetic algorithm is used to iteratively search for compositions that optimize desired properties as predicted by the neural-network model, both with constraints, optimizing existing alloys, and without constraints, freely searching for novel alloys. GAs have been established for some time as a powerful tool for materials science,19,20 in particular being frequently used to optimize the properties of existing materials,21,22 the parameters of processing techniques,23,24 and to design entirely novel materials.25,26 GAs have also been applied in the search for novel glass-forming alloy compositions; Bansal et al.27 applied a NN and GA to identify glassy structures in Cu–Zr alloys with good resistance to shear deformation, Sun et al.28 used a GA to search for energetically favoured packing motifs in glassy Cu–Zr and Al–Sm alloys, and Tripathi et al. applied genetic programming to identify glass-forming ability criteria.29,30
Three main objectives are considered in the present work's search for new glass-forming alloy compositions: maximization of Dmax and ΔTx (=Tx − Tg), and minimization of the cost-per-kilogramme. Dmax is a direct measurement of the GFA of an alloy, while ΔTx measures the width of the supercooled region upon heating, and is related to the stability of the glass.31 Both these quantities may be maximized to return higher GFA, and they are considered simultaneously here to provide two different metrics of GFA.
While a higher inherent GFA would improve the possibility of a glass-forming alloy being reliably cast to sufficient dimensions for use in real-world applications, high-GFA compositions often include expensive elements among their constituents,32 thus raising commercial barriers. For this reason, the price-per-kilogramme is investigated as a minimization objective in this work. The optimization of these three objectives is intended to identify commercially viable BMG-forming alloys, capable of feasibly being incorporated into product designs.
• The liquidus temperature Tl,
• The temperature of the onset of crystallization Tx,
• The glass-transition temperature Tg,
• Classification of GFA as crystalline, GR, or BMG,
• The maximum castable diameter of a fully glassy rod Dmax.
The networks consisted of a number of densely linked hidden layers of neurons, and contained multiple output neurons, allowing a single model to be trained to predict all of the desired properties. The models were trained on a dataset consisting of data on pure elements, and experimental measurements from investigations of glass-forming alloys, compiled from previous publications attempting to model GFA.34,35 Of the compositions in the training dataset, 1700 (25.6%) were crystalline, 3763 (56.7%) were GRs, and 1175 (17.7%) were BMGs. All GRs were approximated to have a Dmax of 0.15 mm, following literature precedent.35
Our previous work concerning the neural-network model for GFA focused on extracting useful theoretical insights into glass formation, by studying the reliance of the model itself on the supplied data inputs. The present work takes an alternative approach, instead probing the outputs of the model by means of a GA.
Gal and Ghahramani36 proposed the use of dropout layers,37 which randomly ‘drop’ neuron connections with the intention of promoting generalisation rather than overfitting, as a route to quantifying uncertainty in standard neural networks. Generally, dropout layers are active only during training, and are disabled when a model is used for inference. If the dropout layers are not disabled, and neuron connections are randomly removed during inference, different predictions may be made by the model for the same input – the variation across many repeated predictions is interpreted as the uncertainty of the model for that specific input. Since our ensemble model contains dropout layers, we are able to quantify the model's uncertainty in all predictions made via the method of Gal and Ghahramani. Each prediction made here is repeated 100 times with dropout layers active, taking the mean as the final value, and the 95% confidence interval as the uncertainty.
Fig. 2 illustrates the general flow of a genetic algorithm, wherein a population of candidate solutions is assessed on their fitness, and acted upon by competition, recombination, and mutation operators, each described in the following sections, until there is convergence on a solution.
There are various methods by which fitness can be defined to enable calculation. For example, if an investigation is searching for glass-forming alloy compositions with large Dmax values, fitness could be defined simply as the value of Dmax. The situation becomes more complex when there are multiple objectives, as the ability of a candidate must be assessed on each criterion to provide an overall judgement that is comparable to judgements made about its peers. The criteria a candidate is being judged on may have wildly different scales, ranges, and interpretations, such that their raw numerical values cannot be simply compared or combined to give a total score.
Pareto ranking allows the comparison of the fitness of any two candidates in different frontiers. To allow ranking of candidates found in the same frontier, and in the interest of improving diversity within the population, algorithms may compare candidates by their average similarity to the entire population of the frontier, favouring those that are more different. One such measure of diversity is the crowding distance, calculated as the distance along a frontier to the neighbours of a candidate, averaged over each direction. Candidates with a shorter crowding distance are in a region of higher population density, and are less preferable than candidates with a longer crowding distance. This measure is used by the popular NGSA-II evolutionary algorithm,41 which itself is used as the template for the algorithm implemented in this work. It is noted that the crowding distance is not useful for measuring the objective fitness of a candidate; rather it is a metric that prevents algorithms entering local minima with tunnel vision.
{(Cu, 75%), (Zr, 15%), (Al, 9.5%), (Ti, 0.5%)}
{(Be, 50%), (U, 50%)}
{(Li, 99.9%), (Fe, 0.1%)}
{(Pd, 100%)}
The initial population of alloy candidates is generated entirely randomly, with no requirement for realism, such that alloys that could never be created may be suggested. It is the subsequent action of the GA that transforms the random population into feasible BMG-formers.
Tournaments are held, rather than simply selecting the elite top 50% of the population, in order to decrease the likelihood of the algorithm becoming trapped in local minima.
(1) |
The intermediate recombination approach is used here, rather than a crossover-based approach, because the genes of a composition have real-valued rather than integer representations, being the value of the atomic percentage of each element.
The recombination rate, the probability that parent compositions undergo recombination rather than simply being copied into the next generation, is set to the commonly used value of 90%.44
During mutation, the genetic code of an alloy composition undergoes one of several possible alterations:
• Add: a new element is added to the composition with a random percentage:
• Remove: an element is removed from the composition:
• Swap: the percentages of two elements in the composition are swapped:
• Adjust: the percentage of an element in the composition is changed by a random amount:
After all kinds of mutation, the percentages in a trial alloy composition are rescaled to ensure they sum to 100%. Any elements with a percentage below a threshold of 0.01% are removed from the composition. The culling of trace elements from compositions is performed to ensure composition percentages remain within the level of fine control typically possible during production – alloys with ≤0.01% of impurities are referred to as “super pure”.46 Limiting the minimum atomic percentage in the candidate alloys decreases the size of the search space, as illustrated by Fig. 1, meaning the algorithm does not waste effort investigating alloys containing element traces that could not be reliably replicated experimentally.
However, due to the stochastic nature of GAs and the lack of a single well-defined solution for the ‘best glass-former’ led to by smooth gradients, the alloy compositions selected are highly sensitive to the initial guesses. Given that these initial guesses are also random, the algorithm will generally return novel suggestions each time it runs, as it explores different areas of the composition-space and converges upon different local minima. This effect becomes more pronounced as the size of the accessible composition-space, i.e. the number of elements allowed in a trial composition, increases. While this can be mitigated by techniques such as simulated annealing, as discussed in Section 2.7, algorithmic investigation can be guided to areas of existing human interest via the application of constraints to the trial compositions. For example, certain elements may be required in every composition, with upper or lower bounds on their percentages. Such constraints also allow the GA to be employed in the optimization of existing alloy compositions, by limiting elemental percentages to be close to the nominal values of a particular alloy.
Restrictions on alloy compositions are applied here through an additional genetic operator, the constraints operator, which acts on every composition in the population. Given a set of rules, the constraints operator modifies any composition which violates a rule, until all rules are satisfied. For instance, if a rule specifies that the element copper must be present in every candidate, with atomic percentage between 40% and 60%, the constraints operator would act in the following way for these example alloy compositions:
An alternative approach to applying constraints, not used here, is to penalise the fitness score of illegal candidates, leading to evolution towards candidates that both have the desired properties and exist within the constrained search-space.47 The constraints operator has the benefit, in comparison to penalisation, of effectively reducing the size of the search-space, as the illegal candidates are inaccessible.48 Conversely, the constraints operator is somewhat more complex to implement than a simple penalisation of candidate fitness.
Given an existing candidate, A, and a new candidate B, with fitnesses fA and fB, SA selects a candidate using the following scheme:50
(2) |
The annealing temperature is reduced over the course of the search by means of a cooling schedule, of which there are a large variety. Cooling quickly may prematurely cull the algorithm's freedom to widely explore the search space, while cooling slowly may inefficiently prolong the computation time required to obtain good solutions.51
In this work, we incorporate SA as an additional stage of the GA, resulting in a hybrid search scheme able to benefit from the exploratory capabilities of SA.52 Here, alloy candidates of the current generation, created by the action of the competition, recombination, and mutation operators, are compared during annealing against randomly generated alloys. We apply the commonly used geometric cooling schedule, which defines the annealing temperature as,51
Ti = T0αi, | (3) |
Convergence in scientific computing generally refers to the state in which no further improvements upon the current best solution can be found. The likelihood that the current best solution is globally optimal is related to the quality of the searching algorithm, and the convergence criterion used to decide whether searching continues or terminates. Here, we define convergence as 20 consecutive generations with less than 1% improvement on each of the target objectives.
The local outlier factor (LOF) of Breunig et al.57 measures the relative isolation of an object in its local neighbourhood, outlying objects being those existing in areas of the space with substantially lower density,
(4) |
(5) |
Reach-dist(x,y) = max[k-distance(y),d(x,y)], | (6) |
While the LOF was originally intended for outlier detection within datasets, it can equivalently be considered as a measure of the ‘novelty’ of a new data-point with respect to an existing dataset.58,59 To ensure that the machine-learning model for GFA is used meaningfully, for interpolation rather than extrapolation, alloy candidates identified by the LOF to be outliers are discarded in this work. In other investigations, where on-the-fly data gathering and model re-training is possible, outlying candidates provide a valuable opportunity to expand the experience of the model and reinforce its understanding.60–62 We welcome the experimental validation of all the BMG-forming alloy candidates identified here by the GA, data on which, positive or negative, would allow updating of the training dataset and re-training of the model.
As the number of dimensions increases, the relative contrast of a distance measure, being the difference between the shortest and longest lengths, tends towards zero.63 Thus, the ability of a distance measure to distinguish objects in high-dimensional spaces also tends towards zero – this phenomenon is known as the ‘curse of dimensionality’.64 The feature-space of the alloy candidates in this work is 39-dimensional,18 and so to mitigate the curse, we apply principal-component analysis (PCA), enabling embedding of samples in a three-dimensional space where distance measures remain meaningful,65 before calculation of the LOF.
Composition | Predicted Dmax (mm) | Local outlier factor |
---|---|---|
Y41Er36Be13Rh10 | 17.2 ± 3.4 | −0.45 |
Zn39Y27Nd17Yb16 | 16.8 ± 3.1 | −0.47 |
Er42Mg25Be18Cu15 | 11.8 ± 2.6 | −0.48 |
Co32Y30Ho21Ba17 | 11.2 ± 1.6 | −0.52 |
Y38Ag36Dy26 | 9.1 ± 1.6 | −0.51 |
Ca40Mg37Cu23 | 5.6 ± 1.0 | −0.48 |
Ho41Cu31Zr15Sc13 | 5.5 ± 0.4 | −0.53 |
Dy44Mn33Li17Be6 | 5.4 ± 1.0 | −0.46 |
La39Tm36Rh25 | 3.8 ± 1.0 | −0.47 |
Zn63Tm37 | 2.9 ± 1.2 | −0.31 |
Yb52Zn48 | 2.2 ± 0.9 | −0.48 |
Al68Yb32 | 0.9 ± 0.3 | −0.47 |
Fig. 5 shows the variation of Dmax within various alloy compositions, illustrating the drastic changes in GFA that can occur across the composition-space of a single alloy, often over relatively small percentage ranges.
P = ∑wipi, | (7) |
(8) |
This approach is the same as that used in our previous work and throughout the literature to calculate approximate properties of alloys based on elemental information.
Pricing data used here are not intended to be highly accurate or recent; indeed due to market fluctuations it would be impossible to avoid immediate obsolescence. Instead, these data merely act as a reasonable proxy for the commercial barriers faced when attempting to manufacture an alloy, such as supply-chain maturity and material availability. The pricing data were acquired from multiple sources reporting on several different years of market activity.68–71 Pricing data for all elements are given in ESI S1.†Table 3 presents a sample of compositions identified by the GA to simultaneously have large predicted Dmax and ΔTx values and low price-per-kilogramme.
Composition | D max (mm) | ΔTx (K) | Price ($ per kg) |
---|---|---|---|
Fe44B22Ce18V16 | 5.14 ± 1.28 | 67.3 ± 7.72 | 54.3 |
B30Mn28Ba27Co15 | 6.44 ± 1.81 | 40.1 ± 4.71 | 5.26 |
Er53Mg30Cu17 | 4.88 ± 0.54 | 55.2 ± 5.15 | 22.7 |
Y38Ti37B25 | 4.67 ± 1.52 | 42.3 ± 3.71 | 23.3 |
Gd49Cu34Mg17 | 4.59 ± 0.78 | 63.1 ± 3.10 | 22.8 |
Er47Ti37B16 | 4.76 ± 0.58 | 46.7 ± 1.74 | 23.3 |
Yb50Zn50 | 2.01 ± 0.89 | 35.5 ± 2.11 | 13.1 |
Er52Zn48 | 1.24 ± 0.73 | 41.6 ± 6.70 | 20.1 |
Mg66Mn34 | 1.47 ± 0.76 | 37.6 ± 4.49 | 2.05 |
The price-per-kilogramme of elements varies over several orders of magnitude, but given the previously mentioned average cost-per-kilogramme of glass-forming alloys of $2310, even the most expensive compositions listed in Table 3, are relatively inexpensive. Some compositions identified by the GA when optimizing multiple objectives also appear in the previous results focusing solely on Dmax. For example, the Yb–Zn composition shown in Table 2 is identified again here, albeit with slightly different elemental percentages. This is to be expected since the single-objective problem is a subset of the multi-objective problem – candidates that perform extremely well on one objective, but poorly on the remainder, can nonetheless Pareto-dominate other candidates by nature of there being no other improvements among the population with respect to that objective.
The intrinsic contribution to an alloy property A of an element i is determined from the composition-weighted average of the property across compositions containing that element from the large set of compositions considered by the GA,
(9) |
Table 4 presents a sample of aluminium-based BMG candidates identified by the GA, while Fig. 6 demonstrates the more general impact on glass-forming ability of including other elements in aluminium-based alloys as calculated by eqn (9). It is observed that, generally, the algorithm attempts to lower the percentage of aluminium in candidate alloys to increase the GFA, and finds particular success alloying Al with Pd, Mn, Zr, Sc, Sr, and the lanthanides. However, elements such as Pd and Sc are extremely expensive, and as such are avoided by the GA despite their contributions to GFA. A sample of high-GFA Al-based alloy candidates identified by the GA, but penalised because of their expense, is given in Table 5.
Composition | D max (mm) | ΔTx (K) | Price ($ per kg) |
---|---|---|---|
Al32Gd32Ho23Be13 | 6.69 ± 3.1 | 60.4 ± 5.0 | 47.3 |
Al41Yb37Er12Te10 | 3.39 ± 0.61 | 60.1 ± 2.75 | 22.6 |
Al51Yb32Cr17 | 4.04 ± 0.43 | 59.4 ± 4.89 | 13.6 |
Al62Yb30Ca8 | 2.13 ± 1.6 | 48.6 ± 3.56 | 12.9 |
Al54Yb40Ba6 | 2.96 ± 1.4 | 47.9 ± 2.68 | 13.1 |
Al62Yb38 | 0.67 ± 0.33 | 48.9 ± 2.78 | 14.0 |
Al50Yb50 | 0.59 ± 0.35 | 49.0 ± 2.74 | 15.0 |
Fig. 6 Contributions to (a) Dmax and (b) ΔTx of elements added to aluminium-based alloys, calculated from the composition-weighted average, defined in eqn (9), for each element across the alloy populations generated by multiple independent runs of the genetic algorithm. |
Composition | D max (mm) | ΔTx (K) | Price ($ per kg) |
---|---|---|---|
Al50Dy22Gd15Ce10Ir3 | 2.61 ± 0.57 | 70.2 ± 2.81 | 4092 |
Al50Nd40Rb7K3 | 4.18 ± 1.87 | 30.8 ± 5.07 | 1186 |
Al50Tm20Nd16Ho10Rb4 | 3.41 ± 0.70 | 62.8 ± 4.40 | 1681 |
Al50Tm29Tb11Dy8Y2 | 2.18 ± 1.52 | 42.6 ± 6.76 | 1711 |
Al60Yb31Gd5Rh4 | 4.23 ± 0.76 | 60.9 ± 4.37 | 7410 |
Al57Yb20Tm20Au3 | 2.39 ± 0.55 | 55.1 ± 4.02 | 4088 |
The lanthanide elements have long been incorporated into BMG-forming alloys, including those that are Al-based.74–80 Aluminium has a relatively large difference in radius compared to most lanthanides, a key metric under Inoue's rules, which frustrates the growth of crystalline phases during cooling.66,81,82 Further, aluminium has large negative mixing enthalpies with the lanthanide elements, promoting mixing rather than phase segregation, another of Inoue's rules.83
Here, we probe the copper–zirconium-based alloy space in a number of ways. Firstly, the GA is constrained to search for alloys such that the percentages of Cu and Zr are the two highest in the candidate compositions. Table 6 contains a sample of alloy candidates identified by the GA under these constraints. It is noted that the well-known region of high-GFA around the Zr50Cu50 composition is identified among the binary alloys. Further, the algorithm identifies beryllium, magnesium, and cerium as important elements to be added to Cu–Zr-based alloys to improve GFA.
Composition | D max (mm) | ΔTx (K) | Price ($ per kg) |
---|---|---|---|
Zr50Cu50 | 0.89 ± 0.26 | 52.4 ± 1.71 | 24.3 |
Zr54Cu46 | 0.80 ± 0.24 | 51.9 ± 1.81 | 25.5 |
Zr53Cu47 | 0.80 ± 0.28 | 52.2 ± 1.55 | 25.2 |
Cu45Zr31Ce24 | 8.26 ± 1.33 | 80.4 ± 3.67 | 15.2 |
Cu49Zr43Al8 | 7.74 ± 0.83 | 72.9 ± 2.70 | 22.7 |
Zr50Cu39Be11 | 5.45 ± 2.20 | 102.5 ± 5.17 | 38.0 |
Zr39Cu38Ce19Be5 | 25.98 ± 3.38 | 99.9 ± 4.99 | 23.1 |
Zr36Cu35Mg15Be14 | 18.34 ± 3.70 | 96.9 ± 8.38 | 40.4 |
Zr49Cu31Be11Dy9 | 10.23 ± 1.97 | 109.0 ± 5.72 | 88.9 |
Zr39Cu38Be11Pb7Zn5 | 9.80 ± 2.95 | 92.2 ± 4.91 | 29.3 |
Zr43Cu38La12Be4Ca3 | 10.40 ± 4.18 | 89.3 ± 5.44 | 24.3 |
In the second line of investigation, constraints are relaxed such that only one of Cu or Zr is required to be the dominant element in candidate compositions, to allow investigations of alloy systems such as Zr–Ti–Cu–Ni–Be.88Fig. 7 demonstrates the average impact on GFA due to the inclusion of each element under these looser constraints, both from the perspective of Cu-dominated and Zr-dominated alloys. Despite dropping the requirement for both Cu and Zr to be present as the dominant components, the GA continues to identify compositions rich in both elements together, implying a high intrinsic GFA associated with the Cu–Zr pairing. However, Fig. 7 also shows a clear difference in the alloying preferences of Cu- and Zr-based BMGs, with each favouring elements from the opposite side of the transition metal series, and Cu exhibiting low GFA when alloyed with certain refractory metals. Both Cu and Zr show high GFA when alloyed with many of the lanthanides, as is also observed with previous evolutions of the GA under different constraints.
Fig. 7 Contributions to Dmax of elements added to (a) copper- and (b) zirconium-based alloys, calculated from the composition-weighted average, defined in eqn (9), for each element across the alloy populations generated by multiple independent runs of the genetic algorithm. |
Alloy | Zr (%) | Ti (%) | Cu (%) | Ni (%) | Al (%) | Resolution (%) | Combinations |
---|---|---|---|---|---|---|---|
LM105 | 47–60 | 2–6 | 12–24 | 0–30 | 5–15 | 0.5 | 181566 |
LM601 | 45–65 | 0–20 | 0–30 | 0–30 | 0–20 | 0.5 | 2932786 |
Alloy candidates found on the Pareto frontier, highlighted in red in Fig. 8, are the optimal set of compositions for the current objectives of maximal Dmax and ΔTx. For each alloy candidate on the frontier, performance on one of these objectives is degraded by considering another alloy candidate. The alloy used in a real-world application with these objectives would be selected from the Pareto frontier, as all other non-frontier compositions are necessarily inferior.
The compositions discovered here to be Pareto-optimal are similar to those found by Ward et al. Importantly, while the referenced work spent “only a few CPU-days of computer time” performing a brute-force scan of the entire composition space, the present work spent on the order of minutes searching with the GA. The total number of combinations of elemental percentages that sum to 100% in the LM105 case is 181566, while the presented execution of the genetic algorithm investigates only 12481 candidates, 6.9% of the composition-space, before converging on solutions. The constraints applied to LM601 are more relaxed, giving a larger space of 2932786 possibilities, of which the GA considers 14684, only 0.5%. This serves to highlight the ability of GAs in general to rapidly identify good solutions to a given problem. In loose analogy, the GA attempts to find and exploit the path of least resistance, in contrast to the brute-force approach. In Fig. 8 multiple ‘clusters’ of solutions are visible, representing different stages of the evolutionary process where the algorithm identifies fruitful regions and explores them for some time, before finding a new region that makes the previous obsolete. Fig. 9 presents similar information on a coarser scale by showing the average percentage of each possible element in all of the alloy candidates at each generation, during evolution of the LM601 alloy. It is shown that the GA quickly increases the amount of Cu, while decreasing the content of the other elements, particularly Al. Following this macroscopic adjustment, the GA enters a period of fine-tuning that is less apparent from the average percentage values, before achieving convergence.
Fig. 9 The average atomic percentage of every allowed element across all candidate alloys in each generation of the genetic algorithm when optimizing the LM601 alloy. |
It is also occasionally observed that the GA may identify the solutions deemed to be optimal early in the evolutionary process, and is unable to improve upon them for the remainder of the investigation. This can be seen for optimization of LM106 in Fig. 8, where a large cluster of candidates from later generations (as indicated by the brighter colouring of the points) exists near the frontier, the result of the GA repeatedly attempting to improve upon the Pareto-dominant alloy compositions. While this inability to improve could be due simply to indeed finding the optimal solutions, it may instead indicate that the GA's application of the recombination and mutation operators creates new candidates further away from the parents than would be desired, such that the local minimum is not efficiently explored before the GA moves elsewhere. If this were true, and the ideal solutions are not being found, it is unlikely that repeated independent executions of the GA would return the same results. Here we observe that the GA, when acting on tightly constrained problems, usually returns the same or extremely similar alloy compositions when repeated, indicating that the optimal solutions are indeed being identified. More loosely constrained problems introduce the complexity of there being multiple disparate groups of alloys that perform equally well, with the alloys returned by the algorithm depending largely on the initial population.
While computationally the rate of consideration of candidate solutions may be fast enough to allow the brute-force method in some situations, such as in the example of the Zr-based alloys considered here with fairly strict constraints, in other cases with more elements or relaxed constraints the combinatorial growth of the composition-space renders brute-forcing infeasible.
Further to the optimization of Dmax and ΔTx, the alloys LM105 and LM601 are also optimized by the GA to minimize price. Fig. 10 demonstrates the Dmax–price plane of the now three-dimensional solution-space. The minimization of price-per-kilogramme is inverted to maximization of kilogrammes-per-dollar for easier conception of the Pareto frontier. The candidate solutions are more densely packed in the region of high kilogrammes-per-dollar and low Dmax, due to the GA treating both objectives equally and inexpensive alloys being more common than those with high Dmax. In both cases, the GA minimizes the amount of zirconium present in the alloys, as this is the most expensive component. The algorithm also substitutes copper for aluminium when attempting to further reduce the price, but does this at the expense of Dmax.
Our ensemble neural-network model18 was trained on a dataset containing examples of crystalline, GR-forming, and BMG-forming alloys. This resulted in a model proficient in distinguishing between these three categories of alloy, but by nature of the dataset it was not presented with any examples of alloys consisting of immiscible elements. Mixing enthalpy data, which are related to the miscibility, were available during training, but the model could only relate these data to the GFA. However, since a large negative value of the mixing enthalpy is a known criterion for glass-formation under Inoue's rules,66 immiscible alloy compositions are likely to be deemed unfavourable by the model.
The Miedema model is applied here to approximate the miscibility of proposed alloy compositions, as a physically grounded check on the output of the GA and machine-learning model. The Miedema model defines the mixing enthalpy in terms of differences in electronegativity and electron density,89
(10) |
Fig. 11 shows the calculated mixing enthalpy for the novel Mn–Yb and Ti–Rh–Tl compositions, which are identified by the GA to be good glass-formers. The mixing enthalpy of Mn–Yb, while negative across the composition-space, is relatively small, indicating little driving force for mixing. In the case of Ti–Rh–Tl, the majority of the composition-space exhibits a positive enthalpy of mixing, other than in the region of low Tl-content in which the GA identifies candidate alloys with large predicted Dmax. Fig. 12 shows the mixing enthalpy throughout the population of alloy candidates identified during an evolution of the GA, demonstrating that the majority of predicted BMG candidates do indeed have a negative mixing enthalpy. However, there remains a small portion of the population with positive mixing enthalpy, which are unlikely to be miscible in reality. This indicates that the alloy compositions identified by the GA, using predictions provided by the neural-network model, may be good glass-formers under the assumption that all combinations of elements can form homogeneous alloys, but other unconsidered factors such as phase-separation could render them difficult to form experimentally.
Fig. 11 Prediction of the Miedema mixing enthalpy for compositions identified by the genetic algorithm to have large predicted Dmax: (a) binary Cu–Zr and (b) ternary Hf–Dy–Cu. |
Fig. 12 Histogram of mixing enthalpies calculated using the Miedema model applied to candidate compositions generated by the genetic algorithm, comparing the entire population to BMGs alone. |
To incorporate this physical reasoning formally into the alloy selection process, the mixing enthalpy is added as a minimization target for further evolutions of the GA. This has the effect of penalising poor miscibility during evolution, filtering out alloy candidates with poor ability to form homogeneous mixtures. Table 8 contains a sample of alloy candidates identified via this process.
Composition | Predicted Dmax (mm) | Mixing enthalpy (kJ mol−1) |
---|---|---|
Sr60Zn40 | 1.70 ± 1.13 | −14.21 |
Cu46Zr54 | 0.71 ± 0.29 | −22.12 |
Yb52Zn48 | 2.19 ± 0.92 | −20.32 |
Lu36Pr34Pd30 | 7.86 ± 2.29 | −64.62 |
Y43Nd33Pd24 | 9.90 ± 2.33 | −58.81 |
Hf41Dy30Cu29 | 5.27 ± 0.99 | −11.95 |
GAs search for optimal solutions to any problem through application on a candidate population of the biologically inspired competition, recombination, and mutation operators, requiring direct assessment of only a small fraction of all the possible solutions. It is hoped that the adoption and further development of the GA methodology will lead to reduced costs of materials discovery and development, and to increased usage of BMGs in commercial applications. While this work considers the pairing of a GA with a model for GFA, future investigations may elect to use models for different material properties, such as strength or corrosion resistance.
Here, we pair a GA with a neural-network model for GFA, to enable rapid searching through composition-space for alloys predicted to be excellent glass formers. This is performed both to optimize existing compositions, and to predict entirely novel glass-forming alloys. We target the optimization of the maximum castable diameter of a fully glassy rod, the width of the supercooled region, and the price-per-kilogramme, with the intention of identifying commercially viable novel bulk metallic glass candidates.
Aluminium-based bulk metallic glass (BMG) forming alloys are of interest due to their potential for high strength and corrosion resistance at low cost. The GA identifies many alloy candidates predicted to be good glass-formers, particularly those alloying aluminium with lanthanides. Similar results are obtained for copper–zirconium-based BMG candidates, of interest for their mechanical properties and low relative cost, indicating an important role played by the lanthanide elements in glass formation. Further, application of the GA to optimization of Zr-based alloys returns similar results to previously published brute-force investigations, but with computation time reduced from days to minutes.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d2dd00078d |
This journal is © The Royal Society of Chemistry 2023 |