Open Access Article
This Open Access Article is licensed under a Creative Commons Attribution-Non Commercial 3.0 Unported Licence

Micro-kinetic modeling of temporal analysis of products data using kinetics-informed neural networks

Dingqi Nai , Gabriel S. Gusmão , Zachary A. Kilwein , Fani Boukouvala and Andrew J. Medford *
School of Chemical and Biomolecular Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA. E-mail: ajm@gatech.edu

Received 19th June 2024 , Accepted 7th October 2024

First published on 10th October 2024


Abstract

The temporal analysis of products (TAP) technique produces extensive transient kinetic data sets, but it is challenging to translate the large quantity of raw data into physically interpretable kinetic models, largely due to the computational scaling of existing numerical methods for fitting TAP data. In this work, we utilize kinetics-informed neural networks (KINNs), which are artificial feedforward neural networks designed to solve ordinary differential equations constrained by micro-kinetic models, to model the TAP data. We demonstrate that, under the assumption that all concentrations are known in the thin catalyst zone, KINNs can simultaneously fit the transient data, retrieve the kinetic model parameters, and interpolate unseen pulse behavior for multi-pulse experiments. We further demonstrate that, by modifying the loss function, KINNs maintain these capabilities even when precise thin-zone information is unavailable, as would be the case with real experimental TAP data. We also compare the approach to existing optimization techniques, which reveals improved noise tolerance and performance in extracting kinetic parameters. The KINNs approach offers an efficient alternative for TAP analysis and can assist in interpreting transient kinetics in complex systems over long timescales.


1 Introduction

Catalysis is a key component of the current chemical industry, and is involved in the production processes of more than 80% of the items used in our daily life.1–3 Furthermore, catalysis will play an important role in the development of new chemical processes that improve energy production efficiency, flexible conversion of renewable feedstocks, and control of pollution emissions.4–10 Therefore, the discovery and optimization of catalytic materials is an essential factor in the development of technologies to improve the economic and environmental sustainability of modern society.11,12 As multicomponent solids, the effectiveness of many heterogeneous catalysts depends on their unique properties and operating conditions, such as particle size, chemical composition, and kinetic characteristics.13–16 Slight variations in intrinsic catalyst properties can result in significant changes in macroscopic reactor performance in terms of conversion, yield, and selectivity.17–19 Therefore, a complete understanding of the properties of the catalyst and their relationship to the intrinsic kinetics of the elementary steps is desired to help rationally develop and optimize new catalyst materials.20–22

Over the years, many techniques for obtaining the catalyst properties and intrinsic kinetics of catalyst materials have been continuously developed.23–43 Catalytic experiments can be broadly divided into steady-state studies and transient kinetic studies. Steady-state techniques operate under the assumption that systems are designed to reach a steady state where the rate of change in reactor properties is effectively zero.44,45 Steady-state operation allows simplification of the governing equations, but the experiments are time-consuming, and the low temporal resolution makes steady-state studies best for providing a perspective on the behavior of the catalyst under a single set of operating conditions. Moreover, the kinetic behavior of steady-state data is dominated by the rate-limiting steps,46,47 or the rate-determining states,48 making it difficult to provide sufficient knowledge of each elementary step to predict how the catalyst will behave under a wide range of operating conditions.20,22,49,50

In contrast to steady-state studies, transient kinetic studies function under designated disturbances, such as rapid changes in gas flow rate or temperature.1,51–53 The dynamic response of the system to the induced time-dependent disturbance is then monitored using a mass spectrometer.54 Even if short-lived intermediates are included, the high temporal resolution allows a more comprehensive understanding of each elementary step that controls the process, which is generally unavailable through steady-state studies.20,28,52 In particular, temporal analysis of products (TAP) is a well-studied transient kinetics tool,20,45,53,55 which has attracted more attention in recent years due to the high volume of data and the ability to directly study powdered or supported catalysts.54,56–62 TAP experiments generate rich data sets that provide fundamental quantitative insights into the material properties and transient kinetic responses of complex supported catalysts. However, converting the collected data into physically interpretable kinetic information is still challenging, particularly for multi-pulse datasets that may include hundreds or thousands of individual TAP pulses that gradually alter the state of the catalyst.20

Several steps have been taken to learn kinetic models and parameters from TAP data, including the derivation of the governing partial differential equations (PDEs) and PDE-constrained optimization, but some gaps still need to be filled.54,56,61–74 Most of these tools exhibit a trade-off between scalability and model complexity and struggle to handle large reaction networks (>100 parameters) or multi-pulse TAP data (typically comprising hundreds to thousands of pulses) efficiently. In other words, the extraction of kinetic information is still limited by the “data velocity” and the “data volume” of the “V's” of data science.75–77 To combat this issue, we introduce kinetics-informed neural networks (KINNs), a machine learning-based method, into the field of TAP data analysis. KINNs can fit the existing data while solving and parameterizing a physically interpretable kinetic model.78 The machine learning structure provides high scalability, enabling KINNs to handle multi-pulse data efficiently through a single network, even for reaction networks with many elementary steps. Additionally, the flexible loss function allows for the incorporation of information from different sources, which fully exploits the vast data sets that can be obtained through TAP experiments.

This work demonstrates the theoretical and practical validity of using KINNs to extract kinetic information from TAP data sets. We present three case studies: ideal single-pulse, ideal multi-pulse, and practical multi-pulse. In all cases, simulated synthetic data is used so that the ground truth is known, although the rate constants and noise levels are inspired by an experimental data set to ensure that they are representative.56 In ideal cases, explicit reactor information from the catalyst zone (concentration and net flux) is available from the simulation. In contrast, the practical case mirrors a realistic scenario, where we assume that only fluxes at the reactor outlet are known and utilize data preprocessing in conjunction with the KINN formalism to estimate concentrations in the catalyst zone. The performance of KINNs is compared to the optimization via collocation in the case of the practical multi-pulse data. The results indicate that KINNs are a promising option for analyzing transient kinetics, particularly in the case of large or noisy datasets, and indicate that KINNs provide a scalable and flexible foundation for the development of new techniques in the analysis of TAP and other types of transient kinetic data.

2 Methodology

2.1 Kinetics-informed neural networks

KINNs, developed by Gusmão et al.,78 solve ordinary differential equations (ODEs) utilizing feedforward neural nets (NNs) as a basis function.79 While training to fit concentration data, the NNs are also constrained by the micro-kinetic models (MKMs) that are in the form of ODEs.30,80 At each time t, the rate of change of concentration based on the MKM is given as:
 
ċ = M × k(θ)∘ψ(c) + f,(1)
where image file: d4dd00163j-t1.tif is the stoichiometriy matrix, and ψ is a function that maps concentration image file: d4dd00163j-t2.tif to the corresponding concentration-based power-law kinetic model. image file: d4dd00163j-t3.tif is the temperature (θ) dependent Arrhenius-like rate constant term, and image file: d4dd00163j-t4.tif is the transport term. Here, n represents the number of species, m is the number of elementary reactions, and ∘ represents an element-wise product. A more detailed description of the formulation of KINNs can be found in previous work by Gusmão et al.78. In this study, under the isothermal assumption, we simplify the rate law term in eqn (1) to
 
r(c,k) = M × kψ(c)(2)
where image file: d4dd00163j-t5.tif.

During training, the residuals of the states and derivatives ε are calculated as shown in eqn (3).

 
image file: d4dd00163j-t6.tif(3)
Here t is the input time representation, and ωs is the weights of the connections in the NN. The tilde term represents the target value, and therefore the loss function J is defined as the combination of data loss jdata and MKM loss jmodel using a weighted parameter α, which follows eqn (4).78
 
image file: d4dd00163j-t7.tif(4)
The tuple ω = (ωs,k) combines the NN parameters, or the weights of the connections, ωs and model (kinetic) parameters k. In the forward process, the Adam optimizer minimizes data loss jdata in the least squares sense. During training, the Adam optimizer81 is also used to solve the inverse problem with the same network, minimizing the MKM loss, jmodel.82 It's worth noting that the discretization points for the model residual do not necessarily need to coincide with the data collection points. In this work, for simplicity and computational efficiency, we chose to evaluate the model residuals at the same data collection points. This approach allows for a direct comparison between the model predictions and the experimental data at each point. More detailed discussion on discretization in KINNs can be found in the work of Gusmão et al.78

This loss function grants KINNs significant scalability and flexibility in the sources of information it can handle, since the neural network acts as an interpolator that can smooth noise or fill in missing values. The approach scales well to large reaction networks, since the kinetic parameters enter the optimization in the same way as NN weights, so that the size of the reaction network becomes limiting only if the number of elementary steps approaches the number of parameters in the NN (typically >100). In addition, other information, such as reaction thermodynamics, DFT energies, or transient spectroscopic signals, can, in principle, be included as additional constraints to MKMs by including more residual components in the loss J with proper weighting hyperparameters. The relative focus of the network can be altered between the model and the data by adjusting the hyperparameter α. Unfortunately, results can be sensitive to this choice, and there is currently no standardized method to choose an optimum α; instead, this must be done manually using previous knowledge of the relative importance between the data and the model, cross-validation techniques, or multi-objective optimization to identify an optimal trade-off.78 In the field of power system engineering, some works have employed Lagrangian duality to iteratively adjust the hyperparameter values.83,84 Although these works are not targeted to kinetic modeling, they may still offer useful insights for hyperparameter tuning. Recently, the development of robust KINNs has resolved this issue by using a maximum-likelihood estimation formalism.85 However, the method assumes that all intermediate states are known, so it is not directly applicable to TAP data, although future work will focus on adaptation of robust KINNs to the TAP problem.

2.2 Transport and kinetic model in TAP reactor

A typical TAP system consists of a catalyst zone sandwiched between two inert zones, and the reactor operates under isothermal vacuum conditions. During experiments, a series of nanomolar pulses of reactant/inert gases are injected into the reactor, and a mass spectrometer is placed at the outlet to instantly quantify the outlet flow of gas molecules with a millisecond time resolution. Gas transport occurs in the Knudsen diffusion regime, which provides well-defined transport properties for the reactor.1,20,52,53 In TAP reactors, gas-phase chemistry is generally considered negligible due to the low pressure operation, short residence time, and the Knudsen diffusion regime. Although there are some cases where these factors are used to isolate gas-phase reactions,86,87 in general gas-phase reactions can be avoided in TAP data. A more detailed explanation of the TAP reactor is beyond the scope of this paper and can be found in previous work including Gleaves et al.53 and others.

The one-dimensional model that describes the dynamics, including both diffusion and chemical reaction, inside the TAP reactor is

 
image file: d4dd00163j-t8.tif(5)
where ci stands for the concentration for species i at location x at time t, image file: d4dd00163j-t9.tif is effective Knudsen diffusion coefficient, e is bed voidage, and ri is the reaction rate for species i, which is a function of gas phase species and adspecies concentrations in catalyst zone and intrinsic kinetic parameters.88 In practice, the catalyst zone is often purposefully designed to be much thinner than the inert zones, following the thin-zone TAP reactor (TZTR) proposed by Shekhtman et al.45 The TZTR is a critical concept in TAP design and analysis. It ensures a highly uniform chemical composition and stable catalyst properties throughout the reactive zone and therefore allows for several key simplifications in the analysis: (1) the negligible concentration gradients in the catalyst zone, (2) the approximation of governing PDEs to ODEs for catalyst zone, and (3) the applicability of the Y-procedure to estimate catalyst zone concentrations and net fluxes from the outlet flux data.88 Under these assumptions, the thin catalyst zone can then be described as a diffusional transient CSTR.45,55,89 In this case, the diffusion term in the above governing equation can be approximated to
 
image file: d4dd00163j-t10.tif(6)
where fini and fouti are the flux into and out of the thin zone for species i, and lcat is the catalyst zone length. So, when the explicit flux at each timestamp is available, the spatial dependence in eqn (5) can be eliminated, and the PDE governing equation, eqn (5), can be approximated to an ODE as
 
image file: d4dd00163j-t11.tif(7)
which can be directly mapped to the mathematical form of KINNs in eqn (3).

2.3 Data generation and processing

Carbon monoxide oxidation is a common benchmark reaction in the field of catalysis and TAP experiments,58 and we adopt it as a case study here to support the validity of the KINNs approach. The elementary steps and kinetic parameters are provided in Table 1, which were generated from previous TAPSolver simulations and serve as the ground truth in this study.56 Additional information about TAPSolver can be found in previous work by Yonge et al.56 and scripts to reproduce the data are provided in the ESI. TAPSolver simulations provide concentrations in the catalyst zone as well as the net flux, which can then be directly fed into KINNs — this is referred to as the “ideal case” since the exact concentrations of all species (including adspecies) in the thin zone are assumed to be known. In the ideal case, we omit noise from the data to evaluate the ability of KINNs to recover kinetic parameters. The robustness of KINNs to noise is explicitly demonstrated using the practical multi-pulse data set in Section 3.3 and the performance comparison in Section 3.4.
Table 1 List of elementary steps and kinetic parameters used to generate the synthetic carbon monoxide oxidation data. Dashed value indicates the parameter was not included in simulation
Kinetic parameter Reaction Actual value Units
k 1 CO + * → CO* 15.0 cm3 nmol−1 s−1
k −1 CO* → CO + * 0.70 s−1
k 2 O2 + 2* → 2O* 0.33 cm6 nmol−2 s−1
k −2 2O* → O2 + 2* cm3 nmol−1 s−1
k 3 CO* + O* → CO2 + 2* 0.40 cm3 nmol−1 s−1
k −3 CO2 + 2* → CO* + O* 0.02 cm6 nmol−2 s−1
k 4 CO + O* → CO2 + * 15.2 cm3 nmol−1 s−1
k −4 CO2 + * → CO + O* cm3 nmol−1 s−1


In contrast to simulated data, in real experiments, the catalyst zone concentration and fluxes are not directly measurable, and noise will be present. Therefore, noise was added as a Gaussian error distribution centered around 0 with the width of half the standard deviation of estimated signals. TAP noise was shown to be a combination of Gaussian noise and spectrally localized noise, and Gaussian noise was found to be the dominant noise source in many data sets.90 The noise level is proportional to the intensity of the signal and the noise is heteroskedastic, which means that the proportionality factor varies over different data. However, here we assume a constant proportionality factor as a first approximation. The half of the standard deviation choice was based on empirical observations from real TAP experiments and our comparative tests (Section 3.4), showed that using the full standard deviation often produced noise levels higher than typically observed in actual experiments. Using half the standard deviation provided a realistic noise level while ensuring model convergence across different analysis methods. We also note that the induced noise may affect mass balance, particularly at higher noise levels. While this effect is minimal at the chosen noise level, it becomes more significant as noise increases. However, the KINN approach can mitigate this issue to some extent by regularization through multiple loss terms.

To estimate the catalyst zone information in the realistic scenario, the Y-procedure is applied to approximate thin zone concentrations from the outlet flux data. The Y-procedure was introduced by Yablonsky et al.88. This technique provides a route for extracting the reaction rate and concentration of gas species from the secondary TAP response outlet gas flux, without any prior assumptions about the kinetic or diffusion models.60,88 Yablonsky et al.88 showed that when the thin zone assumption holds, the difference of two fluxes is equal to the estimated reaction rate rY, as

 
fin(t) − fout(t) = rY(t).(8)

The concentration of adspecies is not directly available through the Y-procedure, but the “atomic uptakes”, which are defined as the amount of each elemental species present on the surface, can be added to the loss function in eqn (4) to constrain the concentration of adspecies in the thin zone. Gas uptake U is determined as the integral of each reaction rate with respect to time scaled by stoichiometry as:

 
image file: d4dd00163j-t12.tif(9)
where ν represents the stoichiometric coefficients.61,88,91,92 By using the results of noisy TAPSolver simulations to calculate estimated thin-zone gas concentration, net flux, and surface uptake, the synthetic data become a direct surrogate for experimental data that can be fed into KINNs to derive the intrinsic kinetics. Fig. 1 shows a typical workflow for this analysis.


image file: d4dd00163j-f1.tif
Fig. 1 Workflow of modeling TAP data with KINNs. The raw outlet response is processed to obtain thin zone concentration and flux that are fed into constructed KINN model. The obtained KINN parameters can be used to interpolate single or multi-pulse data, and the kinetic parameters can be used directly in kinetic models or serve as an initial guesses for refinement using PDE tools like TAPSolver.

Before feeding the data into the KINN model, the data was subsampled non-uniformly to have a higher density of data points in the high-derivative region. Specifically, half of the data points were taken from the 0.0 to 0.5 s interval. Additional sampling techniques, such as uniform sampling in log space and Chebyshev sampling, could be considered, but exploring the optimal sampling technique is beyond the scope of this paper. Concentrations and uptake are normalized to a range between 0 and 1 to ensure numerical stability and efficiency within the NN. In the single-pulse case, each species is scaled using its own maximum value, resulting in all series being normalized to 1; while in the multi-pulse cases, each species' concentration is normalized using its maximum value within the entire multi-pulse training dataset, as this approach allows us to visualize the build-up of coverage over multiple pulses. Moreover, instead of using raw timestamps t, we transform the timestamps with a natural logarithm to enhance the resolution of the KINN at short timescales, which often features rapid changes in concentration. This logarithmic transformation ensures that the KINN can adapt more effectively to these early, rapidly evolving dynamics.

2.4 KINNs setup

To model the single-pulse TAP response, we constructed a 2-layer 8 hidden unit KINN (70 adjustable parameters) with a swish activation function,93 where the input is the time representation and the outputs are the scaled thin zone concentrations. During training, α was initially set to 1 × 10−10, which implied that the network would fit the concentrations without taking the MKMs into account. After every 5 epochs, each of which had 1000 iterations, α was increased by a factor of 10 until it reached 1. Then, α could be increased or decreased, and the direction and amount are manually adjusted based on the relative value between jdata and jmodel, and the parity behavior between the predicted and target values. Switching the focus back and forth effectively alters the initial guess of the NN weights for each value of α, and allows KINNs to find NN weights that optimally fit both the MKMs and the concentrations. For α tuning, we rely on the loss value and r2 correlation for both the concentration data and the kinetic model. This approach is chosen because the model residual, as defined in eqn (3), is sensitive to changes in the predicted concentration values. Specifically, adjustments to jdata can negatively impact the fit of the kinetic model by affecting jmodel, even if they result in a lower overall loss value. Monitoring these factors independently provides a route to balance the trade-offs between data fidelity and model accuracy. A more systematic optimization of α is beyond the scope of this work, but more detail can be found in previous work by Gusmão et al.78

Due to the increased size and complexity of multi-pulse dataset, including a wider range of the state space, complex pulse-to-pulse behavior, and additional inputs, we increased the neural network's complexity. For multi-pulse fitting, we constructed a 3-layer KINN with 10 hidden units in each layer, resulting in a total of 226 adjustable parameters. The zeroth moments (m0) of the TAP outlet flux of a given pulse were included as an additional input. The m0 of the outlet flows is equal to the amount of corresponding gas species that exit the reactor during the specific pulse;60,65,94 therefore, using it as an additional input provides the KINN with information about the surface conditions during the specific pulse and indicates the relative position of the current pulse in a series of pulses. To test KINN's ability to interpolate unseen pulses, which is critical to sparsify dense TAP data consisting of hundreds or even thousands of pulses, we used a synthetic 10-pulse CO oxidation data set, where pulses 0, 1, 2, 5, 8 were used for training, and pulses 3, 4, 6, 7, 9 served as testing sets.

The kinetic parameters are treated as part of KINNs' parameter set, allowing their uncertainty to be estimated through the inverse of the Hessian matrix of the loss function as

 
P = HJ−1(k),(10)
where P represents the inverse of Hessian of the loss function, HJ, with respect to the kinetic parameters k. The standard deviation σi of the parameter i can be calculated by taking the square root of the diagonal values using eqn (11)
 
image file: d4dd00163j-t13.tif(11)
based on the parameter's covariance.56,95,96 This is not a rigorous error estimate because the Hessian, and therefore the confidence intervals, depend on the hyperparameters that control the relative weights in the loss function.85 However, the estimate is still useful since it provides a semi-quantitative sensitivity analysis to each kinetic parameter.

3 Results & discussion

To assess the ability of KINNs to extract kinetic parameters from transient kinetics measured with a TAP reactor, we conducted three case studies, including single-pulse CO oxidation with explicit thin zone information (“single-pulse ideal scenario”), 10-pulse CO oxidation with explicit thin zone information (“multi-pulse ideal scenario”), and 10-pulse CO oxidation with noise and fluxes and uptakes estimated using the Y-procedure (“multi-pulse practical scenario”). Finally, we compare the results of the multi-pulse noisy data fitting to the results obtained using collocation, a standard non-linear optimization technique for finding parameters of ODEs from data.

3.1 Single-pulse ideal scenario

Fig. 2(a) and (b) show the performance of the KINN for predicting the concentrations and rates, respectively. The predicted concentrations from the KINN matches the target data for the scaled concentration well, achieving a Mean Absolute Error (MAE) of 1.39 × 10−2, which is equivalent to 4.66 × 10−2 nmol cm−3 in the original scale. A minor discrepancy in the initial concentration of the adspecies could be further refined by incorporating constraints on the initial condition into the loss function.78 However, we omit this step since the goal of this study is to work toward realistic multi-pulse scenarios where initial conditions may not be known exactly. Furthermore, the kinetics based on the estimated concentrations demonstrate a clear parity with the automatic differentiated derivatives, as indicated by an MAE of 4.20 × 10−1 nmol cm−3 s−1 and r2 values of ∼0.99 for both concentrations and rates.
image file: d4dd00163j-f2.tif
Fig. 2 KINN optimization of single-pulse CO oxidation under minimization of cost function J in eqn (4) for (a) concentrations, where circles represent target values and dashed lines represent predicted values, and (b) a parity plot of rates extracted from the KINN model and predicted from the kinetic model.

Table 2 presents the values of the converged kinetic parameters in this case study. The initial guess for all parameters was 1 × 10−5, which is far from the true values to demonstrate the robustness of the KINN and to avoid bias during optimization. The agreement between the converged and true values illustrates the ability of KINNs to extract intrinsic kinetic parameters when explicit concentration and flux are utilized. Fig. 3 compares the ground truth TAP curve and the TAP curve obtained by solving the ODE model, using both the KINN-extracted and ground truth kinetic parameters. For all species except CO2, the solved concentration profiles are nearly indistinguishable. The discrepancy observed in CO2 is mainly due to the approximation of a PDE-based to an ODE-based system, as evidenced by the fact that the two curves derived from ODEs are very similar. A possible reason we did not observe this discrepancy for the reactants is that the reactant gases are present from the beginning of the experiment, and therefore their consumption rates are more straightforward to model accurately. In contrast, CO2, the product, is formed as the experiment progresses, and its formation rate can be influenced by more factors. This complexity likely enlarges the discrepancy in its approximation. Additionally, the slower diffusion rate of CO2 might render the transport term dominant more in its gradient. Consequently, omitting the rigorous transport model may contribute significantly to the deviation in its predicted concentration profile. Despite these discrepancies, the predicted kinetic parameter values are within an order of magnitude of the ground truth parameters, and the MAE for the apparent activation barriers, calculated using the Eyring equation, is 0.019 eV, which is an order of magnitude lower than typical DFT errors. These results indicate that the ODE-based KINNs approach is capable of accurately recovering kinetic parameters from TAP data.

Table 2 List of the kinetic parameters that served as ground truth and KINN's predicted values for single-pulse ideal (Section 3.1), multi-pulse ideal (Section 3.2), and multi-pulse practical (Section 3.3) simulations with the standard deviations. Initial guesses of all parameters in all three cases were set to 1.0 × 10−5
Parameter Actual value Single-pulse ideal Multi-pulse ideal Multi-pulse practical
k 1 15.0 11.5 ± 0.091 11.5 ± 0.120 6.36 ± 0.242
k −1 0.70 0.53 ± 0.048 0.52 ± 0.061 0.69 ± 0.334
k 2 0.33 0.25 ± 0.003 0.25 ± 0.004 0.14 ± 0.008
k 3 0.40 0.37 ± 0.029 0.31 ± 0.041 0.44 ± 0.183
k −3 0.02 0.04 ± 0.010 0.02 ± 0.018 0.02 ± 0.054
k 4 15.2 12.8 ± 1.251 12.1 ± 0.372 5.71 ± 0.521



image file: d4dd00163j-f3.tif
Fig. 3 Ground truth TAP curve (solid line) and TAP curve obtained from solving ODEs with kinetic parameters extracted from the KINN (dash-dotted line) and the ground truth parameters (dotted line).

We note that the uncertainty estimates for parameters extracted from KINNs are based on the inverse Hessian. These estimates are not rigorous, since they assume that errors from the states and derivatives are uncorrelated with a constant variance of α.85 Therefore, the uncertainty should be interpreted as a semi-quantitative measure of relative parameter sensitivity, rather than a quantitative error estimate, which explains the systematic under-estimation of errors in Table 2. A more rigorous error estimation for KINNs is beyond the scope of this work, and can be found in the work by Gusmão and Medford.85

3.2 Multi-pulse ideal scenario

In contrast to single-pulse experiments, where the catalyst state remains unchanged throughout the process, multi-pulse experiments gradually alter the surface catalyst state, leading to different outlet response and concentration profiles from pulse to pulse. In this case, we used a 10-pulse CO oxidation data set generated from 10 consecutive single-pulse experiments as a case study. The initial state of the coverage for each pulse corresponds to the final state of the preceding pulse. In real experimental scenarios, multi-pulse tests may involve hundreds or thousands of pulses, and may have unmeasured gaps between pulses. These factors pose significant computational challenges for data analysis since models must be solved sequentially and an uncontrolled error may be introduced due to gaps in the measurements. Hence, the ability to interpolate between multi-pulse data points and predict the initial concentrations of a given pulse becomes crucial for subsampling and analyzing dense TAP data.

To simulate altered states between pulses and evaluate the pulse subsampling capability, we divide the data set into a “training” set comprising pulses 0, 1, 2, 5, and 8, and a “testing” set comprising pulses 3, 4, 6, 7, and 9. Notably, the purpose of the testing set in this context is not to control for overfitting, as is common in the machine learning field, but rather to evaluate the ability of the model to represent the entire 10-pulse data set using 5 selected pulses.

As shown in Fig. 4, the KINN is able to accurately fit the training data with an MAE of 6.29 × 10−3 and accurately interpolate the testing data with a very similar MAE of 6.92 × 10−3. In this case, we included more initial pulses in the training set to provide the NN with more data on kinetics at low coverages where the responses are more dynamic. As shown for the concentrations of the adspecies in Fig. 4, the KINN successfully captures the CO adsorption/desorption, the accumulation of adsorbed oxygen, and the reduction of available active sites. These results indicate that the KINN is able to accurately model the change in surface states over pulses.


image file: d4dd00163j-f4.tif
Fig. 4 KINN's predicted scaled concentration (dashed line) and target ground truth value (dot) for the training set (pulse 0, 2) and interpolating to the testing set (pulse 3, 6) for gas species (top) and adspecies (bottom).

The values of the kinetic parameters extracted from the ideal multi-pulse data are shown in Table 2. Compared to the results from the single pulse data, the MAE of the extracted kinetic parameters, in the free energy of activation scale, has decreased from 0.019 eV to 0.016 eV. This reduction is also shown in Fig. 5. Although the improvement in model precision is modest, additional testing also shows that inclusion of multiple pulses systematically improves the accuracy of the kinetic parameters (see ESI). Moreover, the decrease in the standard deviation of k4 indicates that multi-pulse fitting yields a more robust estimate of this parameter compared to single-pulse fitting. This improved resolution of this parameter in the case of multi-pulse data is likely due to the Eley–Rideal CO2 formation mechanism, where CO reacts directly with adsorbed oxygen. This reaction is expected to be more sensitive to “state-altering” experiments where the O* coverage increases, increasing the rate and contribution of this particular elementary step.97,98 This result is consistent with the intuition that the multi-pulse data will be sensitive to more kinetic parameters, highlighting the particular importance of multi-pulse fitting for capturing the nuances of reactions involving adsorbed species.


image file: d4dd00163j-f5.tif
Fig. 5 Training (top) and testing (bottom) performance for practical multi-pulse case under minimization of eqn (12).

We note that while the use of m0 as additional inputs for multi-pulse analysis grants KINNs the ability to interpolate unseen pulses' behaviors, this approach is not intended for forward simulation or active learning scenarios where the m0 would not be available a priori. The primary goal of the KINNs approach is to extract rate parameters of a physical kinetic model. Once these parameters are obtained, they can be used to simulate an arbitrary number of pulses, or used in traditional reactor simulations. While it might be possible to adapt KINNs for direct forward predictions of multi-pulse data, in practice, using the extracted rate parameters with conventional kinetic solvers will likely be more efficient and straightforward.

3.3 Multi-pulse practical scenario

In real experiments, measured data contains noise, and it is not possible to directly measure the catalyst zone concentrations and net fluxes. To address this, as mentioned in Section 2.2, the Y-procedure was introduced to estimate the thin zone gas phase concentrations, net fluxes, and atomic uptake. The inclusion of atomic uptake, which restricts the concentrations of unobservable adspecies, requires the modification of the loss function J into eqn (12),
 
image file: d4dd00163j-t14.tif(12)
where β is the weighted parameter for uptake loss juptake, and is found in the similar way as α.

Application of the KINN method to the practical multi-pulse data yields the parity plots shown in Fig. 5. These plots reveal good agreement between the KINN predicted concentrations and kinetics and the target values for training and testing pulses. The results show significant deviation in the concentration fitting due to the noise, but reveal strong agreement for the model fitting. Despite the fact that the concentrations of adspecies are not available through direct measurement, the additional juptake term grants the network knowledge about the surface environment in the form of the total amount of each element that remains on the surface. As a result, penalizing this term with proper weighting parameters can limit the concentrations of adspecies to physically meaningful values, at least in the case of a simple reaction network. These results also highlight the robustness of KINNs to noise. The high correlations in Fig. 5 indicate that the model can accurately predict both concentrations and derivatives despite the presence of noise. This robustness is a key feature of KINNs that results from simultaneous optimization of the interpolating function and the underlying model in a single loss function. We note that as the reaction complexity increases, the uptake constraint is expected to become less effective. Nevertheless, the results still show that it can be used as an effective soft constraint in the optimization problem. It is straightforward to envision that the loss function can be extended to information from different sources, such as thermodynamic constraints or spectroscopic signals, to provide constraints on adspecies concentrations in more complicated models.

Table 2 demonstrates that all kinetic parameters were solved within the correct order of magnitude with an energy MAE of 0.034 eV in this case, although the fitted values show a greater deviation from the ground truth compared to the ideal cases, as shown in Fig. 6. This deviation is expected due to the introduction of noise and the substitution of precise thin zone information with estimated quantities. Consequently, the larger error bars suggest that the KINN exhibits reduced sensitivity to all parameters in the presence of noise. Moreover, the inherent information loss in estimating concentrations and fluxes also contributes to performance degradation. The subsequent analysis indicates that KINN parameter estimates are robust to noise, suggesting that the primary cause of the reduced accuracy is the lack of direct information about the adsorbate concentrations. Despite the reduced accuracy in kinetic parameters compared to ideal cases, the results still provide parameters with energy errors well below what is common in DFT, and the accuracy could be further improved by using the KINN results as initial guesses for more rigorous and computationally demanding PDE-based TAP analysis tools, such as TAPSolver.56


image file: d4dd00163j-f6.tif
Fig. 6 Logarithm parity between the ground truth and KINNs solved kinetic parameter values that presented in Table 2 for (a) single-pulse ideal, (b) multi-pulse ideal, and (c) multi-pulse practical scenarios. The MAE is calculated using the Eyring equation.

3.4 Multi-pulse analysis compared to differential algebraic programming

To highlight the advantages of the KINN approach, we directly compare its performance with an established optimization approach based on differential algebraic equations (DAEs) and collocation. This comparison is performed with Pyomo, a Python-based open-source optimization modeling tool.99–102 The ideal multi-pulse data set was used for the comparison, and noise was added in the same way as described in Section 2.3. In addition to Gaussian noise with a width of half the data standard deviation, we explored two other noise levels: one and two standard deviations. In this case, KINN was applied directly to unsmoothed noisy data, while the DAE collocation approach was tested on both unsmoothed data and data smoothed by the Savitzky–Golay filter.103

Fig. 7 indicates that the KINN approach has improved noise tolerance. As the noise level increases, the concentration fitting of both smoothed and unsmoothed DAE collocation starts to deviate from the ground truth and exhibits discontinuous noisy behavior. The collocation method is much faster, by a factor of ∼100 with the implementation and architectures tested here, although we note that the KINNs approach is not heavily optimized and run times will depend on many factors. However, the collocation approach shows reduced effectiveness at higher noise levels. This is because the discretization of the time domain used for approximating solutions in the collocation method may not adequately capture the random fluctuations caused by noise, resulting in an inaccurate approximation of the true solution, and hence an inaccurate estimation of the derivatives. Moreover, collocation methods typically rely on interpolation and smoothing techniques to estimate the dynamics of the system between discretized points, and these techniques can be sensitive to noise and numerical parameters.104,105 At high noise levels, it also becomes impossible to converge estimates for some parameters with the collocation technique. In contrast, the KINN approach converges all parameters across different noise levels. This is evident from the lower deviation of the KINN-predicted concentrations compared to the DAE results in the presence of noise. The neural network in the KINN acts as a highly flexible built-in interpolator, simultaneously solving the smoothing and fitting problems. Moreover, the optimization of the KINN is constrained and regularized by the kinetic model, which helps prevent overfitting to the noise.106 This robustness to noise is a key advantage of the KINN over traditional collocation methods, as it ensures a more reliable approximation of the underlying kinetics even when noise levels are high or datasets have missing values.


image file: d4dd00163j-f7.tif
Fig. 7 Performance of DAE and the KINN on fitting data with increasing noise level. The vertical dash lines in kinetic parameter parity plots means the specific parameters are not converged. CO in pulse 0 is used as the example species here. The MAEs in an energy scale are stated in parity plots. The complete multi-pulse concentration plots can be found in the ESI.

For the extracted kinetic parameters, the KINN also demonstrates better performance than DAE collocation for this case study. As shown in Fig. 7, at a noise level of 0.5σ, the MAE for free energy of activation extracted by the KINN is 0.018 eV, only slightly higher than the 0.016 eV MAE for the ideal multi-pulse scenario without noise. This outperforms the DAE collocation approach which has an MAE of 0.04 eV regardless of whether or not smoothing is applied. When the noise level increases to 1.0σ, the unsmoothed DAE fails to converge for two parameters, and at a noise level of 2σ, the convergence issue extends to both smoothed and unsmoothed DAE collocation, while the KINN continues to deliver consistent and accurate results comparable to the 0.5σ noise scenario. The values of the fitted parameters for all reactions with each method can be found in the ESI.

There are several factors that account for this notable improvement in robustness. Primarily, KINNs utilize automatic differentiation applied to an interpolated concentration profile, which can help mitigate the impact of noise on parameter estimation. In contrast, traditional collocation methods, while capable of using automatic or analytical derivatives, may be more sensitive to noise in the raw data points used for discretization. However, advanced collocation techniques, such as those used in weak-SINDy107 or integral SINDy formulations,108,109 have developed sophisticated approaches to handle noisy data effectively. The key advantage of KINNs in this context lies in their inherent smoothing effect through the neural network interpolation, which provides an additional layer of noise resistance. This property is crucial because it allows KINNs to derive kinetic parameters that are more resilient to noise, resulting in closer approximation to the ground truth in high-noise scenarios.

We note that the comparison between KINNs and the DAE collocation presented here is not intended to demonstrate the inherent superiority of one method over the other. Rather, our aim is to illustrate how these methods perform under similar circumstances, particularly in handling noisy data. The fundamental differences between the two methods, including their underlying formulations, optimization approaches, and problem size and structure, make a strictly “apples-to-apples” comparison challenging. Therefore, the goal here is not to crown a superior method, but to highlight the strengths and potential applications in the context of analyzing TAP data.

4 Conclusions

In this work, we utilized feedforward NNs as basis functions to describe TAP data and extract the intrinsic kinetics from the approximated ODE MKMs under state-defining (single-pulse) and state-altering (multi-pulse) CO oxidation data sets. The validity of KINNs is demonstrated by their ability to solve the underlying ODEs, extract the rate parameters of kinetic models, and forecast the responses of unobserved pulses in both scenarios. Applying zeroth moments as an additional NN input provides the network with information on the surface environment in multi-pulse datasets and enables the network to locate the pulse in pulse series, making interpolation between pulses achievable.

The machine learning framework grants KINNs scalability to handle multi-pulse data with a single network, and tests indicate that KINNs exhibit improved noise tolerance when compared to the incumbent DAE collocation approach. With these features, dense, noisy data consisting of hundreds or thousands of pulses can be subsampled and analyzed, revealing underlying kinetics that cannot be inspected through state-defining experiments only. Currently, the main downside of KINNs is the relatively high computational cost, but with further optimization and increasing problem complexity, their advantages over the DAE approach are expected to increase. For example, the KINN framework is naturally capable of handling highly nonlinear problems such as temperature-dependent data or coverage-dependent kinetic parameters. Future work will focus on exploring the application of KINNs to these more complex data types, optimizing the KINN implementation, and incorporating the maximum likelihood approach85 for data where not all concentrations are known.

The flexible loss function makes KINNs a solid option for addressing complex heterogeneous catalytic reactions. We show that when the precise thin zone information is unavailable, KINNs can still derive kinetic parameters that accurately reflect the relative rate of each step by including atomic uptake into the loss function. This flexibility allows for incorporating a variety of information, including spectroscopy, DFT, and thermodynamics, which may be used to constrain more complex systems than CO oxidation, suggesting KINNs as a promising framework for treatment of operando and in situ transient kinetic data sets.

While this study demonstrates the potential of KINNs for analyzing TAP reactor data, several limitations and opportunities for future work remain. Our current case studies focused on a relatively simple reaction system where all steps were kinetically relevant. However, real-world catalytic systems often involve more complex mechanisms with large timescale separations, including quasi-equilibrated steps and scenarios with both fast and slow transients. These complexities may challenge parameter identifiability and could test the limits of the KINN approach, particularly in highly stiff systems. Future work will focus on adapting KINNs to handle these more complex scenarios, potentially incorporating techniques for dealing with stiffness and methods for identifying and handling quasi-equilibrated steps. Additionally, extending the KINN methodology to learn activation barriers from multiple isothermal experiments could provide deeper insights into reaction energetics.

Furthermore, while our current implementation provides only semi-quantitative sensitivity analysis, there is potential for KINNs to contribute to mechanism refinement and discovery. Existing work in mechanism identification relies heavily on accurate parameter estimation and uncertainty quantification.110–115 By potentially improving these aspects, KINNs could be used in conjunction with existing algorithms to enhance mechanism determination. Future developments could focus on more rigorous uncertainty analysis within the KINN framework, potentially leading to systematic methods for adding or removing reaction steps. This could significantly contribute to the field of mechanism discovery, especially when applied to more complex systems.

Addressing these challenges will be crucial for broadening the applicability of KINNs to a wider range of catalytic systems and experimental conditions, and we expect that KINNs will lower the multi-pulse TAP data analysis barrier and provide insight into fusing contemporary computational methods with transient catalytic dynamics.

Data availability

The data, TAPSolver scripts, Pyomo scripts, KINNs training scripts, and scripts for data preparation, preprocessing, and figure generation are available at https://github.com/medford-group/kinn-tap, Zenodo DOI: https://doi.org/10.5281/zenodo.13754686.

Author contributions

D. N., G. S. G., and A. J. M. designed the methodology. D. N. and G. S. G. developed the code. Z. A. K. performed the Pyomo calculation. F. B. conceptualized the collocation comparison. A. J. M. conceptualized and supervised the project. D. N. wrote the original draft. All authors contributed to the review and editing of the paper.

Conflicts of interest

The authors declare no conflict of interest.

Acknowledgements

Support for this work was provided by the U.S. Department of Energy (USDOE), Office of Energy Efficiency and Renewable Energy (EERE), Advanced Manufacturing Office Next Generation R&D Projects under contract no. DE-AC07-05ID14517. This research was supported in part through research cyberinfrastructure resources and services provided by the Partnership for an Advanced Computing Environment (PACE) at the Georgia Institute of Technology, Atlanta, Georgia, USA. This research also made use of Idaho National Laboratory's High Performance Computing systems located at the Collaborative Computing Center supported by the Office of Nuclear Energy of the U.S. Department of Energy and the Nuclear Science User Facilities under Contract No. DE-AC07-05ID14517. The authors are grateful to Dr Adam Yonge for the discussion of TAP simulations and transient methods and to Prof. Joseph Scott for suggestions on comparing to existing methods.

References

  1. K. Morgan, N. Maguire, R. Fushimi, J. T. Gleaves, A. Goguet, M. P. Harold, E. V. Kondratenko, U. Menon, Y. Schuurman and G. S. Yablonsky, Catal. Sci. Technol., 2017, 7, 2416–2439 RSC.
  2. S. B. Rasmussen, M. A. Bañares, P. Bazin, J. Due-Hansen, P. Ávila and M. Daturi, Phys. Chem. Chem. Phys., 2012, 14, 2171–2177 RSC.
  3. J. M. Thomas and W. J. Thomas, Principles and Practice of Heterogeneous Catalysis, John Wiley & Sons, 1997 Search PubMed.
  4. P. T. Anastas and M. M. Kirchhoff, Acc. Chem. Res., 2002, 35, 686–694 CrossRef CAS.
  5. R. A. Sheldon, I. W. Arends and U. Hanefeld, Green Chemistry and Catalysis, 2007, pp. 1–433 Search PubMed.
  6. F. Cavani and J. H. Teles, ChemSusChem, 2009, 2, 508–534 CrossRef CAS.
  7. J. Gong and R. Luque, Chem. Soc. Rev., 2014, 43, 7466–7468 RSC.
  8. J. M. Thomas, ChemSusChem, 2014, 7, 1801–1832 CrossRef CAS PubMed.
  9. Z. Guo, B. Liu, Q. Zhang, W. Deng, Y. Wang and Y. Yang, Chem. Soc. Rev., 2014, 43, 3480–3524 RSC.
  10. G. Papanikolaou, G. Centi, S. Perathoner and P. Lanzafame, ACS Catal., 2022, 12, 2861–2876 CrossRef CAS PubMed.
  11. F. X. Felpin and E. Fouquet, ChemSusChem, 2008, 1, 718–724 CrossRef CAS PubMed.
  12. S. Mitchell, R. Qin, N. Zheng and J. Pérez-Ramírez, Nat. Nanotechnol., 2020, 16(2), 129–139 CrossRef PubMed.
  13. I. Chorkendorff and J. W. Niemantsverdriet, Concepts of Modern Catalysis and Kinetics, John Wiley & Sons, 2003 Search PubMed.
  14. R. P. Galhenage, H. Yan, A. S. Ahsen, O. Ozturk and D. A. Chen, J. Phys. Chem. C, 2014, 118, 17773–17786 CrossRef CAS.
  15. J. T. Gleaves, G. Yablonsky, X. Zheng, R. Fushimi and P. L. Mills, J. Mol. Catal. A: Chem., 2010, 315, 108–134 CrossRef CAS.
  16. R. Jinnouchi and R. Asahi, J. Phys. Chem. Lett., 2017, 8, 4279–4283 CrossRef CAS PubMed.
  17. J. Hagen, Industrial Catalysis: A Practical Approach, John Wiley & Sons, 2015 Search PubMed.
  18. K. Morgan, J. Touitou, J. S. Choi, C. Coney, C. Hardacre, J. A. Pihl, C. E. Stere, M. Y. Kim, C. Stewart, A. Goguet and W. P. Partridge, ACS Catal., 2016, 6, 1356–1381 CrossRef CAS.
  19. G. Rothenberg, Catalysis: Concepts and Green Applications, 2008, pp. 1–279 Search PubMed.
  20. J. T. Gleaves, G. S. Yablonskii, P. Phanawadee and Y. Schuurman, Appl. Catal., A, 1997, 160, 55–88 CrossRef CAS.
  21. S. Matera, W. F. Schneider, A. Heyden and A. Savara, ACS Catal., 2019, 9, 6624–6647 CrossRef CAS.
  22. J. Pérez-Ramírez and E. V. Kondratenko, Catal. Today, 2007, 121, 160–169 CrossRef.
  23. H. Eyring, J. Chem. Phys., 2004, 3, 107 CrossRef.
  24. M. G. Evans and M. Polanyi, Trans. Faraday Soc., 1935, 31, 875–894 RSC.
  25. M. Kobayashi and H. Kobayashi, J. Catal., 1972, 27, 100–107 CrossRef CAS.
  26. H. Kobayashi and M. Kobayashi, Catal. Rev., 1974, 10, 139–176 CrossRef CAS.
  27. C. O. Bennett, Catal. Rev., 1976, 13, 121–148 CrossRef CAS.
  28. P. Biloen, J. Mol. Catal., 1983, 21, 17–24 CrossRef CAS.
  29. K. J. Laidler and M. C. King, J. Phys. Chem., 1983, 87, 2657–2664 CrossRef CAS.
  30. J. A. Dumesic, D. F. Rudd, D. F. Rudd, L. M. Aparicio, J. E. Rekoske and A. A. Trevino, The Microkinetics of Heterogeneous Catalysis, Wiley-Vch, 1993 Search PubMed.
  31. W. Kohn, A. D. Becke and R. G. Parr, J. Phys. Chem., 1996, 100, 12974–12980 CrossRef CAS.
  32. C. De Bellefon, S. Caravieilhes, C. Joly-Vuillemin, D. Schweich and A. Berthod, Chem. Eng. Sci., 1998, 53, 71–74 CrossRef CAS.
  33. R. D. Cortright and J. A. Dumesic, Adv. Catal., 2001, 46, 161–264 CAS.
  34. T. Bligaard, J. K. Nørskov, S. Dahl, J. Matthiesen, C. H. Christensen and J. Sehested, J. Catal., 2004, 224, 206–217 CrossRef CAS.
  35. M. Andersson, T. Bligaard, A. Kustov, K. Larsen, J. Greeley, T. Johannessen, C. Christensen and J. Norskov, J. Catal., 2006, 239, 501–506 CrossRef CAS.
  36. J. K. Nørskov, T. Bligaard, J. Rossmeisl and C. H. Christensen, Nat. Chem., 2009, 1(1), 37–46 CrossRef PubMed.
  37. S. Wang, V. Petzold, V. Tripkovic, J. Kleis, J. G. Howalt, E. Skúlason, E. M. Fernández, B. Hvolbæk, G. Jones, A. Toftelund, H. Falsig, M. Björketun, F. Studt, F. Abild-Pedersen, J. Rossmeisl, J. K. Nørskov and T. Bligaard, Phys. Chem. Chem. Phys., 2011, 13, 20760–20765 RSC.
  38. S. Wang, B. Temel, J. Shen, G. Jones, L. C. Grabow, F. Studt, T. Bligaard, F. Abild-Pedersen, C. H. Christensen and J. K. Nørskov, Catal. Lett., 2011, 141, 370–373 CrossRef CAS.
  39. S. Rangarajan, A. Bhan and P. Daoutidis, Comput. Chem. Eng., 2012, 45, 114–123 CrossRef CAS.
  40. M. J. Hoffmann, S. Matera and K. Reuter, Comput. Phys. Commun., 2014, 185, 2138–2150 CrossRef CAS.
  41. A. J. Medford, C. Shi, M. J. Hoffmann, A. C. Lausche, S. R. Fitzgibbon, T. Bligaard and J. K. Nørskov, Catal. Lett., 2015, 145, 794–807 CrossRef CAS.
  42. J. Greeley, Annu. Rev. Chem. Biomol. Eng., 2016, 7, 605–635 CrossRef.
  43. C. F. Goldsmith and R. H. West, J. Phys. Chem. C, 2017, 121, 9970–9981 CrossRef CAS.
  44. B. Jandeleit, H. W. Turner, T. Uno, J. A. Van Beek and W. Henry Weinberg, CATTECH, 1998, 2, 885–990 Search PubMed.
  45. S. O. Shekhtman, G. S. Yablonsky, S. Chen and J. T. Gleaves, Chem. Eng. Sci., 1999, 54, 4371–4378 CrossRef CAS.
  46. S. Nitopi, E. Bertheussen, S. B. Scott, X. Liu, A. K. Engstfeld, S. Horch, B. Seger, I. E. Stephens, K. Chan, C. Hahn, J. K. Nørskov, T. F. Jaramillo and I. Chorkendorff, Chem. Rev., 2019, 119, 7610–7672 CrossRef CAS PubMed.
  47. P. Schlexer Lamoureux, K. T. Winther, J. A. Garrido Torres, V. Streibel, M. Zhao, M. Bajdich, F. Abild-Pedersen and T. Bligaard, ChemCatChem, 2019, 11, 3581–3601 CrossRef CAS.
  48. S. Kozuch and J. M. Martin, ChemPhysChem, 2011, 12, 1413–1418 CrossRef CAS PubMed.
  49. S. Caravieilhes, D. Schweich and C. De Bellefon, Chem. Eng. Sci., 2002, 57, 2697–2705 CrossRef CAS.
  50. F. A. Döppel and M. Votsmeier, Chem. Eng. Sci., 2022, 262, 117964 CrossRef.
  51. M. Olea, M. Tada, Y. Iwasawa, V. Balcaen, I. Sack, G. B. Marin, D. Poelman and H. Poelman, J. Chem. Eng. Jpn., 2009, 42, s219–s225 CrossRef.
  52. G. S. Yablonsky, M. Olea and G. B. Marin, J. Catal., 2003, 216, 120–134 CrossRef CAS.
  53. J. T. Gleaves, J. R. Ebner and T. C. Kuechler, Catal. Rev.: Sci. Eng., 1988, 30, 49–116 CrossRef CAS.
  54. G. S. Yablonsky, E. A. Redekop, D. Constales, J. T. Gleaves and G. B. Marin, Int. J. Chem. Kinet., 2016, 48, 304–317 CrossRef CAS.
  55. D. Constales, G. S. Yablonsky, G. B. Marin and J. T. Gleaves, Chem. Eng. Sci., 2001, 56, 133–149 CrossRef CAS.
  56. A. Yonge, M. R. Kunz, R. Batchu, Z. Fang, T. Issac, R. Fushimi and A. J. Medford, Chem. Eng. J., 2021, 420, 129377 CrossRef CAS.
  57. A. Yonge, G. S. Gusmão, R. Batchu, M. R. Kunz, Z. Fang, R. Fushimi and A. J. Medford, AIChE J., 2022, 68, e17776 CrossRef CAS.
  58. E. A. Redekop, G. S. Yablonsky, D. Constales, P. A. Ramachandran, J. T. Gleaves and G. B. Marin, Chem. Eng. Sci., 2014, 110, 20–30 CrossRef CAS.
  59. E. A. Redekop, H. Poelman, M. Filez, R. K. Ramachandran, J. Dendooven, C. Detavernier, G. B. Marin, U. Olsbye and V. V. Galvita, Faraday Discuss., 2022, 236, 485–509 RSC.
  60. M. R. Kunz, R. Batchu, Y. Wang, Z. Fang, G. Yablonsky, D. Constales, J. Pittman and R. Fushimi, Chem. Eng. J., 2020, 402, 125985 CrossRef CAS.
  61. M. R. Kunz, A. Yonge, Z. Fang, R. Batchu, A. J. Medford, D. Constales, G. Yablonsky and R. Fushimi, Chem. Eng. J., 2021, 420, 129610 CrossRef CAS.
  62. R. Batchu, V. V. Galvita, K. Alexopoulos, T. S. Glazneva, H. Poelman, M. F. Reyniers and G. B. Marin, Catal. Today, 2020, 355, 822–831 CrossRef CAS.
  63. M. Rothaeme and M. Baerns, Ind. Eng. Chem. Res., 1996, 35, 1556–1565 CrossRef.
  64. S. C. Van Der Linde, T. A. Nijhuis, F. H. Dekker, F. Kapteijn and J. A. Moulijn, Appl. Catal., A, 1997, 151, 27–57 CrossRef CAS.
  65. G. S. Yablonskii, S. O. Shekhtman, S. Chen and J. T. Gleaves, Ind. Eng. Chem. Res., 1998, 37, 2193–2202 CrossRef CAS.
  66. J. A. Delgado, T. A. Nijhuis, F. Kapteijn and J. A. Moulijn, Chem. Eng. Sci., 2002, 57, 1835–1847 CrossRef CAS.
  67. E. V. Kondratenko, V. A. Kondratenko, M. Santiago and J. Pérez-Ramírez, J. Catal., 2008, 256, 248–258 CrossRef CAS.
  68. V. Balcaen, I. Sack, M. Olea and G. B. Marin, Appl. Catal., A, 2009, 371, 31–42 CrossRef CAS.
  69. V. Balcaen, H. Poelman, D. Poelman and G. B. Marin, J. Catal., 2011, 283, 75–88 CrossRef CAS.
  70. A. Kumar, X. Zheng, M. P. Harold and V. Balakotaiah, J. Catal., 2011, 279, 12–26 CrossRef CAS.
  71. R. Roelant, PhD thesis, Ghent University, 2011 Search PubMed.
  72. E. A. Redekop, G. S. Yablonsky, V. V. Galvita, D. Constales, R. Fushimi, J. T. Gleaves and G. B. Marin, Ind. Eng. Chem. Res., 2013, 52, 15417–15427 CrossRef CAS.
  73. B. Golman, Educ. Chem. Eng., 2016, 15, 1–18 CrossRef.
  74. C. Reece, PhD thesis, Cardiff University, 2017 Search PubMed.
  75. A. J. Medford, M. R. Kunz, S. M. Ewing, T. Borders and R. Fushimi, ACS Catal., 2018, 8, 7403–7429 CrossRef CAS.
  76. M. K. Saggi and S. Jain, Inf. Process. Manage., 2018, 54, 758–790 CrossRef.
  77. W. Chang and N. Grady, NIST Big Data Interoperability Framework: Volume 2, Big Data Taxonomies, Special Publication (NIST SP), National Institute of Standards and Technology, Gaithersburg, MD, 2019, vol. 2,  DOI:10.6028/NIST.SP.1500-2r2, accessed October 10, 2024.
  78. G. S. Gusmão, A. P. Retnanto, S. C. D. Cunha and A. J. Medford, Catal. Today, 2023, 417, 113701 CrossRef.
  79. A. J. Meade and A. A. Fernandez, Math. Comput. Model., 1994, 19, 1–25 CrossRef.
  80. G. S. Gusmão and P. Christopher, AIChE J., 2015, 61, 188–199 CrossRef.
  81. D. P. Kingma and J. L. Ba, 3rd International Conference on Learning Representations, ICLR 2015 – Conference Track Proceedings, 2014 Search PubMed.
  82. M. Raissi, P. Perdikaris and G. Karniadakis, J. Comput. Phys., 2019, 378, 686–707 CrossRef.
  83. F. Fioretto, T. W. Mak and P. van Hentenryck, AAAI Conference on Artificial Intelligence, 2019, pp. 630–637 Search PubMed.
  84. J. Jalving, M. Eydenberg, L. Blakely, A. Castillo, Z. Kilwein, J. K. Skolfield, F. Boukouvala and C. Laird, Int. J. Electr. Power Energy Syst. Eng., 2024, 157, 109741 Search PubMed.
  85. G. S. Gusmão and A. J. Medford, Comput. Chem. Eng., 2024, 181, 108547 CrossRef.
  86. B. Beck, V. Fleischer, S. Arndt, M. G. Hevia, A. Urakawa, P. Hugo and R. Schomäcker, Catal. Today, 2014, 228, 212–218 CrossRef CAS.
  87. E. P. Mallens, J. H. Hoebink and G. B. Marin, Studies in Surface Science and Catalysis, 1994, vol. 81, pp. 205–210 Search PubMed.
  88. G. Yablonsky, D. Constales, S. Shekhtman and J. Gleaves, Chem. Eng. Sci., 2007, 62, 6754–6767 CrossRef CAS.
  89. D. Constales, G. S. Yablonsky, L. Wang, W. Diao, V. V. Galvita and R. Fushimi, Catal. Today, 2017, 298, 203–208 CrossRef CAS.
  90. R. Roelant, D. Constales, G. S. Yablonsky, R. Van Keer, M. A. Rude and G. B. Marin, Catal. Today, 2007, 121, 269–281 CrossRef CAS.
  91. E. A. Redekop, G. S. Yablonsky, D. Constales, P. A. Ramachandran, C. Pherigo and J. T. Gleaves, Chem. Eng. Sci., 2011, 66, 6441–6452 CrossRef CAS.
  92. M. Ross Kunz, T. Borders, E. Redekop, G. S. Yablonsky, D. Constales, L. Wang and R. Fushimi, Chem. Eng. Sci., 2018, 192, 46–60 CrossRef CAS.
  93. P. Ramachandran, B. Zoph and Q. V. Le Google Brain, 6th International Conference on Learning Representations, ICLR 2018 – Workshop Track Proceedings, 2017 Search PubMed.
  94. M. R. Kunz, A. Yonge, X. He, R. Batchu, Z. Fang, Y. Wang, G. S. Yablonsky, A. J. Medford and R. R. Fushimi, Catal. Today, 2023, 417, 113650 CrossRef CAS.
  95. G. Franceschini and S. Macchietto, Chem. Eng. Sci., 2008, 63, 4846–4872 CrossRef CAS.
  96. N. Zhan and J. R. Kitchin, AIChE J., 2021, e17516 Search PubMed.
  97. M. Persson and B. Jackson, J. Chem. Phys., 1995, 102, 1078–1093 CrossRef CAS.
  98. L. Zhu, C. Hu, J. Chen and B. Jiang, Phys. Chem. Chem. Phys., 2023, 25, 5479–5488 RSC.
  99. W. E. Hart, J.-P. Watson and D. L. Woodruff, Math. Program. Comput., 2011, 3, 219–260 CrossRef.
  100. M. L. Bynum, G. A. Hackebeil, W. E. Hart, C. D. Laird, B. L. Nicholson, J. D. Siirola, J.-P. Watson and D. L. Woodruff, Pyomo — Optimization Modeling in Python, Springer International Publishing, Cham, 2021, vol. 67 Search PubMed.
  101. B. Nicholson, J. D. Siirola, J.-P. Watson, V. M. Zavala and L. T. Biegler, Math. Program. Comput., 2018, 10, 187–223 CrossRef.
  102. L. T. Biegler, Nonlinear Programming: Concepts, Algorithms, and Applications to Chemical Processes, SIAM, 2010 Search PubMed.
  103. A. Savitzky and M. J. Golay, Anal. Chem., 1964, 36, 1627–1639 CrossRef CAS.
  104. P. Craven and G. Wahba, Numer. Math., 1978, 31, 377–403 CrossRef.
  105. I. Knowles and R. J. Renka, Electron. J. Differ. Equ., 2012, 21, 1072–6691 Search PubMed.
  106. T. Kircher, F. A. Döppel and M. Votsmeier, Comput.-Aided Chem. Eng., 2024, 53, 817–822 Search PubMed.
  107. D. A. Messenger and D. M. Bortz, J. Comput. Phys., 2021, 443, 110525 CrossRef PubMed.
  108. S. L. Brunton, J. L. Proctor and J. N. Kutz, Proc. Natl. Acad. Sci. U. S. A., 2016, 113, 3932–3937 CrossRef CAS.
  109. B. M. de Silva, K. Champion, M. Quade, J.-C. Loiseau, J. N. Kutz and S. L. Brunton, J. Open Source Softw., 2020, 5, 2104 CrossRef.
  110. F. Lejarza, E. Koninckx, L. J. Broadbelt and M. Baldea, Chem. Eng. J., 2023, 462, 142089 CrossRef CAS.
  111. S. Wang and M. Baldea, Comput. Chem. Eng., 2014, 64, 138–152 CrossRef CAS.
  112. S. Rangarajan, R. R. Brydon, A. Bhan and P. Daoutidis, Green Chem., 2014, 16, 813–823 RSC.
  113. S. Bhandari, S. Rangarajan and M. Mavrikakis, Acc. Chem. Res., 2020, 53, 1893–1904 CrossRef CAS PubMed.
  114. Z. T. Wilson and N. V. Sahinidis, Comput. Chem. Eng., 2019, 127, 88–98 CrossRef CAS.
  115. B. Kreitz, P. Lott, A. J. Medford, F. Studt, O. Deutschmann, F. Goldsmith and D.-I. B. Kreitz, Angew. Chem., Int. Ed., 2023, e202306514 CAS.

Footnote

Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4dd00163j

This journal is © The Royal Society of Chemistry 2024
Click here to see how this site uses Cookies. View our privacy policy here.