Peng Nia,
Bin Liub and
Ge He*b
aChina University of Petroleum, Beijing 102249, China
bLanzhou Petro of PetroChina Company Limited, Lanzhou 730060, China. E-mail: 877157304@qq.com; hege@scu.edu.cn
First published on 24th August 2021
Rigorous mechanistic models of refining processes are often too complex, which results in long modeling times, low model computational efficiencies, and poor convergence, limiting the application of mechanistic-model-based process optimization and advanced control in complex refining production processes. To address this problem and take advantage of big data technology, this study used case-based reasoning (CBR) for process optimization. The proposed method makes full use of previous process cases and reuses previous process cases to solve production optimization problems. The proposed process optimization method was applied to an actual fluid catalytic cracking maximizing iso-paraffins (MIP) production process for industrial validation. The results showed that the CBR method can be used to obtain optimization results under different optimization objectives, with a solution time not exceeding 1 s. The CBR method based on big data technology proposed in this study provides a feasible solution for fluid catalytic cracking to achieve online process optimization.
Big data technology directly mines patterns from massive production data and retrieves and extracts useful information. The analysis process can reduce the reliance on complex process mechanisms. The application of cutting-edge data analysis technologies, such as big data and artificial intelligence, to the field of unit optimization can further improve the unit control, enhance the operating efficiency of the unit in the optimal operating range, increase the target product yield, improve product quality, reduce the energy consumption and production costs, improve safety, control environmental indicators, improve production efficiency, and increase economic benefits from multiple dimensions. As an alternative to APC and RTO, extracting reliable solutions from historical data sets based on mechanistic models is a feasible approach without any requirements for first-principles models.8
Based on the above discussion and the requirements of intelligent process manufacturing, we propose a data-model-based optimization method for fluid catalytic cracking units in this paper. This method is a general strategy of distributed reasoning based on historical cases, i.e., case-based reasoning (CBR). CBR mimics human reasoning by reusing data and solutions from similar problems in the past to solve new problems. It is an excellent tool for reusing previously acquired experience and is widely used to build design automation or decision support systems,9 such as production scheduling,10 processing,11,12 and fault diagnosis.13 In this paper, we first process the accumulated data sets of industrial refineries and high-fidelity simulation activities to form a case base with a determined structure. We then use a fuzzy matching method to extract cases from the case base that are similar to the new cases. The CBR method proposed in this paper is essentially a variable correlation algorithm, which can intelligently select variables that are strongly correlated with the target variables from a large number of the laboratory information management system (LIMS) and distributed control system (DCS) variables, thereby minimizing the model complexity and allowing the model to exhibit high computational speeds, fast convergence, and strong adaptability while ensuring reliability. Thus, it can be used to guide real-time online optimization.14–16
The rest of this article is organized as follows. Section 2 briefly reviews the method and applications of CBR. The method of CBR in the optimization of the fluid catalytic cracking process is presented specifically in Sections 3. Section 4 focuses on the validation of the effectiveness of the model with cases, and the article is concluded with the main findings.
CBR has been used in many aspects of the refining industry, mainly in chemical process synthesis, design analysis, and fault diagnosis. (1) The following research progress has been made in the application of CBR in chemical process synthesis and design analysis. In 2001, Pajula et al.21 proposed a CBR-based approach for chemical process synthesis and demonstrated the application of CBR in separation systems with cases. Avramenko et al.22 in 2004 used the CBR method as a design support tool for the pre-selection of the packing type for reactive distillation columns. In 2005, Seuranen et al.23 presented a new CBR-based approach for separation process synthesis and selection of single separations. Lopez-Arevalo et al.24 in 2007 proposed an approach for managing the complexity in the redesign/retrofitting of chemical processes. This approach uses model-based reasoning (MBR) to automatically generate alternative representations of an existing chemical process at multiple levels of abstraction. In the overall process, the hierarchical representation leads to sets of equipment and sections organized according to their functions and purposes. In 2009, Robles et al.25 proposed an approach for accelerating the inventive preliminary design for chemical engineering by coupling CBR with the TRIZ (theory of inventive problem solving) theory to achieve an extension of the CBR method from routine design to inventive design. Stephane et al.26 in 2010 attempted to improve the retrieval step of the CBR-based preliminary design of chemical engineering units. (2) The following research progress has been made in the application of CBR in process monitoring and fault diagnosis. Zhao et al.27 integrated CBR and ontology to develop a new learning hazard and operability analysis (HAZOP) expert system to improve the learning capability of the expert system. Zhao et al.13 proposed an improved CBR method to predict the status of the Tennessee Eastman (TE) process. Yan et al.28 proposed a case retrieval method based on a learning pseudo-metric (LPM) to replace the distance measure retrieval method and established a CBR-based fault diagnosis model for the TE process.
Zhang et al.8 applied the CBR method for the first time to optimize the refinery production process. First, the accumulated data sets from the industrial plants as well as high-fidelity simulation activities were processed to form a case base with a determined structure. Fuzzy matching was employed to evaluate the similarity, and an optimization model was established for the parameters of the fuzzy membership function. The application to an industrial fluid catalytic cracking unit was performed as an example for validation.
• Retrieve: With the feed composition of the current case, including wax/residue ratio, residual carbon, sulfur contents, distillation temperature range and the feeding rate, etc., using the distance similarity to calculate the similarities between the operation conditions of the current case with historical cases, and then perform the comparison on four levels. The target is to find the historical cases with the most similar operation conditions.
• Reuse: The solution resulting from the retrieved cases is used as a suggested solution to the target problem. By matching the feed compositions to find the candidates from the case base, and arrange these candidates in descending order regarding the total liquid yield (or the gasoline yield, the coke yield). Then the optimal solution is the first one with the highest total liquid yield (or the highest gasoline yield, the lowest coke yield).
• Revise: If the actual product yield and the total liquid yield under the recommended conditions did not conform to the calculation results in the corresponding case base, then such case base would be modified, then the above operation was repeated to obtain the process optimization results.
• Retain: The process model or the simulation software was used to simulate different operating conditions, and a large number of cases representing various operating conditions were obtained. The generated cases were added to the case base to complete its expansion. On the other hand, due to the process transformation, the cases that did not appear were then eliminated.
The case base should be established to cover all possible problems and variable sets in the application field.29 The variables are divided into feed variables, influencing variables, and product variables. The case model is expressed as follows:
Ck = {(Ik,Pk) → Sk} | (1) |
Ik = (Ik,1,Ik,2, …, Ik,11) = (r, ρ, σ, ξ, Tini, T5%, T10%, T30%, T50%, T70%, Qm) | (2) |
Pk = (Pk,1,Pk,2, …, Pk,m) = (Ydrygas, YLPG, Ygasoline, Ydiesel, Yslurry, Ycoke…) | (3) |
(4) |
Four-level matching of the feed information was conducted based on the established case base using a distributed reasoning algorithm, which is shown by the steps in the rounded box in Fig. 1, as described in detail below:
(1) The cases that meet the upper and lower limits of the wax residue ratio are selected, and the resulting number of cases is denoted as n1. The similarity Sim(I1,Ik,1) between the wax residue ratio I1 of the current feed and the wax residue ratio Ik,1 in the case base is calculated using eqn (4). The first n2(n2 ≤ n1) cases greater than 0.9 are selected from the results ranked in descending order as the first-level case base.
(2) The weighted similarity D1 of the density, residual carbon, and sulfur content between the current feed and the feed in the case base in the first-level case base is calculated, and they are assigned different weights. The first n3(n3 ≤ n2) cases greater than 0.85 are selected according to the D1 results ranked in descending order as the second-level case base. D1 is calculated as follows:
(5) |
(3) The weighted similarity D2 of the boiling range temperatures (including the initial boiling point and the 5%, 10%, 30%, 50%, and 70% boiling range temperatures) between the current feed and the feed in the case base in the second-level case base is calculated, different weights are assigned to boiling range temperatures. The first n4(n4 ≤ n3) cases greater than 0.8 are selected according to the D2 results ranked in descending order as the third-level case base. D2 is calculated as follows:
(6) |
(4) The similarity Sim(I11,Ik,11) of the flow rate between the current feed and the feed in the historical base in the third-level case base is calculated. The first n5(n5 ≤ n4) cases greater than 0.7 are selected according to the results ranked in descending order as the fourth-level case base.
(1) If the maximization of the total liquid yield Fk is used as the objective of the process optimization, the operating condition corresponding to the maximum value of Fk in the fourth-level case base is the optimal operating condition ST:
(7) |
Fk,1 = Pk,2 + Pk,3 + Pk,4 | (8) |
(2) If the maximization of the gasoline yield Fk,2 is used as the objective of the process optimization, the operating condition corresponding to the maximum value of Fk,2 in the fourth-level case base is the optimal operating condition ST:
(9) |
Fk2 = Pk,3 | (10) |
(3) If the minimization of the coke yield Fk,3 is used as the objective of the process optimization, the operating condition corresponding to the minimum value of Fk,3 in the fourth-level case base is the optimal operating condition ST:
(11) |
Fk3 = Pk,6 | (12) |
Case expansion provide enough cases to ensure the reliability of results and broad applicability of model. There are two methods of case expansion: one is industrial historical data (actual data), and the other is the prediction results of the process model or the simulation software. In addition, regarding some hidden parameters such as the mass transfer performance of the unit change slowly with time, so regular updates can ensure the timeliness of the model. There are two methods of case update: one is to update the expansion regularly, and the other is to expand immediately when the process is modified or the operating conditions continue to change significantly. During the case update, the new production data are added to the data set while part of the oldest data are eliminated, and the new data set is used to adapt to the new operating conditions.
The MIP process of a fluid catalytic cracking unit in a refinery in northwest China was modeled for industrial validation. This unit has a production load of 300 million tons per annum, and its calibrated operating conditions were as follows: wax oil feed rate, 250 t h−1; residue oil feed rate, 140 t h−1; catalyst–oil ratio, 6.5; inlet temperature of the second reactor, 502 °C; inlet temperature of the first reactor, 512 °C; outlet temperature of the second reactor, 501 °C; settler top pressure, 0.18 MPa; riser pressure drop, 50 kPa; and regenerator pressure, 0.22 MPa.
The production data from October 2019 to May 2020 were collected, and a case base was created according to the method presented in Section 3.1, including 42 DCS items and 9 LIMS analysis indices. The description of these variables is given in Appendix A, Tables S1 and S2.†
No. | Type | Name | Value | Unit |
---|---|---|---|---|
1 | General properties | Density (20 °C) | 913.5 | kg m−3 |
2 | Residual carbon | 3.1 | Wt% | |
3 | Slag mixing ratio | 0.436 | Dimensionless | |
4 | Boiling range temperature | Initial boiling point | 219 | °C |
5 | 5% distilled temperature | 324 | °C | |
6 | 10% distilled temperature | 348 | °C | |
7 | 30% distilled temperature | 400 | °C | |
8 | 50% distilled temperature | 435 | °C | |
9 | 70% distilled temperature | 489 | °C | |
10 | Element content | Sulfur | 0.35 | Wt% |
In this study, 22 important DCS items were selected as the influencing variables for optimization, and the process optimization calculation was carried out with the selected feed for different optimization objectives, including the maximum gasoline yield, the maximum total liquid yield, and the minimum coke yield, to obtain the results of the influencing variables. Specifically, the product yield distribution of three different working conditions (optimal, suboptimal and third best) under three different optimization objectives is given. See Table 2 for specific data results. The calculated optimal chemical parameters are given with the best gasoline yield as an example, as shown in Table 3.
Optimization objective and value (wt%) | Product | Yield under optimal condition (wt%) | Yield under second optimal condition (wt%) | Yield under third optimal condition (wt%) |
---|---|---|---|---|
Maximization of gasoline yield, 52.50% | Total liquid | 86.01 | 87.07 | 85.49 |
Gasoline | 52.5 | 52.16 | 52.02 | |
Diesel | 16.63 | 17.36 | 16.66 | |
Dry gas | 3.65 | 3.77 | 3.58 | |
LPG | 16.88 | 17.55 | 16.81 | |
Slurry | 3.53 | 3.81 | 3.81 | |
Coke | 6.81 | 5.35 | 7.12 | |
Maximization of total liquid yield, 87.07% | Total liquid | 87.07 | 86.06 | 86.01 |
Gasoline | 52.16 | 51.45 | 52.5 | |
Diesel | 17.36 | 17.06 | 16.63 | |
Dry gas | 3.77 | 3.71 | 3.65 | |
LPG | 17.55 | 17.55 | 16.88 | |
Slurry | 3.81 | 4.05 | 3.53 | |
Coke | 5.35 | 6.18 | 6.81 | |
Minimization of coke yield, 5.35% | Total liquid | 87.07 | 86.06 | 85.94 |
Gasoline | 52.16 | 51.45 | 51.44 | |
Diesel | 17.36 | 17.06 | 17.01 | |
Dry gas | 3.77 | 3.71 | 3.7 | |
LPG | 17.55 | 17.55 | 17.49 | |
Slurry | 3.81 | 4.05 | 4.08 | |
Coke | 5.35 | 6.18 | 6.28 |
No. | Item | Optimal condition | Second optimal condition | Third optimal condition |
---|---|---|---|---|
1 | TI3106B | 506.48 | 503.86 | 506.34 |
2 | TI3106A | 511.82 | 509.48 | 511.8 |
3 | TI3111 | 676.02 | 679.39 | 677.46 |
4 | FIC3105 | 1.53 | 1.49 | 1.52 |
5 | FIC3208 | 168.66 | 176.88 | 170.99 |
6 | FIC3209 | 216.92 | 237.64 | 213.65 |
7 | FIC3109 | 2.25 | 2.26 | 2.25 |
8 | PdI3122 | 69.47 | 68.11 | 69.34 |
9 | PdIC3103 | 52.23 | 54.45 | 48.87 |
10 | DI3102 | 32.47 | 50.11 | 31.23 |
11 | TIC3101 | 496.73 | 495.49 | 496.02 |
12 | TIC3204 | 197.67 | 193.59 | 197.5 |
13 | FIC3111 | 7.5 | 5.01 | 7.5 |
14 | FIC3110 | 3 | 5.5 | 3 |
15 | PI3106 | 0.23 | 0.23 | 0.23 |
16 | TIC3125 | 694.58 | 694.91 | 697.21 |
17 | TI3131A | 669.51 | 671.82 | 669.75 |
18 | TI3126A | 699.97 | 699.98 | 703.64 |
19 | TIC3102 | 691.88 | 693.73 | 700.38 |
20 | PI3110 | 0.3 | 0.31 | 0.3 |
21 | DI3112 | 419.34 | 415.17 | 437.66 |
22 | FIC3122 | 2524.2 | 2629.98 | 2374.71 |
MATLAB 2014b (MathWorks, Inc.) was used to modeling for process optimization. Computer configuration: processor: Intel (R) Xeon (R) Gold 5117 CPU @ 2.00 GHz (dual processor); memory: 32.0 GB; operating system type: 64-bit. The optimization calculation process, including case extraction and case reuse, was controlled to be completed within 1 s.
The benefits were calculated with the following considerations. The energy consumption was based on the relevant Chinese national standard. The energy consumption for processing heavy raw oil should be less than 85 kg of standard oil per ton, and the product with increased yield was multiplied by an energy consumption coefficient of 0.085. Coke was processed by the regenerator to provide energy for the unit at no cost. The maximization of the total liquid yield was the objective for optimization. The yield change and profit calculation of each product are shown in Table 4. The profit change was calculated as follows:
Material flow | Yield change (%) | Average price (yuan per ton) | Average profit (yuan per ton) |
---|---|---|---|
Dry gas | 0.08 | 2000 | 200 |
Liquefied petroleum gas | 0.36 | 3500 | 350 |
Catalytic gasoline | 0.57 | 5000 | 500 |
Catalytic diesel | 0.17 | 4000 | 400 |
Slurry | −0.01 | 2500 | 250 |
Coke | −1.17 | 0 | — |
It is estimated that the use of the optimized system can bring about a profit improvement of approximately 13.52 million yuan per year for a 3 million ton/year fluid catalytic cracking unit.
Although the application of real-time operation and optimal control based on rigorous mechanistic models in complex production processes is still in its infancy, the CBR method used in this paper is advantageous in terms of data models. The fusion of the two technical methods will become an important research topic and direction in the future. In particular, the development of plant-wide process optimization in real time using mechanism-based big data technology may be achieved.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/d1ra03228c |
This journal is © The Royal Society of Chemistry 2021 |