基于CARS-CNN-GRU模型的发动机尾焰红外光谱浓度求解方法

傅莉; 张昆; 孙旭; FU Li; ZHANG Kun; SUN Xu

A Method for Calculating the Infrared Spectrum Concentration of Engine Tail Flame Based on the CARS-CNN-GRU Model

doi: 10.3969/j.issn.1672-8785.2025.06.004

FU Li¹ ， ZHANG Kun¹ ， SUN Xu²

1. School of Automation, Shenyang Aerospace University, Shenyang 110136 , China

2. Shenyang Engine Research Institute of AECC, Shenyang 110015 , China

Funds: Supported by the National Natural Science Foundation of China (No. 61602321)

Abstract

In view of the importance of the concentration of engine tail flame components to the infrared spectrum radiation intensity, an efficient infrared spectrum concentration solution model is proposed, namely the CARS-CNN-GRU model which combines the competitive adaptive reweighted sampling (CARS) algorithm with the convolutional neural network (CNN)-gated recurrent unit (GRU) deep learning algorithm. This method uses the CARS algorithm to select the key wavelengths and obtain the tail flame component concentration information. Then the CNN-GRU model is used to perform long-range dependency analysis on the sequence data to achieve multi-scale feature extraction. Simulation results show that compared with the traditional models, the CARS-CNN-GRU model has higher accuracy in solving H₂O and CO₂ concentrations. Its root mean square error (RMSE) is reduced to 0.0014 and 0.0017, respectively. The R² value is 0.999 and 0.998, respectively; the mean absolute error (MAE) is 0.0011 and 0.0014, respectively. The CARS-CNN-GRU model proposed in this paper shows superior performance in solving infrared spectral concentration. Compared with traditional methods, it has higher accuracy, stability and reliability, and provides strong support for stealth technology, environmental monitoring and combustion efficiency evaluation in the military and civil aviation fields.

Keywords

engine tail flame / infrared spectrum / CARS-CNN-GRU model / concentration solution

1 Data acquisition and modeling 1.1 Construction of mixed gas dataset 1.2 CARS Algorithm 1.3 CNN 1.4 GRU Network 1.5 CARS-CNN-GRU model establishment 1.6 Model Evaluation Metrics 2 Experimental results analysis 2.1 Characteristic wavelength extraction 2.2 Model solution results and analysis 3 Conclusion

Introduction

The infrared radiation characteristics of the engine tail flame are important feature bases for detecting, identifying, warning and tracking aircraft using infrared detection equipment ^[1]. As an important infrared radiation source of aircraft, the radiation spectrum of the engine tail flame mainly depends on its composition and temperature. The tail flame is a high-temperature, high-speed airflow ejected by the aircraft engine. Its main components are high-temperature gases such as H₂O, CO₂, N₂, and CO. Among them, H₂O and CO₂ are the main active components that generate infrared radiation energy ^[2]. Different aviation kerosene combustion will produce different concentrations of H₂O and CO₂, so it is very important to solve the concentration of engine tail flame components.

With the rapid advancement of computer technology, the combination of machine learning algorithms and infrared spectroscopy technology to accurately detect the concentration of mixed gas components ^[3] has become a key research direction. Traditional gas detection methods are easily disturbed by external conditions (such as temperature and air pressure changes) , resulting in poor stability of measurement results. Usually, other technologies must be used to ensure that the detection purpose is achieved. At present, infrared spectroscopy is considered to be one of the most ideal gas detection methods.

Infrared spectroscopy detection technology has attracted great attention from scholars in this field and has been widely used in practical fields ^[4]. Fang L et al. ^[5] used genetic algorithms to identify unknown components in Fourier transform infrared (FTIR) spectroscopy analysis. Evseev V ^[6] and Bharadwaj SP ^[7] et al. used the HITEMP database and CDSD database to obtain the infrared spectrum of CO₂ gas under high temperature conditions and discussed the spectral characteristics of CO₂ under high temperature. Yu Duanhui ^[8] used the radial basis neural network algorithm to identify five gases including CO. Shao L et al. ^[9] used FTIR technology to measure air and obtained a large number of spectral samples. They established the Classical Least Squares (CL Squares (CLS) and partial least squares (PLS) quantitative models were used to obtain the concentration information of gases such as NH₃ and CH₄ in the spectrum. Zhang L et al. ^[10] used chaos optimization to estimate the concentration of formaldehyde and benzene, and the prediction errors were reduced by 26.03% and 16.4% respectively.

Convolutional Neural Network (CNN) has the ability to extract image features quickly and does not require data preprocessing, so it is used to process spectral data. Cai Y et al. ^[11] proposed a multi-gas component measurement method that combines CNN with a long short-term memory (LSTM) network, but the interference of the on-site environment can cause significant errors in the prediction of low-concentration gas results.

Based on this, this paper established a CARS-CNN-GRU model. The CARS algorithm is used to remove useless variables, and the CNN-GRU model is used to accurately solve the engine tail flame spectral concentration. CNN has a strong feature extraction capability for one-dimensional spectral data ^[12], while the GRU network makes up for the shortcomings of ordinary recurrent networks that bring gradient vanishing and gradient explosion, making it difficult to train the network, and can better adapt to time series signals ^[13]. The combination of the two is expected to greatly enhance the generalization ability and robustness of the model. At the same time, in terms of spectral data dimensionality reduction, the CARS algorithm can effectively remove uninformative variables while minimizing the impact of collinear variables on the model, and finally select the variables that are most critical to the solution target ^[14].

1 Data acquisition and modeling

1.1 Construction of mixed gas dataset

The HITRAN database was used to obtain 100 sets of absorbance spectra of H₂O and CO₂ mixed gases of different concentrations and used as experimental data. The wavelength range obtained was 2－5 μm, and each spectrum contained 3000 discrete data points (interval 0.001 μm) .80% of the data was used as a training set, and the remaining20% of the data was used as a test set. The training set was used to train the neural network, and the test set was used to evaluate the performance of each gas concentration solution model. Some mixed gas spectral data are shown in Figure1.

Fig.1Spectra of some mixed gases.

1.2 CARS Algorithm

CARS algorithm ^[15] is a method for variable selection, which is particularly suitable for spectral data analysis. It can effectively select the most informative features from a large number of variables. The core of the CARS algorithm is to determine the optimal variable subset through an iterative process, combined with the strategy of adjusting the weights of variables and gradually eliminating unimportant variables. The following is a simplified version of the CARS algorithm process and its mathematical description.

(1) Initialization process

Assume we have 𝑁 samples, each sample contains 𝑝 variables (features) . Initially, the weight of each variable ω_i is 1, where 𝑖 =1, 2, ..., 𝑝 .

(2) PLS regression

Using PLS regression analysis, the solution model is calculated based on the current weight vector 𝑤 . PLS is a linear modeling technique that is particularly suitable for high-dimensional data. It looks for potential linear combinations that explain the relationship between the response variable and the solution variable.

(3) Weight update

Calculate an importance measure for each variable, such as the absolute value of the variable's coefficient |β_i|. The weight of the variable is usually updated using formula (1) :

w_{i}^{(k + 1)} = w_{i}^{(k)} e x p (- λ |z_{i}|)

(1)

(4) Competitive Adaptive Sampling

Based on the updated weights, a certain proportion of variables are selected as "winners". These variables will be retained to participate in the next round of iterations. The remaining "loser" variables will be eliminated or their weights will be further reduced until the stopping criterion is met.

(5) Iteration process

The above process needs to be repeated until the termination condition is met, such as the number of iterations reaches a preset upper limit or the weight change is lower than a certain critical value .

(6) Final model

Finally, the final model is constructed from the retained variables. These variables are considered to be the most representative and informative features of the dataset.

The CARS algorithm dynamically adjusts sample weights and feature subsets so that important features receive higher weights, thereby improving the effects of feature selection and dimensionality reduction. The algorithm has good robustness and generalization capabilities.

1.3 CNN

The design of CNN is inspired by the working mechanism of biological visual cortex. Its architecture effectively captures and extracts valuable information while reducing the data size by staggering convolutional layers and pooling layers. With the continuous and unremitting exploration and improvement of researchers around the world, CNN has been rapidly updated and developed. The network consists of five main parts: input layer, several convolutional layers, pooling layer, fully connected layer and output layer. Its structure is shown in Figure2.

Fig.2CNN structure diagram.

1.4 GRU Network

The GRU network is a member of the Recurrent Neural Network (RNN) family. Its design concept is derived from the improvement of the LSTM network. The LSTM network effectively alleviates the gradient vanishing and gradient exploding problems faced by RNN when dealing with long-term dependencies by introducing innovative input gating, output gating, and forgetting gating mechanisms ^[16] . The GRU network is built on this basis, and its core component is also the gated recurrent unit.

As a variant of the LSTM network, the GRU network has a simpler structure. Therefore, on the basis of achieving the same effect, it has a faster training speed with fewer parameters and can reduce the risk of overfitting. Compared with the triple gate mechanism of the LSTM network, the GRU network simplifies the gate function and only has an update gate and a reset gate.

The RNN structure is shown in Figure3. In the GRU network, each traditional RNN node in the hidden layer is replaced by GRU. The structure of each unit node is shown in Figure4.

Fig.3RNN structure.

Fig.4GRU node model.

In Figure4, x_t represents the input information at the current moment; h_t_-1 represents the hidden state at the previous moment. The hidden state serves as the memory carrier of the neural network and contains the characteristic information of the data received at the previous time steps; h_t represents the hidden state passed forward to the next moment;

{\tilde{h}}_{t}

is the candidate hidden state;

r_{t}

is the reset gate; and z_t is the update gate.

The reset gate determines how to combine new input information with previous memory. This relationship is expressed in mathematical formula form as follows:

r_{t} = σ (W_{r} \cdot [h_{t - 1}, x_{t}])

(2)

Where σ is the logistic sigmoid function; W_r is the weight vector. The formula used for the candidate hidden state after solving r_t is:

{\tilde{h}}_{t} = t a n h (W \cdot [r_{t} * h_{t - 1}, x_{t}])

(3)

From formula (3) , we can see that the larger the r_t value is, the more the previous moment is combined with the current moment. If the r_t value is 1, it means that the hidden state information of the previous moment will be completely retained; if the r_t value is 0, it means that the hidden state information of the previous moment will be completely ignored. Therefore, when processing time series data, the reset gate can effectively identify and capture short-term dependency relationships.

The update gate is used to update the memory, and its mathematical formula is as follows:

z_{t} = σ (W_{z} \cdot [h_{t - 1}, x_{t}])

(4)

The expression for updating memory is as follows:

h_{t} = (1 - z_{t}) * h_{t - 1} + z_{t} * {\tilde{h}}_{t}

(5)

The final memory h_t forgets some of the h_t_-1 information passed down and adds some of the information input by the current node. The closer z_t is to 1, the more data is "remembered"; the closer z_t is to 0, the more data is "forgotten".

1.5 CARS-CNN-GRU model establishment

When solving the spectral concentration of mixed gases, the CARS algorithm is used to select characteristic bands. At the same time, the CNN-GRU model is used as the algorithm basis for the selected data, which can effectively extract and utilize the characteristic information in the spectrum, thereby improving the solution accuracy. Therefore, this section combines the above algorithm with the model to obtain the CARS-CNN-GRU model, and compares it with other algorithms. The modeling process of the model is shown in Figure5.

Fig.5CARS-CNN-GRU network modeling diagram.

The CARS-CNN-GRU model includes an input layer, a CARS algorithm, a convolution layer, a pooling layer, a GRU layer, a fully connected layer, and an output layer. The absorbance spectrum data of the mixed gas in the characteristic band is input, and the concentration is solved using different models.

1.6 Model Evaluation Metrics

Root mean square error (RMSE) , mean absolute error (MAE) and R² determination coefficient are introduced to effectively evaluate the error of mixed gas concentration solution. The expressions of RMSE, MAE and R² are as follows:

R M S E (X, h) = \sqrt{\frac{1}{m} \sum_{i = 1}^{m} {(h (x_{i}) - y_{i})}^{2}}

(6)

M A E = \frac{1}{n} \sum_{i = 1}^{n} |{\hat{y}}_{i} - y_{i}|

(7)

R^{2} = 1 - \frac{\sum_{i = 1}^{N} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{N} {(y_{i} - {\bar{y}}_{i})}^{2}}

(8)

2 Experimental results analysis

2.1 Characteristic wavelength extraction

A 50-round variable selection process (the process is targeted at the spectral absorption band of the mixed gas) was performed using the CARS algorithm. As the number of Monte Carlo sampling iterations increases, the number of selected wavelengths gradually decreases, and the deceleration tends to be gentle. This process shows the trend of wavelength selection from broad to fine. By comparing the root mean square error of cross-validation (RMSECV) of each iteration, the whole process continues until the iteration corresponding to the minimum RMSECV value is found, thereby determining the wavelength containing the optimal variable subset .

Figure6 Schematic diagram of CARS characteristic band selection. It can be seen that with the increase in the number of iterations, the number of selected wavelengths shows a decreasing trend, while the RMSECV curve shows a trend of first fluctuating down to the lowest point and then gradually recovering. This change process shows that in the initial stage, by eliminating irrelevant wavelength variables, the RMSECV value can be reduced; however, in the subsequent stage, excessive elimination of related variables leads to information loss, and the RMSECV value increases accordingly. Specifically, when the CARS algorithm is applied to the infrared radiation spectrum of the engine tail flame, its RMSECV value reaches the minimum value when the iteration is carried out for the10th time. At this time, the number of wavelength variables determined is 112.

Fig.6Schematic diagram of CARS characteristic band selection.

In order to verify the effective improvement of the CARS algorithm on the model accuracy, this paper also uses the Differential Evolution (DE) algorithm and the Sequential Projection Algorithm (SPA) to select feature bands, and compares them with the CARS algorithm.

2.2 Model solution results and analysis

The data after the CARS feature band selection is input into the trained CNN-GRU gas concentration solution model, and the concentration of the gas samples in the test set is solved by the neural network. At the same time, five additional methods, CARS-GRU, SPA-GRU, SPA-CNN-GRU, DE-GRU, and DE-CNN-GRU, are used to invert the concentration of the test set data, and then the results are compared with the solution results of the CARS-CNN-GRU model. The RMSE and MAE of all samples in the test set and the solution are used as evaluation indicators of algorithm performance.

Solution results of the CARS-CNN-GRU model for H₂O and CO₂ gas concentration are shown in Figure7 (a) and Figure7 (b) .

Fig.7Solution of mixed gas concentration: (a) H₂O; (b) CO₂.

In order to intuitively reflect the superiority of the CARS-CNN-GRU model proposed in this paper, the RMSE, MAE and R² comparison are shown in Figure8.

Fig.8Comparison of model error indicators: (a) RMSE; (b) MAE; (c) R².

Table1 lists the RMSE, MAE and of each algorithm R² for solving H₂O and CO₂ concentrations. Combining Figure8 and Table1, it can be concluded that in terms of RMSE, CARS-CNN-GRU<DE-CNN-GRU<SPA-CNN-GRU<DE-GRU<SPA-GRU<CARS-GRU; in terms of MAE, CARS-CNN-GRU<SPA-CNN-GRU<DE-CNN-GRU<DE-GRU<SPA-GRU<CARS-GRU; in terms of R², CARS-CNN-GRU>SPA-CNN-GRU>DE-CNN-GRU>DE-GRU>SPA-GRU>CARS-GRU. This shows that the CARS-CNN-GRU model has the smallest solution error, the best degree of consistency between the solution value and the true value, and all samples have obtained good concentration solutions, proving that the CARS-CNN-GRU model has good generalization ability.

Table1Model solution results

The gas concentration solution error of the CARS-CNN-GRU model is lower than that of other hybrid models, indicating that using the CARS algorithm to extract data features and then using the CNN algorithm to deeply extract features can reduce the error and enhance the solution performance of the GRU network.

In general, the CARS-CNN-GRU gas concentration solution model has the smallest MSE value and the smallest MAE value. The R² value is the largest . Therefore, CARS-CNN-GRU is the model with the best performance in solving gas concentration.

3 Conclusion

In order to solve the concentration value of the mixed gas components in the tail flame of the aircraft, this paper adds a CNN network to the GRU network and constructs a CNN-GRU model; at the same time, the CARS algorithm is used to extract the features of the data, and finally a tail flame infrared spectrum concentration solution method based on the CARS-CNN-GRU model is proposed. At the same time, five additional methods, CARS-GRU, SPA-GRU, SPA-CNN-GRU, DE-GRU, and DE-CNN-GRU are used to solve the concentration of the test set data, and the results are compared with the prediction results of the CARS-CNN-GRU model. The simulation results show that the CARS-CNN-GRU model has higher accuracy in solving infrared spectrum concentration than other models. The RMSE values of H₂O and CO₂ concentrations are as low as 0.0014 and 0.0017, respectively, the R² values are 0.999 and 0.998, and the MAE values are 0.0011 and 0.0014, respectively. Therefore, the model has a good ability to solve tail flame concentration.

The concentration solution model in this article is currently only applicable to the solution of high-concentration gases. In the future, the solution of trace gases in the tail flame will also be carried out to expand the applicable conditions of the algorithm.

Fig.1Spectra of some mixed gases.

Download: Full size image

Fig.2CNN structure diagram.

Download: Full size image

Fig.3RNN structure.

Download: Full size image

Fig.4GRU node model.

Download: Full size image

Fig.5CARS-CNN-GRU network modeling diagram.

Download: Full size image

Fig.6Schematic diagram of CARS characteristic band selection.

Download: Full size image

Fig.7Solution of mixed gas concentration: (a) H₂O; (b) CO₂.

Download: Full size image

Fig.8Comparison of model error indicators: (a) RMSE; (b) MAE; (c) R².

Download: Full size image

Table1Model solution results

Download: Full size image

Fig.1Spectra of some mixed gases.

Fig.2CNN structure diagram.

Fig.3RNN structure.

Fig.4GRU node model.

Fig.5CARS-CNN-GRU network modeling diagram.

Fig.6Schematic diagram of CARS characteristic band selection.

Fig.7Solution of mixed gas concentration: (a) H₂O; (b) CO₂.

Fig.8Comparison of model error indicators: (a) RMSE; (b) MAE; (c) R².

Table1Model solution results

Image(8) / Table(1)

Citation

Fu L, Zhang K, Sun X. A Method for Calculating the Infrared Spectrum Concentration of Engine Tail Flame Based on the CARS-CNN-GRU Model [J]. Infrared, 2025, 46(6): 24-33.

Copy

Metering

Fig.1Spectra of some mixed gases.

Fig.2CNN structure diagram.

Fig.3RNN structure.

Fig.4GRU node model.

Fig.5CARS-CNN-GRU network modeling diagram.

Fig.6Schematic diagram of CARS characteristic band selection.

Fig.7Solution of mixed gas concentration: (a) H₂O; (b) CO₂.

Fig.8Comparison of model error indicators: (a) RMSE; (b) MAE; (c) R².

Table1Model solution results

Zhu Nian, Gao Sili, Yue Juan. Modeling and simulation of infrared radiation characteristics of tail flame of high-speed flying target[J]. Infrared,2018,39(5):8-12.

Yu Kun, Cong Mingyu, Dai Wencong. Infrared radiation of solid particles in aircraft tail flames Simulation analysis of radiation suppression effect[J]. Acta Optica Sinica,2020,40(21):191-204.

Khoury Y E, Gebelin M, Marcou G,et al. Rapid discrimination of neuromyelitis optica spectrum disorder and multiple sclerosis using machine learning on infrared spectra of sera[J]. International Journal of Molecular Science,2022,23(5):2791.

Liang H, Long Y, Liu G. Qualitative and quantitative studies of multicomponent gas by CNN-KPCA-RF model[J]. Vibrational Spectroscopy,2024,130:103647.

Fang L, Junde W. Using genetic algorithm to identify completely unknown system in FTIR spectra analysis[J]. Journal of Environmental Science and Health, Part A: Toxic/Hazardous Substances and Environmental Engineering,2004,39(6):1525-1533.

Evseev V, Fateev A, Clausen S. High-resolution transmission measurements of CO₂ at high temperatures for industrial applications[J]. Journal of Quantitative Spectroscopy & Radiative Transfer,2012,113(17):2222-2233.

Bharadwaj S P, Modest M F. Medium resolution transmission measurements of CO₂ at high temperature[J]. Journal of Quantitative Spectroscopy and Radiative Transfer,2007,73(2):329-338.

Yu Duanhui. Research on gas identification and concentration detection algorithm based on infrared spectroscopy[D]. Chengdu: University of Electronic Science and Technology of China,2018.

Shao L, Griffiths PR, Chu PM,et al. Quantitative vapor-phase infrared spectrometry of ammonia[J]. Applied Spectroscopy,2006,60(3):254-260.

Zhang L, Tian F, Liu S,et al. Chaos based neural network optimization for concentration estimation of indoor air contaminants by an electronic nose[J]. Sensors and Actuators A: Physical,2013,189:161-167.

Cai Y, Xu G, Yang D,et al. On-line multi-gas component measurement in the mud logging process based on Raman spectroscopy combined with a CNN-LSTM-AM hybrid model[J]. Analytica Chimica Acta,2023,1259:341200.

Liu Zongyi, Zhang Caihong, Jiang Jiankang,et al. Rapid detection of total flavonoids content in Dendrobium officinale based on Raman spectroscopy combined with CNN-LSTM deep learning method[J]. Spectroscopy and Spectral Analysis,2024,44(4):1018-1024.

Peng Chaoqin, Li Qicong, Chen Juan,et al. EMA fault diagnosis method based on multi-information fusion of GRU and improved attention mechanism[J/OL].https://bhxb.buaa.edu.cn/bhzk/cn/article/doi/10.13700/j.bh.1001-5965.2023.0584,2023.

Li Jiangbo, Peng Yankun, Chen Liping,et al. Quantitative determination of SSC content in pear using near-infrared hyperspectral imaging combined with CARS algorithm[J]. Spectroscopy and Spectral Analysis,2014,34(5):1264-1269.

Li H D, Liang Y Z, Xu Q S,et al. Key wavelengths screening using competitive adaptive reweighted sampling method for multivariate calibration[J]. Analytica Chimica Acta,2009,648:77-84.

Chen Enshuai, Mao Dajun, Chen Siqin,et al. Research on load forecasting of thermal power plants based on bidirectional LSTM-Attention model[J]. Power Technology and Environmental Protection,2024,40(4):380-387.

Home

Introduction

Editorial Board

Authors Guidelines

Open Access

Publishing Ethics

Downloads

Copyright

Contact Us

中文版

1 Data acquisition and modeling

2 Experimental results analysis

3 Conclusion