Physical-oriented and machine learning-based emission modeling in a diesel compression ignition engine: Dimensionality reduction and regression

Research output: Contribution to journalArticleResearchpeer review

Authors

  • Aran Mohammad
  • Reza Rezaei
  • Christopher Hayduk
  • Thaddaeus Delebinski
  • Saeid Shahpouri
  • Mahdi Shahbakhti

External Research Organisations

  • IAV GmbH
  • University of Alberta
View graph of relations

Details

Original languageEnglish
Pages (from-to)904-918
Number of pages15
JournalInternational Journal of Engine Research
Volume24
Issue number3
Early online date6 Jan 2022
Publication statusPublished - Mar 2023
Externally publishedYes

Abstract

The development of internal combustion engines is affected by the exhaust gas emissions legislation and the striving to increase performance. This demands for engine-out emission models that can be used for engine optimization for real driving emission controls. The prediction capability of physically and data-driven engine-out emission models is influenced by the system inputs, which are specified by the user and can lead to an improved accuracy with increasing number of inputs. Thereby the occurrence of irrelevant inputs becomes more probable, which have a low functional relation to the emissions and can lead to overfitting. Alternatively, data-driven methods can be used to detect irrelevant and redundant inputs. In this work, thermodynamic states are modeled based on 772 stationary measured test bench data from a commercial vehicle diesel engine. Afterward, 37 measured and modeled variables are led into a data-driven dimensionality reduction. For this purpose, approaches of supervised learning, such as lasso regression and linear support vector machine, and unsupervised learning methods like principal component analysis and factor analysis are applied to select and extract the relevant features. The selected and extracted features are used for regression by the support vector machine and the feedforward neural network to model the NOx, CO, HC, and soot emissions. This enables an evaluation of the modeling accuracy as a result of the dimensionality reduction. Using the methods in this work, the 37 variables are reduced to 25, 22, 11, and 16 inputs for NOx, CO, HC, and soot emission modeling while maintaining the accuracy. The features selected using the lasso algorithm provide more accurate learning of the regression models than the extracted features through principal component analysis and factor analysis. This results in test errors RMSETe for modeling NOx, CO, HC, and soot emissions 19.22 ppm, 6.46 ppm, 1.29 ppm, and 0.06 FSN, respectively.

Keywords

    diesel compression ignition engine, dimensionality reduction, Emission modeling, machine learning, regression

ASJC Scopus subject areas

Cite this

Physical-oriented and machine learning-based emission modeling in a diesel compression ignition engine: Dimensionality reduction and regression. / Mohammad, Aran; Rezaei, Reza; Hayduk, Christopher et al.
In: International Journal of Engine Research, Vol. 24, No. 3, 03.2023, p. 904-918.

Research output: Contribution to journalArticleResearchpeer review

Mohammad A, Rezaei R, Hayduk C, Delebinski T, Shahpouri S, Shahbakhti M. Physical-oriented and machine learning-based emission modeling in a diesel compression ignition engine: Dimensionality reduction and regression. International Journal of Engine Research. 2023 Mar;24(3):904-918. Epub 2022 Jan 6. doi: 10.1177/14680874211070736
Download
@article{e2911e8c322b418ab12db6421d7ab07e,
title = "Physical-oriented and machine learning-based emission modeling in a diesel compression ignition engine: Dimensionality reduction and regression",
abstract = "The development of internal combustion engines is affected by the exhaust gas emissions legislation and the striving to increase performance. This demands for engine-out emission models that can be used for engine optimization for real driving emission controls. The prediction capability of physically and data-driven engine-out emission models is influenced by the system inputs, which are specified by the user and can lead to an improved accuracy with increasing number of inputs. Thereby the occurrence of irrelevant inputs becomes more probable, which have a low functional relation to the emissions and can lead to overfitting. Alternatively, data-driven methods can be used to detect irrelevant and redundant inputs. In this work, thermodynamic states are modeled based on 772 stationary measured test bench data from a commercial vehicle diesel engine. Afterward, 37 measured and modeled variables are led into a data-driven dimensionality reduction. For this purpose, approaches of supervised learning, such as lasso regression and linear support vector machine, and unsupervised learning methods like principal component analysis and factor analysis are applied to select and extract the relevant features. The selected and extracted features are used for regression by the support vector machine and the feedforward neural network to model the NOx, CO, HC, and soot emissions. This enables an evaluation of the modeling accuracy as a result of the dimensionality reduction. Using the methods in this work, the 37 variables are reduced to 25, 22, 11, and 16 inputs for NOx, CO, HC, and soot emission modeling while maintaining the accuracy. The features selected using the lasso algorithm provide more accurate learning of the regression models than the extracted features through principal component analysis and factor analysis. This results in test errors RMSETe for modeling NOx, CO, HC, and soot emissions 19.22 ppm, 6.46 ppm, 1.29 ppm, and 0.06 FSN, respectively.",
keywords = "diesel compression ignition engine, dimensionality reduction, Emission modeling, machine learning, regression",
author = "Aran Mohammad and Reza Rezaei and Christopher Hayduk and Thaddaeus Delebinski and Saeid Shahpouri and Mahdi Shahbakhti",
year = "2023",
month = mar,
doi = "10.1177/14680874211070736",
language = "English",
volume = "24",
pages = "904--918",
journal = "International Journal of Engine Research",
issn = "1468-0874",
publisher = "SAGE Publications Ltd",
number = "3",

}

Download

TY - JOUR

T1 - Physical-oriented and machine learning-based emission modeling in a diesel compression ignition engine: Dimensionality reduction and regression

AU - Mohammad, Aran

AU - Rezaei, Reza

AU - Hayduk, Christopher

AU - Delebinski, Thaddaeus

AU - Shahpouri, Saeid

AU - Shahbakhti, Mahdi

PY - 2023/3

Y1 - 2023/3

N2 - The development of internal combustion engines is affected by the exhaust gas emissions legislation and the striving to increase performance. This demands for engine-out emission models that can be used for engine optimization for real driving emission controls. The prediction capability of physically and data-driven engine-out emission models is influenced by the system inputs, which are specified by the user and can lead to an improved accuracy with increasing number of inputs. Thereby the occurrence of irrelevant inputs becomes more probable, which have a low functional relation to the emissions and can lead to overfitting. Alternatively, data-driven methods can be used to detect irrelevant and redundant inputs. In this work, thermodynamic states are modeled based on 772 stationary measured test bench data from a commercial vehicle diesel engine. Afterward, 37 measured and modeled variables are led into a data-driven dimensionality reduction. For this purpose, approaches of supervised learning, such as lasso regression and linear support vector machine, and unsupervised learning methods like principal component analysis and factor analysis are applied to select and extract the relevant features. The selected and extracted features are used for regression by the support vector machine and the feedforward neural network to model the NOx, CO, HC, and soot emissions. This enables an evaluation of the modeling accuracy as a result of the dimensionality reduction. Using the methods in this work, the 37 variables are reduced to 25, 22, 11, and 16 inputs for NOx, CO, HC, and soot emission modeling while maintaining the accuracy. The features selected using the lasso algorithm provide more accurate learning of the regression models than the extracted features through principal component analysis and factor analysis. This results in test errors RMSETe for modeling NOx, CO, HC, and soot emissions 19.22 ppm, 6.46 ppm, 1.29 ppm, and 0.06 FSN, respectively.

AB - The development of internal combustion engines is affected by the exhaust gas emissions legislation and the striving to increase performance. This demands for engine-out emission models that can be used for engine optimization for real driving emission controls. The prediction capability of physically and data-driven engine-out emission models is influenced by the system inputs, which are specified by the user and can lead to an improved accuracy with increasing number of inputs. Thereby the occurrence of irrelevant inputs becomes more probable, which have a low functional relation to the emissions and can lead to overfitting. Alternatively, data-driven methods can be used to detect irrelevant and redundant inputs. In this work, thermodynamic states are modeled based on 772 stationary measured test bench data from a commercial vehicle diesel engine. Afterward, 37 measured and modeled variables are led into a data-driven dimensionality reduction. For this purpose, approaches of supervised learning, such as lasso regression and linear support vector machine, and unsupervised learning methods like principal component analysis and factor analysis are applied to select and extract the relevant features. The selected and extracted features are used for regression by the support vector machine and the feedforward neural network to model the NOx, CO, HC, and soot emissions. This enables an evaluation of the modeling accuracy as a result of the dimensionality reduction. Using the methods in this work, the 37 variables are reduced to 25, 22, 11, and 16 inputs for NOx, CO, HC, and soot emission modeling while maintaining the accuracy. The features selected using the lasso algorithm provide more accurate learning of the regression models than the extracted features through principal component analysis and factor analysis. This results in test errors RMSETe for modeling NOx, CO, HC, and soot emissions 19.22 ppm, 6.46 ppm, 1.29 ppm, and 0.06 FSN, respectively.

KW - diesel compression ignition engine

KW - dimensionality reduction

KW - Emission modeling

KW - machine learning

KW - regression

UR - http://www.scopus.com/inward/record.url?scp=85122733541&partnerID=8YFLogxK

U2 - 10.1177/14680874211070736

DO - 10.1177/14680874211070736

M3 - Article

AN - SCOPUS:85122733541

VL - 24

SP - 904

EP - 918

JO - International Journal of Engine Research

JF - International Journal of Engine Research

SN - 1468-0874

IS - 3

ER -

By the same author(s)