Skip to main navigation Skip to search Skip to main content

PM2.5 concentration forecasting: Development of integrated multivariate variational mode decomposition with kernel Ridge regression and weighted mean of vectors optimization

  • Hai Tao
  • , Iman Ahmadianfar
  • , Leonardo Goliatt
  • , Syed Shabi Ul Hassan Kazmi
  • , Mohamed A. Yassin
  • , Atheer Y. Oudah
  • , Raad Z. Homod
  • , Hussein Togun
  • , Zaher Mundher Yaseen
  • Nanchang Institute of Science and Technology
  • Qiannan Normal College for Nationalities
  • Behbahan Khatam Alanbia Univ. of Technology
  • Universidade Federal de Juiz de Fora
  • Shantou University
  • King Fahd University of Petroleum and Minerals
  • University of Thi-Qar
  • Al-Ayen University
  • Basra Univirsity of Oil and Gas
  • University of Baghdad

Research output: Contribution to journalArticlepeer-review

15 Scopus citations

Abstract

The accurate prediction of the PM2.5 air quality parameter within industrial and urban settings is the most pressing issue examined by the researchers because it has high health implications. However, accurately forecasting PM2.5 levels is crucial. The traditional machine learning (ML) models are incapable, as the indices fluctuate daily. To successfully manage this problem, a new ML framework is proposed that incorporates various techniques, such as LGBM feature selection (light gradient-boosting machine), MVMD (multivariate variational mode decomposition), KRidge (kernel Ridge regression), and INFO (weighted mean of vectors). The proposed framework is used to estimate PM2.5 pollution at specific stations in China for a one-time prediction. The LGBM feature selection technique is the first step of the pre-processing, which performs to select the most important variables. Next, the MVMD splits the initial signal into intrinsic mode functions (IMFs), accommodating the signal's non-stationary multivariate. Following that, the KRidge approach is applied to each sub-component using the best input feature, and the resulting predictions are summed up to get the PM2.5 levels. To assess the validity of the proposed MVMD-KRidge-INFO model, the categories of gaussian-process-regression (GPR), locally-weighted-linear-regression (LWLR), and multivariate adaptive regression splines (MARS) are analyzed in individual and hybrid moldes. As observed from the research results, MVMD-KRidge-INFO performs optimally at forecasting PM2.5 levels at Huairou and Shunyi, as evidenced by R, RMSE, MAPE, IA, MdAE, and U95% metrics.

Original languageEnglish
Article number102125
JournalAtmospheric Pollution Research
Volume15
Issue number6
DOIs
StatePublished - Jun 2024
Externally publishedYes

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being
  2. SDG 11 - Sustainable Cities and Communities
    SDG 11 Sustainable Cities and Communities

Keywords

  • Air quality forecasting
  • INFO algorithm
  • Kernel ridge regression
  • LGBM feature selection
  • MVMD
  • PM

Fingerprint

Dive into the research topics of 'PM2.5 concentration forecasting: Development of integrated multivariate variational mode decomposition with kernel Ridge regression and weighted mean of vectors optimization'. Together they form a unique fingerprint.

Cite this