Arwa S. Sayegh 1, Said Munir2, Turki M. Habeebullah2

  • 1 SETS International, Beirut, Lebanon
  • 2 The Custodian of the Two Holy Mosques Institute for Hajj and Umrah Research, Umm Al Qura University, Makkah, Saudi Arabia

Received: July 31, 2013
Revised: January 2, 2014
Accepted: January 2, 2014
Download Citation: ||https://doi.org/10.4209/aaqr.2013.07.0259 

  • Download: PDF


Cite this article:
Sayegh, A.S., Munir, S. and Habeebullah, T.M. (2014). Comparing the Performance of Statistical Models for Predicting PM10 Concentrations. Aerosol Air Qual. Res. 14: 653-665. https://doi.org/10.4209/aaqr.2013.07.0259


 

ABSTRACT


The ability to accurately model and predict the ambient concentration of Particulate Matter (PM) is essential for effective air quality management and policies development. Various statistical approaches exist for modelling air pollutant levels. In this paper, several approaches including linear, non-linear, and machine learning methods are evaluated for the prediction of urban PM10 concentrations in the City of Makkah, Saudi Arabia. The models employed are Multiple Linear Regression Model (MLRM), Quantile Regression Model (QRM), Generalised Additive Model (GAM), and Boosted Regression Trees1-way (BRT1) and 2-way (BRT2). Several meteorological parameters and chemical species measured during 2012 are used as covariates in the models. Various statistical metrics, including the Mean Bias Error (MBE), Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), the fraction of prediction within a Factor of Two (FACT2), correlation coefficient (R), and Index of Agreement (IA) are calculated to compare the predictive performance of the models. Results show that both MLRM and QRM captured the mean PM10 levels. However, QRM topped the other models in capturing the variations in PM10 concentrations. Based on the values of error indices, QRM showed better performance in predicting hourly PM10 concentrations. Superiority over the other models is explained by the ability of QRM to model the contribution of covariates at different quantiles of the modelled variable (here PM10). In this way QRM provides a better approximation procedure compared to the other modelling approaches, which consider a single central tendency response to a set of independent variables. Numerous recent studies have used these modelling approaches, however this is the first study that compares their performance for predicting PM10 concentrations.


Keywords: Quantile regression model; Performance evaluation; Multiple linear regression; Generalised additive model; Boosted regression trees


Latest Articles

Impact Factor: 2.735

5-Year Impact Factor: 2.827


SCImago Journal & Country Rank

Enter your email below to receive latest published articles in your field.