Machine Learning Approach-based Big Data Imputation Methods for Outdoor Air Quality Forecasting
Online Publishing @ NISCAIR
View Archive InfoField | Value | |
Authentication Code |
dc |
|
Title Statement |
Machine Learning Approach-based Big Data Imputation Methods for Outdoor Air Quality Forecasting |
|
Added Entry - Uncontrolled Name |
Narasimhan, D ; Department of Mathematics, Srinivasa Ramanujan Centre, SASTRA Deemed to be University, Kumbakonam 612 001, Tamil Nadu, India Vanitha, M ; Department of Computer Science and Engineering, Srinivasa Ramanujan Centre, SASTRA Deemed to be University, Kumbakonam 612 001, Tamil Nadu, India |
|
Uncontrolled Index Term |
Air quality, Big data analytics, Classification, Ensemble, Multiple imputation |
|
Summary, etc. |
Missing data from ambient air databases is a typical issue, but it is much worse in small towns or cities. Missing data is a significant concern for environmental epidemiology. These settings have high pollution exposure levels worldwide, and dataset gaps obstruct health investigations that could later affect local and international policies. When a substantial number of observations contain missing values, the standard errors increase due to the smaller sample size, which may significantly affect the final result. Generally, the performance of various missing value imputation algorithms is proportional to the size of the database and the percentage of missing values within it. This paper proposes and demonstrates an ensemble – imputation – classification framework approach to rebuild air quality information using a dataset from Beijing, China, to forecast air quality. Various single and multiple imputation procedures are utilized to fill the missing records. Then ensemble of diverse classifiers is used on the imputed data to find the air pollution level. The recommended model aims to reduce the error rate and improve accuracy. Extensive testing of datasets with actual missing values has revealed that the suggested methodology significantly enhances the air quality forecasting model’s accuracy with multiple imputation and ensemble techniques when compared to other conventional single imputation techniques. |
|
Publication, Distribution, Etc. |
Journal of Scientific & Industrial Research 2023-03-09 20:08:18 |
|
Electronic Location and Access |
application/pdf http://op.niscair.res.in/index.php/JSIR/article/view/71764 |
|
Data Source Entry |
Journal of Scientific & Industrial Research; ##issue.vol## 82, ##issue.no## 03 (2023) |
|
Language Note |
en |
|