To Analyze The Dataset For Air Pollution by Air Quality Prediction Based On Supervised With Classification Machine Learning Approach Abstract
To Analyze The Dataset For Air Pollution by Air Quality Prediction Based On Supervised With Classification Machine Learning Approach Abstract
Abstract:
Generally, Air pollution refers to the release of pollutants into the air that are detrimental to human
health and the planet as a whole. It can be described as one of the most dangerous threats that the humanity
ever faced. It causes damage to animals, crops, forests etc. To prevent this problem in transport sectors
have to predict air quality from pollutants using machine learning techniques. Hence, air quality evaluation
and prediction has become an important research area. The aim is to investigate machine learning based
techniques for air quality forecasting by prediction results in best accuracy. The analysis of dataset by
supervised machine learning technique(SMLT) to capture several information’s like, variable
identification, uni-variate analysis, bi-variate and multi-variate analysis, missing value treatments and
analyze the data validation, data cleaning/preparing and data visualization will be done on the entire given
dataset. Our analysis provides a comprehensive guide to sensitivity analysis of model parameters with
regard to performance in prediction of air quality pollution by accuracy calculation. To propose a machine
learning-based method to accurately predict the Air Quality Index value by prediction results in the form
of best accuracy from comparing supervise classification machine learning algorithms. Additionally, to
compare and discuss the performance of various machine learning algorithms from the given transport
traffic department dataset with evaluation classification report, identify the confusion matrix and to
categorizing data from priority and the result shows that the effectiveness of the proposed machine
learning algorithm technique can be compared with best accuracy with precision, Recall and F1 Score.
REFERENCES
[1] L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone. Classification and Regression Trees. Wadsworth
International Group ’84.
[2] P. Chatterjee. Online reviews: do consumers use them? Advances in Consumer Research ’01.
[3] P.-Y. Chen, S.-y. Wu, and J. Yoon. The impact of online recommendations and consumer feedback on sales. In
Proc. of ICIS ’04.
[4] Y. Chen, S. Fay, and Q. Wang. Marketing implications of online consumer product reviews. Business Week
’03.
[5] P. Domingos and G. Hulten. Mining high-speed data streams. In Proc. of ACM SIGKDD ’00.
[6] M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. Witten. The weka data mining software: an
update. ACM SIGKDD Explorations Newsletter ’09.
[7] J. L. Heskett, L. Schlesinger, et al. Putting the service-profit chain to work. Harvard business review ’94.
[8] G. Hulten, L. Spencer, and P. Domingos. Mining time-changing data streams. In Proc. of ACM SIGKDD ’01.
[9] J. Lee Rodgers and W. A. Nicewander. Thirteen ways to look at the correlation coefficient. The American
Statistician ’98.
Submitted On:
Guide Comments: