0% found this document useful (0 votes)
114 views2 pages

To Analyze The Dataset For Air Pollution by Air Quality Prediction Based On Supervised With Classification Machine Learning Approach Abstract

The document discusses analyzing a dataset on air pollution using supervised machine learning classification techniques to accurately predict air quality index values. It will analyze the dataset to identify variables, relationships between variables, and handle missing data. Various supervised machine learning algorithms will be compared to identify the best model for air pollution prediction based on accuracy metrics like precision, recall, and F1 score. The goal is to determine the most effective machine learning technique for air quality forecasting.

Uploaded by

Varun Punk
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
114 views2 pages

To Analyze The Dataset For Air Pollution by Air Quality Prediction Based On Supervised With Classification Machine Learning Approach Abstract

The document discusses analyzing a dataset on air pollution using supervised machine learning classification techniques to accurately predict air quality index values. It will analyze the dataset to identify variables, relationships between variables, and handle missing data. Various supervised machine learning algorithms will be compared to identify the best model for air pollution prediction based on accuracy metrics like precision, recall, and F1 score. The goal is to determine the most effective machine learning technique for air quality forecasting.

Uploaded by

Varun Punk
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

To analyze the dataset for air pollution by air quality prediction based on supervised with

classification machine learning approach

Abstract:

Generally, Air pollution refers to the release of pollutants into the air that are detrimental to human
health and the planet as a whole. It can be described as one of the most dangerous threats that the humanity
ever faced. It causes damage to animals, crops, forests etc. To prevent this problem in transport sectors
have to predict air quality from pollutants using machine learning techniques. Hence, air quality evaluation
and prediction has become an important research area. The aim is to investigate machine learning based
techniques for air quality forecasting by prediction results in best accuracy. The analysis of dataset by
supervised machine learning technique(SMLT) to capture several information’s like, variable
identification, uni-variate analysis, bi-variate and multi-variate analysis, missing value treatments and
analyze the data validation, data cleaning/preparing and data visualization will be done on the entire given
dataset. Our analysis provides a comprehensive guide to sensitivity analysis of model parameters with
regard to performance in prediction of air quality pollution by accuracy calculation. To propose a machine
learning-based method to accurately predict the Air Quality Index value by prediction results in the form
of best accuracy from comparing supervise classification machine learning algorithms. Additionally, to
compare and discuss the performance of various machine learning algorithms from the given transport
traffic department dataset with evaluation classification report, identify the confusion matrix and to
categorizing data from priority and the result shows that the effectiveness of the proposed machine
learning algorithm technique can be compared with best accuracy with precision, Recall and F1 Score.
REFERENCES
[1] L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone. Classification and Regression Trees. Wadsworth
International Group ’84.

[2] P. Chatterjee. Online reviews: do consumers use them? Advances in Consumer Research ’01.

[3] P.-Y. Chen, S.-y. Wu, and J. Yoon. The impact of online recommendations and consumer feedback on sales. In
Proc. of ICIS ’04.

[4] Y. Chen, S. Fay, and Q. Wang. Marketing implications of online consumer product reviews. Business Week
’03.

[5] P. Domingos and G. Hulten. Mining high-speed data streams. In Proc. of ACM SIGKDD ’00.

[6] M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. Witten. The weka data mining software: an
update. ACM SIGKDD Explorations Newsletter ’09.

[7] J. L. Heskett, L. Schlesinger, et al. Putting the service-profit chain to work. Harvard business review ’94.

[8] G. Hulten, L. Spencer, and P. Domingos. Mining time-changing data streams. In Proc. of ACM SIGKDD ’01.

[9] J. Lee Rodgers and W. A. Nicewander. Thirteen ways to look at the correlation coefficient. The American
Statistician ’98.

Submitted On:

Student 1 Name and Reg No :

Student 2 Name and Reg No :

Project Batch No ( student may fill once it is assigned):

Guide Name and Employee ID:

Guide Signature with date:

Guide Comments:

MDD Faculty Name and Signature with date:

MDD Faculty Comments:

You might also like