0% found this document useful (0 votes)

88 views3 pages

Great Step Data Abstract

1. The team analyzed safety data and evaluated several machine learning models for classification, including Naive Bayes, Decision Tree, and Support Vector Machines (SVMs) with different kernels. 2. The best performing models were Decision Tree with 95.06% accuracy and SVM with a polynomial kernel achieving 92.93% accuracy. 3. To further improve the SVM model, the team tuned hyperparameters like cost, epsilon, degree, and kernel type and found the polynomial kernel with specified values for these hyperparameters achieved the best accuracy.

Uploaded by

Sri Chandra Duddu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

88 views3 pages

Great Step Data Abstract

Uploaded by

Sri Chandra Duddu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

GREAT STEP – SAFETY DATA ANALYTICS ABSTRACT SUBMISSION

TEAM MATES:

SRI CHANDRA DUDDU – 14AG36001

SRICHANDRA CHILAPPAGARI – 13EC35014

Abstract Submission:

1. We have reviewed the predictor variables and dropped the variables ‘Id’ and ‘Phone
number’ which is obvious for the reason that they are unique for each customer.
This is also seen from the importance plot from randomForest package in R.
2. Looking at the importance plot from Random Forest, ‘Area Code’ is the least
important with < 5 % importance.
3. Upon changing the Categorical Variable ‘State’ into One Hot Encoding, we have seen
a decrease in the accuracy. So, we dropped this variable.
4. We have found that there are no missing values in the data. We have performed the
stratified sampling using ‘CreateDataPartition’ and divided the whole dataset into
train set and test set in 70:30 split.

RESULTS-

1. For Naive Bayes:

Reference
Prediction False True
False 1225 116
True 62 96

Accuracy: 88.12 %
Precision: 91.13 %
Recall: 95.2 %
2. For Decision Tree:

Reference
Prediction False True
False 1269 56
True 18 156

Accuracy: 95.06 %
Precision: 95.8 %
Recall: 98.6 %
GREAT STEP – SAFETY DATA ANALYTICS ABSTRACT SUBMISSION

3. For SVM – radial kernel:

Reference
Prediction False True
False 1275 117
True 12 95

Accuracy: 91.39 %
Precision: 91.6 %
Recall: 99.1 %

4. For SVM – polynomial kernel:

Reference
Prediction False True
False 1280 123
True 7 89

Accuracy: 91.32 %
Precision: 91.2 %
Recall: 99.5 %
5. For SVM – Linear kernel:

Reference
Prediction False True
False 1287 212
True 0 0

Accuracy: 85.85 %
Precision: 85.9 %
Recall: 100 %

6. For SVM – sigmoid kernel:

Reference
Prediction False True
False 1195 190
True 92 22

Accuracy: 81.18 %
Precision: 86.3 %
Recall: 92.9 %
GREAT STEP – SAFETY DATA ANALYTICS ABSTRACT SUBMISSION

There are no parameters to be tuned in rpart and Naive bayes. In order to improve the
accuracy performance of the support vector classification we will need to select the best
parameters for the model. We trained a lot of models for the different couples
of ϵ(epsilon) and cost, and choose the best one based on the root mean square
error(RMSE) value.

For SVM :

Best possible accuracy : 92.93%

Gamma : 0.0556
Cost : 16
Epsilon : 0
Degree : 3
Kernel : polynomial

In the figure below, dark blue regions represent the svm models with less RMSE value.
Darker the region, less is the RMSE of the model.

ML Merged
No ratings yet
ML Merged
51 pages
08 Classification
No ratings yet
08 Classification
46 pages
Summary
No ratings yet
Summary
51 pages
5 2 Ensemble Learning
No ratings yet
5 2 Ensemble Learning
38 pages
MLA Lab Record (2024)
No ratings yet
MLA Lab Record (2024)
47 pages
ML5&6&7&8&9&10
No ratings yet
ML5&6&7&8&9&10
35 pages
EDAN96 2024 Last Lecture-1
No ratings yet
EDAN96 2024 Last Lecture-1
78 pages
Machine Learnig Revision
No ratings yet
Machine Learnig Revision
93 pages
Credit Card Approval Prediction Report-Final
No ratings yet
Credit Card Approval Prediction Report-Final
27 pages
Final PPT Lung
100% (4)
Final PPT Lung
21 pages
ML W8 Merged
No ratings yet
ML W8 Merged
27 pages
Paper1 Lite
No ratings yet
Paper1 Lite
18 pages
Support Vector Machines For Classification and Regression: Steve R. Gunn
No ratings yet
Support Vector Machines For Classification and Regression: Steve R. Gunn
66 pages
Prediction - Accuracy
No ratings yet
Prediction - Accuracy
33 pages
Exam PA Knowledge Based Outline
No ratings yet
Exam PA Knowledge Based Outline
22 pages
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
No ratings yet
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
10 pages
Final Report
No ratings yet
Final Report
17 pages
ML Unit-3 - RTU
No ratings yet
ML Unit-3 - RTU
20 pages
Em Semester Project
No ratings yet
Em Semester Project
21 pages
Active Sample Selection For Matrix Compl
No ratings yet
Active Sample Selection For Matrix Compl
89 pages
TE ML LAB Mannual
No ratings yet
TE ML LAB Mannual
21 pages
Untitled
No ratings yet
Untitled
29 pages
FINAL - CC01 - Group7
No ratings yet
FINAL - CC01 - Group7
23 pages
01 Apply Data Preprocessing On Heart Dataset and Evaluate Performance Using Confusion Matrix
No ratings yet
01 Apply Data Preprocessing On Heart Dataset and Evaluate Performance Using Confusion Matrix
19 pages
Final Cc01 Group7
No ratings yet
Final Cc01 Group7
23 pages
Machine Learning Final Report
No ratings yet
Machine Learning Final Report
8 pages
Evaluation Machine Learning
No ratings yet
Evaluation Machine Learning
5 pages
SQR Da 2
No ratings yet
SQR Da 2
11 pages
ML Mini Project
No ratings yet
ML Mini Project
9 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
B24 ML Exp-3
No ratings yet
B24 ML Exp-3
10 pages
S20220020307 Assignment 2
No ratings yet
S20220020307 Assignment 2
4 pages
Assignment II Machine Learning
No ratings yet
Assignment II Machine Learning
8 pages
Chapter 6 SVM
No ratings yet
Chapter 6 SVM
66 pages
ML101 Graded Assignment 2.ipynb - Colab
No ratings yet
ML101 Graded Assignment 2.ipynb - Colab
6 pages
ICSCSP 2021 Proceedings-477-488
No ratings yet
ICSCSP 2021 Proceedings-477-488
12 pages
ADS Exp 04
No ratings yet
ADS Exp 04
3 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
8 pages
2-Training and Testing Models, Evaluation Metrics-01-07-2023
No ratings yet
2-Training and Testing Models, Evaluation Metrics-01-07-2023
23 pages
Ml-Exp-2 - Jupyter Notebook
No ratings yet
Ml-Exp-2 - Jupyter Notebook
2 pages
Machine Learning Project: Raghul Harish
100% (2)
Machine Learning Project: Raghul Harish
46 pages
Complete Data Science Questions
No ratings yet
Complete Data Science Questions
5 pages
Ads Exp 4
No ratings yet
Ads Exp 4
4 pages
WINSEM2024-25 CSE3008 ELA AP2024254001161 2025-02-13 Reference-Material-I
No ratings yet
WINSEM2024-25 CSE3008 ELA AP2024254001161 2025-02-13 Reference-Material-I
2 pages
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
No ratings yet
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
20 pages
TB 969425740
No ratings yet
TB 969425740
16 pages
Practical 9
No ratings yet
Practical 9
3 pages
Articles Xgboost Classification With Smote-Enn Algorithm
No ratings yet
Articles Xgboost Classification With Smote-Enn Algorithm
11 pages
Stats 101c Final Project
100% (1)
Stats 101c Final Project
16 pages
Import As From Import From Import From Import
No ratings yet
Import As From Import From Import From Import
9 pages
Farm Biometrics Age 103
No ratings yet
Farm Biometrics Age 103
36 pages
Aim of The Experiment-Software Required - Theory
No ratings yet
Aim of The Experiment-Software Required - Theory
6 pages
Ritesh Machine Learning Project
100% (9)
Ritesh Machine Learning Project
46 pages
7047 001 Group 8 Water Potability
No ratings yet
7047 001 Group 8 Water Potability
19 pages
Data Analytics On Banking
No ratings yet
Data Analytics On Banking
3 pages
Practical Machine Learning
No ratings yet
Practical Machine Learning
11 pages
About The Dataset - Car Evaluation Dataset (UCI Machine Learning Repository
No ratings yet
About The Dataset - Car Evaluation Dataset (UCI Machine Learning Repository
5 pages
7708 - MBA PredAnanBigDataNov21
No ratings yet
7708 - MBA PredAnanBigDataNov21
11 pages
Lab Assignment 1 Ucs551
No ratings yet
Lab Assignment 1 Ucs551
23 pages
Customer Churn Analysis
No ratings yet
Customer Churn Analysis
10 pages
How To Learn Machine Learning Algorithms For Interviews
No ratings yet
How To Learn Machine Learning Algorithms For Interviews
16 pages
Model Perf Cheat Sheet
No ratings yet
Model Perf Cheat Sheet
2 pages
Graph Theory and Algorithms: Pratima Panigrahi Department of Mathematics Indian Institute of Technology Kharagpur 721302
No ratings yet
Graph Theory and Algorithms: Pratima Panigrahi Department of Mathematics Indian Institute of Technology Kharagpur 721302
18 pages
COVID19 Vaccine OG111 Chapter 16
No ratings yet
COVID19 Vaccine OG111 Chapter 16
148 pages
Obfuscated Malware Detection Using Dilated Convolutional Network
0% (1)
Obfuscated Malware Detection Using Dilated Convolutional Network
6 pages
Malware Analysis Using Machine Learning and Deep Learning Techniques
No ratings yet
Malware Analysis Using Machine Learning and Deep Learning Techniques
7 pages
TechnicalPaper GROUP4
No ratings yet
TechnicalPaper GROUP4
4 pages
03 - Malware Analysis Primer
No ratings yet
03 - Malware Analysis Primer
64 pages
Applications of Object Detection in
No ratings yet
Applications of Object Detection in
19 pages
Predictive Modeling
No ratings yet
Predictive Modeling
42 pages
Applsci 15 05930
No ratings yet
Applsci 15 05930
29 pages
Aishwarya PDF
No ratings yet
Aishwarya PDF
80 pages
Graph Theory and Algorithms: Pratima Panigrahi Department of Mathematics Indian Institute of Technology Kharagpur 721302
No ratings yet
Graph Theory and Algorithms: Pratima Panigrahi Department of Mathematics Indian Institute of Technology Kharagpur 721302
26 pages
A Comprehensive Overview of Deep Learning Techniques For 3D Point Cloud Classification and Semantic Segmentation
No ratings yet
A Comprehensive Overview of Deep Learning Techniques For 3D Point Cloud Classification and Semantic Segmentation
54 pages
App - Py Code
No ratings yet
App - Py Code
22 pages
Bangla Text Summarization Using Natural Language Processing
No ratings yet
Bangla Text Summarization Using Natural Language Processing
6 pages
FDS Viva
No ratings yet
FDS Viva
46 pages
6COM1044 2023 2024 SVM Classification
No ratings yet
6COM1044 2023 2024 SVM Classification
50 pages
Prediction of Diabetes Using Machine Learning Techniques
No ratings yet
Prediction of Diabetes Using Machine Learning Techniques
10 pages
Insect Pest Image Detection and Classification Usi
No ratings yet
Insect Pest Image Detection and Classification Usi
11 pages
Yang Et Al. - 2023 - Real-Time Detection of Crop Rows in Maize Fields Based On Autonomous Extraction of ROI
No ratings yet
Yang Et Al. - 2023 - Real-Time Detection of Crop Rows in Maize Fields Based On Autonomous Extraction of ROI
16 pages
Thesis Floris Visser 406508fv
No ratings yet
Thesis Floris Visser 406508fv
80 pages
ChurnNet Deep Learning Enhanced Customer Churn Prediction in Telecommunication Industry
No ratings yet
ChurnNet Deep Learning Enhanced Customer Churn Prediction in Telecommunication Industry
14 pages
7428-Article Text-8044-1-10-20230803
No ratings yet
7428-Article Text-8044-1-10-20230803
8 pages
Country Condos Harithavanam Brochure
No ratings yet
Country Condos Harithavanam Brochure
4 pages
09 - Machine Learning
No ratings yet
09 - Machine Learning
7 pages
Locally GAN-generated Face Detection Based On An Improved Xception
No ratings yet
Locally GAN-generated Face Detection Based On An Improved Xception
13 pages
A296 D Stamped
No ratings yet
A296 D Stamped
4 pages
Prediction of Cardiovascular Disease Using Machine Learning Algorithms
No ratings yet
Prediction of Cardiovascular Disease Using Machine Learning Algorithms
11 pages
Week 6 Machine Learning
No ratings yet
Week 6 Machine Learning
17 pages
Indian Institute of Technology, Kharagpur
No ratings yet
Indian Institute of Technology, Kharagpur
2 pages
Conference Latex Template 10-17-196
No ratings yet
Conference Latex Template 10-17-196
6 pages
Tutorial On Recent Practical Vowpal Wabbit Improvements: Zhen Qin
No ratings yet
Tutorial On Recent Practical Vowpal Wabbit Improvements: Zhen Qin
30 pages
Revision: High Variance
No ratings yet
Revision: High Variance
8 pages
Paper 69-Fake Reviews Detection Using Supervised Machine
No ratings yet
Paper 69-Fake Reviews Detection Using Supervised Machine
6 pages
Order SC 25-05-2018 PG Admission
No ratings yet
Order SC 25-05-2018 PG Admission
3 pages
Term Paper: - by SRI CHANDRA DUDDU (14AG30028) Abstract
No ratings yet
Term Paper: - by SRI CHANDRA DUDDU (14AG30028) Abstract
3 pages
Vector Spaces Assg.
No ratings yet
Vector Spaces Assg.
2 pages
Solutions Manual to accompany Introduction to Linear Regression Analysis
From Everand
Solutions Manual to accompany Introduction to Linear Regression Analysis
Douglas C. Montgomery
1/5 (1)

Great Step Data Abstract

Uploaded by

Great Step Data Abstract

Uploaded by

GREAT STEP – SAFETY DATA ANALYTICS ABSTRACT SUBMISSION

SRI CHANDRA DUDDU – 14AG36001

SRICHANDRA CHILAPPAGARI – 13EC35014

1. For Naive Bayes:

3. For SVM – radial kernel:

4. For SVM – polynomial kernel:

6. For SVM – sigmoid kernel:

Best possible accuracy : 92.93%

You might also like