0% found this document useful (0 votes)

20 views9 pages

ML Lab 8

The document outlines a machine learning project focused on predicting mobile phone price ranges using various classification algorithms, including Decision Trees, Random Forests, Support Vector Machines (SVM), and K-Nearest Neighbors (KNN). The SVM model achieved the highest accuracy of 95.75% and an AUC-ROC score of 0.9988, making it the recommended model for mobile price prediction. It also discusses challenges in traditional methods, data preprocessing, model training/testing, and evaluation metrics.

Uploaded by

vanshikasehrawat1085

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views9 pages

ML Lab 8

Uploaded by

vanshikasehrawat1085

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Sugandh 06701192023

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.cluster import KMeans
from imblearn.over_sampling import SMOTE
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import LabelEncoder,
StandardScaler,label_binarize
from sklearn.linear_model import LogisticRegression
from sklearn.neighbors import KNeighborsClassifier
from sklearn.svm import SVC
from sklearn.tree import DecisionTreeClassifier
from sklearn.naive_bayes import GaussianNB
from sklearn.metrics import accuracy_score, classification_report,
confusion_matrix,roc_auc_score
from sklearn.ensemble import RandomForestClassifier

df=pd.read_csv("train.csv - train.csv.csv")
df.head()

battery_power blue clock_speed dual_sim fc four_g int_memory

m_dep \
0 842 0 2.2 0 1 0 7
0.6
1 1021 1 0.5 1 0 1 53
0.7
2 563 1 0.5 1 2 1 41
0.9
3 615 1 2.5 0 0 0 10
0.8
4 1821 1 1.2 0 13 1 44
0.6

mobile_wt n_cores ... px_height px_width ram sc_h sc_w

talk_time \
0 188 2 ... 20 756 2549 9 7
19
1 136 3 ... 905 1988 2631 17 3
7
2 145 5 ... 1263 1716 2603 11 2
9
3 131 6 ... 1216 1786 2769 16 8
11
4 141 2 ... 1208 1212 1411 8 2
15
three_g touch_screen wifi price_range
0 0 0 1 1
1 1 1 0 2
2 1 1 0 2
3 1 0 0 2
4 1 1 0 1

[5 rows x 21 columns]

df.isnull().sum()

battery_power 0
blue 0
clock_speed 0
dual_sim 0
fc 0
four_g 0
int_memory 0
m_dep 0
mobile_wt 0
n_cores 0
pc 0
px_height 0
px_width 0
ram 0
sc_h 0
sc_w 0
talk_time 0
three_g 0
touch_screen 0
wifi 0
price_range 0
dtype: int64

# Split features and target

X = df.drop("price_range", axis=1) # Independent variables
y = df["price_range"]

X_train, X_test, y_train, y_test = train_test_split(X, y,

test_size=0.2, random_state=42, stratify=y)

dt_model = DecisionTreeClassifier(criterion='entropy', max_depth=5,

random_state=42) # You can use 'entropy' instead of 'gini'
dt_model.fit(X_train, y_train)

y_pred = dt_model.predict(X_test)
y_prob_dt = dt_model.predict_proba(X_test)
print("Accuracy:", accuracy_score(y_test, y_pred))
print("\nConfusion Matrix:\n", confusion_matrix(y_test, y_pred))
print("\nClassification Report:\n", classification_report(y_test,
y_pred))

Accuracy: 0.8725

Confusion Matrix:
[[90 10 0 0]
[ 4 89 7 0]
[ 0 11 76 13]
[ 0 0 6 94]]

Classification Report:
precision recall f1-score support

0 0.96 0.90 0.93 100

1 0.81 0.89 0.85 100
2 0.85 0.76 0.80 100
3 0.88 0.94 0.91 100

accuracy 0.87 400

macro avg 0.87 0.87 0.87 400
weighted avg 0.87 0.87 0.87 400

rf_model = RandomForestClassifier(n_estimators=100, random_state=42,

max_depth=10)
rf_model.fit(X_train, y_train)

RandomForestClassifier(max_depth=10, random_state=42)

dt_model = DecisionTreeClassifier(criterion='gini', max_depth=5,

random_state=42) # You can use 'entropy' instead of 'gini'
dt_model.fit(X_train, y_train)

Accuracy: 0.83

Confusion Matrix:
[[89 11 0 0]
[ 6 82 12 0]
[ 0 17 75 8]
[ 0 0 14 86]]

Classification Report:
precision recall f1-score support

0 0.94 0.89 0.91 100

1 0.75 0.82 0.78 100
2 0.74 0.75 0.75 100
3 0.91 0.86 0.89 100

accuracy 0.83 400

macro avg 0.83 0.83 0.83 400
weighted avg 0.83 0.83 0.83 400

y_pred_rf = rf_model.predict(X_test)
y_prob_rf = rf_model.predict_proba(X_test)

# Accuracy Score
accuracy = accuracy_score(y_test, y_pred_rf)
print(f"Accuracy: {accuracy:.4f}")

# Classification Report
print("Classification Report:\n", classification_report(y_test,
y_pred_rf))

# Confusion Matrix
print("Confusion Matrix:\n", confusion_matrix(y_test, y_pred_rf))

Accuracy: 0.8900
Classification Report:
precision recall f1-score support

0 0.95 0.95 0.95 100

1 0.82 0.84 0.83 100
2 0.84 0.82 0.83 100
3 0.95 0.95 0.95 100

accuracy 0.89 400

macro avg 0.89 0.89 0.89 400
weighted avg 0.89 0.89 0.89 400

Confusion Matrix:
[[95 5 0 0]
[ 5 84 11 0]
[ 0 13 82 5]
[ 0 0 5 95]]

# --- SVM Model with Probability Enabled ---

svm_model = SVC(kernel='rbf', C=1.0, gamma='scale', probability=True)
# Enable predict_proba
svm_model.fit(X_train, y_train)
y_pred_svm = svm_model.predict(X_test)
y_prob_svm = svm_model.predict_proba(X_test) # Now this will work
without error

print("\n--- SVM Results ---")

print("Accuracy:", accuracy_score(y_test, y_pred_svm))
print(confusion_matrix(y_test, y_pred_svm))
print(classification_report(y_test, y_pred_svm))

--- SVM Results ---

Accuracy: 0.9575
[[100 0 0 0]
[ 2 97 1 0]
[ 0 7 89 4]
[ 0 0 3 97]]
precision recall f1-score support

0 0.98 1.00 0.99 100

1 0.93 0.97 0.95 100
2 0.96 0.89 0.92 100
3 0.96 0.97 0.97 100

accuracy 0.96 400

macro avg 0.96 0.96 0.96 400
weighted avg 0.96 0.96 0.96 400

# --- KNN Model ---

knn_model = KNeighborsClassifier(n_neighbors=5)
knn_model.fit(X_train, y_train)
y_pred_knn = knn_model.predict(X_test)
y_prob_knn = knn_model.predict_proba(X_test)
print("\n--- KNN Results ---")
print("Accuracy:", accuracy_score(y_test, y_pred_knn))
print(confusion_matrix(y_test, y_pred_knn))
print(classification_report(y_test, y_pred_knn))

--- KNN Results ---

Accuracy: 0.935
[[99 1 0 0]
[ 2 93 5 0]
[ 0 7 87 6]
[ 0 0 5 95]]
precision recall f1-score support

0 0.98 0.99 0.99 100

1 0.92 0.93 0.93 100
2 0.90 0.87 0.88 100
3 0.94 0.95 0.95 100

accuracy 0.94 400

macro avg 0.93 0.94 0.93 400
weighted avg 0.93 0.94 0.93 400

# Challenges in Traditional Methods:

# Feature Engineering Complexity

# Traditional models like Decision Trees and KNN rely heavily on

manual feature selection.

# Feature importance needs to be analyzed carefully to avoid

irrelevant or redundant features.

# Scalability Issues

# KNN struggles with large datasets due to its computational cost

(O(n^2)) when making predictions.

# Decision Trees may become too deep, leading to overfitting.

# Hyperparameter Sensitivity

# High Computational Cost for Some Models

# SVM and KNN are computationally expensive for large datasets.

# Grid Search for Hyperparameter tuning can be slow without

optimization techniques.

# Model Training and Testing in ML

# Data Preprocessing – Clean and prepare data by handling missing
values, encoding categorical features, and scaling numerical data.

# Train-Test Split – Divide data into training (70-80%) and testing

(20-30%) sets to ensure proper model evaluation.

# Model Training – The model learns patterns from the training data by
adjusting its parameters based on the input features and target
variable.

# Model Testing – The trained model is tested on unseen data (test

set) to evaluate its ability to generalize to new examples.

# Performance Evaluation – The model is assessed using metrics such as

accuracy, precision, recall, F1-score, AUC-ROC, and confusion matrix
to determine its effectiveness.

# Metrics for Evaluating ML Algorithms

# Classification Metrics (for Categorical Targets)
# Accuracy – Measures overall correctness (suitable for balanced
datasets).

# Precision – Measures correctness of positive predictions (useful for

imbalanced datasets).

# Recall (Sensitivity) – Measures how well actual positives are

detected.

# F1-Score – Harmonic mean of precision and recall (best for

imbalanced classes).

# Confusion Matrix – Shows true positives, true negatives, false

positives, and false negatives.

# AUC-ROC (Area Under Curve – Receiver Operating Characteristic) –

Evaluates classification ability across thresholds.

# Binarizing the test set labels only (AFTER splitting)

y_bin_test = label_binarize(y_test, classes=[0, 1, 2, 3])

# --- Accuracy and AUC Calculation ---

models = ['Decision Tree', 'Random Forest', 'SVM', 'KNN']
accuracies = [
accuracy_score(y_test, y_pred),
accuracy_score(y_test, y_pred_rf),
accuracy_score(y_test, y_pred_svm),
accuracy_score(y_test, y_pred_knn)
]

# Multi-class AUC scores using the TEST SET ONLY

auc_scores = [
roc_auc_score(y_bin_test, y_prob_dt, multi_class='ovr'),
roc_auc_score(y_bin_test, y_prob_rf, multi_class='ovr'),
roc_auc_score(y_bin_test, y_prob_svm, multi_class='ovr'),
roc_auc_score(y_bin_test, y_prob_knn, multi_class='ovr')
]

# --- Plotting Accuracy and AUC ---

fig, ax = plt.subplots(1, 2, figsize=(14, 6))

# Accuracy Bar Graph

ax[0].bar(models, accuracies, color='skyblue')
ax[0].set_title('Model Accuracy Comparison')
ax[0].set_ylabel('Accuracy')
ax[0].set_ylim(0, 1)

# AUC Bar Graph

ax[1].bar(models, auc_scores, color='lightgreen')
ax[1].set_title('Model AUC Comparison')
ax[1].set_ylabel('AUC Score')
ax[1].set_ylim(0, 1)

plt.tight_layout()
plt.show()

# --- Print Model Performances ---

print("\n--- Model Performance ---")
for i, model in enumerate(models):
print(f"{model}: Accuracy = {accuracies[i]:.4f}, AUC =
{auc_scores[i]:.4f}")

--- Model Performance ---

Decision Tree: Accuracy = 0.8300, AUC = 0.9459
Random Forest: Accuracy = 0.8900, AUC = 0.9793
SVM: Accuracy = 0.9575, AUC = 0.9988
KNN: Accuracy = 0.9350, AUC = 0.9914

# Recommended Model for Mobile Price Prediction

# Based on your evaluation metrics (accuracy and AUC-ROC), the Support
Vector Machine (SVM) model performs the best with:

# Accuracy: 95.75%

# AUC-ROC: 0.9988

# Best Model: SVM (with RBF Kernel)

# Pros:

# High accuracy and robustness in high-dimensional spaces.

# Works well with non-linear data using the RBF kernel.

# Good generalization with proper hyperparameter tuning.

# Cons:

# Computationally expensive for very large datasets.

Sharp MX M654N MX M754N Service Manual
90% (10)
Sharp MX M654N MX M754N Service Manual
484 pages
Catalogue BRITPARTS PDF
100% (2)
Catalogue BRITPARTS PDF
172 pages
MRP User Exit Enhancement M61X0001
No ratings yet
MRP User Exit Enhancement M61X0001
6 pages
Schedule of Rates Electrical&Mechanical 2021-2022
0% (1)
Schedule of Rates Electrical&Mechanical 2021-2022
504 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
8 pages
Garishav Basra 102103129 2CO5
No ratings yet
Garishav Basra 102103129 2CO5
8 pages
Experiment 7
No ratings yet
Experiment 7
3 pages
ML Lab Programs 2
No ratings yet
ML Lab Programs 2
16 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
7 pages
ML Mini Project
No ratings yet
ML Mini Project
9 pages
6 - 2 - SVMS, - Randon - Forests - and - KNN - Ipynb - Colaboratory
No ratings yet
6 - 2 - SVMS, - Randon - Forests - and - KNN - Ipynb - Colaboratory
4 pages
ML Assignment 4
No ratings yet
ML Assignment 4
7 pages
ML Mini Project
No ratings yet
ML Mini Project
9 pages
Machine Learnin1
100% (1)
Machine Learnin1
41 pages
Ann Experiential Learning
No ratings yet
Ann Experiential Learning
43 pages
Dsbda 10
No ratings yet
Dsbda 10
5 pages
Print Version
No ratings yet
Print Version
29 pages
WINSEM2024-25 CSE3008 ELA AP2024254001161 2025-02-13 Reference-Material-I
No ratings yet
WINSEM2024-25 CSE3008 ELA AP2024254001161 2025-02-13 Reference-Material-I
2 pages
ML 11 Decision Trees
No ratings yet
ML 11 Decision Trees
4 pages
Multi - Class - Scaled - Down - Data - Colaboratory
No ratings yet
Multi - Class - Scaled - Down - Data - Colaboratory
2 pages
Machine Learning Final Report
No ratings yet
Machine Learning Final Report
8 pages
KNN
No ratings yet
KNN
4 pages
Practical No 6
No ratings yet
Practical No 6
3 pages
'Classified Data': Import As Import As Import As Import As
No ratings yet
'Classified Data': Import As Import As Import As Import As
3 pages
Untitled0.ipynb - Colaboratory
No ratings yet
Untitled0.ipynb - Colaboratory
5 pages
ML Functions
No ratings yet
ML Functions
12 pages
ML Using Python Programs
No ratings yet
ML Using Python Programs
12 pages
Fashion MNIST-6
No ratings yet
Fashion MNIST-6
10 pages
Bi 6 New
No ratings yet
Bi 6 New
6 pages
AIML Lab 3 4
No ratings yet
AIML Lab 3 4
5 pages
CCD - Ipynb - Colab
No ratings yet
CCD - Ipynb - Colab
6 pages
Machine Learning
No ratings yet
Machine Learning
3 pages
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
No ratings yet
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
20 pages
ML - Labtask5.ipynb - K - Colab
No ratings yet
ML - Labtask5.ipynb - K - Colab
8 pages
ML Keshav
No ratings yet
ML Keshav
23 pages
ML Lab Manual
No ratings yet
ML Lab Manual
6 pages
TASK 8: Deploy Support Vector Machine, Apriori Algorithm: BTCS619-18
No ratings yet
TASK 8: Deploy Support Vector Machine, Apriori Algorithm: BTCS619-18
5 pages
ML Lab3 PGM
No ratings yet
ML Lab3 PGM
3 pages
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
Aiml Nts
No ratings yet
Aiml Nts
33 pages
Nilay Debnath CSE 06607735
No ratings yet
Nilay Debnath CSE 06607735
22 pages
Prac7 23bme053
No ratings yet
Prac7 23bme053
2 pages
Practical 10
No ratings yet
Practical 10
4 pages
Import As Import As Import As Import As From Import
No ratings yet
Import As Import As Import As Import As From Import
3 pages
456 ML Lab
No ratings yet
456 ML Lab
7 pages
ML Lab-1
No ratings yet
ML Lab-1
32 pages
Case Study - Classifier
No ratings yet
Case Study - Classifier
5 pages
E.X No.6 Build D: Ecision Trees and Random Forests
No ratings yet
E.X No.6 Build D: Ecision Trees and Random Forests
4 pages
I Avaliação Parcial - 25.0 PTS - Gabarito
No ratings yet
I Avaliação Parcial - 25.0 PTS - Gabarito
9 pages
ML 2 16
No ratings yet
ML 2 16
6 pages
Import As Import As Import As From Import From Import From Import From Import From Import
No ratings yet
Import As Import As Import As From Import From Import From Import From Import From Import
4 pages
DL2.ipynb - Colab
No ratings yet
DL2.ipynb - Colab
3 pages
KNN Practical Debasmita Datta
No ratings yet
KNN Practical Debasmita Datta
6 pages
Implementing KNN Algorithm: Importing Libraries
No ratings yet
Implementing KNN Algorithm: Importing Libraries
6 pages
ML101 Graded Assignment 2.ipynb - Colab
No ratings yet
ML101 Graded Assignment 2.ipynb - Colab
6 pages
Ml-Exp-2 - Jupyter Notebook
No ratings yet
Ml-Exp-2 - Jupyter Notebook
2 pages
Classification - With - Decision - Tree - MarketingData - Jupyter Notebook
No ratings yet
Classification - With - Decision - Tree - MarketingData - Jupyter Notebook
9 pages
Rev Insurance Business Report
No ratings yet
Rev Insurance Business Report
4 pages
Decision Tree
No ratings yet
Decision Tree
9 pages
Week10 - Colab
No ratings yet
Week10 - Colab
3 pages
3c) Cross Validation
No ratings yet
3c) Cross Validation
6 pages
Confusion Matrix
No ratings yet
Confusion Matrix
5 pages
Lec-2 Logic Gates
No ratings yet
Lec-2 Logic Gates
18 pages
Lec 5 Complements
No ratings yet
Lec 5 Complements
42 pages
Indira Gandhi Delhi Technical University for Women Mail - Welcome to Cisco - Your First Day Information - Interns
No ratings yet
Indira Gandhi Delhi Technical University for Women Mail - Welcome to Cisco - Your First Day Information - Interns
1 page
Mid Term Marks
No ratings yet
Mid Term Marks
6 pages
Maintenance Performance and Replacement
No ratings yet
Maintenance Performance and Replacement
10 pages
Daa 1
No ratings yet
Daa 1
25 pages
Lec 1 Introduction
No ratings yet
Lec 1 Introduction
24 pages
Production System
No ratings yet
Production System
25 pages
Maintenance
No ratings yet
Maintenance
38 pages
Process and Product Life Cycle
No ratings yet
Process and Product Life Cycle
11 pages
LeetCode SQL 50 Questions Challenge
No ratings yet
LeetCode SQL 50 Questions Challenge
41 pages
ML Lab 9
No ratings yet
ML Lab 9
2 pages
Unit 3 Notes-1
No ratings yet
Unit 3 Notes-1
39 pages
Biol 120 2025 01 14
No ratings yet
Biol 120 2025 01 14
7 pages
Idea Presentation Format SIH2027College v1
No ratings yet
Idea Presentation Format SIH2027College v1
4 pages
Proposed Date Sheet End - Term - May 2025
No ratings yet
Proposed Date Sheet End - Term - May 2025
9 pages
17 Software Testing - Introduction 2024
No ratings yet
17 Software Testing - Introduction 2024
94 pages
20 Software Maintenance 2024
No ratings yet
20 Software Maintenance 2024
46 pages
Graph Theory New
No ratings yet
Graph Theory New
22 pages
19 Software Economics 2024
No ratings yet
19 Software Economics 2024
58 pages
ML Project 2
No ratings yet
ML Project 2
19 pages
DM All Unit
No ratings yet
DM All Unit
289 pages
Notes - B
No ratings yet
Notes - B
43 pages
Poster
No ratings yet
Poster
1 page
What To Expect - Summer Internship Virtual Assessment Centre
No ratings yet
What To Expect - Summer Internship Virtual Assessment Centre
2 pages
Unit 3 - Decision Making Under Uncertainty in AI
No ratings yet
Unit 3 - Decision Making Under Uncertainty in AI
25 pages
Presentation Template
No ratings yet
Presentation Template
6 pages
Internship Report Format
No ratings yet
Internship Report Format
16 pages
Mse Unit-3
No ratings yet
Mse Unit-3
18 pages
Summer Internship - Presentation Activity - Candidate Instructions 2025
No ratings yet
Summer Internship - Presentation Activity - Candidate Instructions 2025
8 pages
Archimedes Screw Design An Analytical Mo
No ratings yet
Archimedes Screw Design An Analytical Mo
14 pages
GR9277 Solutions
No ratings yet
GR9277 Solutions
126 pages
Catálogo LG
No ratings yet
Catálogo LG
149 pages
STAR CCM Design Manager Spotlight PDF
No ratings yet
STAR CCM Design Manager Spotlight PDF
69 pages
Eclipse Tutorial3
No ratings yet
Eclipse Tutorial3
26 pages
Video Lectures - Assignment Questions - Lecture 2 Signals and Systems - Part I PDF Format
No ratings yet
Video Lectures - Assignment Questions - Lecture 2 Signals and Systems - Part I PDF Format
7 pages
"Minor Research Project": Study The Perception of Indorians Towards Existing Traffic System in Indore City
No ratings yet
"Minor Research Project": Study The Perception of Indorians Towards Existing Traffic System in Indore City
17 pages
Lecture 5 Bisares
0% (1)
Lecture 5 Bisares
6 pages
Key Windows 8 RTM Mak Keys
No ratings yet
Key Windows 8 RTM Mak Keys
3 pages
Planview Enterprise Agile
No ratings yet
Planview Enterprise Agile
2 pages
P P W V: MEEN 310-Thermodynamics II Exam 1 Equation Sheet
No ratings yet
P P W V: MEEN 310-Thermodynamics II Exam 1 Equation Sheet
5 pages
Components of Computer: Hardware Software Hardware
No ratings yet
Components of Computer: Hardware Software Hardware
9 pages
Make of Materials
No ratings yet
Make of Materials
2 pages
Selfstudys Com File
No ratings yet
Selfstudys Com File
5 pages
Annexure-2 Application Form For Installation of Roof-Top Solar PV System Under Net Metering Arrangement
No ratings yet
Annexure-2 Application Form For Installation of Roof-Top Solar PV System Under Net Metering Arrangement
2 pages
Transportation + Assignment Models
No ratings yet
Transportation + Assignment Models
46 pages
Scrum Questions
100% (4)
Scrum Questions
16 pages
50 TOP MEASUREMENT and INSTRUMENTS Objective Questions and Answers
33% (3)
50 TOP MEASUREMENT and INSTRUMENTS Objective Questions and Answers
16 pages
Seismic Requirement of Power Transformer
No ratings yet
Seismic Requirement of Power Transformer
10 pages
4x4x4 LED Cube With Charlieplexing: Food Living Outside Play Technology Workshop
No ratings yet
4x4x4 LED Cube With Charlieplexing: Food Living Outside Play Technology Workshop
14 pages
PB Mastermatrix 106 CSB ES X3
No ratings yet
PB Mastermatrix 106 CSB ES X3
8 pages
CRC Electrode Potentials
No ratings yet
CRC Electrode Potentials
10 pages
Part Lists DC236-286-336
No ratings yet
Part Lists DC236-286-336
91 pages
Themodynamics II
0% (1)
Themodynamics II
3 pages
SG15 B6 英文单页
No ratings yet
SG15 B6 英文单页
2 pages
Bilal CV
No ratings yet
Bilal CV
3 pages

ML Lab 8

Uploaded by

ML Lab 8

Uploaded by

Sugandh 06701192023

battery_power blue clock_speed dual_sim fc four_g int_memory

mobile_wt n_cores ... px_height px_width ram sc_h sc_w

# Split features and target

X_train, X_test, y_train, y_test = train_test_split(X, y,

dt_model = DecisionTreeClassifier(criterion='entropy', max_depth=5,

0 0.96 0.90 0.93 100

accuracy 0.87 400

rf_model = RandomForestClassifier(n_estimators=100, random_state=42,

dt_model = DecisionTreeClassifier(criterion='gini', max_depth=5,

0 0.94 0.89 0.91 100

accuracy 0.83 400

0 0.95 0.95 0.95 100

accuracy 0.89 400

# --- SVM Model with Probability Enabled ---

print("\n--- SVM Results ---")

--- SVM Results ---

0 0.98 1.00 0.99 100

accuracy 0.96 400

# --- KNN Model ---

--- KNN Results ---

0 0.98 0.99 0.99 100

accuracy 0.94 400

# Challenges in Traditional Methods:

# Traditional models like Decision Trees and KNN rely heavily on

# Feature importance needs to be analyzed carefully to avoid

# KNN struggles with large datasets due to its computational cost

# Decision Trees may become too deep, leading to overfitting.

# High Computational Cost for Some Models

# SVM and KNN are computationally expensive for large datasets.

# Grid Search for Hyperparameter tuning can be slow without

# Model Training and Testing in ML

# Train-Test Split – Divide data into training (70-80%) and testing

# Model Testing – The trained model is tested on unseen data (test

# Performance Evaluation – The model is assessed using metrics such as

# Metrics for Evaluating ML Algorithms

# Precision – Measures correctness of positive predictions (useful for

# Recall (Sensitivity) – Measures how well actual positives are

# F1-Score – Harmonic mean of precision and recall (best for

# Confusion Matrix – Shows true positives, true negatives, false

# AUC-ROC (Area Under Curve – Receiver Operating Characteristic) –

# Binarizing the test set labels only (AFTER splitting)

# --- Accuracy and AUC Calculation ---

# Multi-class AUC scores using the TEST SET ONLY

# --- Plotting Accuracy and AUC ---

# Accuracy Bar Graph

# AUC Bar Graph

# --- Print Model Performances ---

--- Model Performance ---

# Recommended Model for Mobile Price Prediction

# Best Model: SVM (with RBF Kernel)

# High accuracy and robustness in high-dimensional spaces.

# Works well with non-linear data using the RBF kernel.

# Good generalization with proper hyperparameter tuning.

# Computationally expensive for very large datasets.

You might also like