0% found this document useful (0 votes)

17 views5 pages

Assignment ML

The document discusses using machine learning models like SVM, Naive Bayes and Random Forest classifiers to predict diseases by analyzing medical data. It loads and preprocesses a disease dataset, trains and tests the models on it, and evaluates their accuracy on test data using metrics like confusion matrix.

Uploaded by

vtu19941

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views5 pages

Assignment ML

Uploaded by

vtu19941

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

SCHOOL OF COMPUTING

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

ASSIGNMENT

Programme Name : B. Tech CSE

Course Code / Course Name : 10211CS120/ Machine Learning
Year / Semester : 2023-2024 / Summer
VTU Number : 19941
Register Number : 21UECM0046
Name : CHENNA ANAND
Slot : S6+L18
Faculty : Dr. T. Kujani

Use Case: DISEASE PREDICTION

Objective: Disease prediction holds immense potential for transforming healthcare.

Its objectives range from enabling early detection of potential health risks through analyzing
factors like genetics and initial symptoms, to improving diagnostic accuracy by leveraging
vast amounts of medical data. Furthermore, disease prediction models can categorize
individuals based on their susceptibility to specific diseases, allowing for targeted
interventions and personalized healthcare plans. This not only empowers preventative
measures but also optimizes resource allocation within healthcare systems, ensuring
preparedness for outbreaks and better patient outcomes. Ultimately, disease prediction paves
the way for a future of personalized medicine, where treatment plans are tailored to each
individual's unique health profile.

Program:
# Importing libraries
import numpy as np
import pandas as pd
from scipy.stats import mode
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.preprocessing import LabelEncoder
from sklearn.model_selection import train_test_split, cross_val_score
from sklearn.svm import SVC
from sklearn.naive_bayes import GaussianNB
from sklearn.ensemble import RandomForestClassifier
from sklearn.metrics import accuracy_score, confusion_matrix

%matplotlib inline
# Reading the train.csv by removing the
# last column since it's an empty column
DATA_PATH = "dataset/Training.csv"
data = pd.read_csv(DATA_PATH).dropna(axis = 1)

# Checking whether the dataset is balanced or not

disease_counts = data["prognosis"].value_counts()
temp_df = pd.DataFrame({
"Disease": disease_counts.index,
"Counts": disease_counts.values
})

plt.figure(figsize = (18,8))
sns.barplot(x = "Disease", y = "Counts", data = temp_df)
plt.xticks(rotation=90)
plt.show()

X = data.iloc[:,:-1]
y = data.iloc[:, -1]
X_train, X_test, y_train, y_test =train_test_split(
X, y, test_size = 0.2, random_state = 24)

print(f"Train: {X_train.shape}, {y_train.shape}")

print(f"Test: {X_test.shape}, {y_test.shape}")

# Training and testing SVM Classifier

svm_model = SVC()
svm_model.fit(X_train, y_train)
preds = svm_model.predict(X_test)

print(f"Accuracy on train data by SVM Classifier\

: {accuracy_score(y_train, svm_model.predict(X_train))*100}")

print(f"Accuracy on test data by SVM Classifier\

# Training and testing Naive Bayes Classifier

nb_model = GaussianNB()
nb_model.fit(X_train, y_train)
preds = nb_model.predict(X_test)
print(f"Accuracy on train data by Naive Bayes Classifier\
: {accuracy_score(y_train, nb_model.predict(X_train))*100}")

print(f"Accuracy on test data by Naive Bayes Classifier\

: {accuracy_score(y_test, preds)*100}")
cf_matrix = confusion_matrix(y_test, preds)
plt.figure(figsize=(12,8))
sns.heatmap(cf_matrix, annot=True)
plt.title("Confusion Matrix for Naive Bayes Classifier on Test Data")
plt.show()
# Training and testing Random Forest Classifier
rf_model = RandomForestClassifier(random_state=18)
rf_model.fit(X_train, y_train)
preds = rf_model.predict(X_test)
print(f"Accuracy on train data by Random Forest Classifier\
: {accuracy_score(y_train, rf_model.predict(X_train))*100}")

print(f"Accuracy on test data by Random Forest Classifier\

: {accuracy_score(y_test, preds)*100}")

cf_matrix = confusion_matrix(y_test, preds)

plt.figure(figsize=(12,8))
sns.heatmap(cf_matrix, annot=True)
plt.title("Confusion Matrix for Random Forest Classifier on Test Data")
plt.show()

Output:

FAFD Questions
90% (10)
FAFD Questions
89 pages
9.structural Behaviour and Design Criteria of Concrete Box-Girder Bridges - JRC
No ratings yet
9.structural Behaviour and Design Criteria of Concrete Box-Girder Bridges - JRC
16 pages
FDP Manual - Petrel Dynamic Modeling PDF
83% (6)
FDP Manual - Petrel Dynamic Modeling PDF
28 pages
Wma14-01-June-2023 Solved
50% (2)
Wma14-01-June-2023 Solved
32 pages
Products Barcodes 2024-04-05T10 38 12.851448Z
No ratings yet
Products Barcodes 2024-04-05T10 38 12.851448Z
16 pages
Firmenliste Katar DT DLD
No ratings yet
Firmenliste Katar DT DLD
1 page
Additional Program
No ratings yet
Additional Program
573 pages
Nand 2 Nor 2
No ratings yet
Nand 2 Nor 2
19 pages
Bell ADT D-Series General Info
100% (1)
Bell ADT D-Series General Info
32 pages
EM 300 G3 Manual 12 2016 EN
100% (1)
EM 300 G3 Manual 12 2016 EN
60 pages
PRJ-Parkinsons Disease Prediction
No ratings yet
PRJ-Parkinsons Disease Prediction
16 pages
SUMMARY
No ratings yet
SUMMARY
16 pages
PythonHeartDisease FirstReview
No ratings yet
PythonHeartDisease FirstReview
20 pages
Assignment 1
No ratings yet
Assignment 1
17 pages
SVM
No ratings yet
SVM
12 pages
Null 1
No ratings yet
Null 1
2 pages
MN
No ratings yet
MN
1 page
Dhyey V Desai Supervised Machine Learning Approaches
No ratings yet
Dhyey V Desai Supervised Machine Learning Approaches
5 pages
PythonHeartDisease FirstReview
No ratings yet
PythonHeartDisease FirstReview
4 pages
ML Lab6
No ratings yet
ML Lab6
4 pages
SVM - Classification - Jupyter Notebook
No ratings yet
SVM - Classification - Jupyter Notebook
2 pages
Heart Dis
No ratings yet
Heart Dis
13 pages
Meaningful Predictive Modeling Week-4 Assignment Cancer Disease Prediction
No ratings yet
Meaningful Predictive Modeling Week-4 Assignment Cancer Disease Prediction
6 pages
Schematic Nrf24l01+Pa+Lna
100% (1)
Schematic Nrf24l01+Pa+Lna
2 pages
Regression - Naive - SVM
No ratings yet
Regression - Naive - SVM
3 pages
Ex 12
No ratings yet
Ex 12
4 pages
ML W8 Merged
No ratings yet
ML W8 Merged
27 pages
Ai in HC - 2
No ratings yet
Ai in HC - 2
9 pages
PDF To Jpeg
No ratings yet
PDF To Jpeg
7 pages
Review
No ratings yet
Review
5 pages
ML0101EN Clas SVM Cancer Py v1
No ratings yet
ML0101EN Clas SVM Cancer Py v1
10 pages
Disease Prediction System
No ratings yet
Disease Prediction System
9 pages
Bacdeaf 23032025 115708 Split 1
No ratings yet
Bacdeaf 23032025 115708 Split 1
37 pages
IEEE Conference Team ATOM
No ratings yet
IEEE Conference Team ATOM
5 pages
Disease Prediction Using ML
No ratings yet
Disease Prediction Using ML
20 pages
Disease Prediction Using Patient Data
No ratings yet
Disease Prediction Using Patient Data
7 pages
The TOEFL ITP Tests at A Glance
No ratings yet
The TOEFL ITP Tests at A Glance
4 pages
Processes 11 01210
No ratings yet
Processes 11 01210
31 pages
Diseasereport
No ratings yet
Diseasereport
18 pages
Unit 4
No ratings yet
Unit 4
15 pages
1 KNN - Jupyter Notebook
No ratings yet
1 KNN - Jupyter Notebook
3 pages
20MIS7043 (LAB 7) .Ipynb Colaboratory
No ratings yet
20MIS7043 (LAB 7) .Ipynb Colaboratory
4 pages
HEART
No ratings yet
HEART
15 pages
20MIS7095 (LAB 7) .Ipynb Colaboratory
No ratings yet
20MIS7095 (LAB 7) .Ipynb Colaboratory
4 pages
MID 039 - CID 1846 - FMI 09: Pantalla Anterior
No ratings yet
MID 039 - CID 1846 - FMI 09: Pantalla Anterior
6 pages
Multi-Disease Prediction With Machine Learning
No ratings yet
Multi-Disease Prediction With Machine Learning
7 pages
DWDM Lab 3
No ratings yet
DWDM Lab 3
10 pages
B24 ML Exp-3
No ratings yet
B24 ML Exp-3
10 pages
Disease Prediction With Android Application: Shagun Patial, Shashwat Agarwal, Shruti Pathak, Prabhat Verma
No ratings yet
Disease Prediction With Android Application: Shagun Patial, Shashwat Agarwal, Shruti Pathak, Prabhat Verma
6 pages
Final Presentation GDP
No ratings yet
Final Presentation GDP
21 pages
ML Model Report
No ratings yet
ML Model Report
8 pages
Synopsis MLD Ps
No ratings yet
Synopsis MLD Ps
25 pages
Project Synopsis - Machine Learning in Disease Prediction
No ratings yet
Project Synopsis - Machine Learning in Disease Prediction
5 pages
Disease Pred Report
No ratings yet
Disease Pred Report
42 pages
Diabeties SVM
No ratings yet
Diabeties SVM
2 pages
Diseaseppt
No ratings yet
Diseaseppt
18 pages
t560 - Engineering Science n2 QP Nov 2015final
No ratings yet
t560 - Engineering Science n2 QP Nov 2015final
12 pages
Team No-7
No ratings yet
Team No-7
12 pages
Disease Prediction Based On Symptoms
No ratings yet
Disease Prediction Based On Symptoms
16 pages
AI ML - Cycle 2 Programs
No ratings yet
AI ML - Cycle 2 Programs
15 pages
A Disease Prediction Model Using Naive Bayes and Keras Based Neural Networks
No ratings yet
A Disease Prediction Model Using Naive Bayes and Keras Based Neural Networks
8 pages
Boo PH 3
No ratings yet
Boo PH 3
11 pages
(IJCST-V13I2P2) :seema Saroj, Sakshi Sahu, Sanjana Patel, Suraj Sahu
No ratings yet
(IJCST-V13I2P2) :seema Saroj, Sakshi Sahu, Sanjana Patel, Suraj Sahu
2 pages
Article Eda
No ratings yet
Article Eda
7 pages
Multiple Diseases
No ratings yet
Multiple Diseases
15 pages
Maxbox - Starter67 Machine Learning
No ratings yet
Maxbox - Starter67 Machine Learning
7 pages
Major
No ratings yet
Major
15 pages
Machine File
No ratings yet
Machine File
27 pages
No 11
No ratings yet
No 11
8 pages
Exploring Social Psychology 8th Edition Myers Full Download
No ratings yet
Exploring Social Psychology 8th Edition Myers Full Download
405 pages
Base Paper
No ratings yet
Base Paper
4 pages
Final Research Paper
No ratings yet
Final Research Paper
5 pages
Activity 4 Worlds Greatest Strategists
No ratings yet
Activity 4 Worlds Greatest Strategists
3 pages
Modernism and Post Modernism in Literature
No ratings yet
Modernism and Post Modernism in Literature
16 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
17 pages
CV
No ratings yet
CV
4 pages
Disease Prediction Synopsis
No ratings yet
Disease Prediction Synopsis
3 pages
Ratanben C Raolji
No ratings yet
Ratanben C Raolji
2 pages
Activity 2 - Qualitative Test For The Presence of Organic Compounds
No ratings yet
Activity 2 - Qualitative Test For The Presence of Organic Compounds
5 pages
GMW 16443 Type 1: Adhesion Performance Requirements For Adhesive Backed Light Trim and Foam
No ratings yet
GMW 16443 Type 1: Adhesion Performance Requirements For Adhesive Backed Light Trim and Foam
10 pages
Sembulingam Physiology 1
No ratings yet
Sembulingam Physiology 1
15 pages
Lesson 200.6 Creating Reports and Dashboards
No ratings yet
Lesson 200.6 Creating Reports and Dashboards
63 pages
Authorization Form Panda Food
No ratings yet
Authorization Form Panda Food
3 pages
Chapter 22
No ratings yet
Chapter 22
54 pages
Rezgui-An Overview of Optical Fibers
No ratings yet
Rezgui-An Overview of Optical Fibers
8 pages
Geometric Annual Presentation
No ratings yet
Geometric Annual Presentation
12 pages
Erick Oliva
No ratings yet
Erick Oliva
6 pages
The Corporation
No ratings yet
The Corporation
4 pages
CYCLOPENTANE
No ratings yet
CYCLOPENTANE
2 pages
EDUCATION DATA MINING FOR PREDICTING STUDENTS’ PERFORMANCE
From Everand
EDUCATION DATA MINING FOR PREDICTING STUDENTS’ PERFORMANCE
Dr. GEETHA N DATA SCIENTIST, BENGALURU
No ratings yet

Assignment ML

Uploaded by

Assignment ML

Uploaded by

SCHOOL OF COMPUTING

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

Programme Name : B. Tech CSE

Use Case: DISEASE PREDICTION

Objective: Disease prediction holds immense potential for transforming healthcare.

# Checking whether the dataset is balanced or not

print(f"Train: {X_train.shape}, {y_train.shape}")

# Training and testing SVM Classifier

print(f"Accuracy on train data by SVM Classifier\

print(f"Accuracy on test data by SVM Classifier\

# Training and testing Naive Bayes Classifier

print(f"Accuracy on test data by Naive Bayes Classifier\

print(f"Accuracy on test data by Random Forest Classifier\

cf_matrix = confusion_matrix(y_test, preds)

You might also like