0% found this document useful (0 votes)

8 views8 pages

Female A S Breast Cancer Prediction Model

Uploaded by

sahilkoshriya82325

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views8 pages

Female A S Breast Cancer Prediction Model

Uploaded by

sahilkoshriya82325

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

MODEL 2: Breast Cancer Prediction Using

Python Importing libraries

# importing libraries
import numpy
import matplotlib.pyplot as plt
import pandas as pd
import seaborn as sns
# reading data from the file
df=pd.read_csv("data.csv")

df.head()

{"type":"dataframe","variable_name":"df"}

df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 569 entries, 0 to 568
Data columns (total 33 columns):
# Column Non-Null Count Dtype

0 id 569 non-null int64

1 diagnosis 569 non-null object
2 radius_mean 569 non-null float64
3 texture_mean 569 non-null float64
4 perimeter_mean 569 non-null float64
5 area_mean 569 non-null float64
6 smoothness_mean 569 non-null float64
7 compactness_mean 569 non-null float64
8 concavity_mean 569 non-null float64
9 concave points_mean 569 non-null float64
10 symmetry_mean 569 non-null float64
11 fractal_dimension_mean 569 non-null float64
12 radius_se 569 non-null float64
13 texture_se 569 non-null float64
14 perimeter_se 569 non-null float64
15 area_se 569 non-null float64
16 smoothness_se 569 non-null float64
17 compactness_se 569 non-null float64
18 concavity_se 569 non-null float64
19 concave points_se 569 non-null float64
20 symmetry_se 569 non-null float64
21 fractal_dimension_se 569 non-null float64
22 radius_worst 569 non-null float64
23 texture_worst 569 non-null float64
24 perimeter_worst 569 non-null float64
25 area_worst 569 non-null float64
26 smoothness_worst 569 non-null float64
27 compactness_worst 569 non-null float64
28 concavity_worst 569 non-null float64
29 concave points_worst 569 non-null float64
30 symmetry_worst 569 non-null float64
31 fractal_dimension_worst 569 non-null float64
32 Unnamed: 32 0 non-null float64
dtypes: float64(31), int64(1), object(1)
memory usage: 146.8+ KB

# return the size of dataset

df.shape

(569, 33)

# remove the column

df=df.dropna(axis=1)

# shape of dataset after removing the null column

df.shape

(569, 32)

# describe the dataset

df.describe()

{"type":"dataframe"}

# Get the count of malignant<M> and Benign<B> cells

df['diagnosis'].value_counts()

diagnosis
B 357
M 212
Name: count, dtype: int64

sns.countplot(df['diagnosis'],label="count")

<Axes: xlabel='count', ylabel='diagnosis'>

# label encoding(convert the value of M and B into 1 and 0)
from sklearn.preprocessing import LabelEncoder
labelencoder_Y = LabelEncoder()
df.iloc[:,1]=labelencoder_Y.fit_transform(df.iloc[:,1].values)

df.head()

{"type":"dataframe","variable_name":"df"}

sns.pairplot(df.iloc[:,1:5],hue="diagnosis")

<seaborn.axisgrid.PairGrid at 0x7af9d51ebac0>
# get the correlation
df.iloc[:,1:32].corr()

{"type":"dataframe"}

# visualize the correlation

plt.figure(figsize=(10,10))
sns.heatmap(df.iloc[:,1:10].corr(),annot=True,fmt=".0%")

<Axes: >
# split the dataset into dependent(X) and Independent(Y) datasets
X=df.iloc[:,2:31].values
Y=df.iloc[:,1].values

# spliting the data into trainning and test dateset

from sklearn.model_selection import train_test_split
X_train,X_test,Y_train,Y_test=train_test_split(X,Y,test_size=0.20,rand
om_state=0)

# feature scaling
from sklearn.preprocessing import StandardScaler
X_train=StandardScaler().fit_transform(X_train)
X_test=StandardScaler().fit_transform(X_test)

# models/ Algorithms

def models(X_train,Y_train):
#logistic regression
from sklearn.linear_model import LogisticRegression
log=LogisticRegression(random_state=0)
log.fit(X_train,Y_train)

#Decision Tree
from sklearn.tree import DecisionTreeClassifier

tree=DecisionTreeClassifier(random_state=0,criterion="entropy")
tree.fit(X_train,Y_train)

#Random Forest
from sklearn.ensemble import RandomForestClassifier

forest=RandomForestClassifier(random_state=0,criterion="entropy",n_est
imators=10)
forest.fit(X_train,Y_train)

print('[0]logistic regression
accuracy:',log.score(X_train,Y_train))
print('[1]Decision tree
accuracy:',tree.score(X_train,Y_train))
print('[2]Random forest
accuracy:',forest.score(X_train,Y_train))

return log,tree,forest

model=models(X_train,Y_train)
[0]logistic regression accuracy: 0.9472527472527472
[1]Decision tree accuracy: 1.0
[2]Random forest accuracy: 1.0

/usr/local/lib/python3.10/dist-packages/sklearn/linear_model/
_logistic.py:469: ConvergenceWarning: lbfgs failed to converge
(status=1):
STOP: TOTAL NO. of ITERATIONS REACHED LIMIT.

Increase the number of iterations (max_iter) or scale the data as

shown in:
https://scikit-learn.org/stable/modules/preprocessing.html
Please also refer to the documentation for alternative solver options:

https://scikit-learn.org/stable/modules/linear_model.html#logistic-
regression
n_iter_i = _check_optimize_result(

# testing the models/result

from sklearn.metrics import accuracy_score

from sklearn.metrics import classification_report

for i in range(len(model)):
print("Model",i)
print(classification_report(Y_test,model[i].predict(X_test)))
print('Accuracy :
',accuracy_score(Y_test,model[i].predict(X_test)))
Model 0
precision recall f1-score support

0 0.97 0.91 0.94 43

1 0.95 0.99 0.97 71

accuracy 0.96 114

macro avg 0.96 0.95 0.95 114
weighted avg 0.96 0.96 0.96 114

Accuracy : 0.956140350877193
Model 1
precision recall f1-score support

0 0.97 0.91 0.94 43

1 0.95 0.99 0.97 71

accuracy 0.96 114

macro avg 0.96 0.95 0.95 114
weighted avg 0.96 0.96 0.96 114

Accuracy : 0.956140350877193
Model 2
precision recall f1-score support

0 0.98 0.93 0.95 43

1 0.96 0.99 0.97 71

accuracy 0.96 114

macro avg 0.97 0.96 0.96 114
weighted avg 0.97 0.96 0.96 114

Accuracy : 0.9649122807017544

# prediction of random-forest
pred=model[2].predict(X_test)
print('Predicted values:')
print(pred)
print('Actual values:')
print(Y_test)

Predicted values:
[1 0 0 1 1 0 0 0 0 1 1 0 1 0 1 0 1 1 1 0 1 1 0 1 1 1 1 1 1 0 1 1 1 1 1
1 0
1 0 1 1 0 1 1 1 1 1 1 1 1 0 0 1 1 1 1 1 0 0 1 1 0 0 1 1 1 0 0 1 1 0 0
1 0
1 1 1 1 1 1 0 1 1 0 0 0 0 0 1 1 1 1 1 1 1 1 0 0 1 0 0 1 0 0 1 1 1 0 1
1 0
1 1 0]
Actual values:
204 1
70 0
131 0
431 1
540 1
..
486 1
75 0
249 1
238 1
265 0
Length: 114, dtype: int64

from joblib import dump

dump(model[2],"Feamle_Awareness_Breast_Cancer_prediction.joblib")

['Feamle_Awareness_Breast_Cancer_prediction.joblib']

Model Evaluation and Selection Cheatsheet 1708023215
No ratings yet
Model Evaluation and Selection Cheatsheet 1708023215
7 pages
Sap Brim Overview
No ratings yet
Sap Brim Overview
53 pages
Reast Cancer Prediction Using Debt
No ratings yet
Reast Cancer Prediction Using Debt
18 pages
ML Lab 5
No ratings yet
ML Lab 5
2 pages
FREE AI Code Generator - Generate Code Online in Any Language
No ratings yet
FREE AI Code Generator - Generate Code Online in Any Language
12 pages
Appendix - Complete Code Implementation
No ratings yet
Appendix - Complete Code Implementation
8 pages
Scikit Learn What Were Covering
No ratings yet
Scikit Learn What Were Covering
15 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
6 pages
Breast Cancer Detection Using Python & Machine Learning
No ratings yet
Breast Cancer Detection Using Python & Machine Learning
12 pages
ML Functions
No ratings yet
ML Functions
12 pages
AIH Lab2
No ratings yet
AIH Lab2
10 pages
Meaningful Predictive Modeling Week-4 Assignment Cancer Disease Prediction
No ratings yet
Meaningful Predictive Modeling Week-4 Assignment Cancer Disease Prediction
6 pages
Import Numpy As NP Import Pandas As PD
No ratings yet
Import Numpy As NP Import Pandas As PD
7 pages
8 To 12 Jaimeen
No ratings yet
8 To 12 Jaimeen
34 pages
Untitled Document
No ratings yet
Untitled Document
6 pages
ML Fat
No ratings yet
ML Fat
9 pages
Decision Trees in Sklearn Decision Trees in Sklearn
No ratings yet
Decision Trees in Sklearn Decision Trees in Sklearn
7 pages
Codigo Modelo
No ratings yet
Codigo Modelo
5 pages
COMPARISON - Jupyter Notebook
No ratings yet
COMPARISON - Jupyter Notebook
5 pages
Project
No ratings yet
Project
16 pages
MlLabManualdocx 2024 09 04 22 02 58
No ratings yet
MlLabManualdocx 2024 09 04 22 02 58
19 pages
Experiment 8
No ratings yet
Experiment 8
4 pages
SUMMARY
No ratings yet
SUMMARY
16 pages
Bagging - Ipynb - Colab
No ratings yet
Bagging - Ipynb - Colab
2 pages
Lab 3
No ratings yet
Lab 3
6 pages
Breast Cancer Classification Using DTC
No ratings yet
Breast Cancer Classification Using DTC
1 page
Random Forest: Implementaciones de Scikit-Learn Sobre QSAR
100% (1)
Random Forest: Implementaciones de Scikit-Learn Sobre QSAR
11 pages
Supple Maximizing Performance in Cs CuBiCl
No ratings yet
Supple Maximizing Performance in Cs CuBiCl
5 pages
Ensemble Learning
No ratings yet
Ensemble Learning
1 page
Heart: Our "Goal" Predict The Presence of Heart Disease in The Patient
100% (1)
Heart: Our "Goal" Predict The Presence of Heart Disease in The Patient
73 pages
Pra 8
No ratings yet
Pra 8
4 pages
DWDM Lab 3
No ratings yet
DWDM Lab 3
10 pages
Experiment 2 FDL - Jupyter Notebook
No ratings yet
Experiment 2 FDL - Jupyter Notebook
2 pages
MLfull
No ratings yet
MLfull
29 pages
Session 2 Machine Learning Execution
No ratings yet
Session 2 Machine Learning Execution
12 pages
Aiml Programs
No ratings yet
Aiml Programs
12 pages
5) Randomforest - Ipynb - Colaboratory
No ratings yet
5) Randomforest - Ipynb - Colaboratory
12 pages
Random Forest
No ratings yet
Random Forest
8 pages
ML Codes
No ratings yet
ML Codes
9 pages
Supervised Classi & Regression
No ratings yet
Supervised Classi & Regression
5 pages
AAM 6th Prac
No ratings yet
AAM 6th Prac
3 pages
Random Forest
No ratings yet
Random Forest
3 pages
Progrram8-Decision Tree
No ratings yet
Progrram8-Decision Tree
3 pages
Python Essential Methods in Machine Learning
No ratings yet
Python Essential Methods in Machine Learning
6 pages
CONTENTS
No ratings yet
CONTENTS
7 pages
Assignment 10
No ratings yet
Assignment 10
14 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
Build A Random Forest Algorithm Aim
No ratings yet
Build A Random Forest Algorithm Aim
3 pages
Random Forest 1737667979
No ratings yet
Random Forest 1737667979
11 pages
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
No ratings yet
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
20 pages
Classification Algorithms
No ratings yet
Classification Algorithms
16 pages
Additional Program
No ratings yet
Additional Program
573 pages
Code Examples in Space
No ratings yet
Code Examples in Space
13 pages
Exercise Random Forests
No ratings yet
Exercise Random Forests
2 pages
Cancer Disease Classification
No ratings yet
Cancer Disease Classification
6 pages
ML P-6 - 024
No ratings yet
ML P-6 - 024
22 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
9 pages
TP - Ipynb - Colab
No ratings yet
TP - Ipynb - Colab
6 pages
ML in Python Part-2
No ratings yet
ML in Python Part-2
21 pages
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet
Core Concepts in Real Analysis
From Everand
Core Concepts in Real Analysis
Roshan Trivedi
No ratings yet
Srarm - Unit 1
No ratings yet
Srarm - Unit 1
16 pages
Swarm Unit2
No ratings yet
Swarm Unit2
12 pages
Sahilkosariya Web Dev Programmer FutureFirst
No ratings yet
Sahilkosariya Web Dev Programmer FutureFirst
1 page
Mynk
No ratings yet
Mynk
27 pages
Cloud Computing Transforming The Digital Landscape
No ratings yet
Cloud Computing Transforming The Digital Landscape
8 pages
OpenStack and Zoho Empowering The Cloud Revolution
No ratings yet
OpenStack and Zoho Empowering The Cloud Revolution
8 pages
AIML - Module 3 - Updated
No ratings yet
AIML - Module 3 - Updated
42 pages
OD11 PL Decision Analysis
No ratings yet
OD11 PL Decision Analysis
4 pages
Answer PDF Lab
No ratings yet
Answer PDF Lab
34 pages
Machine Learning Classifiers For Fall Detection Leveraging LoRa Communication Network
No ratings yet
Machine Learning Classifiers For Fall Detection Leveraging LoRa Communication Network
9 pages
P02 DecisionTrees SolutionNotes
No ratings yet
P02 DecisionTrees SolutionNotes
3 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
16 pages
Unit 3
No ratings yet
Unit 3
86 pages
Comparative Research On Network Intrusion Detection Methods Based
No ratings yet
Comparative Research On Network Intrusion Detection Methods Based
17 pages
Unit II Classifications
No ratings yet
Unit II Classifications
18 pages
Customer Churn Prediction in Telecom Sector Using Machine Learning Techniques
No ratings yet
Customer Churn Prediction in Telecom Sector Using Machine Learning Techniques
16 pages
Chapter 11-Project Risk Management
No ratings yet
Chapter 11-Project Risk Management
65 pages
IT 802 ML Unit-2 Notes
No ratings yet
IT 802 ML Unit-2 Notes
19 pages
DecisionTrees-2 2
No ratings yet
DecisionTrees-2 2
1 page
Slides
No ratings yet
Slides
174 pages
IS4834 Final Exam Sample Questions
No ratings yet
IS4834 Final Exam Sample Questions
5 pages
21csc305p Machine Learning Unit 5
No ratings yet
21csc305p Machine Learning Unit 5
61 pages
Nptel Week 6 - 2
No ratings yet
Nptel Week 6 - 2
4 pages
Rashmi Agrawal
No ratings yet
Rashmi Agrawal
223 pages
Data Analytics - Object Segmentation UNIT-IV
100% (1)
Data Analytics - Object Segmentation UNIT-IV
33 pages
Unit Iv
No ratings yet
Unit Iv
38 pages
Decision Tree
No ratings yet
Decision Tree
4 pages
A Guide To Machine Learning Algorithms 100+
No ratings yet
A Guide To Machine Learning Algorithms 100+
49 pages
Hand Gesture Recognition Using Machine Learning and Computer Vision
No ratings yet
Hand Gesture Recognition Using Machine Learning and Computer Vision
38 pages
Svmsmote 061430
No ratings yet
Svmsmote 061430
2 pages
Decision Tree in Machine Learning
No ratings yet
Decision Tree in Machine Learning
16 pages
1 s2.0 S0167739X18307325 Main
No ratings yet
1 s2.0 S0167739X18307325 Main
11 pages
Project Plagiarism Report
No ratings yet
Project Plagiarism Report
21 pages
DM - Ai22c07 - Unit 3
No ratings yet
DM - Ai22c07 - Unit 3
272 pages
6-CSC 405 Sem1 2020-2021 - Intro To Machine Learning
No ratings yet
6-CSC 405 Sem1 2020-2021 - Intro To Machine Learning
39 pages

Female A S Breast Cancer Prediction Model

Uploaded by

Female A S Breast Cancer Prediction Model

Uploaded by

MODEL 2: Breast Cancer Prediction Using

Python Importing libraries

0 id 569 non-null int64

# return the size of dataset

# remove the column

# shape of dataset after removing the null column

# describe the dataset

# Get the count of malignant<M> and Benign<B> cells

<Axes: xlabel='count', ylabel='diagnosis'>

# visualize the correlation

# spliting the data into trainning and test dateset

Increase the number of iterations (max_iter) or scale the data as

# testing the models/result

from sklearn.metrics import accuracy_score

0 0.97 0.91 0.94 43

accuracy 0.96 114

0 0.97 0.91 0.94 43

accuracy 0.96 114

0 0.98 0.93 0.95 43

accuracy 0.96 114

from joblib import dump

You might also like