0% found this document useful (0 votes)

11 views4 pages

Decision Tree Code Explanation

The document explains the process of implementing a Decision Tree Classifier using the breast cancer dataset. It covers importing necessary libraries, loading the dataset, splitting the data into training and testing sets, training the model, making predictions, calculating accuracy, testing on a single sample, and visualizing the decision tree. The decision tree operates by asking a series of yes/no questions based on tumor characteristics to classify samples as malignant or benign.

Uploaded by

prajwalcg3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views4 pages

Decision Tree Code Explanation

Uploaded by

prajwalcg3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Decision Tree Code Explanation

1. Importing Libraries

python

import numpy as np
import matplotlib.pyplot as plt
from sklearn.datasets import load_breast_cancer
from sklearn.model_selection import train_test_split
from sklearn.tree import DecisionTreeClassifier
from sklearn.metrics import accuracy_score
from sklearn import tree

What it does: These lines import all the tools we need:

numpy (as np): For working with arrays and numerical data

matplotlib.pyplot (as plt): For creating graphs and visualizations

load_breast_cancer : A built-in dataset about breast cancer cases

train_test_split : Splits data into training and testing portions

DecisionTreeClassifier : The machine learning algorithm we'll use

accuracy_score : Measures how well our model performs

tree : Helps us visualize the decision tree

2. Loading the Dataset

python

data = load_breast_cancer()
X = data.data
y = data.target

What it does:

data = load_breast_cancer() : Loads the breast cancer dataset (569 samples with 30 features each)

X = data.data : Gets the input features (measurements like tumor size, texture, etc.)

y = data.target : Gets the labels (0 = malignant, 1 = benign)

3. Splitting the Data

python

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

What it does:

Splits the data into training (80%) and testing (20%) sets

Training data: Used to teach the model

Testing data: Used to evaluate how well the model learned
random_state=42 : Ensures we get the same split every time (reproducibility)

4. Creating and Training the Model

python

clf = DecisionTreeClassifier(random_state=42)
clf.fit(X_train, y_train)

What it does:

clf = DecisionTreeClassifier() : Creates a decision tree classifier object

clf.fit(X_train, y_train) : Trains the model using the training data

The model learns patterns by asking questions like "Is the tumor radius > 15?" and creating a tree of
decisions

5. Making Predictions

python

y_pred = clf.predict(X_test)

What it does:

Uses the trained model to predict outcomes for the test data

y_pred contains the model's guesses (0 or 1) for each test sample

6. Calculating Accuracy

python

accuracy = accuracy_score(y_test, y_pred)

print(f"Model Accuracy: {accuracy * 100:.2f}%")

What it does:
Compares the model's predictions ( y_pred ) with the actual answers ( y_test )

Calculates what percentage the model got right

Prints the accuracy as a percentage (e.g., "Model Accuracy: 93.86%")

7. Testing on a Single Sample

python

new_sample = np.array([X_test[0]])
prediction = clf.predict(new_sample)
prediction_class = "Benign" if prediction == 1 else "Malignant"
print(f"Predicted Class for the new sample: {prediction_class}")

What it does:

Takes the first sample from the test set

Makes a prediction for just this one sample

Converts the numerical prediction (0 or 1) to a readable label:
1 = "Benign" (not cancerous)
0 = "Malignant" (cancerous)

Prints the result

8. Visualizing the Decision Tree

python

plt.figure(figsize=(12,8))
tree.plot_tree(clf, filled=True, feature_names=data.feature_names, class_names=data.target_name
plt.title("Decision Tree - Breast Cancer Dataset")
plt.show()

 

What it does:

Creates a large figure (12x8 inches)

Draws the decision tree with:

filled=True : Colors the nodes based on the majority class

feature_names : Shows actual feature names instead of numbers

class_names : Shows "malignant" and "benign" instead of 0 and 1

Adds a title and displays the visualization

How the Decision Tree Works

The decision tree makes predictions by asking a series of yes/no questions about the tumor
characteristics. For example:

1. "Is the mean radius ≤ 16.8?"

If yes → go left branch

If no → go right branch

2. Continue asking questions until reaching a final decision (leaf node)

Each path from top to bottom represents a different rule for classification, making the model
interpretable and easy to understand!

sklearn
No ratings yet
sklearn
141 pages
1.10. Decision Trees — scikit-learn 0.24.1 documentation
No ratings yet
1.10. Decision Trees — scikit-learn 0.24.1 documentation
10 pages
Ai Merge All Slides'
No ratings yet
Ai Merge All Slides'
314 pages
Progrram8-Decision Tree.docx
No ratings yet
Progrram8-Decision Tree.docx
3 pages
FREE AI Code Generator - Generate Code Online in Any Language
No ratings yet
FREE AI Code Generator - Generate Code Online in Any Language
12 pages
A Fistful of Darkness
67% (3)
A Fistful of Darkness
13 pages
ML Mod-4
No ratings yet
ML Mod-4
30 pages
Bc Module 1(Bcs613a)
No ratings yet
Bc Module 1(Bcs613a)
25 pages
ML Acti
No ratings yet
ML Acti
23 pages
Guppy Trend Trading 1
88% (8)
Guppy Trend Trading 1
50 pages
Experiment 8
No ratings yet
Experiment 8
14 pages
Brown Illustrative Abstract Group Project Presentation_20241208_171319_0000
No ratings yet
Brown Illustrative Abstract Group Project Presentation_20241208_171319_0000
16 pages
Decision Tree Copy
No ratings yet
Decision Tree Copy
44 pages
Titanfall - Genesys - GM Binder PDF
100% (5)
Titanfall - Genesys - GM Binder PDF
54 pages
reast-cancer-prediction-using-debt
No ratings yet
reast-cancer-prediction-using-debt
18 pages
Cancer Disease Classification
No ratings yet
Cancer Disease Classification
6 pages
Breast Cancer Detection and Prediction: Created by
No ratings yet
Breast Cancer Detection and Prediction: Created by
20 pages
Decision Trees
No ratings yet
Decision Trees
8 pages
Lecture 15: Tree-Based Algorithms — Applied ML
No ratings yet
Lecture 15: Tree-Based Algorithms — Applied ML
17 pages
Cancer Cell Classification Using Scikit
No ratings yet
Cancer Cell Classification Using Scikit
4 pages
Visualizing Decision Trees With Python (Scikit-Learn, Graphviz, Matplotlib) - by Michael Galarnyk - Towards Data Science
No ratings yet
Visualizing Decision Trees With Python (Scikit-Learn, Graphviz, Matplotlib) - by Michael Galarnyk - Towards Data Science
18 pages
What Is Decision Tree?: ISM Implementation of Decision Tree Submitted By: Sagiruddin Akthar 19mcmc28
No ratings yet
What Is Decision Tree?: ISM Implementation of Decision Tree Submitted By: Sagiruddin Akthar 19mcmc28
4 pages
Experiment 8
No ratings yet
Experiment 8
4 pages
Exp 3 121a1047 Lavanya Kurup ML
No ratings yet
Exp 3 121a1047 Lavanya Kurup ML
4 pages
Classification Algorithms
No ratings yet
Classification Algorithms
16 pages
Program -8
No ratings yet
Program -8
2 pages
ITM Document Format_Vedant
No ratings yet
ITM Document Format_Vedant
5 pages
Scikit Learn What Were Covering
No ratings yet
Scikit Learn What Were Covering
15 pages
ppt on daibeteg
No ratings yet
ppt on daibeteg
27 pages
Machine Learning With Random Forests - by Knoldus Inc. - Knoldus - Technical Insights - Medium
No ratings yet
Machine Learning With Random Forests - by Knoldus Inc. - Knoldus - Technical Insights - Medium
12 pages
14 - Ensemble Methods
No ratings yet
14 - Ensemble Methods
38 pages
DWM_EXP4
No ratings yet
DWM_EXP4
5 pages
ML Lab 5
No ratings yet
ML Lab 5
2 pages
phyton
No ratings yet
phyton
10 pages
Decision tree classifier
No ratings yet
Decision tree classifier
3 pages
AIH_Lab2
No ratings yet
AIH_Lab2
10 pages
NitinKumar 12112147 DecisionTreeAssignment
No ratings yet
NitinKumar 12112147 DecisionTreeAssignment
3 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
Bagging - Ipynb - Colab
No ratings yet
Bagging - Ipynb - Colab
2 pages
Trees and Forests: Machine Learning With Python Cookbook
No ratings yet
Trees and Forests: Machine Learning With Python Cookbook
5 pages
Prac 6
No ratings yet
Prac 6
6 pages
Experiment 8 ml vtu
No ratings yet
Experiment 8 ml vtu
4 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
Title: Implementation of Decision Tree Classification: Department of Computer Science and Engineering
No ratings yet
Title: Implementation of Decision Tree Classification: Department of Computer Science and Engineering
8 pages
Module 5.pptx_20250608_201231_0000
No ratings yet
Module 5.pptx_20250608_201231_0000
43 pages
8.Program Decisiontree
No ratings yet
8.Program Decisiontree
15 pages
8.PRGM
No ratings yet
8.PRGM
2 pages
Week14 - LAQs - SWR
No ratings yet
Week14 - LAQs - SWR
3 pages
Decision Tree
No ratings yet
Decision Tree
30 pages
practical 15 python
No ratings yet
practical 15 python
6 pages
Decision Tree Classification
No ratings yet
Decision Tree Classification
1 page
Decision_Tree_Regression.ipynb - Colab
No ratings yet
Decision_Tree_Regression.ipynb - Colab
3 pages
Ensemble_learning
No ratings yet
Ensemble_learning
12 pages
Decision Trees Presentation
No ratings yet
Decision Trees Presentation
10 pages
5b Python Implementation of Decision Tree
No ratings yet
5b Python Implementation of Decision Tree
7 pages
Random Forest
No ratings yet
Random Forest
3 pages
Cancer Detection Using Data Mining
No ratings yet
Cancer Detection Using Data Mining
13 pages
PR1 EXAM Ally
100% (2)
PR1 EXAM Ally
19 pages
Technical Manual Operator'S Manual Machine Gun, 40Mm, Mk19, Mod 3 (1010-01-126-9063) (EIC: 4AE)
100% (4)
Technical Manual Operator'S Manual Machine Gun, 40Mm, Mk19, Mod 3 (1010-01-126-9063) (EIC: 4AE)
406 pages
Prediction of Breast Cancer Using Machine Learning Algorithms - 2nd Review
No ratings yet
Prediction of Breast Cancer Using Machine Learning Algorithms - 2nd Review
21 pages
School Leaving Certificate
100% (4)
School Leaving Certificate
1 page
ChatGPT_MyLearning on Coding for Machine Learning
No ratings yet
ChatGPT_MyLearning on Coding for Machine Learning
16 pages
Decision Trees and Random Forests
No ratings yet
Decision Trees and Random Forests
25 pages
Niki Tha
No ratings yet
Niki Tha
1 page
Resume
No ratings yet
Resume
1 page
ml_chapter 2-7
No ratings yet
ml_chapter 2-7
12 pages
ml_report 2 to 5
No ratings yet
ml_report 2 to 5
12 pages
Ml_report Chapter 1
No ratings yet
Ml_report Chapter 1
3 pages
daivek
No ratings yet
daivek
5 pages
Prajwal Ml Front.pdf
No ratings yet
Prajwal Ml Front.pdf
1 page
Disease Prediction Using Machine Learning
No ratings yet
Disease Prediction Using Machine Learning
4 pages
Environmental Thesis Topics in The Philippines
100% (3)
Environmental Thesis Topics in The Philippines
4 pages
Crush_core_forming
No ratings yet
Crush_core_forming
8 pages
Connect To Database in Java
No ratings yet
Connect To Database in Java
8 pages
Scott Foresman Addison Wesley Mathematics Grade 4 Homework Workbook
100% (1)
Scott Foresman Addison Wesley Mathematics Grade 4 Homework Workbook
5 pages
Progress of Indonesian Replanting Program by Eddy Abdurrachman
No ratings yet
Progress of Indonesian Replanting Program by Eddy Abdurrachman
12 pages
The Insurgency and Human Rights in Punjab
No ratings yet
The Insurgency and Human Rights in Punjab
655 pages
Forms of Energy: Lab & Anchor Chart
No ratings yet
Forms of Energy: Lab & Anchor Chart
10 pages
Week 1
No ratings yet
Week 1
8 pages
Aero Practice_Examination_1__100_.pdf (1)
No ratings yet
Aero Practice_Examination_1__100_.pdf (1)
62 pages
As300-Ma-V06r02 Splitter VHF Ais
No ratings yet
As300-Ma-V06r02 Splitter VHF Ais
20 pages
Final Black Book
No ratings yet
Final Black Book
75 pages
Prolonged Infusions of Beta-Lactam Antibiotics
No ratings yet
Prolonged Infusions of Beta-Lactam Antibiotics
23 pages
Adventure Tourism and Risk Management
No ratings yet
Adventure Tourism and Risk Management
6 pages
CTA - 04 VBA Basics.18 PDF
No ratings yet
CTA - 04 VBA Basics.18 PDF
45 pages
Rastar Data Structure
No ratings yet
Rastar Data Structure
4 pages
Self-Efficacy Beliefs in Academic Settings - Frank Pajares 1996
No ratings yet
Self-Efficacy Beliefs in Academic Settings - Frank Pajares 1996
37 pages
VPMPGF 19g526 Ac
No ratings yet
VPMPGF 19g526 Ac
1 page
Learning
No ratings yet
Learning
58 pages
Websphere Application Server 6.1 Questions and Answers
No ratings yet
Websphere Application Server 6.1 Questions and Answers
29 pages
NCMB 312 MS RLE WEEK 15 Concept Mapping GI Bleeding
No ratings yet
NCMB 312 MS RLE WEEK 15 Concept Mapping GI Bleeding
2 pages
ACG
No ratings yet
ACG
10 pages
Encyclopedia of Electrochemical Power Sources - Zinc Electrodes Solar Thermal Production
No ratings yet
Encyclopedia of Electrochemical Power Sources - Zinc Electrodes Solar Thermal Production
19 pages
Report Legal Medicine
No ratings yet
Report Legal Medicine
4 pages
Assembly / Installation Instructions:: 6 Corporate Parkway Goose Creek, Sc. 29445 Www. Quoizel. Com
No ratings yet
Assembly / Installation Instructions:: 6 Corporate Parkway Goose Creek, Sc. 29445 Www. Quoizel. Com
1 page
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet