0% found this document useful (0 votes)

269 views6 pages

Confusion Matrix

This document discusses confusion matrices for evaluating classification models. It introduces confusion matrices, providing examples of binary and multi-class matrices. It then loads and explores the mushroom dataset, creating feature and target datasets. Various classifiers are defined in a dictionary to apply to the data and evaluate with confusion matrices. A function is defined to generate confusion matrices for each classifier in the dictionary when passed feature, target data and the classifier dictionary.

Uploaded by

amir

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

269 views6 pages

Confusion Matrix

Uploaded by

amir

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

   

(../../index.html) (../../pages/about- (../../archive.html) (../../categories/index.html)

me/index.html)

Metrics - Confusion Matrix (.)

 3 years ago (.)  Harun  Source (index.ipynb) 
 Tags: confusion matrix (../../categories/confusion-matrix/)

In a typical data science project we try several models like (logistic regression, SVM, tree-classifiers
etc) on our data.
Then we measure the predicting performance of these models to find the best performing one.
Finally we decide to implement the best performing model.
In this notebook we talk about one of the classification model evaluation tools: Confusion matrix.
They can help us to see deeper how much reliable our models are.
We are going to look at the confusion matrices of a variety of Scikit-Learn models and compare them
using visual diagnostic tools from Yellowbrick in order to select the best model for our data.

In [4]: # Notebook setup

import pandas as pd

from sklearn.model_selection import train_test_split

from sklearn.pipeline import Pipeline

from sklearn.linear_model import LogisticRegression

from sklearn.svm import LinearSVC

from sklearn.ensemble import RandomForestClassifier

from sklearn.linear_model import SGDClassifier

import category_encoders as ce

from yellowbrick.classifier import ConfusionMatrix

import matplotlib.pyplot as plt

import warnings

warnings.filterwarnings("ignore")

%matplotlib inline

Confusion Matrix
Since we know the labels of the test set we can measure how succesfull are the predictions of the
model by comparing the the actual labels and predictions
We can see if our classifier identifies the samples succesfully, or it is "CONFUSED" with another label?
Confusion matrix shows the amount of confusion.
We use confusion matrices to understand which classes are most easily confused
There are two sets of labels in a confusion matrix of binary (2 class) classification:
{POSITIVE, NEGATIVE}- first, the model makes the prediction. It returns the labels 1 (POSITIVE)
or 0 (NEGATIVE).
{TRUE, FALSE}- then the model's prediction is evaluated if the prediction is made correctly
(TRUE) or incorrectly (FALSE) based on the actual known labels.

Tip: If you have difficulty to remember these terms because of the similarity of the terms just insert
the word "PREDICTED" in the middle.
For instance if you are confused with the meaning of "false positive" read it like
"false(ly) PREDICTED positive"

We create a matrix of predictions and actual labels for a binary classification.

One diagonal show the succesful predictions the other diagonal unsuccessful predictions

In multi-class classification ie if the class labels are more than 2 (not 1 or 0, positive or negative), the
confusion matrix looks like something like this

We do not use the terms like "true positive" with the confusion matrix with classes more than 2
The size of the confusion matrix is nxn where n is the number of classes
Different references may use a different convention for axes ie actual and predicted classes can take
place on different axes

Examples of confusion matrices

Lets demonstrate some confusion matrices
We will utilize the tools from yellowbrick library (http://www.scikit-
yb.org/en/latest/api/classifier/confusion_matrix.html). This is a nice library for machine learning
visualization.

Loading and Exploring the Dataset

This tutorial uses a modified version of the mushroom dataset from the UCI Machine Learning
Repository.
Even though these toy datasets are not interesting anymore because of repetitive usage, here our
focus is the classification metrics not other steps of data processing. So we try to get the advantage of
fast implementation of this dataset
Our objective is to predict if a mushroom is poisonous or edible based on its characteristics.
The data include descriptions of hypothetical samples corresponding to 23 species of mushrooms
Each species was identified as definitely edible or poisonous

In [5]: # Url of the dataset

url='https://raw.githubusercontent.com/rebeccabilbro/rebeccabilbro.github.io/master/data/a
garicus-lepiota.txt'

# Column names list

column_names=['class', 'cap-shape', 'cap-surface', 'cap-color']

# Load the data

mushrooms=pd.read_csv(url, header=None, names= column_names)

mushrooms.head(3)

Out[5]:
class cap-shape cap-surface cap-color

0 edible convex smooth yellow

1 edible bell smooth white

2 poisonous convex scaly white

In [6]: mushrooms.info()

RangeIndex: 8123 entries, 0 to 8122

Data columns (total 4 columns):

class 8123 non-null object

cap-shape 8123 non-null object

cap-surface 8123 non-null object

cap-color 8123 non-null object

dtypes: object(4)

memory usage: 253.9+ KB

In [7]: ## Count the unique values in each column

mushrooms.nunique()

Out[7]: class 2

cap-shape 6

cap-surface 4

cap-color 10

dtype: int64

We see that target and feature columns contain different categorical values.
We need to encode them into numerical types in order to fit Sklearn models.
For this purpose, we will utilize Category Encoders (http://contrib.scikit-learn.org/categorical-
encoding/index.html)
library which provides scikit-learn-compatible categorical variable encoders.
All the transformers of Category Encoders can be used in Sklearn pipelines.
Later in a separate post we will analyse the encoders

Target and Features Datasets

In [8]: # Create the features dataset (X) and target dataset (y)

features = ['cap-shape', 'cap-surface', 'cap-color']

target = 'class'

X = mushrooms[features]

y = mushrooms[target]

Classifiers Dictionary
Now, tet's create a dictionary which contains the classifiers we want to use for our classification task
Here we create the dictionary with instantiates of Sklearn estimators without hyperparameter
tuning.
In reality we need to evaluate the performance of tuned classifiers.
In [10]: # Estimators dictionary

# We can add as more classifiers to our dictionary

# This is just a sample

estimators_dct={"Logistic Legression": LogisticRegression(),

"Linear SVC" : LinearSVC(),

"Random Forest": RandomForestClassifier(n_estimators=8),

"SGD Classifier": SGDClassifier(),

confusion_matrices function
Let's define a function to get the confusion matrices of a given dictionary of models (like in the upper
cell) easily without repetion.

Our function will

take X, y datasets and an estimator dictionary
return the confusion matrices produced by the predictions of each model in the dictionary

In [9]: # set up the figure size for the confusion matrices

plt.rcParams['figure.figsize'] = (6, 4)

plt.rcParams['font.size'] = 15

def confusion_matrices(X, y, estimator_dict):

"""

Takes X, y datasets and an estimator dictionary -> returns confusion matrices of the c
lassifiers

"""

# Split the data as train and test

X_train, X_test, y_train, y_test= train_test_split(X, y, test_size=0.2, random_state=1

# Loop over the estimator_dict keys to get the each estimator

for estimator in estimator_dict.keys():

print(estimator)

# In the pipeline we use OneHotEncoder from Category Encoders

model = Pipeline([('encoder', ce.OneHotEncoder()),

('estimator', estimator_dict[estimator])])

# Instantiate the classification model and visualizer

model.fit(X_train, y_train)

# The ConfusionMatrix visualizer takes a model

cm = ConfusionMatrix(model, fontsize=13, cmap='YlOrBr')

# To create the ConfusionMatrix, we need some test data

# Score runs predict() on the data and then

# creates the confusion_matrix from scikit-learn

cm.score(X_test, y_test)

cm.poof()

In [11]: # Call the confusion_matrices function to get the confusion matrices

confusion_matrices(X, y, estimators_dct)

Logistic Legression

Linear SVC

Random Forest

SGD Classifier

ConfusionMatrix from Yellowbrick

takes a fitted scikit-learn classifier and a set of X_test and y_test values and
returns a report showing how each of the test samples predicted classes compare to their actual
classes.
Confusion matrices provide similar information as what is available in a ClassificationReport (we
will talk about it soon), but rather than top-level scores, they provide deeper insight into the classifica-
tion of individual data points.
Creates a heatmap visualization of the sklearn.metrics.confusion_matrix().
We can choose between displaying values as the percent of true (cell value divided by sum of row) or
as direct counts.

Conclusion
Even though we can get deeper insight on predictions of the classifiers by confusion matrices, still it is
not very practical to compare several models performance with each other
Since confusion matrices provides tables of actual and prediction comparision we still need some
more metrics to interpret the result more directly to choose the best model
So we will continue to work on classification metrics like precision, recall, roc, auc etc in the next
posts

Sources:
http://www.scikit-yb.org/en/latest/api/classifier/confusion_matrix.html (http://www.scikit-
yb.org/en/latest/api/classifier/confusion_matrix.html)
http://contrib.scikit-learn.org/categorical-encoding/index.html (http://contrib.scikit-
learn.org/categorical-encoding/index.html)

Previous post (../metrics/) Next post (../gps-data-analysis/)

Comments
Contents © 2019 Harun (mailto:[email protected]) - Powered by Nikola (https://getnikola.com)

CEC453 Machine Learning
No ratings yet
CEC453 Machine Learning
168 pages
Unit 3 Univariate Analysis
No ratings yet
Unit 3 Univariate Analysis
39 pages
MPC of Complex Systems
100% (1)
MPC of Complex Systems
232 pages
ML Lab Experiments
No ratings yet
ML Lab Experiments
116 pages
Applied ML Notes
No ratings yet
Applied ML Notes
123 pages
Empathy Presentation
No ratings yet
Empathy Presentation
19 pages
Phyton
No ratings yet
Phyton
118 pages
Deep Learning With Python Sample
100% (1)
Deep Learning With Python Sample
31 pages
Operation Research Problems Solving in Python: Prepared by Saurav Barua
No ratings yet
Operation Research Problems Solving in Python: Prepared by Saurav Barua
15 pages
Lecture 3 - MDPs and Dynamic Programming
No ratings yet
Lecture 3 - MDPs and Dynamic Programming
66 pages
Ge3171-Python Lab
No ratings yet
Ge3171-Python Lab
82 pages
MiniTab Introduction
100% (1)
MiniTab Introduction
124 pages
The French Fluency Formula - Master The Language
No ratings yet
The French Fluency Formula - Master The Language
211 pages
Monte Carlo Simulation
No ratings yet
Monte Carlo Simulation
22 pages
Class Xi Python
100% (2)
Class Xi Python
138 pages
Com Skills MCQ
75% (4)
Com Skills MCQ
24 pages
MANAGEMENT
100% (1)
MANAGEMENT
15 pages
CH 22 Analytical Decision Making
No ratings yet
CH 22 Analytical Decision Making
26 pages
AI and ML Notes
No ratings yet
AI and ML Notes
17 pages
Module 1 Notes
100% (1)
Module 1 Notes
73 pages
Chapter 2 IA
No ratings yet
Chapter 2 IA
49 pages
Machine Learning Lab Assignment CSE-716: S. M. Shafkat Raihan ID: 16701041 SESSION: 2015-16
No ratings yet
Machine Learning Lab Assignment CSE-716: S. M. Shafkat Raihan ID: 16701041 SESSION: 2015-16
9 pages
A Semi-Detailed Lesson Plan in Science 7
100% (1)
A Semi-Detailed Lesson Plan in Science 7
3 pages
Machine Learning LAB MANUAL
No ratings yet
Machine Learning LAB MANUAL
56 pages
Pandas Handbook
No ratings yet
Pandas Handbook
33 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
13 pages
Values Education
No ratings yet
Values Education
17 pages
Overfitting and Underfitting in Machine Learning
No ratings yet
Overfitting and Underfitting in Machine Learning
3 pages
Aiml Manual 6th Sem
No ratings yet
Aiml Manual 6th Sem
15 pages
Gold Exp Grammar B2P U7
100% (1)
Gold Exp Grammar B2P U7
7 pages
M5 - Custom Model Building With SQL in BigQuery ML Slides
No ratings yet
M5 - Custom Model Building With SQL in BigQuery ML Slides
32 pages
ML UNIT-IV Notes
100% (1)
ML UNIT-IV Notes
23 pages
Data Mining Project Shivani Pandey
100% (1)
Data Mining Project Shivani Pandey
40 pages
The Next Level of Data Visualization in Python
100% (1)
The Next Level of Data Visualization in Python
17 pages
Handling Missing Value in Decision Tree Algorithm PDF
No ratings yet
Handling Missing Value in Decision Tree Algorithm PDF
6 pages
Statistics Probability
No ratings yet
Statistics Probability
66 pages
Preschool Language Enrichment Plans
No ratings yet
Preschool Language Enrichment Plans
3 pages
Understanding Random Forest
100% (1)
Understanding Random Forest
12 pages
Spa 506 - Intergroup Contact Theory
No ratings yet
Spa 506 - Intergroup Contact Theory
7 pages
Book Review Henry Mintzbergs Strategy Safari
100% (2)
Book Review Henry Mintzbergs Strategy Safari
19 pages
Deep Learning For IoT Big Data and Streaming Analytics A Survey
No ratings yet
Deep Learning For IoT Big Data and Streaming Analytics A Survey
40 pages
Text
No ratings yet
Text
131 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
47 pages
Karanja Evanson Mwangi Cit Masters Report Libre PDF
No ratings yet
Karanja Evanson Mwangi Cit Masters Report Libre PDF
136 pages
Advanced Simpy
No ratings yet
Advanced Simpy
25 pages
Machine Learning in 10 Pages PDF
No ratings yet
Machine Learning in 10 Pages PDF
10 pages
Decision Tree Learning: - A Learned Decision Tree Can Also Be Re-Represented As A Set of If-Then Rules
No ratings yet
Decision Tree Learning: - A Learned Decision Tree Can Also Be Re-Represented As A Set of If-Then Rules
49 pages
Lec-4-HEURISTIC SEARCH METHODS-1
No ratings yet
Lec-4-HEURISTIC SEARCH METHODS-1
54 pages
ACT - Russ Harris - Act With Love - BESTS
100% (5)
ACT - Russ Harris - Act With Love - BESTS
234 pages
Aguilos Mario Jr. G. LDM Portfolio
No ratings yet
Aguilos Mario Jr. G. LDM Portfolio
22 pages
Rachel Ford: 80 M St. #3 Salt Lake City, UT 84103 (385) 212-9242
No ratings yet
Rachel Ford: 80 M St. #3 Salt Lake City, UT 84103 (385) 212-9242
2 pages
Psy404 Assignment PDF
No ratings yet
Psy404 Assignment PDF
2 pages
SimPy For First Time Users - SimPy v2.2 Documentation
No ratings yet
SimPy For First Time Users - SimPy v2.2 Documentation
15 pages
Simple - Linear - Regression - Ipynb - Colaboratory
No ratings yet
Simple - Linear - Regression - Ipynb - Colaboratory
2 pages
By: Lisa Thomas and Jenny Johnson
No ratings yet
By: Lisa Thomas and Jenny Johnson
30 pages
Special Education Assessment Tools
100% (3)
Special Education Assessment Tools
7 pages
Input Modeling For Simulation
No ratings yet
Input Modeling For Simulation
48 pages
Roadmap To Build A Machine Learning Model
No ratings yet
Roadmap To Build A Machine Learning Model
12 pages
Lab 3 - Linear Regression
No ratings yet
Lab 3 - Linear Regression
15 pages
ML 2
No ratings yet
ML 2
6 pages
Monday Tuesday Wednesday Thursday Friday: Daily Lesson LOG I.Objectives
No ratings yet
Monday Tuesday Wednesday Thursday Friday: Daily Lesson LOG I.Objectives
15 pages
Interdisciplinary Island of Rationality: A Promising Active Learning Strategy
No ratings yet
Interdisciplinary Island of Rationality: A Promising Active Learning Strategy
13 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
Python Data Analysis Visualization
No ratings yet
Python Data Analysis Visualization
34 pages
Honours in Artificial Intelligence and Machine Learning: Board of Studies (Computer Engineering)
No ratings yet
Honours in Artificial Intelligence and Machine Learning: Board of Studies (Computer Engineering)
16 pages
Toolbox Grade 5 Formative Assessment Performing
No ratings yet
Toolbox Grade 5 Formative Assessment Performing
5 pages
Exam Rubrics Final PDF
No ratings yet
Exam Rubrics Final PDF
1 page
Aristotle On What It Means To Be Happy
No ratings yet
Aristotle On What It Means To Be Happy
8 pages
Approaches To The Analysis of Survey Data PDF
No ratings yet
Approaches To The Analysis of Survey Data PDF
28 pages
ML Interview Cheat Sheet
No ratings yet
ML Interview Cheat Sheet
9 pages
POL BigDataStatisticsJune2014
No ratings yet
POL BigDataStatisticsJune2014
27 pages
Multimedia Mining Presentation
No ratings yet
Multimedia Mining Presentation
18 pages
Main Complementary: Page Text Book/ Activity Book: 4
No ratings yet
Main Complementary: Page Text Book/ Activity Book: 4
2 pages
First Term Exam Second Year Literary Classes
No ratings yet
First Term Exam Second Year Literary Classes
2 pages
I. The Types of Machine Learning
No ratings yet
I. The Types of Machine Learning
8 pages
Monte Carlo Simulation: Assignment 1
No ratings yet
Monte Carlo Simulation: Assignment 1
13 pages
Year 12 Modern History Assessment Task
No ratings yet
Year 12 Modern History Assessment Task
11 pages
Bakhtin and The Guru Granth Sahib: TH TH
No ratings yet
Bakhtin and The Guru Granth Sahib: TH TH
5 pages
Cheatsheet Midterms 2 - 3
No ratings yet
Cheatsheet Midterms 2 - 3
2 pages
Running Head: Psychology in My Life 1
No ratings yet
Running Head: Psychology in My Life 1
6 pages
ML - Expectation-Maximization Algorithm
No ratings yet
ML - Expectation-Maximization Algorithm
3 pages
Salary Prediction LinearRegression
100% (1)
Salary Prediction LinearRegression
7 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
7 pages
Pastsimple Usedto Would
No ratings yet
Pastsimple Usedto Would
2 pages
Passive Voice: by Group 1: Luthfika Adityo Gindo Putra Nugroho Primatia Palwani Munawaroh
No ratings yet
Passive Voice: by Group 1: Luthfika Adityo Gindo Putra Nugroho Primatia Palwani Munawaroh
8 pages
1st Quarter Exam Oral Comm
No ratings yet
1st Quarter Exam Oral Comm
5 pages
Mastering Parallel Programming with R
From Everand
Mastering Parallel Programming with R
Simon R. Chapple
No ratings yet
Equity of Cybersecurity in the Education System: High Schools, Undergraduate, Graduate and Post-Graduate Studies.
From Everand
Equity of Cybersecurity in the Education System: High Schools, Undergraduate, Graduate and Post-Graduate Studies.
Joseph O. Esin
No ratings yet
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
From Everand
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
Janet Laane Effron
No ratings yet
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
From Everand
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
Sebastian Thelen
5/5 (1)

Confusion Matrix

Uploaded by

Confusion Matrix

Uploaded by

   

(../../index.html) (../../pages/about- (../../archive.html) (../../categories/index.html)

Metrics - Confusion Matrix (.)

In [4]: # Notebook setup

from sklearn.model_selection import train_test_split

from sklearn.pipeline import Pipeline

from sklearn.linear_model import LogisticRegression

from sklearn.svm import LinearSVC

from sklearn.ensemble import RandomForestClassifier

from sklearn.linear_model import SGDClassifier

from yellowbrick.classifier import ConfusionMatrix

import matplotlib.pyplot as plt

We create a matrix of predictions and actual labels for a binary classification.

Examples of confusion matrices

Loading and Exploring the Dataset

In [5]: # Url of the dataset

# Column names list

column_names=['class', 'cap-shape', 'cap-surface', 'cap-color']

# Load the data

mushrooms=pd.read_csv(url, header=None, names= column_names)

0 edible convex smooth yellow

1 edible bell smooth white

2 poisonous convex scaly white

RangeIndex: 8123 entries, 0 to 8122

Data columns (total 4 columns):

class 8123 non-null object

cap-shape 8123 non-null object

cap-surface 8123 non-null object

cap-color 8123 non-null object

memory usage: 253.9+ KB

In [7]: ## Count the unique values in each column

Target and Features Datasets

features = ['cap-shape', 'cap-surface', 'cap-color']

# We can add as more classifiers to our dictionary

# This is just a sample

estimators_dct={"Logistic Legression": LogisticRegression(),

"Linear SVC" : LinearSVC(),

"Random Forest": RandomForestClassifier(n_estimators=8),

"SGD Classifier": SGDClassifier(),

Our function will

In [9]: # set up the figure size for the confusion matrices

def confusion_matrices(X, y, estimator_dict):

# Split the data as train and test

X_train, X_test, y_train, y_test= train_test_split(X, y, test_size=0.2, random_state=1

# Loop over the estimator_dict keys to get the each estimator

for estimator in estimator_dict.keys():

# In the pipeline we use OneHotEncoder from Category Encoders

model = Pipeline([('encoder', ce.OneHotEncoder()),

# Instantiate the classification model and visualizer

# The ConfusionMatrix visualizer takes a model

cm = ConfusionMatrix(model, fontsize=13, cmap='YlOrBr')

# To create the ConfusionMatrix, we need some test data

# Score runs predict() on the data and then

# creates the confusion_matrix from scikit-learn

In [11]: # Call the confusion_matrices function to get the confusion matrices

ConfusionMatrix from Yellowbrick

Previous post (../metrics/) Next post (../gps-data-analysis/)

You might also like