0% found this document useful (0 votes)

52 views28 pages

Classification Algorithm: Supervised Learning Technique Training Data

The Classification algorithm is a supervised learning technique that identifies the category of new observations based on training data. It learns from labeled examples to predict the class of unlabeled examples. Some key points: - Classification outputs categorical labels, not numerical values. Examples include spam/not spam, cat/dog. - Methods include binary classifiers with two classes and multi-class classifiers with more than two classes. - Models are trained on labeled data and tested on unlabeled data to evaluate performance using metrics like confusion matrices, precision, recall, and accuracy. - Neural networks like perceptrons can perform classification tasks and are trained using methods like backpropagation to adjust weights.

Uploaded by

mirahaem5

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views28 pages

Classification Algorithm: Supervised Learning Technique Training Data

Uploaded by

mirahaem5

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Classification Algorithm

The Classification algorithm is a Supervised Learning technique that is

used to identify the category of new observations on the basis of
Training data.
Such as, Yes or No, 0 or 1, Spam or Not Spam, cat or dog, etc.
Classes can be called as targets/labels or categories.
Unlike regression, the output variable of Classification is a category,
not a value, such as "Green or Blue", "fruit or animal", etc.
y=f(x), where y = categorical output

Classification
Labeled Data algorithm

Training
Testing

Learned
Labeled Data Classification
model

Methods that can learn from and make test on data

Classification Algorithm
Binary Classifier: If the classification problem has only two possible
outcomes, then it is called as Binary Classifier.
Examples: YES or NO, MALE or FEMALE, SPAM or NOT SPAM, CAT or
DOG, etc.
Multi-class Classifier: If a classification problem has more than two
outcomes, then it is called as Multi-class Classifier.
Example: Classifications of types of crops, Classification of types of
music.
Learners in Classification Problems
Lazy Learners: Lazy Learner firstly stores the training
dataset and wait until it receives the test dataset. In
Lazy learner case, classification is done on the basis
of the most related data stored in the training
dataset. It takes less time in training but more time
for predictions.
Example: K-NN algorithm, Case-based reasoning
Eager Learners: Eager Learners develop a
classification model based on a training dataset
before receiving a test dataset. Opposite to Lazy
learners, Eager learners take less time in training and
more time in prediction. Example: Decision Trees,
Naïve Bayes, ANN.
Evaluating a Classification model
Confusion Matrix:
The confusion matrix provides us a matrix/table as output and describes the
performance of the model.
It is also known as the error matrix.
The matrix consists of predictions result in a summarized form, which has a total
number of correct predictions and incorrect predictions. The matrix looks like as
below table:

True Positives(TP): Classifier correctly predicts subject as positive for given class. for e.g.For Cat vs
Dog classification Cat is predicted as Cat.
True Negatives(TN): Classifier correctly predicts subject as negative for given class. for e.g. For
Cat vs Dog classification Dog is predicted as Not cat.
False Positives(FP):Classifier incorrectly predicts subject as positive for given class for e.g. For Cat
vs Dog classification Dog is predicted as Cat. This is also known as “Type I” Error.
False Negatives(FN):
Classifier incorrectly predicts subject as negative for given class for e.g. For Cat vs Dog classification
Cat is predicted as Not Cat. This is also known as “Type II” Error.
Evaluating a Classification model (Cont)
Confusion Matrix:
Example
We have built a new spam filter and want to evaluate how good it is . Given below is the
confusion matrix for the spam filter for 100 e-mails.

Predicted
spam Not spam
Actual spam 10 2
Not spam 15 73
a) What is the number 15 here? Put it in plain English
b) Calculate the precision for the spam filter. What is the interpretation of having this value for
precision i.e How would you explain this to someone doesn’t know how precision is
calculated but still uses e-mail and gets spam e-mail.
c) Calculate the recall for the spam filter. What is the interpretation of having this value for
recall i.e How would you explain this to someone doesn’t know how recall is calculated but
still uses e-mail and gets spam e-mail.
d) You can see that the precision is very good for this spam filter but the recall is not so good.
What does it mean to have high precision and low recall (hint: think about how you
interpret precision and recall and apply it to the context of spam filter). What might the
possible reason you are seeing these results?
e) What does it mean to have high recall and low precision for a spam filter? Which of the
two do you think is better i.e high precision and low recall or high recall and low precision
f) What is the overall accuracy of the spam filter? What do you mean when you say this
spam filter has his value of accuracy?
Example
Evaluating a Classification model (Cont)
Log Loss or Cross-Entropy Loss:
• It is used for evaluating the performance of a
classifier, whose output is a probability value
between the 0 and 1.
• For a good binary Classification model, the value
of log loss should be near to 0.
• The value of log loss increases if the predicted
value deviates from the actual value.
• The lower log loss represents the higher accuracy
of the model.
For Binary classification, cross-entropy can be
calculated as:
Evaluating a Classification model (Cont)
AUC-ROC curve:
• ROC curve stands for Receiver Operating Characteristics
Curve and AUC stands for Area Under the Curve.
• It is a graph that shows the performance of the classification
model at different thresholds.
• To visualize the performance of the multi-class classification model,
we use the AUC-ROC Curve.
• The ROC curve is plotted with TPR and FPR, where TPR (True Positive
Rate) on Y-axis and FPR(False Positive Rate) on X-axis.
Evaluating a Classification model
AUC-ROC curve:
Biological Neuron

The human brain is made up of billions of simple processing units –

neurons.

• Inputs are received on dendrites, and if the input levels are

over a threshold, the neuron fires, passing a signal through
the axon to the synapse which then connects to another
neuron.
Artificial Neural Networks (ANN)
Information processing paradigm inspired by biological nervous systems

ANN is composed of a system of neurons connected by synapses

ANN learn by example
Adjust synaptic connections between neurons
History
1943: McCulloch and Pitts model neural networks based on their
understanding of neurology.
Neurons embed simple logic functions:
• a or b
• a and b
Perceptron (Rosenblatt 1958)
• Association units A1, A2, … extract features from user input
• Output is weighted and associated
• Function fires if weighted sum of input exceeds a threshold.
Back-propagation learning method (Werbos 1974)
• Three layers of neurons
• Input, Output, Hidden
• Better learning rule for generic three layer networks
Biological Neuron vs. Artificial Neuron

Biological Neuron Artificial Neuron

Cell Nucleus (Soma) Node
Dendrites Input
Synapse Weights or
interconnections
Axon Output

Artificial Neuron
Artificial Neuron
A neuron is a mathematical function modeled on the working of
biological neurons
• It is an elementary unit in an artificial neural network
• One or more inputs are separately weighted
• Inputs are summed and passed through a nonlinear function to
produce output
• Every neuron holds an internal state called activation signal
• Each connection link carries information about the input signal
• Every neuron is connected to another neuron via connection link
A typical activation function works as follows:
ì+ 1 for X > t
n
X = å wi xi Y =í
i =1
î 0 for X £ t
Each node i has a weight, w associated with it. The input to node i is x .
i i
t is the threshold.
So if the weighted sum of the inputs to the neuron is above the
threshold, then the neuron fires.
Activation Functions
Perceptron's
A perceptron is a single neuron that classifies a set of inputs into one of
two categories (usually 1 or -1).
If the inputs are in the form of a grid, a perceptron can be used to
recognize visual images of shapes.
The perceptron usually uses a step function, which returns 1 if the
weighted sum of inputs exceeds a threshold, and 0 otherwise.

There are two types of Perceptron's: Single layer and Multilayer.

Single layer Perceptron's can learn only linearly separable patterns.

Multilayer Perceptron's or feedforward neural networks with two or more
layers have the greater processing power.
Training Perceptron's
Learning involves choosing values for the weights
The perceptron is trained as follows:
First, inputs are given random weights (usually between –0.5 and 0.5).
An item of training data is presented. If the perceptron mis-classifies it,
the weights are modified according to the following:
wi ¬ wi + (a ´ xi ´ (t - o ))
where t is the target output for the training example, o is the output
generated by the perceptron and a is the learning rate, between 0
and 1 (usually small such as 0.1)

Cycle through training examples until successfully classify all examples

Each cycle known as an epoch
Multilayer Neural Networks
Multilayer neural networks can classify a
range of functions, including non linearly Weights
separable ones.
Each input layer neuron connects to all
neurons in the hidden layer.
The neurons in the hidden layer connect 𝒉 = 𝝈(𝐖𝟏 𝒙 + 𝒃𝟏 )
to all neurons in the output layer.
𝒚 = 𝝈(𝑾𝟐 𝒉 + 𝒃𝟐 )

𝒉 Activation functions

How do we train?
𝒚
4 + 2 = 6 neurons (not counting inputs)
[3 x 4] + [4 x 2] = 20 weights
4 + 2 = 6 biases
26 learnable parameters
𝒙
The Chain Rule
The Chain Rule is a technique for differentiating
composite functions.
Composite functions are made up of layers of
functions inside of functions.
Steps to apply chain rule
0 Identify inner and outer functions.
0 Derive outer function, leaving the inner
function alone.
0 Derive the inner function.
Chain Rule: One Independent Variable and

13-20
Theorem 13.7 Chain Rule: Two Independent Variables

13-21
Chain Rule Example
Training
Forward it Back-
Sample Update the
labeled data through the
network, get
propagate network
(batch) the errors weights
predictions

Optimize (min. or max.) objective/cost function 𝑱(𝜽)

Generate error signal that measures difference
between predictions and target values

Use error signal to change the weights and get

more accurate predictions
Subtracting a fraction of the gradient moves you
towards the (local) minimum of the cost function
Training Algorithm
Step 0 : Initialize weights
(Set to random variables with zero mean and variance one)
Step 1: While stopping condition is false do Step 2-9.
Step 2:
For each training pair do Steps 3-8.
Feed forward
Step 3: Each input unit(Xi,i=1,..,n) receives input
signal xi and broadcasts this signal to all units in the
layer above(the hidden units)
Step 4: Each hidden unit (Zj j=1,…,p) sums its weighted input signals
n
z-in j = voj + å xi vij
i =1
applies its activation function to compute its output signal

z j = f ( z - in j )
and sends this signal to all units in the layer above
(outputunits)
Training Algorithm (Cont)
Step 5: Each output unit (Yk ,k=1,…..,m) sums its weighted input signals,
p
y_ink=wOk+
åz w
j =1
j jk

and applies its activation function to compute its output signal.

yk=f(y_ink).
Backpropagation of error:
Step 6: Each output unit ( Yk ,k=1,…,m) receives a target
pattern corresponding to the input training patern
computes its error information term.
¶ k = (tk - yk ) f ' ( y _ ink ),
calculates its weight correction term (used to update wjk later),
DwOk = ad k Z j ,
calculates its bias correction term ( used to update wOk later).
DwOk = ad k
and sends to units in the layer below,
Training Algorithm (Cont)
Step 7: Each hidden units (Zj, j=1,…,p) sums its delta inputs from
units in the layer above).
m

O_inj= å ¶ k w jk
k =1

multiplies by the derivative of its activation function to calculate its error

information term,
¶ j = ¶ _ in j f ' ( z _ in j ),
calculates its weight correction term(used to update vij later),
Dvij = a¶ j xi
and calculates its bias correction term(used to update voj later),
Dvoj = a¶ j
Training Algorithm (Cont)
Update weights and bias
Step 8: Each output units(Yk,k=1,….,m) updates its bias
and weights(j=0,…,p):
wjk(new)=wjk(old)+ Each hidden unit(Zj j=1,….,p) updates its bias and
weights(i=0,….,n):
Step 9: Test stopping condition
Example
Given the following multiple neural network /deep learning with a
training data points i=[i1=0.1,i2=0.2,i3=0.7],target value
l=[l1=1.0,l2=0.0,l3=0.0],learning rate=0.8 and the bias value at each
layers b=[1.0,1.0,1.0]
• Initialize the weights randomly.
• Forward pass the inputs and calculate the cost.
• Apply backpropagation and adjust the weights accordingly for 2
epoch

A Comprehensive Review of Acousto Ultrasonic-Echo (Au-E) Technique For Furnace Refractory Lining Assessment
100% (1)
A Comprehensive Review of Acousto Ultrasonic-Echo (Au-E) Technique For Furnace Refractory Lining Assessment
21 pages
Evaluation For Vessel
No ratings yet
Evaluation For Vessel
10 pages
Unit 2 Neural Networks
No ratings yet
Unit 2 Neural Networks
52 pages
Cse Vi Computer Graphics and Visualization 10CS65 Notes PDF
100% (1)
Cse Vi Computer Graphics and Visualization 10CS65 Notes PDF
97 pages
2 Pier Alignment
No ratings yet
2 Pier Alignment
27 pages
Syllabus of Chemical Engineering 3rd Year 2020 5 April 2021
No ratings yet
Syllabus of Chemical Engineering 3rd Year 2020 5 April 2021
57 pages
5 Tools and Tricks I Learned From Teaching Autocad Civil 3D To Caltrans Employees
No ratings yet
5 Tools and Tricks I Learned From Teaching Autocad Civil 3D To Caltrans Employees
12 pages
Poison For Breakfast
100% (1)
Poison For Breakfast
23 pages
Thermal Deformation Analysis of Automotive Disc Brake Squeal
No ratings yet
Thermal Deformation Analysis of Automotive Disc Brake Squeal
26 pages
Schedule Risk Analysis
No ratings yet
Schedule Risk Analysis
40 pages
Classification
100% (2)
Classification
105 pages
NPS3 - Negative-Phase-Sequence Protection Low-Set Stage (NPS3Low) High-Set Stage (NPS3High)
No ratings yet
NPS3 - Negative-Phase-Sequence Protection Low-Set Stage (NPS3Low) High-Set Stage (NPS3High)
20 pages
MYSYSTEM2
No ratings yet
MYSYSTEM2
9 pages
Iwata (E 1974) PDF
No ratings yet
Iwata (E 1974) PDF
21 pages
MATLAB Lecture 3 - Built-In Matlab Functions
No ratings yet
MATLAB Lecture 3 - Built-In Matlab Functions
24 pages
07 Neural Networks1
No ratings yet
07 Neural Networks1
73 pages
Design of Rural Water Supply System Using Loop 4.0
No ratings yet
Design of Rural Water Supply System Using Loop 4.0
9 pages
11 Sartori - The Influence of Electoral Systems
100% (1)
11 Sartori - The Influence of Electoral Systems
26 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
54 pages
Control Proporcional
No ratings yet
Control Proporcional
5 pages
AI Lec13
No ratings yet
AI Lec13
65 pages
Limitations of Mathematical Model PDF
No ratings yet
Limitations of Mathematical Model PDF
16 pages
Chap 5 Learning
No ratings yet
Chap 5 Learning
56 pages
Unit 4 Aam
No ratings yet
Unit 4 Aam
26 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
51 pages
Unit Ii
No ratings yet
Unit Ii
118 pages
Dmunit 4
No ratings yet
Dmunit 4
23 pages
DL Full Merged
No ratings yet
DL Full Merged
454 pages
Antenna Look Angles: College of Electronic Engineering Communication Engineering Department
No ratings yet
Antenna Look Angles: College of Electronic Engineering Communication Engineering Department
22 pages
Unit 5 Classification PDF
No ratings yet
Unit 5 Classification PDF
131 pages
Sizing Calculation Spreadsheet PSV
No ratings yet
Sizing Calculation Spreadsheet PSV
1 page
Mod 6 - Charts
No ratings yet
Mod 6 - Charts
45 pages
1 - Introduction
No ratings yet
1 - Introduction
82 pages
Set 3 K@mpoi Algebra 2022 - Jawapan
No ratings yet
Set 3 K@mpoi Algebra 2022 - Jawapan
12 pages
ML Notes UT-2
No ratings yet
ML Notes UT-2
19 pages
Lecture 4.2 Supervised Learning Classification
No ratings yet
Lecture 4.2 Supervised Learning Classification
25 pages
AI Chapter 5
No ratings yet
AI Chapter 5
31 pages
Unit 4 ML
No ratings yet
Unit 4 ML
28 pages
5.1 Oscillations (Part 1)
No ratings yet
5.1 Oscillations (Part 1)
2 pages
GCSE H3 02g4 02 3D Trigonometry
No ratings yet
GCSE H3 02g4 02 3D Trigonometry
2 pages
Unit 1.1
No ratings yet
Unit 1.1
44 pages
Unit III - I
No ratings yet
Unit III - I
15 pages
Evaluation of Predictive Models Final
No ratings yet
Evaluation of Predictive Models Final
6 pages
Lecture 19 NN
No ratings yet
Lecture 19 NN
32 pages
7TH Semester Syllabus
No ratings yet
7TH Semester Syllabus
9 pages
Ann Mid1: Artificial Neural Networks With Biological Neural Network - Similarity
No ratings yet
Ann Mid1: Artificial Neural Networks With Biological Neural Network - Similarity
13 pages
Lec3ML - Preceptron - Updated v4
No ratings yet
Lec3ML - Preceptron - Updated v4
20 pages
ML Unit 3
No ratings yet
ML Unit 3
127 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
WEEK 01 Merged
No ratings yet
WEEK 01 Merged
606 pages
MQL5 Language Basics STRING TYPES
No ratings yet
MQL5 Language Basics STRING TYPES
11 pages
M290 Lecture1h
No ratings yet
M290 Lecture1h
16 pages
Lecture 19 NN
No ratings yet
Lecture 19 NN
32 pages
Lecture 2
No ratings yet
Lecture 2
26 pages
09 Evaluation
No ratings yet
09 Evaluation
64 pages
ML Module 5
No ratings yet
ML Module 5
14 pages
Fractions Potential Assessment Questions
No ratings yet
Fractions Potential Assessment Questions
4 pages
Notes Machine Learning
No ratings yet
Notes Machine Learning
34 pages
Week3 LearningI
No ratings yet
Week3 LearningI
48 pages
Machine Learning Cheatsheet
No ratings yet
Machine Learning Cheatsheet
12 pages
Chapter 7 - LAST
No ratings yet
Chapter 7 - LAST
29 pages
02 ArtificialNeurons
No ratings yet
02 ArtificialNeurons
31 pages
ML RUSA Module 1 Intro
No ratings yet
ML RUSA Module 1 Intro
30 pages
QA 27 Geometry - 2
No ratings yet
QA 27 Geometry - 2
33 pages
Mid 1 DL Notes
No ratings yet
Mid 1 DL Notes
15 pages
Lecture 2 - Supervised Learning
No ratings yet
Lecture 2 - Supervised Learning
6 pages
Classification
No ratings yet
Classification
22 pages
ML Viva Questions
No ratings yet
ML Viva Questions
25 pages
CFBC 718 e 2 C
No ratings yet
CFBC 718 e 2 C
30 pages
Lec1 - Introduction
No ratings yet
Lec1 - Introduction
55 pages
Machine Learning Intro & Evaluation Metrics
No ratings yet
Machine Learning Intro & Evaluation Metrics
49 pages
ML Detention Work
No ratings yet
ML Detention Work
3 pages
Classification Algorithm in Machine Learning
No ratings yet
Classification Algorithm in Machine Learning
13 pages
Deep Learning-Material For The Units 1,2,3
No ratings yet
Deep Learning-Material For The Units 1,2,3
36 pages
Soft Computing Unit 2 Notes..
No ratings yet
Soft Computing Unit 2 Notes..
24 pages
Learning
No ratings yet
Learning
48 pages
Unit 2 - Class
No ratings yet
Unit 2 - Class
16 pages
Unit-Ii MLT1
No ratings yet
Unit-Ii MLT1
45 pages
02 - Supervised Network (Perceptron)
No ratings yet
02 - Supervised Network (Perceptron)
40 pages
Advanced Supervised Learning
No ratings yet
Advanced Supervised Learning
17 pages
Soft Computing Unit 2
No ratings yet
Soft Computing Unit 2
23 pages
11-AI ML Intro 2022
No ratings yet
11-AI ML Intro 2022
54 pages
UNIT1
No ratings yet
UNIT1
72 pages
Unit 3
No ratings yet
Unit 3
27 pages
Machine Learning Intro & Evaluation Metrics
No ratings yet
Machine Learning Intro & Evaluation Metrics
50 pages
Lecture#12 DM MS (DEIM) Spring 2025
No ratings yet
Lecture#12 DM MS (DEIM) Spring 2025
21 pages
Perceptrons: Fundamentals and Applications for The Neural Building Block
From Everand
Perceptrons: Fundamentals and Applications for The Neural Building Block
Fouad Sabry
No ratings yet
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
From Everand
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
Fouad Sabry
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet

Classification Algorithm: Supervised Learning Technique Training Data

Uploaded by

Classification Algorithm: Supervised Learning Technique Training Data

Uploaded by

Classification Algorithm

The Classification algorithm is a Supervised Learning technique that is

Methods that can learn from and make test on data

The human brain is made up of billions of simple processing units –

• Inputs are received on dendrites, and if the input levels are

ANN is composed of a system of neurons connected by synapses

Biological Neuron Artificial Neuron

There are two types of Perceptron's: Single layer and Multilayer.

Single layer Perceptron's can learn only linearly separable patterns.

Cycle through training examples until successfully classify all examples

Optimize (min. or max.) objective/cost function 𝑱(𝜽)

Use error signal to change the weights and get

and applies its activation function to compute its output signal.

multiplies by the derivative of its activation function to calculate its error

You might also like