0% found this document useful (0 votes)

3 views34 pages

Lecture7C Classification

The document provides an overview of binary classification, focusing on various algorithms such as Support Vector Machines (SVM), K Nearest Neighbor (KNN), and Artificial Neural Networks (ANN). It discusses the evaluation metrics for binary classifiers, the concept of hyperplanes, and the optimization techniques used in SVM to maximize margins while minimizing misclassification. Additionally, it introduces the soft-margin SVM and kernel trick for handling non-linearly separable data.

Uploaded by

Hoàng Bảo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views34 pages

Lecture7C Classification

Uploaded by

Hoàng Bảo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

Classification

n Outline:
1. Introduction
2. K Nearest Neighbor (KNN)
3. Artificial Neural Network (ANN)
4. Support Vector Machine (SVM)
Binary classifier

§ Supervised learning algorithm,

§ To categorize new observations (data point) into one of two

predefined classes.

§ Applications:
Common binary classification algorithms

§ Support Vector Machines

§ Naïve Bayes

§ K Nearest Neighbor

§ Decision Trees

§ Logistic Regression

§ Artificial Neural Networks

Binary classification evaluation

§ TP, TN, FP, FN § Accuracy

§ Recall § ROC

§ Precision § AUC

§ F1-score § …
Example of binary classification

Red fruit Green fruit

1D data

StatQuest with Josh Starmer

Example of binary classification

Red fruit Green fruit

Threshold

This threshold is not good! New data?

How to find a better threshold?

§ Focus on the edges of each class/cluster

edges
§ Use a midpoint as threshold

This is Maximal Margin

Classifier!
margin
Is maximal margin classifier good?

§ Training data:

à Maximal margin classifier is super close to the green cluster

à Maximal margin classifier is super sensitive to outliers in

training data
Misclassification

§ Choosing a threshold that allows misclassification:

Ignore it!!!

Soft margin
This is Soft Margin
Classifier
How to choose a good soft margin?

§ Choosing this one?

§ Or choosing this one?

?
How to choose a good soft margin?

§ Using cross validation à how many misclassification errors and

observations within the soft margin

observation misclassification observation

Soft margin classifier = support vector classifier

Support vector classifier can handle outliers

Support vector classifier

support vector support vector

1D data
Support vector classifier

2D data
Support vector classifier

3D data
Support vector classifier

§ 𝑛-D data (𝒏 ≥ 𝟒): support vector classifier is a hyperplane

§ The term “hyperplane” is usually used when we can’t draw it

§ Support vector classifier allows misclassification à can handle

overlapping classification
Overlapping classification

§ In case of lots of overlapping

cured
cured
uncured patients uncured
patients
patients patients

§ Support vector classifiers don’t perform well à solution???

Solution

Support
Vector
Machines
(SVM)
SVM: a visual explanation (cont)

How to split the data in

the best possible way?

Alice Zhao
SVM: a visual explanation (cont)
SVM: a visual explanation (cont)

Not good split

SVM: a visual explanation (cont)

The best split

Margin

Margin = distance
between the hyperplane
and the closest data point
from either class.
margin
Margin (cont)
What is a hyperplane?

§ In an n-D space, a hyperplane is a flat affine (n-1)-D sub-space

§ In an SVM, hyperplane is the decision boundary that separates

two classes,

(𝑤: 𝑣𝑒𝑐𝑡𝑜𝑟, 𝑏: 𝑠𝑐𝑎𝑙𝑎𝑟)

§ Optimal hyperplane is a hyperplane that:

Ø Separates the classes as well as possible

Ø Maximizes the margin

Ø Minimizes the misclassification

Example of hyperplane

Hyperplane

Support
vectors
How to find the optimal hyperplane?

Find the best hyperplane =

maximize the margin?

à Constrained
optimization
problem

à Using
Lagrange
multipliers
technique
How to find the optimal hyperplane? (cont)

§ The hyperplane equation: 𝑓 𝑥 = 𝑤 ! + 𝑏 = 0

§ SVM aims to maximizes margin

2
|𝑤|
How to find the optimal hyperplane? (cont)

§ Given training data { 𝑥" , 𝑦" }%

"#$ (linearly separable)

𝑥" = 𝑛𝐷 − 𝑓𝑒𝑎𝑡𝑢𝑟𝑒 𝑣𝑒𝑐𝑡𝑜𝑟,𝑦" ∈ −1, +1 = 𝑙𝑎𝑏𝑒𝑙

§ The SVM optimization is:

subject to.
Quadratic programming solving

§ Given training data { 𝑥" , 𝑦" }%

"#$

𝑥" = 𝑛𝐷 − 𝑓𝑒𝑎𝑡𝑢𝑟𝑒 𝑣𝑒𝑐𝑡𝑜𝑟,𝑦" ∈ −1, +1 = 𝑙𝑎𝑏𝑒𝑙

§ The SVM optimization is:

Objective function is
quadratic and convex

Constraints are linear

subject to

§ This is a convex quadratic programming problem.

How to make SVM more powerful?

§ When data is not linearly separable and/or has some outliers

Ø Soft -margin SVM

Ø Kernel trick

Ø Mixed
Soft-margin SVM

§ We allow some misclassifications by introducing slack variables

𝜉" ≥ 0 and parameter 𝐶:

subject to:
The role of C parameter

§ C controls the trade-off between:

Ø Maximizing the margin

Ø Minimizing the classification errors

§ Effects of C:

Ø Large C: small margin, few misclassification errors

Ø Small C: large margin, more tolerance to errors

§ How to choose the good value of C?

Ø Cross validation
Kernel trick

§§ When
C controls
data the
is not
trade-off
linearlybetween:
separable:

ØØ Map data to a
Maximizing higher
the dimensional feature space where data
margin

Ø becomes separable
Minimizing the classification errors

§ Effect of
Decision boundary
Summary

Data Type Use Notes

Linearly separable + Linear SVM, hard Perfect separation, no
noise-free margin noise
Nearly separable + Linear SVM, soft C controls error
noise margin (C) tolerance
Kernel SVM + soft Use RBF/poly kernel
Non-linearly separable
margin (C) for flexibility

Presentation On Support Vector Machine (SVM)
100% (2)
Presentation On Support Vector Machine (SVM)
22 pages
Unit 2 PPT - Part 2
100% (1)
Unit 2 PPT - Part 2
81 pages
Support_Vector_Machine(SVM)[1]
No ratings yet
Support_Vector_Machine(SVM)[1]
103 pages
Final - Support Vector Machine - Class - Modifie
No ratings yet
Final - Support Vector Machine - Class - Modifie
69 pages
Svm
No ratings yet
Svm
52 pages
Unit 3_ SVM
No ratings yet
Unit 3_ SVM
43 pages
Unit-III - SVM
No ratings yet
Unit-III - SVM
105 pages
Support Vector Machines
No ratings yet
Support Vector Machines
24 pages
SVM.pptx
No ratings yet
SVM.pptx
67 pages
Svm
No ratings yet
Svm
52 pages
IVPML Unit III
No ratings yet
IVPML Unit III
139 pages
L5_SVMs
No ratings yet
L5_SVMs
37 pages
Introduction To Support Vector Machines
No ratings yet
Introduction To Support Vector Machines
46 pages
Support Vector Machine
No ratings yet
Support Vector Machine
28 pages
Support Vector Machine
No ratings yet
Support Vector Machine
33 pages
2024-SCU-ML-2-1-SVM
No ratings yet
2024-SCU-ML-2-1-SVM
36 pages
ML Chapter 5 Part 2
No ratings yet
ML Chapter 5 Part 2
52 pages
Support Vector Machine
No ratings yet
Support Vector Machine
19 pages
16 SVM
No ratings yet
16 SVM
41 pages
Unit 2 - SVM - 241016 - 104220
No ratings yet
Unit 2 - SVM - 241016 - 104220
47 pages
Lecture 18 - SVM
No ratings yet
Lecture 18 - SVM
54 pages
ML-Lec9-SVM
No ratings yet
ML-Lec9-SVM
32 pages
6. Support Vector Machine for Classification
No ratings yet
6. Support Vector Machine for Classification
38 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
Svm
No ratings yet
Svm
29 pages
Machine Learning (CSO851) - Lecture 05
No ratings yet
Machine Learning (CSO851) - Lecture 05
27 pages
10 Classification SVM
No ratings yet
10 Classification SVM
22 pages
Chapter 07 SVM
No ratings yet
Chapter 07 SVM
20 pages
SVM (Repaired)
No ratings yet
SVM (Repaired)
39 pages
Support vector Machine.pptx
No ratings yet
Support vector Machine.pptx
18 pages
SVM - Feb 15
No ratings yet
SVM - Feb 15
34 pages
Module10 - Support Vector Machine
No ratings yet
Module10 - Support Vector Machine
23 pages
SVM Tutorial
100% (1)
SVM Tutorial
34 pages
W12 SVM
No ratings yet
W12 SVM
52 pages
Support Vector Machine
No ratings yet
Support Vector Machine
21 pages
Support Vector Machine
No ratings yet
Support Vector Machine
19 pages
Birke - Yeshambel-Support Vector Machine Algorigthm
No ratings yet
Birke - Yeshambel-Support Vector Machine Algorigthm
6 pages
Support Vector Machine
No ratings yet
Support Vector Machine
45 pages
Machine Learning Unit-3.3
No ratings yet
Machine Learning Unit-3.3
38 pages
Detailed SVM Presentation
No ratings yet
Detailed SVM Presentation
15 pages
UNIT - 2
No ratings yet
UNIT - 2
15 pages
Support Vector Machine
No ratings yet
Support Vector Machine
40 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
28 pages
13.1 Support Vector Machine
No ratings yet
13.1 Support Vector Machine
28 pages
Lec06 SVM
No ratings yet
Lec06 SVM
25 pages
SVM notes
No ratings yet
SVM notes
4 pages
Unit 2
No ratings yet
Unit 2
47 pages
SVM Tutorial
No ratings yet
SVM Tutorial
31 pages
SVM
No ratings yet
SVM
11 pages
Support Vector Machine
No ratings yet
Support Vector Machine
8 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
UNIT - 2-1
No ratings yet
UNIT - 2-1
7 pages
Support Vector Machine
No ratings yet
Support Vector Machine
31 pages
Unit2 notes What is a Support Vector Machine
No ratings yet
Unit2 notes What is a Support Vector Machine
11 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
Support Vector Machine
No ratings yet
Support Vector Machine
17 pages
CS-13410 Introduction To Machine Learning
No ratings yet
CS-13410 Introduction To Machine Learning
33 pages
SVM notes unit 4.docx
No ratings yet
SVM notes unit 4.docx
8 pages
DOC-20250603-WA0002.
No ratings yet
DOC-20250603-WA0002.
28 pages
MCQs | Artificial Neural Networks- Components and Concepts | AIMCQs
No ratings yet
MCQs | Artificial Neural Networks- Components and Concepts | AIMCQs
11 pages
3 Non Linear Classifiers
No ratings yet
3 Non Linear Classifiers
74 pages
MCQ_ST2
No ratings yet
MCQ_ST2
15 pages
ObjectiveQ&a Mid-I NNDL
No ratings yet
ObjectiveQ&a Mid-I NNDL
15 pages
AIML Unit-IV & V
100% (1)
AIML Unit-IV & V
47 pages
Support_Vector_Machines_1639601280
No ratings yet
Support_Vector_Machines_1639601280
16 pages
Feedforward Neural Networks - Part 1 - Parveen Khurana - Medium
No ratings yet
Feedforward Neural Networks - Part 1 - Parveen Khurana - Medium
53 pages
Ai ML
No ratings yet
Ai ML
2 pages
Deep Learning: Convolutional Neural Network & Its Applications
No ratings yet
Deep Learning: Convolutional Neural Network & Its Applications
53 pages
ANN-CNN-RNN
No ratings yet
ANN-CNN-RNN
26 pages
Machine Learning Megapack
No ratings yet
Machine Learning Megapack
6 pages
DL2_Perceptron.pptx
No ratings yet
DL2_Perceptron.pptx
14 pages
Interview Questions in Neural Network
No ratings yet
Interview Questions in Neural Network
9 pages
LSTM Lecture
No ratings yet
LSTM Lecture
163 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
43 pages
Deep Learning: - Course Code: - Unit 1
No ratings yet
Deep Learning: - Course Code: - Unit 1
21 pages
Deep Learning Q Bank Mte
No ratings yet
Deep Learning Q Bank Mte
2 pages
Unit 5-1
No ratings yet
Unit 5-1
6 pages
Perceptrons and SVMS: Cs771: Introduction To Machine Learning Nisheeth
No ratings yet
Perceptrons and SVMS: Cs771: Introduction To Machine Learning Nisheeth
18 pages
ANN Models
No ratings yet
ANN Models
42 pages
247-Article Text-517-1-10-20201130 PDF
No ratings yet
247-Article Text-517-1-10-20201130 PDF
9 pages
Neural Sheet 6
No ratings yet
Neural Sheet 6
3 pages
Precision, Recall, F1-Score
No ratings yet
Precision, Recall, F1-Score
6 pages
Penerapan Algoritma CART Dalam Menentukan Jurusan Siswa Di MAN 1 Inhil
No ratings yet
Penerapan Algoritma CART Dalam Menentukan Jurusan Siswa Di MAN 1 Inhil
8 pages
Artificial Neural Network Supervised Learning
No ratings yet
Artificial Neural Network Supervised Learning
14 pages
Experiments With A New Boosting Algorithm: Yoav Freund Robert E. Schapire
No ratings yet
Experiments With A New Boosting Algorithm: Yoav Freund Robert E. Schapire
9 pages
Implementation of Single Layer Perceptron Model Using MATLAB
No ratings yet
Implementation of Single Layer Perceptron Model Using MATLAB
5 pages
Syllabus ANN
No ratings yet
Syllabus ANN
2 pages
Trees, Boosting, and Random Forest
No ratings yet
Trees, Boosting, and Random Forest
14 pages
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

Lecture7C Classification

Uploaded by

Lecture7C Classification

Uploaded by

Classification

§ Supervised learning algorithm,

§ To categorize new observations (data point) into one of two

§ Support Vector Machines

§ Artificial Neural Networks

§ TP, TN, FP, FN § Accuracy

Red fruit Green fruit

StatQuest with Josh Starmer

Red fruit Green fruit

This threshold is not good! New data?

§ Focus on the edges of each class/cluster

This is Maximal Margin

à Maximal margin classifier is super close to the green cluster

à Maximal margin classifier is super sensitive to outliers in

§ Choosing a threshold that allows misclassification:

§ Choosing this one?

§ Or choosing this one?

§ Using cross validation à how many misclassification errors and

observation misclassification observation

Soft margin classifier = support vector classifier

Support vector classifier can handle outliers

support vector support vector

§ 𝑛-D data (𝒏 ≥ 𝟒): support vector classifier is a hyperplane

§ The term “hyperplane” is usually used when we can’t draw it

§ Support vector classifier allows misclassification à can handle

§ In case of lots of overlapping

§ Support vector classifiers don’t perform well à solution???

How to split the data in

Not good split

The best split

§ In an n-D space, a hyperplane is a flat affine (n-1)-D sub-space

§ In an SVM, hyperplane is the decision boundary that separates

(𝑤: 𝑣𝑒𝑐𝑡𝑜𝑟, 𝑏: 𝑠𝑐𝑎𝑙𝑎𝑟)

§ Optimal hyperplane is a hyperplane that:

Ø Separates the classes as well as possible

Ø Maximizes the margin

Ø Minimizes the misclassification

Find the best hyperplane =

§ The hyperplane equation: 𝑓 𝑥 = 𝑤 ! + 𝑏 = 0

§ SVM aims to maximizes margin

§ Given training data { 𝑥" , 𝑦" }%

𝑥" = 𝑛𝐷 − 𝑓𝑒𝑎𝑡𝑢𝑟𝑒 𝑣𝑒𝑐𝑡𝑜𝑟,𝑦" ∈ −1, +1 = 𝑙𝑎𝑏𝑒𝑙

§ The SVM optimization is:

§ Given training data { 𝑥" , 𝑦" }%

𝑥" = 𝑛𝐷 − 𝑓𝑒𝑎𝑡𝑢𝑟𝑒 𝑣𝑒𝑐𝑡𝑜𝑟,𝑦" ∈ −1, +1 = 𝑙𝑎𝑏𝑒𝑙

§ The SVM optimization is:

Constraints are linear

§ This is a convex quadratic programming problem.

§ When data is not linearly separable and/or has some outliers

Ø Soft -margin SVM

§ We allow some misclassifications by introducing slack variables

§ C controls the trade-off between:

Ø Maximizing the margin

Ø Minimizing the classification errors

Ø Large C: small margin, few misclassification errors

Ø Small C: large margin, more tolerance to errors

§ How to choose the good value of C?

Data Type Use Notes

You might also like