0% found this document useful (0 votes)

10 views38 pages

Support Vector Machine For Classification

The document provides an overview of Support Vector Machines (SVM) for classification, detailing the max-margin classifier concept and the process of determining optimal decision boundaries. It discusses linear and non-linear SVMs, the kernel trick for handling non-linearly separable data, and techniques for hyper-parameter tuning, including cross-validation. Additionally, it covers multi-class SVM strategies and provides resources for further learning.

Uploaded by

Đặng Minh Hoàng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views38 pages

Support Vector Machine For Classification

Uploaded by

Đặng Minh Hoàng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

Support Vector Machine for

Classification
Instructor: Seunghoon Hong
Recap: image representation
Recap: nonparametric approach for classification
Training images Test image

Cat
…

1. Compute distance between feature vectors

2. Find nearest neighbor from the training data
Dog
3. Use the nearest neighbor’s label
Recap: parametric approach for classification
● The nearest neighbor algorithm is a specific instantiation of non-parametric
model for classification
● As an alternative, we can parameterize a decision function and learn these
parameters using the training data

Example: nearest neighbor

(non-parametric model)

Example: linear model

(parametric model)
Parameters
Today’s agenda
● Support Vector Machine (SVM)
Example: separable 2D data

Positive

Negative
Example: separable 2D data

Positive

Negative
Example: determining a good classifier

All decision boundaries lead to

perfect classification.
→ which boundary is better?
Positive

Negative
Example: determining a good classifier

Issues?

The decision boundary is

Positive awkwardly closed to the
negative samples.
Negative
→ may not generalize well to
unseen examples
Support Vector Machine (SVM)
● Max-margin classifier
Maximizing the margin of the
classifier will lead to better
generalization

Positive

Negative

margin
Support Vector Machine (SVM)
● Let’s assume that we have a set of linearly separable data
Support Vector Machine (SVM)
● Our decision rule

Note that we can generalize these rules to

arbitrary big constant (i.e. C instead of 1),
but we set C=1 for mathematical
convenience.

if
Support Vector Machine (SVM)
● Our decision rule

● For the samples closest to the decision

boundary (i.e. support vectors)
Support Vector Machine (SVM)
● Let x+ and x- be the positive and
negative support vectors

● Quantifying the margin

Difference between Unit normal of

positive and negative decision plane
support vectors
Support Vector Machine (SVM)
● Let x+ and x- be the positive and
negative support vectors

● Quantifying the margin

By the definition of
support vectors
Support Vector Machine (SVM)
● Problem of maximizing the margin

● Learning objective (max-margin classifier; SVM)

Support Vector Machine (SVM)
● Lagrangian formulation
○ Integrate the constraints using the Lagrangian multipliers αi

What are the conditions that the optimal w, b should satisfy?

Support Vector Machine (SVM)
● From the optimality conditions:

The (optimal) decision

boundary is computed by a
linear combination of data!
Support Vector Machine (SVM)
● From the optimality conditions:
Support Vector Machine (SVM)
● From the optimality conditions:

Rewrite the original objective by exploiting these conditions!

Support Vector Machine (SVM)
Support Vector Machine (SVM)
● Dual form of SVM objective

● Both the objective and constraints are convex functions

● We can find the solution using any Quadratic Programming (QP) solver
● The obtained solution is the global optimum! (no local optima!)
So the optimality of the solution is always guaranteed!
Support Vector Machine (SVM)
● Parameters for the max-margin hyperplane
○ Weight coefficient

■ Any data point for which will not contribute

■ It turned out that only the support vectors have

○ Bias parameter
■ From the fact that for all support vectors
■ We usually take average over all support vectors for numerical stability
Support Vector Machine (SVM)
● Testing
Linear SVM, Non-separable case
● Soft margin
○ Introduce slack variables,
○ Allow training example to be within the margin or
even on the wrong side of the linear separator.

● New objective function with slack variables

Non-linear SVM
● So far, we assume that our data is (almost) linearly separable
● What if our data is linearly not separable?
Non-linear SVM
● We want to map the data in original input space to some higher dimensional
space where separation of training data is much easier with linear classifier.

Projection of the data to Linear SVM

higher dimensional space

Non-linear SVM

How do we design the mapping such that it

maps the data to linearly separable space?
Kernel method in SVM
● Kernel trick: define a kernel as a dot product between the features

● The SVM decision function is then can be written as:

● It allows us to just define the kernel k without knowing the explicit form of the
mapping function φ!
Widely-used kernels
● Linear kernel:
● Polynomial kernel:
● Gaussian (Radial basis function - RBF) kernel:
● Histogram intersection kernel:
● And many others...
Non-linear SVM
● Optimizing the SVM objective with kernel
Oops? inner-product!

❏ Dual form of linear SVM:

❏ SVM with kernel:

● The same optimization techniques can be used to solve kernel SVM

● We only know the kernel k, and don’t have to know the mapping φ explicitly
Non-linear SVM
● Linear vs. non-linear SVM
Hyper-parameter tuning
● Hyper-parameters
○ Weights to the sum of slack variables in the soft-margin SVM

○ Kernel parameters

● How to select appropriate values for the hyper-parameters?

Cross-validation
● A naïve approach
○ Select hyper-parameter values that minimize training error.
○ OVERFITTING!

● A better approach: Cross Validation

○ Divide the training dataset into 𝐾 parts.
○ Set aside one of the parts for validation.
○ Learn SVMs with the remaining 𝐾−1
parts by varying hyper-parameters.
○ Evaluate errors of the learned models
on the validation set.
○ Repeat the 2~4 steps and calculate
mean errors per hyper-parameter set.
○ Select the hyper-parameter set
with lowest mean error.
Multi-class SVM
● One‐versus‐all
○ Training: learn an SVM for each class vs. the others.
○ Testing: apply each SVM to test example and assign to it the class of the SVM that returns the
highest decision value.

● One‐versus‐one
○ Training: learn an SVM for each pair of classes
○ Testing: each learned SVM “votes” for a class to assign to the test example
SVM Resources
● References
○ C. Cortes and V. Vapnik, Support‐vector networks, Machine Learning 20 (3): 273, 1995.
○ N. Cristianini, and J. Shawe‐Taylor, An introduction to support vector machine and other
kernel-based methods, Cambridge University Press, Cambridge. 2000.
○ B. Scholkopf and A. Smola, Learning with Kernels, Robust Estimators, MIT Press, 2002.

● Libraries and software packages

○ LIBSVM: http://www.csie.ntu.edu.tw/~cjlin/libsvm/
○ LIBLINEAR: http://www.csie.ntu.edu.tw/~cjlin/liblinear/
○ SVM light : http://svmlight.joachims.org/
Questions?
Review: image classification pipeline

Feature extractor Classifier

Next
● Introduction to neural network

Presentation On Support Vector Machine (SVM)
100% (2)
Presentation On Support Vector Machine (SVM)
22 pages
Support Vector Machine
100% (1)
Support Vector Machine
11 pages
Unit 2 PPT - Part 2
100% (1)
Unit 2 PPT - Part 2
81 pages
VO MCA S4 Data Mining Unit 6
No ratings yet
VO MCA S4 Data Mining Unit 6
21 pages
Honours Endsem Notes
No ratings yet
Honours Endsem Notes
163 pages
Chapter Three:Z Transform and Its Application To LTI Systems
No ratings yet
Chapter Three:Z Transform and Its Application To LTI Systems
52 pages
Support Vector Machine
No ratings yet
Support Vector Machine
21 pages
SVM7
No ratings yet
SVM7
53 pages
Unit2 Notes What Is A Support Vector Machine
No ratings yet
Unit2 Notes What Is A Support Vector Machine
11 pages
1694600937-Unit2.5 Support Vector Machine CU 2.0
No ratings yet
1694600937-Unit2.5 Support Vector Machine CU 2.0
26 pages
Detailed SVM Presentation
No ratings yet
Detailed SVM Presentation
15 pages
Support Vector Machines (SVMS) - Introduction and Key Concepts
No ratings yet
Support Vector Machines (SVMS) - Introduction and Key Concepts
52 pages
2.6 Supervised-Support Vector Machine
No ratings yet
2.6 Supervised-Support Vector Machine
18 pages
SVM (Repaired)
No ratings yet
SVM (Repaired)
39 pages
Support Vector Machines
No ratings yet
Support Vector Machines
43 pages
IVPML Unit III
No ratings yet
IVPML Unit III
139 pages
Module 3 ML 24
No ratings yet
Module 3 ML 24
65 pages
SVM - Feb 15
No ratings yet
SVM - Feb 15
34 pages
Support Vector Machine
No ratings yet
Support Vector Machine
18 pages
Ankita
No ratings yet
Ankita
10 pages
Machine Learning (R17a0534) 54 57
No ratings yet
Machine Learning (R17a0534) 54 57
4 pages
DMML Unit4 - SVM
No ratings yet
DMML Unit4 - SVM
50 pages
2024 Scu ML 2 1 SVM
No ratings yet
2024 Scu ML 2 1 SVM
36 pages
Support Vector Machine
No ratings yet
Support Vector Machine
17 pages
Unit 2 - SVM - 241016 - 104220
No ratings yet
Unit 2 - SVM - 241016 - 104220
47 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
103 pages
Lab 6 Dsa
No ratings yet
Lab 6 Dsa
15 pages
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
No ratings yet
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
11 pages
Chapter 07
No ratings yet
Chapter 07
18 pages
SVM Algorithm
No ratings yet
SVM Algorithm
17 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
Unit 2
No ratings yet
Unit 2
47 pages
Support Vector Machine
No ratings yet
Support Vector Machine
13 pages
SVM Notes
No ratings yet
SVM Notes
4 pages
6 Lec SVM Kernel
No ratings yet
6 Lec SVM Kernel
36 pages
SVM Consolidated
No ratings yet
SVM Consolidated
34 pages
Unit-4 AI - SVM
No ratings yet
Unit-4 AI - SVM
21 pages
Secant Method: Major: All Engineering Majors Authors: Autar Kaw, Jai Paul
No ratings yet
Secant Method: Major: All Engineering Majors Authors: Autar Kaw, Jai Paul
24 pages
Support Vector Machine: Abinas Panda
No ratings yet
Support Vector Machine: Abinas Panda
52 pages
PML Lab Exp 10
No ratings yet
PML Lab Exp 10
3 pages
Unit5 ML
No ratings yet
Unit5 ML
12 pages
Unit - 2-1
No ratings yet
Unit - 2-1
7 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
Lecture 18 - SVM
No ratings yet
Lecture 18 - SVM
54 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
Support Vector Machine: Prof. Subodh Kumar Mohanty
No ratings yet
Support Vector Machine: Prof. Subodh Kumar Mohanty
52 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
Support Vector Machine (SVM) Terminology Hyperplane WX + B 0 Support Vectors Margin Kernel Hard Margin Soft Margin
No ratings yet
Support Vector Machine (SVM) Terminology Hyperplane WX + B 0 Support Vectors Margin Kernel Hard Margin Soft Margin
6 pages
Support Vector Machine
No ratings yet
Support Vector Machine
45 pages
Support Vector Machine
No ratings yet
Support Vector Machine
31 pages
SVM
No ratings yet
SVM
11 pages
Control System Toolbox in Scilab
No ratings yet
Control System Toolbox in Scilab
17 pages
Lecture Notes For Chapter 6 Introduction To Data Mining: by Tan, Steinbach, Kumar
No ratings yet
Lecture Notes For Chapter 6 Introduction To Data Mining: by Tan, Steinbach, Kumar
82 pages
Mth603 Numerical Analysis Solved Mcqs For Midterm Exam Preparation Spring 2013 WWW - Virtualians.Pk
No ratings yet
Mth603 Numerical Analysis Solved Mcqs For Midterm Exam Preparation Spring 2013 WWW - Virtualians.Pk
18 pages
Support Vector Machine (SVM) : Basic Terminologies
100% (1)
Support Vector Machine (SVM) : Basic Terminologies
2 pages
SVM Notes
No ratings yet
SVM Notes
8 pages
Mathematics Polynomials 3 Eng
No ratings yet
Mathematics Polynomials 3 Eng
27 pages
Data Classification Using Support Vector Machine: Durgesh K. Srivastava, Lekha Bhambhu
No ratings yet
Data Classification Using Support Vector Machine: Durgesh K. Srivastava, Lekha Bhambhu
7 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
Solved Examples For Chapter 19
No ratings yet
Solved Examples For Chapter 19
7 pages
Analog To Digital PDF
No ratings yet
Analog To Digital PDF
7 pages
SVM Basics Paper
No ratings yet
SVM Basics Paper
7 pages
3 Job Shop Scheduling PDF
No ratings yet
3 Job Shop Scheduling PDF
7 pages
Support Vector Machine in R Paper
No ratings yet
Support Vector Machine in R Paper
28 pages
SVM Notes Unit 4
No ratings yet
SVM Notes Unit 4
8 pages
Cbse Test Paper-03: MATHEMATICS (Class-10) Chapter 2. Polynomials
No ratings yet
Cbse Test Paper-03: MATHEMATICS (Class-10) Chapter 2. Polynomials
1 page
Numerical Methods For Inverse Kinematics: 1 Problem Description
No ratings yet
Numerical Methods For Inverse Kinematics: 1 Problem Description
8 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
Lesson 1.1 - FACTORING POLYNOMIAL WITH GREATEST COMMON MONOMIAL FACTOR, DIFFERENCE OF TWO SQUARE AND SUM AND DIFFERENCE OF TWO CUBES
No ratings yet
Lesson 1.1 - FACTORING POLYNOMIAL WITH GREATEST COMMON MONOMIAL FACTOR, DIFFERENCE OF TWO SQUARE AND SUM AND DIFFERENCE OF TWO CUBES
31 pages
(CS21003: Algorithms-I) Online-Quiz/Test: Question-3: All-Pairs Shortest Path (Marks: 7)
No ratings yet
(CS21003: Algorithms-I) Online-Quiz/Test: Question-3: All-Pairs Shortest Path (Marks: 7)
12 pages
Dhaapps Datascience With Gen AI-1
No ratings yet
Dhaapps Datascience With Gen AI-1
23 pages
Linear Programming I - Part 3
No ratings yet
Linear Programming I - Part 3
15 pages
Graphical Solution Methodspart2
No ratings yet
Graphical Solution Methodspart2
5 pages
Lecture 3 (Chapter 7 ATPG Basics)
No ratings yet
Lecture 3 (Chapter 7 ATPG Basics)
25 pages
High Performance Computing Matrix Mul.
No ratings yet
High Performance Computing Matrix Mul.
15 pages
Entropy 24 01255
No ratings yet
Entropy 24 01255
13 pages
RoBERTa-LSTM A Hybrid Model For Sentiment Analysis With Transformer and Recurrent Neural Network
No ratings yet
RoBERTa-LSTM A Hybrid Model For Sentiment Analysis With Transformer and Recurrent Neural Network
9 pages
D1 Exercise 1A
No ratings yet
D1 Exercise 1A
3 pages
Lecture Notes Adversarial Search
No ratings yet
Lecture Notes Adversarial Search
15 pages
Paper Minig and Association
No ratings yet
Paper Minig and Association
5 pages
Lab 3 Fourier Transform 2022
No ratings yet
Lab 3 Fourier Transform 2022
8 pages
CG Module 3
No ratings yet
CG Module 3
2 pages
Be - Electronics-And-Telecommunicatio N-Engineering - Semester-5 - 2022 - November - Digital-Communication-Dc-Pattern-2019
No ratings yet
Be - Electronics-And-Telecommunicatio N-Engineering - Semester-5 - 2022 - November - Digital-Communication-Dc-Pattern-2019
2 pages
FB Prep Handbook
No ratings yet
FB Prep Handbook
6 pages
MCSE - PGCS202 - SOFT COMPUTING - R18 - Booklet
No ratings yet
MCSE - PGCS202 - SOFT COMPUTING - R18 - Booklet
2 pages
OTE Assignment-1 PDF
No ratings yet
OTE Assignment-1 PDF
2 pages
Week 2
No ratings yet
Week 2
9 pages
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
From Everand
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
Fouad Sabry
No ratings yet
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet

Support Vector Machine For Classification

Uploaded by

Support Vector Machine For Classification

Uploaded by

Support Vector Machine for

1. Compute distance between feature vectors

Example: nearest neighbor

Example: linear model

All decision boundaries lead to

The decision boundary is

Note that we can generalize these rules to

● For the samples closest to the decision

● Quantifying the margin

Difference between Unit normal of

● Quantifying the margin

● Learning objective (max-margin classifier; SVM)

What are the conditions that the optimal w, b should satisfy?

The (optimal) decision

Rewrite the original objective by exploiting these conditions!

● Both the objective and constraints are convex functions

■ Any data point for which will not contribute

● New objective function with slack variables

Projection of the data to Linear SVM

How do we design the mapping such that it

● The SVM decision function is then can be written as:

❏ Dual form of linear SVM:

❏ SVM with kernel:

● The same optimization techniques can be used to solve kernel SVM

● How to select appropriate values for the hyper-parameters?

● A better approach: Cross Validation

● Libraries and software packages

Feature extractor Classifier

You might also like