0% found this document useful (0 votes)

12 views

SVM Slides

The document discusses machine learning algorithms such as support vector machines. It explains how early methods had linear decision boundaries while modern algorithms use nonlinear functions and optimization methods. It provides details on support vector machines, including how they find the optimal hyperplane for classification using maximum margin and support vectors.

Uploaded by

Richa Halder

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

SVM Slides

Uploaded by

Richa Halder

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Machine Learning

◼ Efficient Separability using non-linear regions

◼ Use of Kernel functions
◼ Dot product based similarity measures
◼ Quadratic optimization for global minimum
◼ New methods uses more optimization and
less greedy search
Support Vector Machine

◼ Optimal hyperplane for linearly separable

pattern
◼ Use of Kernel functions for non-linear decision
plane
◼ Maximum Margin Classifiers
◼ Extendable to multi-class
Support Vectors
◼ Support vectors are the data points that lie closest to the
decision surface (or hyperplane)
◼ Data points most difficult to classify
◼ Instrumental on the optimum location of the decision surface
◼ It can be shown that the optimal hyperplane stems from the
function class with the lowest “capacity”= # of independent
features/parameters
Maximum Margin Decision Plane
◼ A “skinny” one (small margin),
which will be able to adopt
many orientations
◼ A “fat” one (large margin),
which will have limited
flexibility
◼ Larger Margin , Lower Capacity
◼ If the margin is sufficiently
large, the complexity of the
function will be low even if the
dimensionality is very high!
Linearly-separable data, binary
classification
◼ In general, lots of
possible solutions
(an infinite
number!)
◼ Support Vector
Machine finds an
optimal solution
Linear SVM
◼ SVMs maximize the margin
around the separating
hyperplane.
◼ The decision function is fully
specified by a (usually very
small) subset of training
samples, the support vectors.
◼ Quadratic programming problem
that is easy to solve by standard
methods
Linear SVM
◼ Find a,b,c, such that
ax + by ≥ c for red points
ax + by ≤ (or < ) c for green points.
◼ Lots of possible solutions for a,b,c.
◼ Some methods find a separating
hyperplane, but not the optimal one
(e.g.,neural net)
Which points should influence
optimality?
◼ All points?
◼ “difficult points” close to decision
boundary (Support Vectors)
◼ Support vectors decide the location
of hyperplane
◼ Optimization techniques used to
find the optimum hyperplane
Linear SVM
Support Vectors: Input vectors that just touch the boundary of the margin (street) –
circled below, there are 3 of them (or, rather, the‘tips’ of the vectors)
wTx + b = 1 or wTx + b = -1

Support vectors, v1, v2, v3,

d denotes 1/2 of the street
‘width’
Linear SVM
Define the hyperplanes H such that:
w•xi+b >= +1 when yi =+1
w•xi+b <= -1 when yi = –1
H1 and H2 are the planes:
H1: w•xi+b = +1
H2: w•xi+b = –1
The points on the planes H1 and H2 are the
tips of the Support Vectors
The plane H0 is the median in between, where
w•xi+b =0
d+ = the shortest distance to the closest
positive point
d- = the shortest distance to the closest
negative point
The margin (gutter) of a separating
hyperplane is d+ + d–.
Linear SVM
Classifier should separate with the biggest
margin

Distance between point x and a plane (w,b)

Optimal hyperplane has infinite solutions,

choose solution such that discriminant
function becomes 1 for training example
closest to boundary

Canonical hyperplane
Distance between H1 and H0

and margin becomes

Linear SVM
In order to maximize the margin, we thus need to minimize ||w||. With the
condition that there are no datapoints between H1 and H2:
wT xi + b > 0 will have corresponding yi = 1.
wT xi + b < 0 will have corresponding yi = -1.
=> y(wT x+b) >= 1

Constrained Optimization Problem

J(w): Quadratic Function, Surface paraboloid, global minimum (improvement over NN)
Solved using classical Lagrangian Optimization
Kuhn-Tucker Theorem

For active constraints 𝛼𝑖≥0; and for inactive constraints 𝛼𝑖=0

The KKT condition allows us to identify the training examples that define the
largest margin hyperplane
Linear SVM
Constrained minimization of 𝐽(𝑤) is solved by introducing the Lagrangian

Yields an unconstrained optimization problem that is solved by:

• minimizing 𝐿𝑃 w.r.t. the primal variables w and b, and
• maximizing 𝐿𝑃 w.r.t. the dual variables 𝛼𝑖≥0 (the Lagrange multipliers)
Thus, the optimum is defined by a saddle point
This is known as the Lagrangian primal problem
Linear SVM
◼ To simplify the primal problem, we eliminate the primal variables (𝑤,𝑏),
differentiate 𝐿p(𝑤,𝑏,𝛼) with respect to 𝑤 and 𝑏, and setting to zero yields

◼ Expansion of 𝐿𝑃 yields

◼ Using the optimality condition 𝜕𝐽/𝜕𝑤=0, the first term in 𝐿𝑃 can be expressed as

◼ The second term in 𝐿𝑃 can be expressed in the same way

◼ The third term in 𝐿𝑃 is zero by virtue of the optimality condition 𝜕𝐽/𝜕𝑏=0
Linear SVM
Merging all the expressions together we get

Subject to constraints
Lagrangian Dual Problem (LP=>LD)
◼ problem of finding a saddle point for 𝐿𝑃(𝑤,𝑏) into the easier one of maximizing
𝐿𝐷(𝛼).
◼ 𝐿𝐷(𝛼) depends on the Lagrange multipliers 𝛼, not on (𝑤,𝑏)
◼ The primal problem scales with dimensionality (𝑤 has one coefficient for each
dimension), whereas the dual problem scales with the amount of training data
(there is one Lagrange multiplier per example)
◼ In 𝐿𝐷(𝛼), training data appears only as dot products 𝑥𝑖𝑇𝑥𝑗
◼ This property can be cleverly exploited to perform the classification in a higher
(e.g., infinite) dimensional space
Linear SVM
The KKT complementary condition states that, for every point in the training set,
the following equality must hold

Therefore, ∀𝑥, either 𝛼𝑖=0 or 𝑦𝑖(𝑤𝑇𝑥𝑖+𝑏−1)=0 must hold

Those points for which 𝛼𝑖>0 must then lie on one of the two hyperplanes that define
the largest margin (the term 𝑦𝑖(𝑤𝑇𝑥𝑖+𝑏−1) becomes zero only at these hyperplanes)
These points are known as the Support Vectors
All the other points must have 𝛼𝑖=0
The SVs contribute to defining the optimal hyperplane–

Bias term 𝑏 is found from the KKT complementary condition on the support
vectors, hence the complete dataset could be replaced by only the support
vectors, and the separating hyperplane would be the same
Non-Linear SVM
Non-Linear SVM
Non-Linear SVM
Non-Linear SVM

Support Vector Machine
100% (2)
Support Vector Machine
11 pages
5 Numerical Differenciation
100% (1)
5 Numerical Differenciation
8 pages
Maths NA 2
No ratings yet
Maths NA 2
62 pages
Support Vector Machine
No ratings yet
Support Vector Machine
35 pages
Support Vector Machine
No ratings yet
Support Vector Machine
55 pages
Final - Support Vector Machine - Class - Modifie
No ratings yet
Final - Support Vector Machine - Class - Modifie
69 pages
Lecture 7_SVM
No ratings yet
Lecture 7_SVM
125 pages
Chapter 5 - Support Vector Machine: Prepared By: Shier Nee, SAW
No ratings yet
Chapter 5 - Support Vector Machine: Prepared By: Shier Nee, SAW
44 pages
SVM
No ratings yet
SVM
28 pages
An Idiot's Guide To Support Vector Machines
No ratings yet
An Idiot's Guide To Support Vector Machines
28 pages
An Idiot Guide To SVM
No ratings yet
An Idiot Guide To SVM
25 pages
CS-13410 Introduction To Machine Learning
No ratings yet
CS-13410 Introduction To Machine Learning
33 pages
Support Vector Machine
No ratings yet
Support Vector Machine
46 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
W12 SVM
No ratings yet
W12 SVM
52 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
45 pages
SVM 30thoct Annotated
No ratings yet
SVM 30thoct Annotated
35 pages
ML Lec SVM Linear
No ratings yet
ML Lec SVM Linear
19 pages
L5_SVMs
No ratings yet
L5_SVMs
37 pages
A Short SVM (Support Vector Machine) Tutorial
No ratings yet
A Short SVM (Support Vector Machine) Tutorial
6 pages
Introduction To: Support Vector Machines
No ratings yet
Introduction To: Support Vector Machines
53 pages
Support Vector Machines (SVM) : Y.H. Hu
No ratings yet
Support Vector Machines (SVM) : Y.H. Hu
25 pages
SVM Tutorial
No ratings yet
SVM Tutorial
31 pages
1632118884_ML-TCS-Lecture-15 (1)
No ratings yet
1632118884_ML-TCS-Lecture-15 (1)
46 pages
SVM Seminarbericht Hofmann
No ratings yet
SVM Seminarbericht Hofmann
16 pages
An Introduction To Support Vector Machines
No ratings yet
An Introduction To Support Vector Machines
13 pages
Support Vector Machine
No ratings yet
Support Vector Machine
45 pages
Report 1
No ratings yet
Report 1
6 pages
Support vector machine
No ratings yet
Support vector machine
49 pages
SVM SLIDES
No ratings yet
SVM SLIDES
32 pages
Support Vecto Machine (3)
No ratings yet
Support Vecto Machine (3)
62 pages
Lec5 Support vector machine
No ratings yet
Lec5 Support vector machine
28 pages
Support Vector Machine
No ratings yet
Support Vector Machine
19 pages
SVM Tutorial
No ratings yet
SVM Tutorial
34 pages
Lec06 SVM
No ratings yet
Lec06 SVM
25 pages
SVM PRESENTATION
No ratings yet
SVM PRESENTATION
34 pages
SVM_NEW
No ratings yet
SVM_NEW
12 pages
Chapter_8 (1)
No ratings yet
Chapter_8 (1)
52 pages
SVM Tutorial
No ratings yet
SVM Tutorial
34 pages
20 SVM
No ratings yet
20 SVM
35 pages
Svm
No ratings yet
Svm
29 pages
Support Vector Machines: (Vapnik, 1979)
No ratings yet
Support Vector Machines: (Vapnik, 1979)
34 pages
Support Vector Machines PDF
No ratings yet
Support Vector Machines PDF
5 pages
L5 SVM
No ratings yet
L5 SVM
61 pages
ML-chap13_2024_110331
No ratings yet
ML-chap13_2024_110331
67 pages
10_SVM (1)
No ratings yet
10_SVM (1)
77 pages
39f6c97e482b96aba75c59b4ac0d99b8_MIT15_097S12_lec12
No ratings yet
39f6c97e482b96aba75c59b4ac0d99b8_MIT15_097S12_lec12
14 pages
5d. Support Vector Machine
No ratings yet
5d. Support Vector Machine
2 pages
Support Vector Machines
No ratings yet
Support Vector Machines
5 pages
Support Vector Machines (SVM) : N I y X D
No ratings yet
Support Vector Machines (SVM) : N I y X D
5 pages
SVM.pptx
No ratings yet
SVM.pptx
67 pages
4 - SVM
No ratings yet
4 - SVM
58 pages
Introduction To Support Vector Machines: 1 Description
No ratings yet
Introduction To Support Vector Machines: 1 Description
15 pages
10 SVM
No ratings yet
10 SVM
23 pages
Classification: Linear SVM
No ratings yet
Classification: Linear SVM
26 pages
Machine Learning - Open Elective - Part III
No ratings yet
Machine Learning - Open Elective - Part III
90 pages
Support_Vector_Machine(SVM)[1]
No ratings yet
Support_Vector_Machine(SVM)[1]
103 pages
Mathematical Optimization: Fundamentals and Applications
From Everand
Mathematical Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
Gauss Nodes Revolution: Numerical Integration Theory Radically Simplified And Generalised
From Everand
Gauss Nodes Revolution: Numerical Integration Theory Radically Simplified And Generalised
Rob Porter
No ratings yet
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Tutorial 2 Numet
No ratings yet
Tutorial 2 Numet
2 pages
Numerical Analysis MCQ's With Answer
100% (2)
Numerical Analysis MCQ's With Answer
7 pages
Introduction To Integer Programming (IP) Difficulties of LP Relaxation IP Formulations Branch and Bound Algorithms
No ratings yet
Introduction To Integer Programming (IP) Difficulties of LP Relaxation IP Formulations Branch and Bound Algorithms
54 pages
Chapter 1 Introduction: in This Chapter, You Will Learn
No ratings yet
Chapter 1 Introduction: in This Chapter, You Will Learn
8 pages
(08.05) - On Solving Higher Order Equations For ODEs
No ratings yet
(08.05) - On Solving Higher Order Equations For ODEs
9 pages
Optimization Technique Lecture Note
No ratings yet
Optimization Technique Lecture Note
25 pages
Department of Mathematics: Kunja Bihari (Degree) College
No ratings yet
Department of Mathematics: Kunja Bihari (Degree) College
31 pages
S.no Topic Date Signature
No ratings yet
S.no Topic Date Signature
22 pages
Interpolating With Splines: Linear Spline Cubic Spline Interpolation Polynomials
No ratings yet
Interpolating With Splines: Linear Spline Cubic Spline Interpolation Polynomials
4 pages
Lec # 32 (MTH-351)
No ratings yet
Lec # 32 (MTH-351)
19 pages
Polynomial: X X X X
No ratings yet
Polynomial: X X X X
9 pages
Script0910 PDF
No ratings yet
Script0910 PDF
127 pages
Lmi 2
No ratings yet
Lmi 2
3 pages
Chapter 2 Polynomials
No ratings yet
Chapter 2 Polynomials
2 pages
One Variable Optimization
No ratings yet
One Variable Optimization
15 pages
mtmw14 4
No ratings yet
mtmw14 4
30 pages
76 que
No ratings yet
76 que
5 pages
MATH F212 Handout
No ratings yet
MATH F212 Handout
3 pages
Chapter 5C. Simultaneous Linear Equations: Gaussian Elimination Method
No ratings yet
Chapter 5C. Simultaneous Linear Equations: Gaussian Elimination Method
21 pages
Taylor's Theorem
No ratings yet
Taylor's Theorem
4 pages
Week 11 - Module 9 Sequential Logic Circuits
No ratings yet
Week 11 - Module 9 Sequential Logic Circuits
12 pages
Synthetic Division of Polynomials
No ratings yet
Synthetic Division of Polynomials
2 pages
EEP311L Chapter 17 The QR Method
No ratings yet
EEP311L Chapter 17 The QR Method
10 pages
Least Squares Optimization With L1-Norm Regularization
No ratings yet
Least Squares Optimization With L1-Norm Regularization
12 pages
NPTEL Basics of FEM - I Week 4 Solutions
No ratings yet
NPTEL Basics of FEM - I Week 4 Solutions
18 pages
Managerial Economics in A Global Economy: Linear Programming
No ratings yet
Managerial Economics in A Global Economy: Linear Programming
18 pages
Numerical Methods in CFD
No ratings yet
Numerical Methods in CFD
40 pages
Row Reduction
No ratings yet
Row Reduction
27 pages

SVM Slides

Uploaded by

SVM Slides

Uploaded by

Machine Learning

and Network Analysis

◼ Efficient Separability using non-linear regions

◼ Optimal hyperplane for linearly separable

Support vectors, v1, v2, v3,

Distance between point x and a plane (w,b)

Optimal hyperplane has infinite solutions,

and margin becomes

Constrained Optimization Problem

For active constraints 𝛼𝑖≥0; and for inactive constraints 𝛼𝑖=0

Yields an unconstrained optimization problem that is solved by:

◼ The second term in 𝐿𝑃 can be expressed in the same way

Therefore, ∀𝑥, either 𝛼𝑖=0 or 𝑦𝑖(𝑤𝑇𝑥𝑖+𝑏−1)=0 must hold

You might also like