0% found this document useful (0 votes)

2 views20 pages

Support Vector Machine (SVM) - Kernel Functions[1]

Uploaded by

Anshika

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views20 pages

Support Vector Machine (SVM) - Kernel Functions[1]

Uploaded by

Anshika

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Support Vector Machine (SVM)

&
Kernel Functions
Reference [1], Chapter 7, section 7.3
Reference [3], Chapter 6, page 292
[1]: Flach, P. (2015). Machine Learning: The Art and Science of Algorithms that Make Sense of Data. Cambridge University Press.
[3]: Christopher & Bishop, M. (2016). Pattern Recognition and Machine Learning. New York: Springer-Verlag
Support Vector Machine (SVM)
The Support Vector Machine (SVM) is a supervised learning algorithm mostly used
for classification but it can be used also for regression.

Ideology behind SVM

SVM is based on the idea of finding a hyperplane that best separates the features into
different domains.
SVM: Some Important Points
The main idea is that based on the labeled data (training data), the algorithm tries to find the
optimal hyperplane which can be used to classify new data points.

In two dimensions the hyperplane is a simple line.

Usually a learning algorithm tries to learn the most common characteristics (what
differentiates one class from another) of a class and the classification is based on those
representative characteristics learnt (so classification is based on differences between
classes). The SVM works in the other way around.

It finds the most similar examples between classes. Those will be the support vectors.
Example
As an example, let’s consider two classes, apples and lemons.

Other algorithms will learn the most evident, most representative characteristics of apples
and lemons, like apples are green and rounded while lemons are yellow and have elliptic
form.

In contrast, SVM will search for apples that are very similar to lemons, for example apples
which are yellow and have elliptic form. This will be a support vector. The other support
vector will be a lemon similar to an apple (green and rounded).

So other algorithms learns the differences while SVM learns similarities.

If we visualize the example above in 2D, we will have something like this:
As we go from left to right, all the examples will be classified as apples until we reach the
yellow apple. From this point, the confidence that a new example is an apple drops while the
lemon class confidence increases. When the lemon class confidence becomes greater than the
apple class confidence, the new examples will be classified as lemons (somewhere between
the yellow apple and the green lemon).

Based on these support vectors, the algorithm tries to find the best hyperplane that
separates the classes.

In 2D the hyperplane is a line, so it would look like this:

Boundary can also be drawn like this:

As you can see, we have an

infinite number of
possibilities to draw the
decision boundary.

So how can we find the optimal

one?
Finding the Optimal Hyperplane
Intuitively the best line is the line
that is far away from both apple
and lemon examples (has the
largest margin).

To have optimal solution, we have to

maximize the margin in both
ways (if we have multiple classes,
then we have to maximize it
considering each of the classes).
Use Cases
Finding the Optimal Hyperplane
Identifying the Right Hyperplane (Scenario I)
Here, we have three hyper-planes (A, B, and C).

Now, identify the right hyper-plane to classify stars and circles.

You need to remember a thumb rule to identify the right

hyper-plane: “Select the hyperplane which segregates the
two classes better”.

In this scenario, hyper-plane “B” has excellently performed this

job.
Identifying the Right Hyperplane (Scenario II)
Here, we have three hyper-planes (A, B, and C) and all are
segregating the classes well.

Now, How can we identify the right hyper-plane?

Here, maximizing the distances between nearest data point

(either class) and hyper-plane will help us to decide the right
hyper-plane. This distance is called as Margin.

Let’s look at the below snapshot:

Consider this diagram,

Here, you can see that the margin for hyper-plane C is high as
compared to both A and B. Hence, we name the right
hyper-plane as C.

Another lightning reason for selecting the hyperplane with

higher margin is robustness.

If we select a hyper-plane having low margin then there is

high chance of miss-classification.
Identifying the Right Hyperplane (Scenario III)
Use the rules as discussed in previous slides to identify the right
hyperplane.

Some of you may have selected the hyperplane B as it has

higher margin compared to A.

But, here is the catch, SVM selects the hyperplane which

classifies the classes accurately prior to maximizing margin.

Here, hyperplane B has a classification error and A has classified

all correctly.

Therefore, the right hyperplane is A.

Identifying the Right Hyperplane (Scenario IV)
I am unable to segregate the two classes using a straight line, as
one of the stars lies in the territory of other(circle) class as an
outlier.

The SVM algorithm has a feature to ignore outliers and find the
hyper-plane that has the maximum margin. Hence, we can say,
SVM classification is robust to outliers.
Identifying the Right Hyperplane (Scenario V)
In the scenario below, we can’t have linear hyperplane between
the two classes, so how does SVM classify these two classes?
Till now, we have only looked at the linear hyperplane.

SVM can solve this problem.

Easily! It solves this problem by introducing additional feature.

Here, we will add a new feature z=x^2+y^2.

Now, let’s plot the data points on axis x and z:

In above plot, points to consider are:
● All values for z would be positive always because z is the
squared sum of both x and y
● In the original plot, red circles appear close to the origin of
x and y axes, leading to lower value of z and star relatively
away from the origin result to higher value of z.

In the SVM classifier, it is easy to have a linear hyperplane

between these two classes.
But, another burning question which arises is, should we need to
add this feature manually to have a hyperplane.
No, the SVM algorithm has a technique called the kernel trick.

The SVM kernel is a function that takes low dimensional input space and transforms it to a higher
dimensional space i.e. it converts not separable problem to separable problem. It is mostly useful in
non-linear separation problem.
Terminologies Used in SVM
The points closest to the hyperplane are called as the support vector points and the
distance of the vectors from the hyperplane are called the margins.

The basic intuition to develop over here is that more the farther SV points, from the
hyperplane, more is the probability of correctly classifying the points in their respective
region or classes.

SV points are very critical in determining the hyperplane because if the position of the
vectors changes the hyperplane’s position is altered. Technically this hyperplane can also be
called as margin maximizing hyperplane.
Hyperplane (Decision Surface)
The hyperplane is a function which is used to differentiate between features.

In 2D, the function used to classify between features is a line whereas, the function used to
classify the features.

In 3D, it is called as a plane similarly the function which classifies the point in higher
dimension is called as a hyperplane.

Now since you know about the hyperplane, let’s move back to SVM.
Let’s say there are “m” dimensions:

thus, the equation of the hyperplane in the ‘M’ dimension can be given as =

where, wi = vectors

x=variables and b = biased term

Basic Steps
The basic steps of the SVM are:

1. select two hyperplanes (in 2D) which separates the data with no points between them
(red lines)
2. maximize their distance (the margin)
3. the average line (here the line half way between the two red lines) will be the decision
boundary

This is very nice and easy, but finding the best margin, the optimization problem is not trivial (it is
easy in 2D, when we have only two attributes, but what if we have N dimensions with N a very big
number)

To solve the optimization problem, we use the Lagrange Multipliers.

Unit 2 PPT - Part 2
100% (1)
Unit 2 PPT - Part 2
81 pages
MCQ's of Geograhpy by Jobs Adda PDF
No ratings yet
MCQ's of Geograhpy by Jobs Adda PDF
341 pages
Business Valuation Dissertation
100% (2)
Business Valuation Dissertation
7 pages
Solutions To TSTST 2015: 57th IMO 2016, Hong Kong
No ratings yet
Solutions To TSTST 2015: 57th IMO 2016, Hong Kong
6 pages
Proceeding Samro Sibiu
No ratings yet
Proceeding Samro Sibiu
473 pages
Regenerative Agriculture
No ratings yet
Regenerative Agriculture
301 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
A. Gruber CV - October 2020
No ratings yet
A. Gruber CV - October 2020
7 pages
Ite2004 Software-Testing Eth 1.0 37 Ite2004
No ratings yet
Ite2004 Software-Testing Eth 1.0 37 Ite2004
2 pages
4 - Lec 11 - ML - Support Vector Machine
No ratings yet
4 - Lec 11 - ML - Support Vector Machine
6 pages
Unit 3_ SVM
No ratings yet
Unit 3_ SVM
43 pages
SVM-1
No ratings yet
SVM-1
8 pages
WSI-WSIS - Ika Rahayu, DKK (Kolaborasi-011)
No ratings yet
WSI-WSIS - Ika Rahayu, DKK (Kolaborasi-011)
9 pages
Gambar Detail Frame Rumah Kucing
No ratings yet
Gambar Detail Frame Rumah Kucing
4 pages
Unit3-SVM
No ratings yet
Unit3-SVM
20 pages
Devops Assignment
No ratings yet
Devops Assignment
10 pages
Support Vector Machine: Scenario 1
No ratings yet
Support Vector Machine: Scenario 1
3 pages
SVM VS SVC
No ratings yet
SVM VS SVC
27 pages
1 Organic and Biomolecular Chemistry, 2018, 16, 1402 Compressed
No ratings yet
1 Organic and Biomolecular Chemistry, 2018, 16, 1402 Compressed
18 pages
Pca PDF
No ratings yet
Pca PDF
10 pages
Python Full
No ratings yet
Python Full
28 pages
Support Vector Machine-1
No ratings yet
Support Vector Machine-1
12 pages
Machine Learning(r17a0534) 54 57
No ratings yet
Machine Learning(r17a0534) 54 57
4 pages
SVM Tutorial Part1
No ratings yet
SVM Tutorial Part1
9 pages
Standard Repair - 502052138 IETP 33rd Issue - Unmaintained
No ratings yet
Standard Repair - 502052138 IETP 33rd Issue - Unmaintained
8 pages
Mnemonic Strategies in Purposive Communication A Study On Its Effects in Vocabu
No ratings yet
Mnemonic Strategies in Purposive Communication A Study On Its Effects in Vocabu
9 pages
How To Avoid: Office Politics
No ratings yet
How To Avoid: Office Politics
14 pages
Quiz 05 - Stock valuation
No ratings yet
Quiz 05 - Stock valuation
11 pages
SUPPORT VECTOR MACHINE
No ratings yet
SUPPORT VECTOR MACHINE
4 pages
How To Make A Horse Bench
No ratings yet
How To Make A Horse Bench
3 pages
Support Vector Machine Algorithm
No ratings yet
Support Vector Machine Algorithm
8 pages
Planning An IoT Implementation
No ratings yet
Planning An IoT Implementation
26 pages
Introduction To Support Vector Machines
No ratings yet
Introduction To Support Vector Machines
46 pages
Support Vector Machine
No ratings yet
Support Vector Machine
12 pages
Business Data Mining Week 6
No ratings yet
Business Data Mining Week 6
20 pages
Svm
No ratings yet
Svm
20 pages
Support Vector Machine
No ratings yet
Support Vector Machine
19 pages
5-SVM
No ratings yet
5-SVM
34 pages
SVM notes
No ratings yet
SVM notes
4 pages
UNIT-III Support Vector Machines
No ratings yet
UNIT-III Support Vector Machines
43 pages
SVM
No ratings yet
SVM
11 pages
SVM.pptx
No ratings yet
SVM.pptx
67 pages
EXP-14
No ratings yet
EXP-14
27 pages
CUSTOM INDICATOR@version=6
No ratings yet
CUSTOM INDICATOR@version=6
10 pages
Machine Learning (CSO851) - Lecture 05
No ratings yet
Machine Learning (CSO851) - Lecture 05
27 pages
Support Vector Machine
No ratings yet
Support Vector Machine
13 pages
ML_Lec-19
No ratings yet
ML_Lec-19
20 pages
2023 08 en
No ratings yet
2023 08 en
84 pages
1501589527da-mod14-Q1-e-text
No ratings yet
1501589527da-mod14-Q1-e-text
12 pages
data mining techniques
No ratings yet
data mining techniques
27 pages
Unit 2 - SVM - 241016 - 104220
No ratings yet
Unit 2 - SVM - 241016 - 104220
47 pages
SVM (Repaired)
No ratings yet
SVM (Repaired)
39 pages
SVMs
No ratings yet
SVMs
30 pages
Session Svmclassification
No ratings yet
Session Svmclassification
28 pages
SVMs[1]
No ratings yet
SVMs[1]
30 pages
Unit-4 AI - SVM
No ratings yet
Unit-4 AI - SVM
21 pages
SVM
No ratings yet
SVM
11 pages
DMML Unit4 - SVM
No ratings yet
DMML Unit4 - SVM
50 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
28 pages
SML Unit 4
No ratings yet
SML Unit 4
61 pages
Lecture Notes - SVM
No ratings yet
Lecture Notes - SVM
13 pages
Support Vector Machine
No ratings yet
Support Vector Machine
40 pages
Unit2 notes What is a Support Vector Machine
No ratings yet
Unit2 notes What is a Support Vector Machine
11 pages
Machine Learning Unit-3.3
No ratings yet
Machine Learning Unit-3.3
38 pages
IVPML Unit III
No ratings yet
IVPML Unit III
139 pages
SVM
No ratings yet
SVM
43 pages
Disini Brosur Hardware Update September 2021
No ratings yet
Disini Brosur Hardware Update September 2021
4 pages
Learning
No ratings yet
Learning
58 pages
Self-Efficacy Beliefs in Academic Settings - Frank Pajares 1996
No ratings yet
Self-Efficacy Beliefs in Academic Settings - Frank Pajares 1996
37 pages
Classification Regression: Mostly Used in Classification Problems
No ratings yet
Classification Regression: Mostly Used in Classification Problems
8 pages
SVM
No ratings yet
SVM
12 pages
SVM notes unit 4.docx
No ratings yet
SVM notes unit 4.docx
8 pages
Syllabus - Master Programme in Health Informatics
No ratings yet
Syllabus - Master Programme in Health Informatics
8 pages
Design Procedure For A Power Transformer Using EI Core: Required Specification
No ratings yet
Design Procedure For A Power Transformer Using EI Core: Required Specification
8 pages
Civil CDR Sample 2901
No ratings yet
Civil CDR Sample 2901
6 pages
Support Vector Machine: Suraj Kumar Das
No ratings yet
Support Vector Machine: Suraj Kumar Das
10 pages
Nci Presentation 14
No ratings yet
Nci Presentation 14
261 pages
Chapter 3 - Support Vector Machine With Math. - Deep Math Machine Learning - Ai - Medium
No ratings yet
Chapter 3 - Support Vector Machine With Math. - Deep Math Machine Learning - Ai - Medium
11 pages
Unit 2
No ratings yet
Unit 2
47 pages
SVM Scribe Notes
No ratings yet
SVM Scribe Notes
16 pages
Support Vector Machines: (Vapnik, 1979)
No ratings yet
Support Vector Machines: (Vapnik, 1979)
34 pages
Cisco Nexus 9000 Series NX-OS Interfaces Configuration Guide, Release 7.x
No ratings yet
Cisco Nexus 9000 Series NX-OS Interfaces Configuration Guide, Release 7.x
368 pages
Prelim Exam
No ratings yet
Prelim Exam
3 pages
Support Vector Machine
No ratings yet
Support Vector Machine
12 pages
Impact of Information Technology in Indian Banking Industry
No ratings yet
Impact of Information Technology in Indian Banking Industry
14 pages
Support Vector Machine - Explanation
No ratings yet
Support Vector Machine - Explanation
12 pages
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
Bresenham Line Algorithm: Efficient Pixel-Perfect Line Rendering for Computer Vision
From Everand
Bresenham Line Algorithm: Efficient Pixel-Perfect Line Rendering for Computer Vision
Fouad Sabry
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
The Practically Cheating Calculus Handbook
From Everand
The Practically Cheating Calculus Handbook
S. Deviant
3.5/5 (7)

Support Vector Machine (SVM) - Kernel Functions[1]

Uploaded by

Support Vector Machine (SVM) - Kernel Functions[1]

Uploaded by

Support Vector Machine (SVM)

Ideology behind SVM

In two dimensions the hyperplane is a simple line.

So other algorithms learns the differences while SVM learns similarities.

In 2D the hyperplane is a line, so it would look like this:

As you can see, we have an

So how can we find the optimal

To have optimal solution, we have to

Now, identify the right hyper-plane to classify stars and circles.

You need to remember a thumb rule to identify the right

In this scenario, hyper-plane “B” has excellently performed this

Now, How can we identify the right hyper-plane?

Here, maximizing the distances between nearest data point

Let’s look at the below snapshot:

Another lightning reason for selecting the hyperplane with

If we select a hyper-plane having low margin then there is

Some of you may have selected the hyperplane B as it has

But, here is the catch, SVM selects the hyperplane which

Here, hyperplane B has a classification error and A has classified

Therefore, the right hyperplane is A.

SVM can solve this problem.

Easily! It solves this problem by introducing additional feature.

Here, we will add a new feature z=x^2+y^2.

Now, let’s plot the data points on axis x and z:

In the SVM classifier, it is easy to have a linear hyperplane

x=variables and b = biased term

To solve the optimization problem, we use the Lagrange Multipliers.

You might also like