0% found this document useful (0 votes)

13 views20 pages

Unit3 SVM

Support Vector Machine (SVM) is a supervised machine learning algorithm that identifies the optimal hyperplane for classifying data into two categories by maximizing the margin between them. Unlike logistic regression, which is probabilistic, SVM focuses on support vectors that are closest to the hyperplane and can handle both linear and non-linear data through the use of kernel functions. SVM can be implemented with hard and soft margins to accommodate misclassifications in real-world datasets.

Uploaded by

pra Bee In adhikari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views20 pages

Unit3 SVM

Uploaded by

pra Bee In adhikari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Support Vector Machine(SVM)

It is a supervised machine learning problem where we try to find a hyperplane that best separates
the two classes.

Note: Don’t get confused between SVM and logistic regression. Both the algorithms try to find the
best hyperplane, but the main difference is logistic regression is a probabilistic approach whereas
support vector machine is based on statistical approaches.
Now the question is which hyperplane does it select? There can be an infinite number of
hyperplanes passing through a point and classifying the two classes perfectly. So, which one is the
best?

Well, SVM does this by finding the maximum margin between the hyperplanes that means
maximum distances between the two classes.

Logistic Regression vs Support Vector Machine (SVM)

Depending on the number of features you have you can either choose Logistic Regression or SVM.
SVM works best when the dataset is small and complex. It is usually advisable to first use logistic
regression and see how does it performs, if it fails to give a good accuracy you can go for SVM
without any kernel (will talk more about kernels in the later section). Logistic regression and SVM
without any kernel have similar performance but depending on your features, one may be more
efficient than the other.

Types of Support Vector Machine (SVM) Algorithms

• Linear SVM: When the data is perfectly linearly separable only then we can use Linear
SVM. Perfectly linearly separable means that the data points can be classified into 2 classes
by using a single straight line(if 2D).
• Non-Linear SVM: When the data is not linearly separable then we can use Non-Linear
SVM, which means when the data points cannot be separated into 2 classes by using a
straight line (if 2D) then we use some advanced techniques like kernel tricks to classify
them. In most real-world applications we do not find linearly separable datapoints hence we
use kernel trick to solve them.

Important Terms
Now let’s define two main terms which will be repeated again and again in this article:
• Support Vectors: These are the points that are closest to the hyperplane. A separating line
will be defined with the help of these data points.
• Margin: it is the distance between the hyperplane and the observations closest to the
hyperplane (support vectors). In SVM large margin is considered a good margin. There are
two types of margins hard margin and soft margin.
How Does Support Vector Machine Work?
SVM is defined such that it is defined in terms of the support vectors only, we don’t have to worry
about other observations since the margin is made using the points which are closest to the
hyperplane (support vectors), whereas in logistic regression the classifier is defined over all the
points. Hence SVM enjoys some natural speed-ups.
Let’s understand the working of SVM using an example. Suppose we have a dataset that has two
classes (green and blue). We want to classify that the new data point as either blue or green.

To classify these points, we can have many decision boundaries, but the question is which is the
best and how do we find it?
NOTE: Since we are plotting the data points in a 2-dimensional graph we call this decision
boundary a straight line but if we have more dimensions, we call this decision boundary a
“hyperplane”

The best hyperplane is that plane that has the maximum distance from both the classes, and this is
the main aim of SVM. This is done by finding different hyperplanes which classify the labels in the
best way then it will choose the one which is farthest from the data points or the one which has a
maximum margin.
Mathematical Intuition Behind Support Vector Machine
Many people skip the math intuition behind this algorithm because it is pretty hard to digest. Here
in this section, we’ll try to understand each and every step working under the hood. SVM is a broad
topic and people are still doing research on this algorithm. If you are planning to do research, then
this might not be the right place for you.
Here we will understand only that part that is required in implementing this algorithm. You must
have heard about the primal formulation, dual formulation, Lagranges multiplier etc.

Before getting into the nitty-gritty details of this topic first let’s understand what a dot product is.

Understanding Dot-Product
We all know that a vector is a quantity that has magnitude as well as direction and just like numbers
we can use mathematical operations such as addition, multiplication. In this section, we will try to
learn about the multiplication of vectors which can be done in two ways, dot product, and cross
product. The difference is only that the dot product is used to get a scalar value as a resultant
whereas cross-product is used to obtain a vector again.
The dot product can be defined as the projection of one vector along with another, multiply by the
product of another vector.

Image 2
Here a and b are 2 vectors, to find the dot product between these 2 vectors we first find the
magnitude of both the vectors and to find magnitude we use the Pythagorean theorem or the
distance formula.
After finding the magnitude we simply multiply it with the cosine angle between both the vectors.
Mathematically it can be written as:
A . B = |A| cosθ * |B|
Where |A| cosθ is the projection of A on B
And |B| is the magnitude of vector B
Now in SVM we just need the projection of A not the magnitude of B, I’ll tell you why later. To just
get the projection we can simply take the unit vector of B because it will be in the direction of B but
its magnitude will be 1. Hence now the equation becomes:
A.B = |A| cosθ * unit vector of B
Now let’s move to the next part and see how we will use this in SVM.

Use of Dot Product in SVM

Consider a random point X and we want to know whether it lies on the right side of the plane or the
left side of the plane (positive or negative).

To find this first we assume this point is a vector (X) and then we make a vector (w) which is
perpendicular to the hyperplane. Let’s say the distance of vector w from origin to decision boundary
is ‘c’. Now we take the projection of X vector on w.

We already know that projection of any vector or another vector is called dot-product. Hence, we
take the dot product of x and w vectors. If the dot product is greater than ‘c’ then we can say that the
point lies on the right side. If the dot product is less than ‘c’ then the point is on the left side and if
the dot product is equal to ‘c’ then the point lies on the decision boundary.
You must be having this doubt that why did we take this perpendicular vector w to the hyperplane?
So what we want is the distance of vector X from the decision boundary and there can be infinite
points on the boundary to measure the distance from. So that’s why we come to standard, we simply
take perpendicular and use it as a reference and then take projections of all the other data points on
this perpendicular vector and then compare the distance.
In SVM we also have a concept of margin. In the next section, we will see how we find the equation
of a hyperplane and what exactly do we need to optimize in SVM.

Margin in Support Vector Machine

We all know the equation of a hyperplane is w.x+b=0 where w is a vector normal to hyperplane and
b is an offset.
To classify a point as negative or positive we need to define a decision rule. We can define decision
rule as:

If the value of w.x+b>0 then we can say it is a positive point otherwise it is a negative point. Now
we need (w,b) such that the margin has a maximum distance. Let’s say this distance is ‘d’.

To calculate ‘d’ we need the equation of L1 and L2. For this, we will take few assumptions that the
equation of L1 is w.x+b=1 and for L2 it is w.x+b=-1.

Now the question comes

• Why the magnitude is equal, why didn’t we take 1 and -2?
• Why did we only take 1 and -1, why not any other value like 24 and -100?
• Why did we assume this line?

Let’s try to answer these questions

• We want our plane to have equal distance from both the classes that means L should pass
through the center of L1 and L2 that’s why we take magnitude equal.
• Let’s say the equation of our hyperplane is 2x+y=2, we observe that even if we multiply the
whole equation with some other number the line doesn’t change (try plotting on a graph).
Hence for mathematical convenience, we take it as 1.
• Now the main question is exactly why there’s a need to assume only this line? To answer
this, I’ll try to take the help of graphs.
Suppose the equation of our hyperplane is 2x+y=2:

Let’s create margin for this hyperplane,

If you multiply these equations by 10, we will see that the parallel line (red and green) gets closer to
our hyperplane. For more clarity look at this graph
(https://www.desmos.com/calculator/dvjo3vacyp)
We also observe that if we divide this equation by 10 then these parallel lines get bigger. Look at
this graph (https://www.desmos.com/calculator/15dbwehq9g).
By this I wanted to show you that the parallel lines depend on (w,b) of our hyperplane, if we
multiply the equation of hyperplane with a factor greater than 1 then the parallel lines will shrink
and if we multiply with a factor less than 1, they expand.
We can now say that these lines will move as we do changes in (w,b) and this is how this gets
optimized. But what is the optimization function? Let’s calculate it.
We know that the aim of SVM is to maximize this margin that means distance (d). But there are few
constraints for this distance (d). Let’s look at what these constraints are.
Optimization Function and its Constraints
In order to get our optimization function, there are few constraints to consider. That constraint is
that “We’ll calculate the distance (d) in such a way that no positive or negative point can cross
the margin line”. Let’s write these constraints mathematically:

Rather than taking 2 constraints forward, we’ll now try to simplify these two constraints into 1. We
assume that negative classes have y=-1 and positive classes have y=1.
We can say that for every point to be correctly classified this condition should always be true:

Suppose a green point is correctly classified that means it will follow w.x+b>=1, if we multiply this
with y=1 we get this same equation mentioned above. Similarly, if we do this with a red point with
y=-1 we will again get this equation. Hence, we can say that we need to maximize (d) such that this
constraint holds true.

We will take 2 support vectors, 1 from the negative class and 2nd from the positive class. The
distance between these two vectors x1 and x2 will be (x2-x1) vector. What we need is, the shortest
distance between these two points which can be found using a trick we used in the dot product. We
take a vector ‘w’ perpendicular to the hyperplane and then find the projection of (x2-x1) vector on
‘w’. Note: this perpendicular vector should be a unit vector then only this will work. Why this
should be a unit vector? This has been explained in the dot-product section. To make this ‘w’ a unit
vector we divide this with the norm of ‘w’.
Finding Projection of a Vector on Another Vector Using Dot Product
We already know how to find the projection of a vector on another vector. We do this by dot-
product of both vectors. So let’s see how

Since x2 and x1 are support vectors and they lie on the hyperplane, hence they will follow yi*
(2.x+b)=1 so we can write it as:

Putting equations (2) and (3) in equation (1) we get:

Hence the equation which we have to maximize is:

We have now found our optimization function but there is a catch here that we don’t find this type
of perfectly linearly separable data in the industry, there is hardly any case we get this type of data
and hence we fail to use this condition we proved here. The type of problem which we just studied
is called Hard Margin SVM now we shall study soft margin which is similar to this but there are
few more interesting tricks we use in Soft Margin SVM.

Soft Margin SVM

In real-life applications, we rarely encounter datasets that are perfectly linearly separable. Instead,
we often come across datasets that are either nearly linearly separable or entirely non-linearly
separable. Unfortunately, the trick demonstrated above for linearly separable datasets is not
applicable in these cases. This is where Support Vector Machines (SVM) come into play. These are
a powerful tool in machine learning that can effectively handle both almost linearly separable and
non-linearly separable datasets, providing a robust solution to classification problems in diverse
real-world scenarios.
To tackle this problem what we do is modify that equation in such a way that it allows few
misclassifications that means it allows few points to be wrongly classified.
We know that max[f(x)] can also be written as min[1/f(x)], it is common practice to minimize a cost
function for optimization problems; therefore, we can invert the function.

To make a soft margin equation we add 2 more terms to this equation which is zeta and multiply
that by a hyperparameter ‘c’

For all the correctly classified points our zeta will be equal to 0 and for all the incorrectly
classified points the zeta is simply the distance of that particular point from its correct hyperplane
that means if we see the wrongly classified green points the value of zeta will be the distance of
these points from L1 hyperplane and for wrongly classified redpoint zeta will be the distance of that
point from L2 hyperplane.
So now we can say that our that are SVM Error = Margin Error + Classification Error. The
higher the margin, the lower would-be margin error, and vice versa.
Let’s say you take a high value of ‘c’ =1000, this would mean that you don’t want to focus on
margin error and just want a model which doesn’t misclassify any data point.
Look at the figure below:

If someone asks you which is a better model, the one where the margin is maximum and has 2
misclassified points or the one where the margin is very less, and all the points are correctly
classified?
Well, there’s no correct answer to this question, but rather we can use SVM Error = Margin Error +
Classification Error to justify this. If you don’t want any misclassification in the model then you
can choose figure 2. That means we’ll increase ‘c’ to decrease Classification Error but if you want
that your margin should be maximized then the value of ‘c’ should be minimized. That’s why ‘c’ is
a hyperparameter and we find the optimal value of ‘c’ using GridsearchCV and cross-validation.

Kernels in Support Vector Machine

The most interesting feature of SVM is that it can even work with a non-linear dataset and for this,
we use “Kernel Trick” which makes it easier to classifies the points. Suppose we have a dataset like
this:
Here we see we cannot draw a single line or say hyperplane which can classify the points correctly.
So what we do is try converting this lower dimension space to a higher dimension space using some
quadratic functions which will allow us to find a decision boundary that clearly divides the data
points. These functions which help us do this are called Kernels and which kernel to use is purely
determined by hyperparameter tuning.

Different Kernel Functions

Some kernel functions which you can use in SVM are given below:

1. Polynomial Kernel
Following is the formula for the polynomial kernel:

Here d is the degree of the polynomial, which we need to specify manually.

Suppose we have two features X1 and X2 and output variable as Y, so using polynomial kernel we
can write it as:
So we basically need to find X12 , X22 and X1.X2, and now we can see that 2 dimensions got
converted into 5 dimensions.

2. Sigmoid Kernel
We can use it as the proxy for neural networks. Equation is:

It is just taking your input, mapping them to a value of 0 and 1 so that they can be separated by a
simple straight line.
Image Source: https://dataaspirant.com/svm-kernels/#t-1608054630725

3. RBF Kernel
What it actually does is to create non-linear combinations of our features to lift your samples onto a
higher-dimensional feature space where we can use a linear decision boundary to separate your
classes It is the most used kernel in SVM classifications, the following formula explains it
mathematically:

where,
1. ‘σ’ is the variance and our hyperparameter
2. ||X₁ – X₂|| is the Euclidean Distance between two points X₁ and X₂
4. Bessel function kernel
It is mainly used for eliminating the cross term in mathematical functions. Following is the formula
of the Bessel function kernel:

5. Anova Kernel
It performs well on multidimensional regression problems. The formula for this kernel function is:

How to Choose the Right Kernel?

I am well aware of the fact that you must be having this doubt about how to decide which kernel
function will work efficiently for your dataset. It is necessary to choose a good kernel function
because the performance of the model depends on it.
Choosing a kernel totally depends on what kind of dataset are you working on. If it is linearly
separable then you must opt. for linear kernel function since it is very easy to use and the
complexity is much lower compared to other kernel functions. I’d recommend you start with a
hypothesis that your data is linearly separable and choose a linear kernel function.
You can then work your way up towards the more complex kernel functions. Usually, we use SVM
with RBF and linear kernel function because other kernels like polynomial kernel are rarely used
due to poor efficiency. But what if linear and RBF both give approximately similar results? Which
kernel do we choose now?

Example
Let’s understand this with the help of an example, for simplicity I’ll only take 2 features that mean 2
dimensions only. In the figure below I have plotted the decision boundary of a linear SVM on 2
features of the iris dataset:
Here we see that a linear kernel works fine on this dataset, but now let’s see how will RBF kernel
work.
We can observe that both the kernels give similar results, both work well with our dataset but which
one should we choose?

Linear SVM is a parametric model. A Parametric Model is a concept used to describe a model in
which all its data is represented within its parameters. In short, the only information needed to
predict the future from the current value is the parameters.
The complexity of the RBF kernel grows as the training data size increases. In addition to the fact
that it is more expensive to prepare RBF kernel, we also have to keep the kernel matrix around, and
the projection into this “infinite” higher dimensional space where the data becomes linearly
separable is more expensive as well during prediction. If the dataset is not linear then using linear
kernel doesn’t make sense we’ll get a very low accuracy if we do so.
So for this kind of dataset, we can use RBF without even a second thought because it makes
decision boundary like this:

Presentation On Support Vector Machine (SVM)
100% (2)
Presentation On Support Vector Machine (SVM)
22 pages
Unit 2 PPT - Part 2
100% (1)
Unit 2 PPT - Part 2
81 pages
Support Vector Machines PDF
100% (1)
Support Vector Machines PDF
37 pages
Jimma University Jimma Institute of Technology Faculty of Electrical and Computer Engineering Power Stream
0% (1)
Jimma University Jimma Institute of Technology Faculty of Electrical and Computer Engineering Power Stream
46 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
103 pages
Math Behind SVM Part 1 (Support Vector Machine) - by MLMath - Io - Medium
No ratings yet
Math Behind SVM Part 1 (Support Vector Machine) - by MLMath - Io - Medium
15 pages
Module 3 ML 24
No ratings yet
Module 3 ML 24
65 pages
Support Vector Machine
No ratings yet
Support Vector Machine
12 pages
IVPML Unit III
No ratings yet
IVPML Unit III
139 pages
Support Vector Machines: (Vapnik, 1979)
No ratings yet
Support Vector Machines: (Vapnik, 1979)
34 pages
S V M (SVM) : Upport Ector Achine
No ratings yet
S V M (SVM) : Upport Ector Achine
67 pages
Support Vector Machine: Abinas Panda
No ratings yet
Support Vector Machine: Abinas Panda
52 pages
7.2. Machine Learning Support Vector Machine
No ratings yet
7.2. Machine Learning Support Vector Machine
52 pages
Support Vector Machine: Prof. Subodh Kumar Mohanty
No ratings yet
Support Vector Machine: Prof. Subodh Kumar Mohanty
52 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
Deep Learn
No ratings yet
Deep Learn
48 pages
Lec17 SVM With Kernel Trick v5
No ratings yet
Lec17 SVM With Kernel Trick v5
62 pages
16 SVM
No ratings yet
16 SVM
41 pages
Unit 2 - SVM - 241016 - 104220
No ratings yet
Unit 2 - SVM - 241016 - 104220
47 pages
SVM 1
No ratings yet
SVM 1
36 pages
SVM (Repaired)
No ratings yet
SVM (Repaired)
39 pages
SVMs
No ratings yet
SVMs
30 pages
Exp 14
No ratings yet
Exp 14
27 pages
SVMs
No ratings yet
SVMs
30 pages
Support Vector Machine
No ratings yet
Support Vector Machine
40 pages
Machine Learning Unit-3.3
No ratings yet
Machine Learning Unit-3.3
38 pages
W12 SVM
No ratings yet
W12 SVM
52 pages
Support Vector Machine
No ratings yet
Support Vector Machine
19 pages
Support Vector Machine
No ratings yet
Support Vector Machine
32 pages
Linear Regression & SVM
No ratings yet
Linear Regression & SVM
33 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
28 pages
Support Vector Machine
No ratings yet
Support Vector Machine
21 pages
Support Vector Machine (SVM) - Kernel Functions
No ratings yet
Support Vector Machine (SVM) - Kernel Functions
20 pages
Overview of SVM: A Support Vector Machine (SVM) Performs by Finding The That The Margin Between The
No ratings yet
Overview of SVM: A Support Vector Machine (SVM) Performs by Finding The That The Margin Between The
20 pages
Support Vector Machine
No ratings yet
Support Vector Machine
19 pages
Unit-4 AI - SVM
No ratings yet
Unit-4 AI - SVM
21 pages
SVM Part A
No ratings yet
SVM Part A
16 pages
Unit 2
No ratings yet
Unit 2
47 pages
Support Vector Machine
No ratings yet
Support Vector Machine
31 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
Support Vector Machine-1
No ratings yet
Support Vector Machine-1
12 pages
WWW Analyticsvidhya Com Blog 2021 10 Support Vector Machines
No ratings yet
WWW Analyticsvidhya Com Blog 2021 10 Support Vector Machines
21 pages
Unit - 2
No ratings yet
Unit - 2
15 pages
SVM
No ratings yet
SVM
11 pages
SVM 1
No ratings yet
SVM 1
8 pages
SVM Notes Unit 4
No ratings yet
SVM Notes Unit 4
8 pages
Support Vector Machine Algorithm
No ratings yet
Support Vector Machine Algorithm
8 pages
SVM Notes
No ratings yet
SVM Notes
4 pages
SVM Theory
No ratings yet
SVM Theory
7 pages
Basic of SVM Algorithm
No ratings yet
Basic of SVM Algorithm
10 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
SVM Scribe Notes
No ratings yet
SVM Scribe Notes
16 pages
Final Year Report Presentation Edited
No ratings yet
Final Year Report Presentation Edited
52 pages
Support Vector Machine: Suraj Kumar Das
No ratings yet
Support Vector Machine: Suraj Kumar Das
10 pages
Introduction To Support Vector Machines: 1 Description
No ratings yet
Introduction To Support Vector Machines: 1 Description
15 pages
Chapter 3 - Support Vector Machine With Math. - Deep Math Machine Learning - Ai - Medium
No ratings yet
Chapter 3 - Support Vector Machine With Math. - Deep Math Machine Learning - Ai - Medium
11 pages
Pca PDF
No ratings yet
Pca PDF
10 pages
SAP Upgrade Service: IBM Global Business Services
No ratings yet
SAP Upgrade Service: IBM Global Business Services
2 pages
Canada NOC Code List PDF 2024 - In-Demand Jobs in Canada
No ratings yet
Canada NOC Code List PDF 2024 - In-Demand Jobs in Canada
363 pages
Activator Office 2016.Cmd
No ratings yet
Activator Office 2016.Cmd
1 page
50 Uses of Computers in My Area
100% (1)
50 Uses of Computers in My Area
4 pages
Libre Office Writer MCQ
No ratings yet
Libre Office Writer MCQ
13 pages
Conec DSub Hoods
No ratings yet
Conec DSub Hoods
54 pages
Piezo Electric Energy Harvesting
No ratings yet
Piezo Electric Energy Harvesting
16 pages
01 Decision Analysis Presentation Group D
No ratings yet
01 Decision Analysis Presentation Group D
78 pages
Project Management Process Groups and Knowledge Areas Mapping
100% (1)
Project Management Process Groups and Knowledge Areas Mapping
1 page
Unit3 KNN Examples
No ratings yet
Unit3 KNN Examples
7 pages
Statistical Computing With R: Masters in Data Sciences 503 (S28) Third Batch, SMS, TU, 2024
No ratings yet
Statistical Computing With R: Masters in Data Sciences 503 (S28) Third Batch, SMS, TU, 2024
35 pages
Matrikon Data Broker MQTT Publisher User Manual
No ratings yet
Matrikon Data Broker MQTT Publisher User Manual
66 pages
Bulk Insert To Oracle - Final
No ratings yet
Bulk Insert To Oracle - Final
13 pages
EFDP Symbiosis Brochure-June 2023
No ratings yet
EFDP Symbiosis Brochure-June 2023
2 pages
Unit4 Clustering Algorithms
No ratings yet
Unit4 Clustering Algorithms
43 pages
Statistical Computing With R: Masters in Data Science 503 (S15) Third Batch, SMS, TU, 2024
No ratings yet
Statistical Computing With R: Masters in Data Science 503 (S15) Third Batch, SMS, TU, 2024
40 pages
Unit4 Clustering Evaluation
No ratings yet
Unit4 Clustering Evaluation
53 pages
Unit2 AssociationAnalysis V2
No ratings yet
Unit2 AssociationAnalysis V2
46 pages
Unit4 Clustering
No ratings yet
Unit4 Clustering
46 pages
Unit1 Introduction
No ratings yet
Unit1 Introduction
38 pages
Practo - Doctors and Medicines
No ratings yet
Practo - Doctors and Medicines
13 pages
02 Decision Making Under Uncertainty and Risk
No ratings yet
02 Decision Making Under Uncertainty and Risk
12 pages
Epcom
100% (1)
Epcom
2 pages
Hyper Parameter Tuning
No ratings yet
Hyper Parameter Tuning
4 pages
Construction Schedule: Ks Saastha Enterprise
No ratings yet
Construction Schedule: Ks Saastha Enterprise
4 pages
Unit4 HAC Example
No ratings yet
Unit4 HAC Example
7 pages
DDB-distribution Database Important.
No ratings yet
DDB-distribution Database Important.
15 pages
Students File - S4
No ratings yet
Students File - S4
6 pages
Plustek IPcam P1000 Guide
No ratings yet
Plustek IPcam P1000 Guide
67 pages
Pleiades Panharpening and Orthorectification
No ratings yet
Pleiades Panharpening and Orthorectification
10 pages
Chapter 7 Supervised Learning
No ratings yet
Chapter 7 Supervised Learning
71 pages
Lifestyle CamScan Catalog V4 HighRes
No ratings yet
Lifestyle CamScan Catalog V4 HighRes
32 pages
06 Finite Elements Catalogs Options
No ratings yet
06 Finite Elements Catalogs Options
28 pages
Comp nd2 FT
No ratings yet
Comp nd2 FT
5 pages
Fullspec Can 26482s1
No ratings yet
Fullspec Can 26482s1
18 pages
Statement of Account No: 013610100020214 For Last 07 Days: Pavani Krishnapriya Dara
No ratings yet
Statement of Account No: 013610100020214 For Last 07 Days: Pavani Krishnapriya Dara
1 page
Ai Theory Assignmnet (120 E)
No ratings yet
Ai Theory Assignmnet (120 E)
6 pages
Controllable Sentence Simplification With A Unified Text-to-Text Transfer Transformer
No ratings yet
Controllable Sentence Simplification With A Unified Text-to-Text Transfer Transformer
12 pages
Vigneshwaran-Resume-Linux and Windows
No ratings yet
Vigneshwaran-Resume-Linux and Windows
6 pages
Main Project Notice
No ratings yet
Main Project Notice
1 page
Master the Fundamentals of Electromagnetism and EM-Induction
From Everand
Master the Fundamentals of Electromagnetism and EM-Induction
Space Learn
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Bresenham Line Algorithm: Efficient Pixel-Perfect Line Rendering for Computer Vision
From Everand
Bresenham Line Algorithm: Efficient Pixel-Perfect Line Rendering for Computer Vision
Fouad Sabry
No ratings yet
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
Direct Linear Transformation: Practical Applications and Techniques in Computer Vision
From Everand
Direct Linear Transformation: Practical Applications and Techniques in Computer Vision
Fouad Sabry
No ratings yet

Unit3 SVM

Uploaded by

Unit3 SVM

Uploaded by

Support Vector Machine(SVM)

Logistic Regression vs Support Vector Machine (SVM)

Types of Support Vector Machine (SVM) Algorithms

Use of Dot Product in SVM

Margin in Support Vector Machine

Now the question comes

Let’s try to answer these questions

Let’s create margin for this hyperplane,

Putting equations (2) and (3) in equation (1) we get:

Hence the equation which we have to maximize is:

Soft Margin SVM

Kernels in Support Vector Machine

Different Kernel Functions

Here d is the degree of the polynomial, which we need to specify manually.

How to Choose the Right Kernel?

You might also like