0% found this document useful (0 votes)

15 views

Support Vector Machine - Theory

Support Vector Machine_theory

Uploaded by

V. Ganesh Karthikeyan

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

Support Vector Machine - Theory

Support Vector Machine_theory

Uploaded by

V. Ganesh Karthikeyan

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Support Vector Machine (SVM)

What is a support vector machine (SVM)?

A support vector machine (SVM) is a type of supervised learning
algorithm used in machine learning to solve classification and
regression tasks; SVMs are particularly good at solving binary
classification problems, which require classifying the elements of a
data set into two groups.

The aim of a support vector machine algorithm is to find the best

possible line, or decision boundary, that separates the data points of
different data classes. This boundary is called a hyperplane when
working in high-dimensional feature spaces. The idea is to maximize
the margin, which is the distance between the hyperplane and the
closest data points of each category, thus making it easy to
distinguish data classes.

SVMs are useful for analyzing complex data that can't be separated
by a simple straight line. Called nonlinear SMVs, they do this by
using a mathematical trick that transforms data into higher-
dimensional space, where it is easier to find a boundary.

How do support vector machines work?

The key idea behind SVMs is to transform the input data into a
higher-dimensional feature space. This transformation makes it
easier to find a linear separation or to more effectively classify the
data set.

To do this, SVMs use a kernel function. Instead of explicitly

calculating the coordinates of the transformed space, the kernel
function enables the SVM to implicitly compute the dot products
between the transformed feature vectors and avoid handling
expensive, unnecessary computations for extreme cases.
SVMs can handle both linearly separable and non-linearly separable
data. They do this by using different types of kernel functions, such
as the linear kernel, polynomial kernel or radial basis function (RBF)
kernel. These kernels enable SVMs to effectively capture complex
relationships and patterns in the data.

During the training phase, SVMs use a mathematical formulation to

find the optimal hyperplane in a higher-dimensional space, often
called the kernel space. This hyperplane is crucial because it
maximizes the margin between data points of different classes,
while minimizing the classification errors.

The kernel function plays a critical role in SVMs, as it makes it

possible to map the data from the original feature space to the
kernel space. The choice of kernel function can have a significant
impact on the performance of the SVM algorithm; choosing the best
kernel function for a particular problem depends on the
characteristics of the data.

Some of the most popular kernel functions for SVMs are the
following:

Linear kernel. This is the simplest kernel function, and it maps the
data to a higher-dimensional space, where the data is linearly
separable.

Polynomial kernel. This kernel function is more powerful than the

linear kernel, and it can be used to map the data to a higher-
dimensional space, where the data is non-linearly separable.

RBF kernel. This is the most popular kernel function for SVMs, and
it is effective for a wide range of classification problems.
Sigmoid kernel. This kernel function is similar to the RBF kernel,
but it has a different shape that can be useful for some classification
problems.

The choice of kernel function for an SVM algorithm is a tradeoff

between accuracy and complexity. The more powerful kernel
functions, such as the RBF kernel, can achieve higher accuracy
than the simpler kernel functions, but they also require more data
and computation time to train the SVM algorithm. But this is
becoming less of an issue due to technological advances.

Once trained, SVMs can classify new, unseen data points by

determining which side of the decision boundary they fall on. The
output of the SVM is the class label associated with the side of the
decision boundary.

Types of support vector machines:

Support vector machines have different types and variants that
provide specific functionalities and address specific problem
scenarios. Here are two types of SVMs and their significance:

Linear SVM. Linear SVMs use a linear kernel to create a straight-

line decision boundary that separates different classes. They are
effective when the data is linearly separable or when a linear
approximation is sufficient. Linear SVMs are computationally
efficient and have good interpretability, as the decision boundary is a
hyperplane in the input feature space.

Nonlinear SVM. Nonlinear SVMs address scenarios where the data

cannot be separated by a straight line in the input feature space.
They achieve this by using kernel functions that implicitly map the
data into a higher-dimensional feature space, where a linear
decision boundary can be found. Popular kernel functions used in
this type of SVM include the polynomial kernel, Gaussian (RBF)
kernel and sigmoid kernel. Nonlinear SVMs can capture complex
patterns and achieve higher classification accuracy when compared
to linear SVMs.

Advantages of SVMs:
SVMs are powerful machine learning algorithms that have the
following advantages:

Effective in high-dimensional spaces. High-dimensional data

refers to data in which the number of features is larger than the
number of observations, i.e., data points. SVMs perform well even
when the number of features is larger than the number of samples.
They can handle high-dimensional data efficiently, making them
suitable for applications with a large number of features.

Resistant to overfitting. SVMs are less prone to overfitting

compared to other algorithms, like decision trees -- overfitting is
where a model performs extremely well on the training data but
becomes too specific to that data and can't generalize to new data.
SVMs' use of the margin maximization principle helps in
generalizing well to unseen data.

Versatile. SVMs can be applied to both classification and

regression problems. They support different kernel functions,
enabling flexibility in capturing complex relationships in the data.
This versatility makes SVMs applicable to a wide range of tasks.

Effective in cases of limited data. SVMs can work well even when
the training data set is small. The use of support vectors ensures
that only a subset of data points influences the decision boundary,
which can be beneficial when data is limited.

Ability to handle nonlinear data. SVMs can implicitly handle non-

linearly separable data by using kernel functions. The kernel trick
enables SVMs to transform the input space into a higher-
dimensional feature space, making it possible to find linear decision
boundaries.

Disadvantages of SVMs:
While support vector machines are popular for the reasons listed
above, they also come with some limitations and potential issues:

Computationally intensive. SVMs can be computationally

expensive, especially when dealing with large data sets. The
training time and memory requirements increase significantly with
the number of training samples.

Sensitive to parameter tuning. SVMs have parameters such as

the regularization parameter and the choice of kernel function. The
performance of SVMs can be sensitive to these parameter settings.
Improper tuning can lead to suboptimal results or longer training
times.

Lack of probabilistic outputs. SVMs provide binary classification

outputs and do not directly estimate class probabilities. Additional
techniques, such as Platt scaling or cross-validation, are needed to
obtain probability estimates.

Difficulty in interpreting complex models. SVMs can create

complex decision boundaries, especially when using nonlinear
kernels. This complexity may make it challenging to interpret the
model and understand the underlying patterns in the data.

Scalability issues. SVMs may face scalability issues when applied

to extremely large data sets. Training an SVM on millions of
samples can become impractical due to memory and computational
constraints.

Important support vector machine vocabulary

C parameter
A C parameter is a primary regularization parameter in SVMs. It
controls the tradeoff between maximizing the margin and minimizing
the misclassification of training data. A smaller C enables more
misclassification, while a larger C imposes a stricter margin.

Classification
Classification is about sorting things into different groups or
categories based on their characteristics, akin to putting things into
labeled boxes. Sorting emails into spam or nonspam categories is
an example.

Decision boundary
A decision boundary is an imaginary line or boundary that separates
different groups or categories in a data set, placing data sets into
different regions. For instance, an email decision boundary might
classify an email with over 10 exclamation marks as "spam" and an
email with under 10 marks as "not spam."

Grid search
A grid search is a technique used to find the optimal values of
hyperparameters in SVMs. It involves systematically searching
through a predefined set of hyperparameters and evaluating the
performance of the model.

Hyperplane
In n-dimensional space -- that is, a space with many dimensions -- a
hyperplane is defined as an (n-1)-dimensional subspace, a flat
surface that has one less dimension than the space itself. In a two-
dimensional space, its hyperplane would be one-dimensional or a
line.

Kernel function
A kernel function is a mathematical function used in the kernel trick
to compute the inner product between two data points in the
transformed feature space. Common kernel functions include linear,
polynomial, Gaussian (RBF) and sigmoid.

Kernel trick
A kernel trick is a technique used to transform low-dimensional data
into higher-dimensional data to find a linear decision boundary. It
avoids the computational complexity that arises when explicitly
mapping the data to a higher dimension.

Margin
The margin is the distance between the decision boundary and the
support vectors. An SVM aims to maximize this margin to improve
generalization and reduce overfitting.

One-vs-All
One-vs-All, or OvA, is a technique for multiclass classification using
SVMs. It trains a binary SVM classifier for each class, treating it as
the positive class and all other classes as the negative class.

One-vs-One
One-vs-One, or OvO, is a technique for multiclass classification
using SVMs. It trains a binary SVM classifier for each pair of classes
and combines predictions to determine the final class.

Regression
Regression is predicting or estimating a numerical value based on
other known information. It's similar to making an educated guess
based on given patterns or trends. Predicting the price of a house
based on its size, location and other features is an example.

Regularization
Regularization is a technique used to prevent overfitting in SVMs.
Regularization introduces a penalty term in the objective function,
encouraging the algorithm to find a simpler decision boundary rather
than fitting the training data perfectly.
Support vector
A support vector is a data point or node lying closest to the decision
boundary or hyperplane. These points play a vital role in defining the
decision boundary and the margin of separation.

Support vector regression

Support vector regression (SVR) is a variant of SVM used for
regression tasks. SVR aims to find an optimal hyperplane that
predicts continuous values, while maintaining a margin of tolerance.

Emperical Measurement of Price
No ratings yet
Emperical Measurement of Price
5 pages
Chapter6 Handbook On SEM Zainudin Awang - Univer PDF
100% (2)
Chapter6 Handbook On SEM Zainudin Awang - Univer PDF
33 pages
Support Vecor Machine
No ratings yet
Support Vecor Machine
4 pages
DMML Unit4 - SVM
No ratings yet
DMML Unit4 - SVM
50 pages
Business Data Mining Week 6
No ratings yet
Business Data Mining Week 6
20 pages
UNIT-III Support Vector Machines
No ratings yet
UNIT-III Support Vector Machines
43 pages
Chapter 07
No ratings yet
Chapter 07
18 pages
Unit-4 AI - SVM
No ratings yet
Unit-4 AI - SVM
21 pages
5-SVM
No ratings yet
5-SVM
34 pages
SVM Notes
No ratings yet
SVM Notes
8 pages
SVM
No ratings yet
SVM
6 pages
Unit 2 SVM
No ratings yet
Unit 2 SVM
16 pages
SVM&Decision Tree
No ratings yet
SVM&Decision Tree
10 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
Machine Learning(r17a0534) 54 57
No ratings yet
Machine Learning(r17a0534) 54 57
4 pages
SVM
No ratings yet
SVM
12 pages
Support Vector Machine
No ratings yet
Support Vector Machine
12 pages
Support Vector Machine - Explanation
No ratings yet
Support Vector Machine - Explanation
12 pages
Support vector Machine.pptx
No ratings yet
Support vector Machine.pptx
18 pages
Support Vector Machine (SVM) : Basic Terminologies
100% (1)
Support Vector Machine (SVM) : Basic Terminologies
2 pages
Generalization of Linear and Non-Linear Support Vector Machine in Multiple Fields: A Review
No ratings yet
Generalization of Linear and Non-Linear Support Vector Machine in Multiple Fields: A Review
14 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
28 pages
Support Vector Machine Algorithm
No ratings yet
Support Vector Machine Algorithm
8 pages
Session Svmclassification
No ratings yet
Session Svmclassification
28 pages
Ijetae 0812 11
No ratings yet
Ijetae 0812 11
4 pages
SVM Manual
No ratings yet
SVM Manual
7 pages
Support Vector Machine
No ratings yet
Support Vector Machine
14 pages
Support Vector Machine
No ratings yet
Support Vector Machine
17 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
B43 Exp3 ML
No ratings yet
B43 Exp3 ML
5 pages
Support Vector Machines and Kernels
No ratings yet
Support Vector Machines and Kernels
23 pages
SVM
No ratings yet
SVM
9 pages
AI21
No ratings yet
AI21
6 pages
SVM notes unit 4.docx
No ratings yet
SVM notes unit 4.docx
8 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
6 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
Support Vector Machine
100% (1)
Support Vector Machine
25 pages
6. Support Vector Machine for Classification
No ratings yet
6. Support Vector Machine for Classification
38 pages
Support Vector Machine-1
No ratings yet
Support Vector Machine-1
12 pages
Unit2 notes What is a Support Vector Machine
No ratings yet
Unit2 notes What is a Support Vector Machine
11 pages
Support Vector Machine
No ratings yet
Support Vector Machine
13 pages
SVM_Presentation
No ratings yet
SVM_Presentation
13 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
Support Vector Machine
No ratings yet
Support Vector Machine
13 pages
Support Vector Machine in R Paper
No ratings yet
Support Vector Machine in R Paper
28 pages
Support Vector Machine
No ratings yet
Support Vector Machine
12 pages
ML_Lec-19
No ratings yet
ML_Lec-19
20 pages
Day 34
No ratings yet
Day 34
3 pages
Lab 6 Dsa
No ratings yet
Lab 6 Dsa
15 pages
SVM - Feb 15
No ratings yet
SVM - Feb 15
34 pages
support_vector_machines
No ratings yet
support_vector_machines
12 pages
SVM
No ratings yet
SVM
14 pages
Tutorial On Support Vector Machine (SVM) : Abstract
No ratings yet
Tutorial On Support Vector Machine (SVM) : Abstract
13 pages
data mining techniques
No ratings yet
data mining techniques
27 pages
SVM
No ratings yet
SVM
43 pages
SVM Theory
No ratings yet
SVM Theory
7 pages
SVM 1
No ratings yet
SVM 1
17 pages
Machine Learning Unit-3.3
No ratings yet
Machine Learning Unit-3.3
38 pages
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet
Mathematics for Data Science: Linear Algebra with Matlab
From Everand
Mathematics for Data Science: Linear Algebra with Matlab
César Pérez López
No ratings yet
C Day-14
No ratings yet
C Day-14
13 pages
Set Example
No ratings yet
Set Example
8 pages
SVM Algorithm
No ratings yet
SVM Algorithm
17 pages
SVM Implementation
No ratings yet
SVM Implementation
8 pages
Strings in C++
No ratings yet
Strings in C++
59 pages
Arrays C Example Programs
No ratings yet
Arrays C Example Programs
11 pages
Java IO Basics Example
No ratings yet
Java IO Basics Example
8 pages
Unit-III Session 19 Inheritance
No ratings yet
Unit-III Session 19 Inheritance
7 pages
Unit-III Session 20 Types of Inheritance
No ratings yet
Unit-III Session 20 Types of Inheritance
11 pages
Unit-III Session 24 Overriding Abstract Classes
No ratings yet
Unit-III Session 24 Overriding Abstract Classes
6 pages
Unit-III Session 23 Constructors in Subclass
No ratings yet
Unit-III Session 23 Constructors in Subclass
6 pages
Unit 1 Notes Java Programming
No ratings yet
Unit 1 Notes Java Programming
22 pages
Unit-II (STATIC UML DIAGRAMS)
No ratings yet
Unit-II (STATIC UML DIAGRAMS)
59 pages
UNIT-I (Unified Process and Use Case Diagrams) OOAD
No ratings yet
UNIT-I (Unified Process and Use Case Diagrams) OOAD
62 pages
Final Cheat Sheet!
No ratings yet
Final Cheat Sheet!
1 page
Simple Regression and Correlation Analysis
100% (2)
Simple Regression and Correlation Analysis
27 pages
Module No. 12 Title: Pearson R and Spearman Rho: 1. The Coefficient of Correlation 2. Rank Correlation
100% (1)
Module No. 12 Title: Pearson R and Spearman Rho: 1. The Coefficient of Correlation 2. Rank Correlation
14 pages
Practice Final Exam, STATS 401 W18
No ratings yet
Practice Final Exam, STATS 401 W18
9 pages
Homework #3 - Answers Economics 113 Introduction To Econometrics Professor Spearot Due Wednesday, October 29th, 2008 - Beginning of Class
No ratings yet
Homework #3 - Answers Economics 113 Introduction To Econometrics Professor Spearot Due Wednesday, October 29th, 2008 - Beginning of Class
2 pages
Factor Influencing On Hanu Students' House Rent: Econometrics Project
No ratings yet
Factor Influencing On Hanu Students' House Rent: Econometrics Project
23 pages
Variable Selection: Prof. Sharyn O'Halloran Sustainable Development U9611 Econometrics II
No ratings yet
Variable Selection: Prof. Sharyn O'Halloran Sustainable Development U9611 Econometrics II
79 pages
Biostatistics (Correlation and Regression)
100% (1)
Biostatistics (Correlation and Regression)
29 pages
Cost Behavior: Analysis and USE: Patrick Louie E. Reyes, CTT, Micb, Rca, Cpa
No ratings yet
Cost Behavior: Analysis and USE: Patrick Louie E. Reyes, CTT, Micb, Rca, Cpa
24 pages
Analysis of Variance
No ratings yet
Analysis of Variance
13 pages
CCP403
No ratings yet
CCP403
34 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
32 pages
Logistics Regression Exercise
No ratings yet
Logistics Regression Exercise
2 pages
Mixed Models Day 2 Student Version2023
No ratings yet
Mixed Models Day 2 Student Version2023
51 pages
Ams 427 Sup Notes 1f
No ratings yet
Ams 427 Sup Notes 1f
2 pages
Multivariate Regression, slides
No ratings yet
Multivariate Regression, slides
61 pages
Ordered Logit Model
No ratings yet
Ordered Logit Model
4 pages
3 Sls
No ratings yet
3 Sls
31 pages
Complete Regression Diagnostics: An Introduction 2nd Edition John Fox PDF For All Chapters
100% (6)
Complete Regression Diagnostics: An Introduction 2nd Edition John Fox PDF For All Chapters
34 pages
Survival Analysis Theory 2024-4
No ratings yet
Survival Analysis Theory 2024-4
49 pages
Syllabus - IM31202 - Statistical Learning With Applications
No ratings yet
Syllabus - IM31202 - Statistical Learning With Applications
3 pages
Saiyidi Mat Roni - 2014 - Partial Least Square in A Nutshell
No ratings yet
Saiyidi Mat Roni - 2014 - Partial Least Square in A Nutshell
23 pages
Lecture Note: Analysis of Financial Time Series
No ratings yet
Lecture Note: Analysis of Financial Time Series
12 pages
NOTES Module 2 - ANOVA (Analysis of Variance)
No ratings yet
NOTES Module 2 - ANOVA (Analysis of Variance)
37 pages
Project Report ME-315 Machine Learning in Practice: Sebastian Perez Viegener LSE ID:201870983 July 3, 2019
No ratings yet
Project Report ME-315 Machine Learning in Practice: Sebastian Perez Viegener LSE ID:201870983 July 3, 2019
15 pages
FRA Milestone-1
No ratings yet
FRA Milestone-1
47 pages
Applied Econometrics Notes
No ratings yet
Applied Econometrics Notes
3 pages
Employee Attrition Prediction Analysis Report
No ratings yet
Employee Attrition Prediction Analysis Report
6 pages