0% found this document useful (0 votes)

8 views

Support Vector Machine

Support Vector Machine (SVM) is a supervised learning algorithm used for classification and regression, focusing on maximizing the margin between data points of different classes. Key concepts include hyperplanes, support vectors, and kernel functions, which help in transforming data for better separability. SVM can be applied effectively to small to medium-sized datasets with clear margins of separation and can handle non-linear boundaries through various kernel types.

Uploaded by

Mohamed Mamdouh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Support Vector Machine

Uploaded by

Mohamed Mamdouh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Support Vector Machine (SVM): supervised learning machine learning

algorithm that can be used for both classification and regression challenges.

Core Concepts
1. Hyperplane:
o A decision boundary that separates data points of different classes.
In 2D, it’s a line
in 3D, it’s a plane; and in higher dimensions, it’s a hyperplane.
2. Margin:
o The distance between the hyperplane and the nearest data points
(support vectors). SVM aims to maximize this margin.
3. Support Vectors:
o Data points closest to the hyperplane that influence its position and
orientation.
4. Kernel Trick:
o SVM uses kernel functions to transform data into higher dimensions,
making it easier to find a hyperplane in cases where data is not
linearly separable.

Hyperparameters in SVM
1. C (Regularization Parameter)
• Purpose: Balances the trade-off between maximizing the margin and
minimizing classification error.
• Effect:
o Large C: Focuses on correctly classifying all training points, resulting
in a smaller margin and potential overfitting.
o Small C: Allows more misclassifications but achieves a wider margin,
improving generalization.
Real-life Analogy for C:
Imagine you're designing a security system for a museum:
1. Large C:
o The security guard checks every single person and every detail of
their belongings. This ensures no one suspicious gets through but
slows down entry for everyone (overfitting).
o Applied to SVM: This results in a very tight decision boundary that
might not generalize well to new visitors (new data).
2. Small C:
o The guard only checks for large, obvious threats (e.g., large bags or
unusual behavior). Some small errors may occur, but it ensures quick
and smooth entry for most people (better generalization).
o Applied to SVM: The decision boundary is looser, focusing on the
broader picture and tolerating some mistakes.
Scenario: Classifying emails as "spam" or "not spam".
• Large C: The model tries to perfectly classify every email in the training
data. If one legitimate email contains the word "win" (often found in spam),
the model might overfit and treat all emails with "win" as spam.
• Small C: The model tolerates a few misclassified emails in the training data
but finds a broader, more generalized rule for spam classification.

2. Kernel
• Purpose: Determines the transformation applied to the data.
• Types:
o Linear Kernel: For linearly separable data.
o Polynomial Kernel: For more complex patterns.
o Radial Basis Function (RBF) Kernel: Popular for non-linear data due
to its flexibility.
o Sigmoid Kernel: Acts like a neural network activation function.
• Advice:
o Use a linear kernel for datasets where features are already linearly
separable or have high dimensionality.
o Use RBF as a default for non-linear data.

3. Gamma (Kernel Coefficient for RBF, Polynomial, and Sigmoid Kernels)

• Purpose: Defines the influence of a single training example.
• Effect:
o Large Gamma: Focuses on points close to each other, leading to
more complex models (risk of overfitting).
o Small Gamma: Considers distant points, resulting in simpler models
(risk of underfitting).
Real-life Analogy for Gamma:
Imagine you're planning where to install Wi-Fi routers in a building:
1. Large Gamma:
o Each router provides a very small range of coverage. You need
many routers to ensure the entire building has Wi-Fi.
o Applied to SVM: The model focuses on very small regions, fitting
tightly around data points, which can lead to overfitting.
2. Small Gamma:
o Each router provides a wide range of coverage, ensuring fewer
routers are needed. However, the signal might be weaker or less
precise.
o Applied to SVM: The model creates smooth, broad decision
boundaries, which might oversimplify the problem.

Examples in Data Scenario: Classifying customers into "high-value" and "low-

value" groups based on spending habits.
• Large Gamma:
o The model focuses on very specific spending patterns. A customer
spending $500 on groceries and $50 on electronics might be treated
differently from one spending $505 and $45, leading to overfitting.
• Small Gamma:
o The model considers broader patterns. It might classify all customers
spending over $500 as "high-value," missing finer details.

4. Degree (for Polynomial Kernel)

• Purpose: Sets the degree of the polynomial kernel.
• Recommended Value: Begin with degree=3\text{degree} = 3degree=3 for
most applications.

When to Use SVM

1. Small to Medium-sized Datasets:
o SVM performs well with a moderate number of data points but may
struggle with extremely large datasets.
2. High Dimensionality:
o Effective when the number of features is greater than the number of
samples.
3. Clear Margins of Separation:
o SVM excels in cases where there is a clear boundary between classes.
4. Non-linear Boundaries:
o Using kernels, SVM can handle complex, non-linear decision
boundaries.

How to Use SVM Effectively

1. Preprocessing:
o Scale features using standardization or normalization to avoid bias from features
with larger ranges.
2. Start Simple:
o Begin with a linear kernel. If the results are poor, try RBF or polynomial kernels.
3. Use Cross-validation:
o Perform grid search or randomized search to tune hyperparameters (CCC, kernel,
gamma, degree).
4. Balance the Dataset:
o For imbalanced datasets, consider using class weights (class_weight='balanced'
in scikit-learn) to give more importance to the minority class.

SVM margin is sensitive to feature scales.

Maximum margin hyperplane

Soft margin and hard margin

Hard Margin SVM:

1. Definition:
• A hard margin SVM aims to find the hyperplane that perfectly
separates the data into different classes with no misclassifications.
• It assumes that the data is linearly separable, meaning there is a clear
gap between the two classes.
2. Characteristics:
• The hard margin SVM is sensitive to outliers and noise because even
a single mislabeled point can prevent finding a feasible hyperplane.
• It may not perform well when the data is not perfectly separable,
leading to overfitting.
3. Use Case:
• Hard margin SVM is suitable when you are confident that the
classes are perfectly separable and there is minimal noise in the
data.

Large C Value and Narrow Margin

So far, we’ve used hard margin classification: all training samples are on the “correct
side of the street”:
Only work if the data is linearly separable (Left Figure)
Sensitive to outliers→not generalize (Right Figure)

Soft Margin SVM:

1. Definition:
• A soft margin SVM allows for some misclassifications to find a
balance between achieving a clear separation and handling noisy
data.
• It introduces a penalty for misclassified points, allowing for a more
flexible decision boundary.
2. Characteristics:
• Soft margin SVM is less sensitive to outliers and noise because it
allows for a margin of error in the classification.
• It performs better on datasets that are not perfectly separable.
3. Use Case:
• Soft margin SVM is suitable when there is some level of noise or
overlap between the classes. It provides a more realistic approach to
classification.
Small C Value and Large Margin

Objective: find a good balance between keeping the margin as large as

possible while limiting the margin violations
Controlled by hyperparameter C: larger value →smaller margin but less violation
SVM Regression

The trick is to reverse the objective: instead of trying to fit the largest possible street between
two classes while limiting margin violations, SVM Regression tries to fit as many instances as
possible on the street while limiting margin violations (i.e., instances off the street)

The width of the street is controlled by a hyperparameter ϵ.

Multi- Class SVM

Turn Binary Classifier into Multiclass:

- One-vs-Rest (OVR)
- One-vs-One (OVO)
OVO

OVR

Kernels

Polynomial
Gaussian (RBF) Kernel
Sigmoid

Behavior Analysis Based On Google Play ST
No ratings yet
Behavior Analysis Based On Google Play ST
11 pages
SVM Using Python
No ratings yet
SVM Using Python
24 pages
Name Here: Face Recognition System With Face Detection
No ratings yet
Name Here: Face Recognition System With Face Detection
70 pages
SVM
No ratings yet
SVM
12 pages
Session Svmclassification
No ratings yet
Session Svmclassification
28 pages
Detailed SVM Presentation
No ratings yet
Detailed SVM Presentation
15 pages
SVM Unit 2
No ratings yet
SVM Unit 2
12 pages
Chapter 07
No ratings yet
Chapter 07
18 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
43 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
28 pages
Support Vector Machines (SVMs) - Introduction and Key Concepts
No ratings yet
Support Vector Machines (SVMs) - Introduction and Key Concepts
52 pages
Support Vector Machines
No ratings yet
Support Vector Machines
43 pages
SVM
No ratings yet
SVM
9 pages
DMML Unit4 - SVM
No ratings yet
DMML Unit4 - SVM
50 pages
Support Vector Machine
No ratings yet
Support Vector Machine
17 pages
Support Vector Machine
100% (1)
Support Vector Machine
25 pages
Unit-4 AI - SVM
No ratings yet
Unit-4 AI - SVM
21 pages
SVM
No ratings yet
SVM
6 pages
support_vector_machines
No ratings yet
support_vector_machines
12 pages
Lab 6 Dsa
No ratings yet
Lab 6 Dsa
15 pages
SVM_Presentation
No ratings yet
SVM_Presentation
13 pages
svm
No ratings yet
svm
4 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
Support Vector Machine
No ratings yet
Support Vector Machine
40 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
SVM Theory
No ratings yet
SVM Theory
7 pages
Support Vector Machine (SVM) : Basic Terminologies
100% (1)
Support Vector Machine (SVM) : Basic Terminologies
2 pages
machine learning note 3
No ratings yet
machine learning note 3
2 pages
Ann Unit III
No ratings yet
Ann Unit III
20 pages
Support vector Machine.pptx
No ratings yet
Support vector Machine.pptx
18 pages
Unit 2 SVM
No ratings yet
Unit 2 SVM
16 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
SVM Manual
No ratings yet
SVM Manual
7 pages
Unit2 notes What is a Support Vector Machine
No ratings yet
Unit2 notes What is a Support Vector Machine
11 pages
SVM Notes
No ratings yet
SVM Notes
8 pages
Support Vactor Machine Final
No ratings yet
Support Vactor Machine Final
11 pages
Support Vecor Machine
No ratings yet
Support Vecor Machine
4 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
SVM 1
No ratings yet
SVM 1
17 pages
SVM&Decision Tree
No ratings yet
SVM&Decision Tree
10 pages
SVM (Repaired)
No ratings yet
SVM (Repaired)
39 pages
SVM
No ratings yet
SVM
43 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
SVM notes unit 4.docx
No ratings yet
SVM notes unit 4.docx
8 pages
SVM
No ratings yet
SVM
11 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
UNIT 3 AAM
No ratings yet
UNIT 3 AAM
30 pages
Ain3001 - 04 - Support - Vector.machines
No ratings yet
Ain3001 - 04 - Support - Vector.machines
50 pages
Support Vector Machine: Classification, Regression and Outliers Detection
No ratings yet
Support Vector Machine: Classification, Regression and Outliers Detection
26 pages
SVM1
No ratings yet
SVM1
4 pages
SVM - Feb 15
No ratings yet
SVM - Feb 15
34 pages
6. Support Vector Machine for Classification
No ratings yet
6. Support Vector Machine for Classification
38 pages
UNIT - 2-1
No ratings yet
UNIT - 2-1
7 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
5 pages
Unit II 2.2 ML Kernel Machines SVM
No ratings yet
Unit II 2.2 ML Kernel Machines SVM
50 pages
This Is
No ratings yet
This Is
7 pages
Unit 2
No ratings yet
Unit 2
47 pages
Support Vector Machine - Theory
No ratings yet
Support Vector Machine - Theory
8 pages
ML-II UNIT-1
No ratings yet
ML-II UNIT-1
4 pages
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet
Acceleration Budget Costs 24 Oct 2013
No ratings yet
Acceleration Budget Costs 24 Oct 2013
2 pages
Green Energy methods (1)
No ratings yet
Green Energy methods (1)
15 pages
2023.11.17 Online CLAC 030 Exam - Autumn 2022 answers
No ratings yet
2023.11.17 Online CLAC 030 Exam - Autumn 2022 answers
8 pages
Grey And White Geometric Research Cover Document_2
No ratings yet
Grey And White Geometric Research Cover Document_2
1 page
Mariam Hossam Data Analyst
No ratings yet
Mariam Hossam Data Analyst
1 page
downloadAsPdfv2
No ratings yet
downloadAsPdfv2
22 pages
Black and Blue Unlimited Resume Template English
No ratings yet
Black and Blue Unlimited Resume Template English
1 page
minapharm_pharmaceuticals_sae_swot_analysis_bac
No ratings yet
minapharm_pharmaceuticals_sae_swot_analysis_bac
14 pages
Sales
No ratings yet
Sales
469 pages
StrategyPunk_BCG-Matrix---Excel-Model
No ratings yet
StrategyPunk_BCG-Matrix---Excel-Model
4 pages
Template_17_11_12 - kidzania case
No ratings yet
Template_17_11_12 - kidzania case
97 pages
852 715568 Er4q23engv1003.pptxreadonly
No ratings yet
852 715568 Er4q23engv1003.pptxreadonly
9 pages
Edinburgh Business School MBA - Brochure
No ratings yet
Edinburgh Business School MBA - Brochure
7 pages
Top 10 Sales (PHARMA)
No ratings yet
Top 10 Sales (PHARMA)
2 pages
Research Paper
No ratings yet
Research Paper
13 pages
Graphology 160319005034
No ratings yet
Graphology 160319005034
39 pages
Computer Vision and Simulation
No ratings yet
Computer Vision and Simulation
191 pages
10-701 Midterm Exam Solutions, Spring 2007
No ratings yet
10-701 Midterm Exam Solutions, Spring 2007
20 pages
REsFil Machine Learning
No ratings yet
REsFil Machine Learning
5 pages
Prediction of EV Charging Behavior Using Machine L
No ratings yet
Prediction of EV Charging Behavior Using Machine L
12 pages
Final Synopsis
No ratings yet
Final Synopsis
23 pages
Sentiment_Analysis_on_Product_Reviews-1
No ratings yet
Sentiment_Analysis_on_Product_Reviews-1
5 pages
Blind Authentication
0% (1)
Blind Authentication
29 pages
Heat Transfer Enhancement For 3D Chip Thermal Simulation and Prediction
No ratings yet
Heat Transfer Enhancement For 3D Chip Thermal Simulation and Prediction
22 pages
Medical Diagnostic Systems Using AI Algorithms: IEEE Access December 2020
No ratings yet
Medical Diagnostic Systems Using AI Algorithms: IEEE Access December 2020
24 pages
Alotaibi 2020
No ratings yet
Alotaibi 2020
12 pages
Machine Learning-Based Prediction of Hospital Prolonged Length of Stay Admission at Emergency Department A Gradient Boosting Algor
No ratings yet
Machine Learning-Based Prediction of Hospital Prolonged Length of Stay Admission at Emergency Department A Gradient Boosting Algor
19 pages
Identification of Cucumber Leaf Diseases Using Dee
No ratings yet
Identification of Cucumber Leaf Diseases Using Dee
13 pages
(INTI
No ratings yet
(INTI
9 pages
COTTON LEAF DISEASE WORD FILE - pdf111
No ratings yet
COTTON LEAF DISEASE WORD FILE - pdf111
57 pages
Islp 1
No ratings yet
Islp 1
15 pages
Paper PDF Data
No ratings yet
Paper PDF Data
3 pages
ML.4-Classification Techniques (Week 5,6,7)
No ratings yet
ML.4-Classification Techniques (Week 5,6,7)
56 pages
TusharGoel Seminar PPT
No ratings yet
TusharGoel Seminar PPT
23 pages
Machine Learning Based Intrusion Detection System
No ratings yet
Machine Learning Based Intrusion Detection System
5 pages
3-Automated Glaucoma Detection Based On LBP Representation and GLRLM Feature Extraction Method
No ratings yet
3-Automated Glaucoma Detection Based On LBP Representation and GLRLM Feature Extraction Method
6 pages
Adikavi Nannaya University: University College of Engineering
No ratings yet
Adikavi Nannaya University: University College of Engineering
13 pages
CONCEPTS_OF_MACHINE_LEARNING [MINOR]
No ratings yet
CONCEPTS_OF_MACHINE_LEARNING [MINOR]
14 pages
42-51CFDMLReview Updated1 (1)
No ratings yet
42-51CFDMLReview Updated1 (1)
11 pages
ML-Objectives-Mid-1
No ratings yet
ML-Objectives-Mid-1
5 pages
ML Unit 3 Part B Material
No ratings yet
ML Unit 3 Part B Material
15 pages
Machine Learning Basics Infographic With Algorithm Examples PDF
No ratings yet
Machine Learning Basics Infographic With Algorithm Examples PDF
1 page

Support Vector Machine

Uploaded by

Support Vector Machine

Uploaded by

Support Vector Machine (SVM): supervised learning machine learning

3. Gamma (Kernel Coefficient for RBF, Polynomial, and Sigmoid Kernels)

Examples in Data Scenario: Classifying customers into "high-value" and "low-

4. Degree (for Polynomial Kernel)

When to Use SVM

How to Use SVM Effectively

SVM margin is sensitive to feature scales.

Maximum margin hyperplane

Soft margin and hard margin

Hard Margin SVM:

Large C Value and Narrow Margin

Soft Margin SVM:

Objective: find a good balance between keeping the margin as large as

The width of the street is controlled by a hyperparameter ϵ.

Multi- Class SVM

Turn Binary Classifier into Multiclass:

You might also like