Unit - 2-1

Support Vector Machines (SVM) are supervised machine learning models used for classification, focusing on finding the optimal hyperplane that maximizes the margin between classes. Key concepts include hyperplanes, margins, and support vectors, with applications in areas like face detection, text categorization, and medical diagnosis. SVMs can handle both linearly and non-linearly separable data through techniques like the kernel trick and are known for their robustness and effectiveness in various domains.

Uploaded by

toy955086

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views7 pages

Unit - 2-1

Uploaded by

toy955086

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

UNIT- 2

Support Vector Machines (SVM)

Introduction to SVM
 Support Vector Machine (SVM) is a powerful supervised machine learning
model used for classification tasks.
 It finds the optimal hyperplane that separates different classes in the
dataset.
Key Concepts
1. Hyperplane:
 In one-dimensional space, it's a point; in two dimensions, it's a line;
and in three dimensions, it's a surface.
 The goal of SVM is to find the hyperplane that maximizes the
margin between classes.
2. Margin:
 The margin is defined as the distance between the hyperplane and
the nearest data points from either class, known as support vectors.
 SVM is often referred to as a "maximum margin classifier" because
it seeks to maximize this margin.
3. Support Vectors:
 These are the data points closest to the hyperplane and are critical
in defining its position.
 Only support vectors influence the decision boundary; other points
do not affect it.
Mathematical Formulation
 To mathematically find the optimal hyperplane, we define it as:

w⋅x+b=0w⋅x+b=0
where ww is the weight vector, xx is the feature vector, and bb is the bias.
 The objective is to maximize the margin defined as:

Margin=2∥w∥Margin=∥w∥2

This leads to minimizing 12∥w∥221∥w∥2 subject to constraints based on class

labels.
Lagrange Multipliers
 To solve this constrained optimization problem, we use Lagrange
multipliers.
 The Lagrangian can be expressed as:
L(w,b,α)=12∥w∥2−∑i=1Nαi(yi(w⋅xi+b)−1)L(w,b,α)=21∥w∥2−i=1∑Nαi(yi(w⋅xi+b)
−1)
where yiyi are class labels and αiαi are Lagrange multipliers.
Hard Margin vs. Soft Margin
 Hard Margin SVM: Assumes data is linearly separable without any noise
or outliers.
 Soft Margin SVM: Introduces slack variables to allow some
misclassifications, accommodating non-linearly separable data and
outliers. The optimization problem becomes:

min⁡w,b,ξ(12∥w∥2+C∑i=1Nξi)w,b,ξmin(21∥w∥2+Ci=1∑Nξi)
where CC controls the trade-off between maximizing the margin and minimizing
classification errors.
Kernel Trick
 For non-linearly separable data, SVM can apply kernel functions to
transform data into higher dimensions where a linear separation is
possible.
 Common kernels include:

 Polynomial Kernel: K(xi,xj)=(xi⋅xj+c)dK(xi,xj)=(xi⋅xj+c)d

 Radial Basis Function (RBF) Kernel: K(xi,xj)=e−γ∣∣xi−xj∣∣2K(xi,xj

)=e−γ∣∣xi−xj∣∣2
Conclusion
 SVM is a robust method for classification that effectively handles both
linear and non-linear problems through its mathematical framework and
kernel trick. It emphasizes maximizing margins while being flexible enough
to accommodate noise and outliers in datasets.
Applications of SVM
1. Face Detection
 SVMs classify parts of images as either face or non-face, creating
boundaries around detected faces. This technology is commonly
used in security systems and social media platforms for automatic
tagging and recognition
2. Text and Hypertext Categorization
 SVMs are employed to categorize documents into various classes,
such as news articles or emails. They analyze text data and assign
categories based on scores compared against predefined
thresholds, enhancing content organization and retrieval systems

3. Image Classification
 In image processing, SVMs improve accuracy in classifying images
compared to traditional methods. They are used for object detection
and image retrieval, significantly enhancing search results in visual
databases
4. Bioinformatics
 SVMs play a crucial role in biological data analysis, including protein
classification and cancer diagnosis. They help identify gene
expressions and classify patients based on genetic information,
aiding in personalized medicine
5. Handwriting Recognition
 This application involves recognizing handwritten characters and is
widely used in postal services and document digitization. SVMs
analyze character features to enable accurate transcription of
handwritten text
6. Spam Detection
 In natural language processing (NLP), SVMs are effective for filtering
spam emails by classifying messages based on their content,
improving email delivery systems like those used by Gmail
7. Financial Forecasting
 SVMs are applied in the financial sector for stock market analysis
and fraud detection. Their ability to handle high-dimensional data
makes them suitable for predicting market trends and identifying
unusual patterns indicative of fraudulent activities

8. Medical Diagnosis
 Beyond cancer detection, SVMs assist in diagnosing various
diseases by analyzing complex medical datasets, helping healthcare
professionals make informed decisions based on predictive analytics
9. Remote Homology Detection
 In computational biology, SVMs are used to detect similarities
between protein structures, which is essential for understanding
biological functions and evolutionary relationships

10.Generalized Predictive Control (GPC)

 SVM-based GPC is used to manage chaotic systems in engineering
applications, allowing for better control over dynamic processes
with useful parameters
These applications demonstrate the versatility of SVMs across different domains,
highlighting their importance in both research and practical implementations.
Separating data with the maximum margin
Support vector machines
Pros: Low generalization error, computationally inexpensive, easy to interpret
results
Cons: Sensitive to tuning parameters and kernel choice; natively only handles
binary classification
Works with: Numeric values, nominal values
To introduce the subject of support vector machines I need to explain a few
concepts. Consider the data in frames A–D in figure 6.1; could you draw a
straight line to put all of the circles on one side and all of the squares on another
side? Now consider the data in figure 6.2, frame A. There are two groups of data,
and the data points are separated enough that you could draw a straight line on
the figure with all the points of one class on one side of the line and all the points
of the other class on the other side of the line. If such a situation exists, we say
the data is linearly separable. Don’t worry if this assumption seems too perfect.
We’ll later make some changes where the data points can spill over the line.
Framing the optimization problem in terms of our classifier
I’ve talked about the classifier but haven’t mentioned how it works.
Understanding how the classifier works will help you to understand the
optimization problem. We’ll have a simple equation like the sigmoid where we
can enter our data values and get a class label out. We’re going to use
something like the Heaviside step function, f(wTx+b), where the
function f(u) gives us -1 if u<0, and 1 otherwise. This is different from logistic
regression in the previous chapter where the class labels were 0 or 1.
Why did we switch from class labels of 0 and 1 to -1 and 1? This makes the math
manageable, because -1 and 1 are only different by the sign. We can write a
single equation to describe the margin or how close a data point is to our
separating hyperplane and not have to worry if the data is in the -1 or +1 class.
When we’re doing this and deciding where to place the separating line, this
margin is calculated by label*(wTx+b). This is where the -1 and 1 class labels
help out. If a point is far away from the separating plane on the positive side,
then wTx+b will be a large positive number, and label*(wTx+b) will give us a
large number. If it’s far from the negative side and has a negative
label, label*(wTx+b) will also give us a large positive number.
The goal now is to find the w and b values that will define our classifier. To do
this, we must find the points with the smallest margin. These are the support
vectors briefly mentioned earlier. Then, when we find the points with the
smallest margin, we must maximize that margin. This can be written as
Solving this problem directly is pretty difficult, so we can convert it into another
form that we can solve more easily. Let’s look at the inside of the previous
equation, the part inside the curly braces. Optimizing multiplications can be
nasty, so what we do is hold one part fixed and then maximize the other part. If
we set label*(wTx+b) to be 1 for the support vectors, then we can maximize ||
w||-1 and we’ll have a solution. Not all of the label*(wTx+b) will be equal to 1,
only the closest values to the separating hyper-plane. For values farther away
from the hyperplane, this product will be larger.
The optimization problem we now have is a constrained optimization problem
because we must find the best values, provided they meet some constraints.
Here, our constraint is that label*(wTx+b) will be 1.0 or greater. There’s a well-
known method for solving these types of constrained optimization problems,
using something called Lagrange multipliers. Using Lagrange multipliers, we can
write the problem in terms of our constraints. Because our constraints are our
data points, we can write the values of our hyperplane in terms of our data
points. The optimization function turns out to be

Instance-based learning
The Machine Learning systems which are categorized as instance-based
learning are the systems that learn the training examples by heart and then
generalizes to new instances based on some similarity measure. It is called
instance-based because it builds the hypotheses from the training instances. It is
also known as memory-based learning or lazy-learning (because they delay
processing until a new instance must be classified). The time complexity of this
algorithm depends upon the size of training data. Each time whenever a new
query is encountered, its previously stores data is examined. And assign to a
target function value for the new instance.
The worst-case time complexity of this algorithm is O (n), where n is the number
of training instances. For example, If we were to create a spam filter with an
instance-based learning algorithm, instead of just flagging emails that are
already marked as spam emails, our spam filter would be programmed to also
flag emails that are very similar to them. This requires a measure of resemblance
between two emails. A similarity measure between two emails could be the same
sender or the repetitive use of the same keywords or something else.
Advantages:
1. Instead of estimating for the entire instance set, local approximations can
be made to the target function.
2. This algorithm can adapt to new data easily, one which is collected as we
go .
Disadvantages:
1. Classification costs are high
2. Large amount of memory required to store the data, and each query
involves starting the identification of a local model from scratch.
Some of the instance-based learning algorithms are :
1. K Nearest Neighbor (KNN)
2. Self-Organizing Map (SOM)
3. Learning Vector Quantization (LVQ)
4. Locally Weighted Learning (LWL)
5. Case-Based Reasoning

FoxScanner+Update+Guide+EN V1.00
No ratings yet
FoxScanner+Update+Guide+EN V1.00
12 pages
Social Bookmarking Site List With Page Rank
100% (2)
Social Bookmarking Site List With Page Rank
19 pages
Unit - 2
No ratings yet
Unit - 2
15 pages
Support Vector Machine: Prof. Subodh Kumar Mohanty
No ratings yet
Support Vector Machine: Prof. Subodh Kumar Mohanty
52 pages
Support Vector Machine: Abinas Panda
No ratings yet
Support Vector Machine: Abinas Panda
52 pages
Support Vector Machines
No ratings yet
Support Vector Machines
43 pages
Unit2 Notes What Is A Support Vector Machine
No ratings yet
Unit2 Notes What Is A Support Vector Machine
11 pages
ML Lec9 SVM
No ratings yet
ML Lec9 SVM
32 pages
Support Vector Machines: Theory, Implementation, and Applications
No ratings yet
Support Vector Machines: Theory, Implementation, and Applications
40 pages
Presentation On Support Vector Machine (SVM)
100% (2)
Presentation On Support Vector Machine (SVM)
22 pages
SVM
No ratings yet
SVM
11 pages
Unit II 2.2 ML Kernel Machines SVM
No ratings yet
Unit II 2.2 ML Kernel Machines SVM
50 pages
Ann Unit III
No ratings yet
Ann Unit III
20 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
Support Vector Machine (SVM) Terminology Hyperplane WX + B 0 Support Vectors Margin Kernel Hard Margin Soft Margin
No ratings yet
Support Vector Machine (SVM) Terminology Hyperplane WX + B 0 Support Vectors Margin Kernel Hard Margin Soft Margin
6 pages
IVPML Unit III
No ratings yet
IVPML Unit III
139 pages
Lecture - 7 Classification (SVM)
No ratings yet
Lecture - 7 Classification (SVM)
48 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
Support Vector Machine
No ratings yet
Support Vector Machine
17 pages
Support Vector Machine
No ratings yet
Support Vector Machine
45 pages
Ankita
No ratings yet
Ankita
10 pages
27-Module 4 - Support Vector Machine and Naïve Bayes-20-09-2024
No ratings yet
27-Module 4 - Support Vector Machine and Naïve Bayes-20-09-2024
31 pages
Unit-4 AI - SVM
No ratings yet
Unit-4 AI - SVM
21 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
This Is
No ratings yet
This Is
7 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
Support Vector Machine (SVM) : Basic Terminologies
100% (1)
Support Vector Machine (SVM) : Basic Terminologies
2 pages
SVM
No ratings yet
SVM
4 pages
Unit 2 - SVM - 241016 - 104220
No ratings yet
Unit 2 - SVM - 241016 - 104220
47 pages
Detailed SVM Presentation
No ratings yet
Detailed SVM Presentation
15 pages
SVM Theory
No ratings yet
SVM Theory
7 pages
SVM Tutorial
100% (1)
SVM Tutorial
34 pages
Course Title: Fundamentals of Machine Learning Course Code: Group Assignment On
No ratings yet
Course Title: Fundamentals of Machine Learning Course Code: Group Assignment On
9 pages
Chapter 07
No ratings yet
Chapter 07
18 pages
Support Vactor Machine Final
No ratings yet
Support Vactor Machine Final
11 pages
Lecture 18 - SVM
No ratings yet
Lecture 18 - SVM
54 pages
SVM Tutorial
No ratings yet
SVM Tutorial
28 pages
AP For NLP-LO2
No ratings yet
AP For NLP-LO2
38 pages
Unit 2 PPT - Part 2
100% (1)
Unit 2 PPT - Part 2
81 pages
Support Vector Machines
No ratings yet
Support Vector Machines
33 pages
Lecture#12
No ratings yet
Lecture#12
16 pages
10 Classification SVM
No ratings yet
10 Classification SVM
22 pages
SVM Unit 2
No ratings yet
SVM Unit 2
12 pages
Machine Learning Answer Bank
No ratings yet
Machine Learning Answer Bank
54 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
Support Vector Machines (SVMS) - Introduction and Key Concepts
No ratings yet
Support Vector Machines (SVMS) - Introduction and Key Concepts
52 pages
SVM Notes Unit 4
No ratings yet
SVM Notes Unit 4
8 pages
VO MCA S4 Data Mining Unit 6
No ratings yet
VO MCA S4 Data Mining Unit 6
21 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
Unit 2
No ratings yet
Unit 2
47 pages
Linear Regression & SVM
No ratings yet
Linear Regression & SVM
33 pages
Unit-III - SVM
No ratings yet
Unit-III - SVM
105 pages
Slide - SVM
No ratings yet
Slide - SVM
12 pages
Honours Endsem Notes
No ratings yet
Honours Endsem Notes
163 pages
SVM Presentation
No ratings yet
SVM Presentation
13 pages
Birke - Yeshambel-Support Vector Machine Algorigthm
No ratings yet
Birke - Yeshambel-Support Vector Machine Algorigthm
6 pages
S V M (SVM) : Upport Ector Achine
No ratings yet
S V M (SVM) : Upport Ector Achine
67 pages
CS-13410 Introduction To Machine Learning
No ratings yet
CS-13410 Introduction To Machine Learning
33 pages
Support Vector Machine
No ratings yet
Support Vector Machine
19 pages
Support Vector Machine
0% (1)
Support Vector Machine
7 pages
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
Data Wrangling
No ratings yet
Data Wrangling
4 pages
Data Wrangling Study Guide
No ratings yet
Data Wrangling Study Guide
3 pages
Blockchain Unit II 14 Mark Answers Detailed Cleaned
No ratings yet
Blockchain Unit II 14 Mark Answers Detailed Cleaned
7 pages
Unit 4 & 5
No ratings yet
Unit 4 & 5
7 pages
Presenatation On SIP by Saral Jain
No ratings yet
Presenatation On SIP by Saral Jain
12 pages
Newcomer Interview Form: Community Connections
No ratings yet
Newcomer Interview Form: Community Connections
5 pages
T1 - Universal Beam
No ratings yet
T1 - Universal Beam
8 pages
Wong 2010
No ratings yet
Wong 2010
27 pages
1 - Accounting - Crossword 3
No ratings yet
1 - Accounting - Crossword 3
2 pages
Engine Test Stands For Automotive Technicians
No ratings yet
Engine Test Stands For Automotive Technicians
6 pages
Improving The ISOIEC 11770 Standard For Key Manage
No ratings yet
Improving The ISOIEC 11770 Standard For Key Manage
16 pages
Birdmobile
No ratings yet
Birdmobile
9 pages
Replication Promblem of DNS
No ratings yet
Replication Promblem of DNS
4 pages
Summay Chapter 6 and 8 (Paul Goodwin and George Wright)
No ratings yet
Summay Chapter 6 and 8 (Paul Goodwin and George Wright)
10 pages
Project Report: "In Pursuit of Global Competitiveness"
75% (4)
Project Report: "In Pursuit of Global Competitiveness"
9 pages
Bioseparation Citric Acid
No ratings yet
Bioseparation Citric Acid
32 pages
World Trade Organization and IPR
No ratings yet
World Trade Organization and IPR
5 pages
I C 616 Rap Workshop
No ratings yet
I C 616 Rap Workshop
62 pages
Sawcod Hsim Final
No ratings yet
Sawcod Hsim Final
249 pages
Asme Section V B Se-1211
No ratings yet
Asme Section V B Se-1211
6 pages
CSIR CLRI Junior Secretariat Assistant Paper II 2018 English
No ratings yet
CSIR CLRI Junior Secretariat Assistant Paper II 2018 English
24 pages
Data Science QB
No ratings yet
Data Science QB
42 pages
12-Plinth Beams Layout PDF
No ratings yet
12-Plinth Beams Layout PDF
1 page
BOQ - Zallaf South Refinery Project - CAMP & TSF
No ratings yet
BOQ - Zallaf South Refinery Project - CAMP & TSF
18 pages
A Study Between Social Media Usage and Self-Esteem Among Youths
No ratings yet
A Study Between Social Media Usage and Self-Esteem Among Youths
10 pages
Abstraction and Specification in Program Development 1st Edition by Barbara Liskov, John Guttag ISBN 0262121123 9780262121125 Download
No ratings yet
Abstraction and Specification in Program Development 1st Edition by Barbara Liskov, John Guttag ISBN 0262121123 9780262121125 Download
66 pages
Denim Fabric Consumption & Booking (Final)
No ratings yet
Denim Fabric Consumption & Booking (Final)
7 pages
Laforteza vs. Machuca, 333 SCRA 643, June 16, 2000
0% (1)
Laforteza vs. Machuca, 333 SCRA 643, June 16, 2000
22 pages
A DD Merged
No ratings yet
A DD Merged
16 pages
Green Building
100% (2)
Green Building
29 pages
Oracle Database Administration
No ratings yet
Oracle Database Administration
57 pages
Transcript
No ratings yet
Transcript
12 pages