0% found this document useful (0 votes)

4 views13 pages

UNIT-II-Support Vector Machine Algorithm

The document explains the Support Vector Machine (SVM) algorithm, a popular supervised learning method primarily used for classification tasks. It describes how SVM creates a hyperplane to separate different classes in n-dimensional space using support vectors, and outlines the differences between linear and non-linear SVM. Additionally, it provides a Python implementation example for SVM, including data preprocessing, model training, prediction, and visualization of results.

Uploaded by

Shreya Saxena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views13 pages

UNIT-II-Support Vector Machine Algorithm

Uploaded by

Shreya Saxena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

UNIT-II

Support Vector Machine Algorithm

Support Vector Machine or SVM is one of the most popular Supervised Learning algorithms,
which is used for Classification as well as Regression problems. However, primarily, it is used for
Classification problems in Machine Learning.

The goal of the SVM algorithm is to create the best line or decision boundary that can segregate
n-dimensional space into classes so that we can easily put the new data point in the correct
category in the future. This best decision boundary is called a hyperplane.

SVM chooses the extreme points/vectors that help in creating the hyperplane. These extreme
cases are called as support vectors, and hence algorithm is termed as Support Vector Machine.
Consider the below diagram in which there are two different categories that are classified using
a decision boundary or hyperplane:

Example: SVM can be understood with the example that we have used in the KNN classifier.
Suppose we see a strange cat that also has some features of dogs, so if we want a model that can
accurately identify whether it is a cat or dog, so such a model can be created by using the SVM
algorithm. We will first train our model with lots of images of cats and dogs so that it can learn
about different features of cats and dogs, and then we test it with this strange creature. So as
support vector creates a decision boundary between these two data (cat and dog) and choose
extreme cases (support vectors), it will see the extreme case of cat and dog. On the basis of the
support vectors, it will classify it as a cat. Consider the below diagram:
SVM algorithm can be used for Face detection, image classification, text categorization, etc.

Types of SVM

SVM can be of two types:

o Linear SVM: Linear SVM is used for linearly separable data, which means if a dataset can
be classified into two classes by using a single straight line, then such data is termed as
linearly separable data, and classifier is used called as Linear SVM classifier.

o Non-linear SVM: Non-Linear SVM is used for non-linearly separated data, which means if
a dataset cannot be classified by using a straight line, then such data is termed as non-
linear data and classifier used is called as Non-linear SVM classifier.

Hyperplane and Support Vectors in the SVM algorithm:

Hyperplane: There can be multiple lines/decision boundaries to segregate the classes in n-

dimensional space, but we need to find out the best decision boundary that helps to classify the
data points. This best boundary is known as the hyperplane of SVM.

The dimensions of the hyperplane depend on the features present in the dataset, which means
if there are 2 features (as shown in image), then hyperplane will be a straight line. And if there
are 3 features, then hyperplane will be a 2-dimension plane.

We always create a hyperplane that has a maximum margin, which means the maximum distance
between the data points.

Support Vectors:
The data points or vectors that are the closest to the hyperplane and which affect the position of
the hyperplane are termed as Support Vector. Since these vectors support the hyperplane, hence
called a Support vector.

How does SVM works?

Linear SVM:

The working of the SVM algorithm can be understood by using an example. Suppose we have a
dataset that has two tags (green and blue), and the dataset has two features x1 and x2. We want
a classifier that can classify the pair(x1, x2) of coordinates in either green or blue. Consider the
below image:

So as it is 2-d space so by just using a straight line, we can easily separate these two classes. But
there can be multiple lines that can separate these classes. Consider the below image:
Hence, the SVM algorithm helps to find the best line or decision boundary; this best boundary or
region is called as a hyperplane. SVM algorithm finds the closest point of the lines from both the
classes. These points are called support vectors. The distance between the vectors and the
hyperplane is called as margin. And the goal of SVM is to maximize this margin.
The hyperplane with maximum margin is called the optimal hyperplane.
Non-Linear SVM:

If data is linearly arranged, then we can separate it by using a straight line, but for non-linear data,
we cannot draw a single straight line. Consider the below image:

So to separate these data points, we need to add one more dimension. For linear data, we have
used two dimensions x and y, so for non-linear data, we will add a third dimension z. It can be
calculated as:

z=x2 +y2

By adding the third dimension, the sample space will become as below image:

So now, SVM will divide the datasets into classes in the following way. Consider the below image:
Since we are in 3-d Space, hence it is looking like a plane parallel to the x-axis. If we convert it in
2d space with z=1, then it will become as:

Hence we get a circumference of radius 1 in case of non-linear data.

Python Implementation of Support Vector Machine(Optional)

Now we will implement the SVM algorithm using Python. Here we will use the same
dataset user_data, which we have used in Logistic regression and KNN classification.

o Data Pre-processing step

Till the Data pre-processing step, the code will remain the same. Below is the code:
1. #Data Pre-processing Step

2. # importing libraries

3. import numpy as nm

4. import matplotlib.pyplot as mtp

5. import pandas as pd

6. #importing datasets

7. data_set= pd.read_csv('user_data.csv')

8. #Extracting Independent and dependent Variable

9. x= data_set.iloc[:, [2,3]].values

10. y= data_set.iloc[:, 4].values

11. # Splitting the dataset into training and test set.

12. from sklearn.model_selection import train_test_split

13. x_train, x_test, y_train, y_test= train_test_split(x, y, test_size= 0.25, random_state=0)

14. #feature Scaling

15. from sklearn.preprocessing import StandardScaler

16. st_x= StandardScaler()

17. x_train= st_x.fit_transform(x_train)

18. x_test= st_x.transform(x_test)

After executing the above code, we will pre-process the data. The code will give the dataset as:
The scaled output for the test set will be:
Fitting the SVM classifier to the training set:

Now the training set will be fitted to the SVM classifier. To create the SVM classifier, we will
import SVC class from Sklearn.svm library. Below is the code for it:

1. from sklearn.svm import SVC # "Support vector classifier"

2. classifier = SVC(kernel='linear', random_state=0)

3. classifier.fit(x_train, y_train)

In the above code, we have used kernel='linear', as here we are creating SVM for linearly
separable data. However, we can change it for non-linear data. And then we fitted the classifier
to the training dataset(x_train, y_train)

Output:

Out[8]:

SVC(C=1.0, cache_size=200, class_weight=None, coef0=0.0,

decision_function_shape='ovr', degree=3, gamma='auto_deprecated',

kernel='linear', max_iter=-1, probability=False, random_state=0,

shrinking=True, tol=0.001, verbose=False)

The model performance can be altered by changing the value of C(Regularization factor),
gamma, and kernel.

o Predicting the test set result:

Now, we will predict the output for test set. For this, we will create a new vector y_pred.
Below is the code for it:

1. #Predicting the test set result

2. y_pred= classifier.predict(x_test)

After getting the y_pred vector, we can compare the result of y_pred and y_test to check the
difference between the actual value and predicted value.

Output: Below is the output for the prediction of the test set:
o Creating the confusion matrix:
Now we will see the performance of the SVM classifier that how many incorrect
predictions are there as compared to the Logistic regression classifier. To create the
confusion matrix, we need to import the confusion_matrix function of the sklearn
library. After importing the function, we will call it using a new variable cm. The function
takes two parameters, mainly y_true( the actual values) and y_pred (the targeted value
return by the classifier). Below is the code for it:

1. #Creating the Confusion matrix

2. from sklearn.metrics import confusion_matrix

3. cm= confusion_matrix(y_test, y_pred)

Output:
As we can see in the above output image, there are 66+24= 90 correct predictions and 8+2= 10
correct predictions. Therefore, we can say that our SVM model improved as compared to the
Logistic regression model.

o Visualizing the training set result:

Now we will visualize the training set result, below is the code for it:

1. from matplotlib.colors import ListedColormap

2. x_set, y_set = x_train, y_train

3. x1, x2 = nm.meshgrid(nm.arange(start = x_set[:, 0].min() - 1, stop = x_set[:, 0].max() + 1,

step =0.01),

4. nm.arange(start = x_set[:, 1].min() - 1, stop = x_set[:, 1].max() + 1, step = 0.01))

5. mtp.contourf(x1, x2, classifier.predict(nm.array([x1.ravel(), x2.ravel()]).T).reshape(x1.sha

pe),

6. alpha = 0.75, cmap = ListedColormap(('red', 'green')))

7. mtp.xlim(x1.min(), x1.max())

8. mtp.ylim(x2.min(), x2.max())

9. for i, j in enumerate(nm.unique(y_set)):

10. mtp.scatter(x_set[y_set == j, 0], x_set[y_set == j, 1],

11. c = ListedColormap(('red', 'green'))(i), label = j)

12. mtp.title('SVM classifier (Training set)')

13. mtp.xlabel('Age')

14. mtp.ylabel('Estimated Salary')

15. mtp.legend()

16. mtp.show()

Output:

By executing the above code, we will get the output as:

As we can see, the above output is appearing similar to the Logistic regression output. In the
output, we got the straight line as hyperplane because we have used a linear kernel in the
classifier. And we have also discussed above that for the 2d space, the hyperplane in SVM is a
straight line.

o Visualizing the test set result:

1. #Visulaizing the test set result

2. from matplotlib.colors import ListedColormap

3. x_set, y_set = x_test, y_test

4. x1, x2 = nm.meshgrid(nm.arange(start = x_set[:, 0].min() - 1, stop = x_set[:, 0].max() + 1,

step =0.01),

5. nm.arange(start = x_set[:, 1].min() - 1, stop = x_set[:, 1].max() + 1, step = 0.01))

6. mtp.contourf(x1, x2, classifier.predict(nm.array([x1.ravel(), x2.ravel()]).T).reshape(x1.sha

pe),

7. alpha = 0.75, cmap = ListedColormap(('red','green' )))

8. mtp.xlim(x1.min(), x1.max())

9. mtp.ylim(x2.min(), x2.max())

10. for i, j in enumerate(nm.unique(y_set)):

11. mtp.scatter(x_set[y_set == j, 0], x_set[y_set == j, 1],

12. c = ListedColormap(('red', 'green'))(i), label = j)

13. mtp.title('SVM classifier (Test set)')

14. mtp.xlabel('Age')

15. mtp.ylabel('Estimated Salary')

16. mtp.legend()

17. mtp.show()

Output:

By executing the above code, we will get the output as:

As we can see in the above output image, the SVM classifier has divided the users into two regions
(Purchased or Not purchased). Users who purchased the SUV are in the red region with the red
scatter points. And users who did not purchase the SUV are in the green region with green scatter
points. The hyperplane has divided the two classes into Purchased and not purchased variable.

Sales Prospecting Playbook - Ebook
100% (7)
Sales Prospecting Playbook - Ebook
53 pages
UNIT 3 AAM
No ratings yet
UNIT 3 AAM
30 pages
SVM7
No ratings yet
SVM7
53 pages
CSL0777 L23
No ratings yet
CSL0777 L23
39 pages
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
No ratings yet
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
11 pages
Lab 6 Dsa
No ratings yet
Lab 6 Dsa
15 pages
SVM Implementation
No ratings yet
SVM Implementation
8 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
Support Vector Machine
100% (1)
Support Vector Machine
11 pages
SVM notes
No ratings yet
SVM notes
4 pages
MLT_07
No ratings yet
MLT_07
8 pages
SVM Unit 2
No ratings yet
SVM Unit 2
12 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
Unit-4 AI - SVM
No ratings yet
Unit-4 AI - SVM
21 pages
Title: Implement Support Vector Machine Classifier: Department of Computer Science and Engineering
No ratings yet
Title: Implement Support Vector Machine Classifier: Department of Computer Science and Engineering
5 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
B24 ML Exp-3
No ratings yet
B24 ML Exp-3
10 pages
SVM Algorithm
No ratings yet
SVM Algorithm
17 pages
Aim of The Experiment-Software Required - Theory
No ratings yet
Aim of The Experiment-Software Required - Theory
6 pages
Unit2 notes What is a Support Vector Machine
No ratings yet
Unit2 notes What is a Support Vector Machine
11 pages
Support Vector Machine
No ratings yet
Support Vector Machine
21 pages
SVM
No ratings yet
SVM
11 pages
Support Vector Machine Algorithm
No ratings yet
Support Vector Machine Algorithm
8 pages
Experiment 2.3 SVM Classifier
No ratings yet
Experiment 2.3 SVM Classifier
3 pages
Assignment II Machine Learning
No ratings yet
Assignment II Machine Learning
8 pages
06 Support - Vector - Machine
No ratings yet
06 Support - Vector - Machine
8 pages
Support vector Machine.pptx
No ratings yet
Support vector Machine.pptx
18 pages
Understanding Support Vector Machine Algorithm From Examples
No ratings yet
Understanding Support Vector Machine Algorithm From Examples
10 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
support_vector_machines
No ratings yet
support_vector_machines
12 pages
What Is Support Vector Machine
No ratings yet
What Is Support Vector Machine
13 pages
Support Vector Machine
No ratings yet
Support Vector Machine
13 pages
Support Vector Machine
No ratings yet
Support Vector Machine
45 pages
SVM
No ratings yet
SVM
9 pages
ML-Lec9-SVM
No ratings yet
ML-Lec9-SVM
32 pages
Experiment 6_Minords (1)
No ratings yet
Experiment 6_Minords (1)
4 pages
Support Vector Machine
No ratings yet
Support Vector Machine
8 pages
ML_Lec-19
No ratings yet
ML_Lec-19
20 pages
Machine Learning Unit-3.3
No ratings yet
Machine Learning Unit-3.3
38 pages
PML Lab Exp 10
No ratings yet
PML Lab Exp 10
3 pages
Support Vector Machine
No ratings yet
Support Vector Machine
17 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
Machine Learning(r17a0534) 54 57
No ratings yet
Machine Learning(r17a0534) 54 57
4 pages
Support Vector Machine-1
No ratings yet
Support Vector Machine-1
12 pages
Lesson Three
No ratings yet
Lesson Three
34 pages
UNIT - 2-1
No ratings yet
UNIT - 2-1
7 pages
SVM
No ratings yet
SVM
6 pages
6. Support Vector Machine for Classification
No ratings yet
6. Support Vector Machine for Classification
38 pages
support vector machine
No ratings yet
support vector machine
9 pages
SVM Notes
No ratings yet
SVM Notes
8 pages
Lecture 18 - SVM
No ratings yet
Lecture 18 - SVM
54 pages
Support Vector Machine
No ratings yet
Support Vector Machine
12 pages
Support Vector Machines
No ratings yet
Support Vector Machines
43 pages
SVM Experimentxtended
No ratings yet
SVM Experimentxtended
3 pages
Lecture 8 Zainab
No ratings yet
Lecture 8 Zainab
6 pages
AP for NLP-LO2
No ratings yet
AP for NLP-LO2
38 pages
Unit II 2.2 ML Kernel Machines SVM
No ratings yet
Unit II 2.2 ML Kernel Machines SVM
50 pages
ML Unit 3 Part B Material
No ratings yet
ML Unit 3 Part B Material
15 pages
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
unit-3-TOOLS AND METHODS USED IN CYBERCRIME
No ratings yet
unit-3-TOOLS AND METHODS USED IN CYBERCRIME
23 pages
WT_Unit 1 (HTML)
No ratings yet
WT_Unit 1 (HTML)
34 pages
Basics of IoT Networking
No ratings yet
Basics of IoT Networking
9 pages
SE Assignment Unit 5
No ratings yet
SE Assignment Unit 5
1 page
OS QUESTION BANK
No ratings yet
OS QUESTION BANK
5 pages
UNIT V Regular Expression, Rollover and Frames
No ratings yet
UNIT V Regular Expression, Rollover and Frames
31 pages
Management Information System: Name of Target
No ratings yet
Management Information System: Name of Target
7 pages
CAM Unit 3
No ratings yet
CAM Unit 3
7 pages
CKA Exam Questoins
No ratings yet
CKA Exam Questoins
12 pages
ClothFlow A Flow-Based Model For Clothed Person Generation ICCV 2019 Paper
No ratings yet
ClothFlow A Flow-Based Model For Clothed Person Generation ICCV 2019 Paper
10 pages
5.254.113.208 MACs-Hits
No ratings yet
5.254.113.208 MACs-Hits
2 pages
User FAQ Care: Girth
No ratings yet
User FAQ Care: Girth
2 pages
Hydrogeological Conceptual Site Models
No ratings yet
Hydrogeological Conceptual Site Models
582 pages
Multimedia Answer Key
100% (1)
Multimedia Answer Key
19 pages
Materials - V-Ray For SketchUp - Chaos Help
No ratings yet
Materials - V-Ray For SketchUp - Chaos Help
6 pages
Netbackup System Requirements
No ratings yet
Netbackup System Requirements
3 pages
How To Connect Azonix To VM
No ratings yet
How To Connect Azonix To VM
1 page
CSS3 Material 1
No ratings yet
CSS3 Material 1
41 pages
Protective Relays Guide SCHNEIDER-1
No ratings yet
Protective Relays Guide SCHNEIDER-1
56 pages
Educational Leadership in ICT
No ratings yet
Educational Leadership in ICT
5 pages
TICKET
No ratings yet
TICKET
5 pages
FICHA TECNICA CONTROLADOR MetronMP15 PDF
No ratings yet
FICHA TECNICA CONTROLADOR MetronMP15 PDF
2 pages
Education: Delhi Technological University
No ratings yet
Education: Delhi Technological University
2 pages
Manual Simplex MINIPLEX 4100ES Series
No ratings yet
Manual Simplex MINIPLEX 4100ES Series
164 pages
60 AutoCAD Tips in 60 Minutes Final
100% (1)
60 AutoCAD Tips in 60 Minutes Final
21 pages
Pay-Pa - Adapted Style Guide
No ratings yet
Pay-Pa - Adapted Style Guide
19 pages
EXP 2 - RSA
No ratings yet
EXP 2 - RSA
4 pages
INTRODUCTION
No ratings yet
INTRODUCTION
4 pages
Online Gas Booking System-2
No ratings yet
Online Gas Booking System-2
18 pages
DDS For Physical and Logical Files
No ratings yet
DDS For Physical and Logical Files
106 pages
HP Universal Print Driver: Solution and Feature Guide
No ratings yet
HP Universal Print Driver: Solution and Feature Guide
24 pages
06 EasyIO FC20 Sedona Kitsv1.2 PDF
No ratings yet
06 EasyIO FC20 Sedona Kitsv1.2 PDF
63 pages
Project PPT
No ratings yet
Project PPT
32 pages
eCOA Device Return Instructions - 17may23
No ratings yet
eCOA Device Return Instructions - 17may23
8 pages

UNIT-II-Support Vector Machine Algorithm

Uploaded by

UNIT-II-Support Vector Machine Algorithm

Uploaded by

UNIT-II

Support Vector Machine Algorithm

SVM can be of two types:

Hyperplane and Support Vectors in the SVM algorithm:

Hyperplane: There can be multiple lines/decision boundaries to segregate the classes in n-

How does SVM works?

Hence we get a circumference of radius 1 in case of non-linear data.

Python Implementation of Support Vector Machine(Optional)

o Data Pre-processing step

4. import matplotlib.pyplot as mtp

8. #Extracting Independent and dependent Variable

10. y= data_set.iloc[:, 4].values

11. # Splitting the dataset into training and test set.

12. from sklearn.model_selection import train_test_split

13. x_train, x_test, y_train, y_test= train_test_split(x, y, test_size= 0.25, random_state=0)

14. #feature Scaling

15. from sklearn.preprocessing import StandardScaler

16. st_x= StandardScaler()

17. x_train= st_x.fit_transform(x_train)

18. x_test= st_x.transform(x_test)

1. from sklearn.svm import SVC # "Support vector classifier"

2. classifier = SVC(kernel='linear', random_state=0)

SVC(C=1.0, cache_size=200, class_weight=None, coef0=0.0,

decision_function_shape='ovr', degree=3, gamma='auto_deprecated',

kernel='linear', max_iter=-1, probability=False, random_state=0,

shrinking=True, tol=0.001, verbose=False)

o Predicting the test set result:

1. #Predicting the test set result

1. #Creating the Confusion matrix

2. from sklearn.metrics import confusion_matrix

3. cm= confusion_matrix(y_test, y_pred)

o Visualizing the training set result:

1. from matplotlib.colors import ListedColormap

2. x_set, y_set = x_train, y_train

3. x1, x2 = nm.meshgrid(nm.arange(start = x_set[:, 0].min() - 1, stop = x_set[:, 0].max() + 1,

4. nm.arange(start = x_set[:, 1].min() - 1, stop = x_set[:, 1].max() + 1, step = 0.01))

5. mtp.contourf(x1, x2, classifier.predict(nm.array([x1.ravel(), x2.ravel()]).T).reshape(x1.sha

6. alpha = 0.75, cmap = ListedColormap(('red', 'green')))

10. mtp.scatter(x_set[y_set == j, 0], x_set[y_set == j, 1],

11. c = ListedColormap(('red', 'green'))(i), label = j)

12. mtp.title('SVM classifier (Training set)')

14. mtp.ylabel('Estimated Salary')

By executing the above code, we will get the output as:

o Visualizing the test set result:

1. #Visulaizing the test set result

2. from matplotlib.colors import ListedColormap

3. x_set, y_set = x_test, y_test

4. x1, x2 = nm.meshgrid(nm.arange(start = x_set[:, 0].min() - 1, stop = x_set[:, 0].max() + 1,

5. nm.arange(start = x_set[:, 1].min() - 1, stop = x_set[:, 1].max() + 1, step = 0.01))

6. mtp.contourf(x1, x2, classifier.predict(nm.array([x1.ravel(), x2.ravel()]).T).reshape(x1.sha

7. alpha = 0.75, cmap = ListedColormap(('red','green' )))

10. for i, j in enumerate(nm.unique(y_set)):

11. mtp.scatter(x_set[y_set == j, 0], x_set[y_set == j, 1],

12. c = ListedColormap(('red', 'green'))(i), label = j)

15. mtp.ylabel('Estimated Salary')

By executing the above code, we will get the output as:

You might also like