DSUP Exp6

The document presents an experiment comparing Support Vector Machines (SVM) and Random Forest Classifier using the Iris dataset to predict flower species. It discusses the theoretical background of machine learning, the characteristics of both algorithms, and their performance outcomes, concluding that SVM generally performs better on this specific dataset. The choice between the two algorithms depends on the dataset's nature, interpretability needs, and available computational resources.

Uploaded by

Chetan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views5 pages

DSUP Exp6

Uploaded by

Chetan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Name: Nikhil Namade

Roll No. A16

ID - TU4F2223015

Experiment No. 6

AIM:
Implement and compare any one case study using SVM classifier and
Random Forest Classifier with web deployment

Theory:
Machine Learning is the field of study that gives computers the capability to
learn without being explicitly programmed. ML is one of the most exciting
technologies that one would have ever come across. As it is evident from
the name, it gives the computer that makes it more similar to humans: The
ability to learn. Machine learning is actively being used today, perhaps in
many more places than one would expect.
Machine Learning is an essential skill for any aspiring data analyst and data
scientist, and also for those who wish to transform a massive amount of raw
data into trends and predictions.
Supervised learning is the types of machine learning in which machines are
trained using well "labelled" training data, and on basis of that data,
machines predict the output. The labelled data means some input data is
already tagged with the correct output. In supervised learning, the training
data provided to the machines work as the supervisor that teaches the
machines to predict the output correctly.
Supervised learning is a process of providing input data as well as correct
output data to the machine learning model. The aim of a supervised
learning algorithm is to find a mapping function to map the input
variable(x) with the output variable(y).

Fortunately, with libraries such as Scikit Learn, it’s now easy to study
structured or unstructured data using scientific methods, algorithms and
systems to extract knowledge.

Here we are going to discuss two of the most popular algorithms — Support
Vector Machines abbreviated as SVMs and Random Forests.

SUPPORT VECTOR MACHINES:

Name: Nikhil Namade
Roll No. A16
ID - TU4F2223015

Support Vector Machine is a supervised learning model which can be used

for both classification or regression challenges. However, it is mostly used
in classification problems where the data is sparse (easy to classify). We
perform classification by finding the hyper-plane that differentiates
between the two classes very well .

RANDOM FOREST:
Random Forest is also one of the most used algorithms in machine learning.
It can be used for both classification and regression tasks. The “forest” it
builds, is an ensemble of decision trees, usually trained with the “bagging”
method. The general idea of the bagging method is to create a combination
of learning models which improves the overall result. Basically, Random
forest uses multiple decision trees and merges them together to get an
accurate and stable prediction.

Support Vector Machines

Aspect Random Forests
(SVM)
Model Type Discriminative model Ensemble model (Bagging)
Constructs multiple decision
Finds optimal separating
Algorithm trees and combines their
hyperplane
outputs
Less interpretable (black More interpretable (can
Interpretability
box) extract feature importances)
Handling Can handle nonlinear
Inherently handles nonlinear
relationships using kernel
Nonlinearity relationships
tricks
Handling Outliers Sensitive to outliers More robust to outliers
Feature No direct feature Provides feature importance
Importance importance measure scores
Less prone to overfitting for Less prone to overfitting due
Overfitting
nonlinear kernels to ensemble approach
Does not require scaling of
Scaling Requires scaling of features
features
Parallelization possible for
Parallelization Limited parallelization
training and prediction
Multiclass Handles multiclass
Handles multiclass problems
problems using one-vs-one
Problems natively
or one-vs-rest
Scales poorly with large
Scales well with number of
Memory Usage number of samples and
samples
features
Name: Nikhil Namade
Roll No. A16
ID - TU4F2223015

Kernel type, regularization Number of trees, max depth,

Hyperparameters
parameter, gamma max features, etc.

Implementation:
Dataset:
We are going to discuss SVM VS Random forests by taking an example of Iris
dataset (data of flowers). Here we have to predict the species of the flower
with certain features, namely, sepal width, sepal length, petal width and petal
length.

Code:

Random Forest Classifier:

SVM
Classifier:
Name: Nikhil Namade
Roll No. A16
ID - TU4F2223015

Output:

Accuracy of Random Forest Classifier:

Accuracy of SVM Classifier:

Name: Nikhil Namade
Roll No. A16
ID - TU4F2223015

Conclusion:

It’s because in this dataset, data is sparse and easy to classify, hence SVM
works faster and provides better results. However, random forest also gives
good results but does not match SVM for this particular dataset. The choice
of algorithm depends upon the desired outcome. Although both of the
models are good at their place, but, it very much depends upon the quality
of data when it comes to algorithm’s performance. The choice between SVM
and Random Forest depends on the specific requirements of your project,
including the nature of your dataset, the importance of interpretability, and
the computational resources available. In some cases, SVMs may
outperform Random Forests, especially in tasks that require a clear
separation between classes or when interpretability is crucial. Conversely,
Random Forests may be more suitable for tasks with a large number of
features or when dealing with complex, non-linearly separable data.

FY24 EMEA TAC Sec Workshop - Firewall - ASAFTD High-Availability
No ratings yet
FY24 EMEA TAC Sec Workshop - Firewall - ASAFTD High-Availability
43 pages
ML Mod1
No ratings yet
ML Mod1
48 pages
UNIT-3 Notes
No ratings yet
UNIT-3 Notes
12 pages
Random Forest
No ratings yet
Random Forest
25 pages
Lecture 6
No ratings yet
Lecture 6
24 pages
Assessing A Single Classification Algorithm and Two Classification Algorithms
No ratings yet
Assessing A Single Classification Algorithm and Two Classification Algorithms
12 pages
UNIT 2-Part2
No ratings yet
UNIT 2-Part2
9 pages
ML Unit-3 Part-1
No ratings yet
ML Unit-3 Part-1
17 pages
13 PracticalMachineLearning
100% (1)
13 PracticalMachineLearning
84 pages
U21amg05 Aif and ML Unit 04 Notes
No ratings yet
U21amg05 Aif and ML Unit 04 Notes
42 pages
Report of Comparing 5 Classification Algorithms of Machine Learning PDF
No ratings yet
Report of Comparing 5 Classification Algorithms of Machine Learning PDF
4 pages
Implementation of Credit Card Fraud Detection Using Random Forest Algorithm
100% (1)
Implementation of Credit Card Fraud Detection Using Random Forest Algorithm
10 pages
ML Unit 3
No ratings yet
ML Unit 3
22 pages
ML Unit-3
No ratings yet
ML Unit-3
28 pages
Unit 3
No ratings yet
Unit 3
20 pages
Three Machine Learning Algorithms
No ratings yet
Three Machine Learning Algorithms
11 pages
A) What Is Motivation Behind Ensemble Methods? Give Your Answer in Probabilistic Terms
100% (1)
A) What Is Motivation Behind Ensemble Methods? Give Your Answer in Probabilistic Terms
6 pages
ML Module 3
No ratings yet
ML Module 3
44 pages
UNIT-3 Material
No ratings yet
UNIT-3 Material
19 pages
Classification Algorithms
No ratings yet
Classification Algorithms
68 pages
End SEM V IMP DSE 2
No ratings yet
End SEM V IMP DSE 2
9 pages
ML Unit 3 V1
No ratings yet
ML Unit 3 V1
25 pages
Comparative Study
No ratings yet
Comparative Study
17 pages
Jntuk Machine Learning 3-2 Unit-3
No ratings yet
Jntuk Machine Learning 3-2 Unit-3
33 pages
Unit 3
No ratings yet
Unit 3
63 pages
Unit 3 &4 BDA Notes
No ratings yet
Unit 3 &4 BDA Notes
20 pages
Machine Learning - Iii
No ratings yet
Machine Learning - Iii
53 pages
Unit 3
No ratings yet
Unit 3
12 pages
Classification
No ratings yet
Classification
4 pages
Mid2 Answers
No ratings yet
Mid2 Answers
42 pages
What Is An SVM
No ratings yet
What Is An SVM
24 pages
Wart Treatment Using Machine Learning Support Vector Algorithm
No ratings yet
Wart Treatment Using Machine Learning Support Vector Algorithm
6 pages
Algorithms 1
No ratings yet
Algorithms 1
23 pages
Lecture-12 Machine Learning With Python
No ratings yet
Lecture-12 Machine Learning With Python
18 pages
ML Unit-3
No ratings yet
ML Unit-3
16 pages
ML Unit-3
No ratings yet
ML Unit-3
15 pages
Unit 3 Ds
No ratings yet
Unit 3 Ds
10 pages
Eda - M4
No ratings yet
Eda - M4
7 pages
ML Mod 4
No ratings yet
ML Mod 4
13 pages
Classification
No ratings yet
Classification
10 pages
SVM Unit3
No ratings yet
SVM Unit3
23 pages
Unit 3
No ratings yet
Unit 3
59 pages
Unit 3 PDF
No ratings yet
Unit 3 PDF
7 pages
Module 2
No ratings yet
Module 2
34 pages
Unit 3 Aam
No ratings yet
Unit 3 Aam
30 pages
Machine Learning Lecture 2,3,4
No ratings yet
Machine Learning Lecture 2,3,4
26 pages
RandomForest Vs SVM Comparison
No ratings yet
RandomForest Vs SVM Comparison
1 page
Unit 4 (Ensemble Methods)
No ratings yet
Unit 4 (Ensemble Methods)
24 pages
D3 IT Random Forest Apr 2023
No ratings yet
D3 IT Random Forest Apr 2023
32 pages
Random Forest
No ratings yet
Random Forest
29 pages
Classifying Data Using Support Vector Machines (SVMS) in Python
No ratings yet
Classifying Data Using Support Vector Machines (SVMS) in Python
5 pages
QUESTIONS
No ratings yet
QUESTIONS
20 pages
ML Unit 3 (DS)
No ratings yet
ML Unit 3 (DS)
31 pages
Kanksha2021 Chapter SupervsedLearnngAlgorthmASu
No ratings yet
Kanksha2021 Chapter SupervsedLearnngAlgorthmASu
9 pages
AI Chapter 3 Part 3
No ratings yet
AI Chapter 3 Part 3
49 pages
3.unit 3 ML Part-1 Q&A
No ratings yet
3.unit 3 ML Part-1 Q&A
39 pages
Untitled Presentation
No ratings yet
Untitled Presentation
21 pages
Types of Kernels in Support Vector Machines
No ratings yet
Types of Kernels in Support Vector Machines
14 pages
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
Java 9 Data Structures and Algorithms
From Everand
Java 9 Data Structures and Algorithms
Debasish Ray Chawdhuri
No ratings yet
Python Machine Learning By Example: Unlock machine learning best practices with real-world use cases
From Everand
Python Machine Learning By Example: Unlock machine learning best practices with real-world use cases
Yuxi (Hayden) Liu
No ratings yet
Movie Recommendation System Report
No ratings yet
Movie Recommendation System Report
18 pages
Accenture
No ratings yet
Accenture
8 pages
DSUP Exp7
No ratings yet
DSUP Exp7
6 pages
DSUP Exp4
No ratings yet
DSUP Exp4
6 pages
Malvan 20250305 191946 0000
No ratings yet
Malvan 20250305 191946 0000
8 pages
DSUP Exp3
No ratings yet
DSUP Exp3
22 pages
ET
No ratings yet
ET
778 pages
350-601-11 01 2022 PDF
No ratings yet
350-601-11 01 2022 PDF
55 pages
International Journal of Data Mining & Knowledge Management Process (IJDKP)
No ratings yet
International Journal of Data Mining & Knowledge Management Process (IJDKP)
3 pages
SBS Product Catalog 2018
No ratings yet
SBS Product Catalog 2018
53 pages
SMMO 2017-2023 Problems
No ratings yet
SMMO 2017-2023 Problems
32 pages
IOT in 5G Training and Certification by TELCOMA Global
100% (1)
IOT in 5G Training and Certification by TELCOMA Global
150 pages
The Complete Servicenow System Administrator Course: Section 6 - User Administration
No ratings yet
The Complete Servicenow System Administrator Course: Section 6 - User Administration
19 pages
IT-2205 Lec 03 Error Detection & Correction-1
No ratings yet
IT-2205 Lec 03 Error Detection & Correction-1
45 pages
Intro-Data Center
No ratings yet
Intro-Data Center
22 pages
Kleene's Theorem
No ratings yet
Kleene's Theorem
6 pages
TFX Power 3 Data Sheet en
No ratings yet
TFX Power 3 Data Sheet en
3 pages
Tallernning 31634053d07c239
No ratings yet
Tallernning 31634053d07c239
2 pages
Ahmad Javaid - Software Engineer
No ratings yet
Ahmad Javaid - Software Engineer
1 page
Case-Study-Dos - 19070123
No ratings yet
Case-Study-Dos - 19070123
13 pages
HashMap HashSet LeetCode Questions
No ratings yet
HashMap HashSet LeetCode Questions
2 pages
Digital Marketing
No ratings yet
Digital Marketing
41 pages
Azure Book 126
No ratings yet
Azure Book 126
1 page
ICDL Documents Syllabus 6.0 1
No ratings yet
ICDL Documents Syllabus 6.0 1
6 pages
DT2485 - DT-BUS Data Logger
No ratings yet
DT2485 - DT-BUS Data Logger
2 pages
Assignment 5 Ageing Chchcs 001 Chcccs 025
No ratings yet
Assignment 5 Ageing Chchcs 001 Chcccs 025
39 pages
PTD Lab Manual
No ratings yet
PTD Lab Manual
16 pages
Subnetting A Network With IP Addresses To Share Among Different Sites
No ratings yet
Subnetting A Network With IP Addresses To Share Among Different Sites
5 pages
Advanced Java Programming Chapter 5 - Network Programming
No ratings yet
Advanced Java Programming Chapter 5 - Network Programming
39 pages
X86 Sale
No ratings yet
X86 Sale
11 pages
Unit Three DBMS Notes-1
No ratings yet
Unit Three DBMS Notes-1
31 pages
The Evolution of Internet
100% (2)
The Evolution of Internet
5 pages
This Study Resource Was Shared Via
20% (5)
This Study Resource Was Shared Via
2 pages
Rs Syll
No ratings yet
Rs Syll
3 pages
EDA Manual
No ratings yet
EDA Manual
20 pages
Computer Interface Design: Dr. Ghassan Abu Samhadana
No ratings yet
Computer Interface Design: Dr. Ghassan Abu Samhadana
37 pages