0% found this document useful (0 votes)

24 views16 pages

Iris Dataset Project Report - Compress

The document outlines a machine learning project focused on classifying iris flowers using various algorithms, including Logistic Regression, K-Nearest Neighbour, Support Vector Machine, Decision Trees, and Naive Bayes. It details the methodology for data preparation, exploratory data analysis, and model implementation, highlighting the importance of understanding the dataset and the classification process. The project concludes with accuracy results for each algorithm, emphasizing the effectiveness of KNN and Gaussian Naive Bayes.

Uploaded by

ntdmikey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views16 pages

Iris Dataset Project Report - Compress

Uploaded by

ntdmikey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Iris Flower

Classification Project
with Machine Learning

Dipali Mistry
01/11/2022

0 0
Contents

Contents
1. INTRODUCTION.............................................................................................................................................2
1.1 Problem statement..................................................................................................................................3
1.2 Data Prepare the data.............................................................................................................................3
2. Methodology................................................................................................................................................5
2.1 Pre Processing.........................................................................................................................................5
2.1.1 Exploratory Data Analysis................................................................................................................5
2.1.2 Outlier Analysis................................................................................................................................6
Feature in Seaborn through a boxplot......................................................................................................7
2.1.3. Box plot grid....................................................................................................................................9
3. The model implementation.........................................................................................................................11
3.1. Logistic Regression...............................................................................................................................11
3.2. K – Nearest Neighbour (KNN)...............................................................................................................12
3.3. Support Vector Machine (SVM)............................................................................................................13
3.4. Decision Trees......................................................................................................................................14
3.5. Naive Bayes classifier............................................................................................................................15
4. Conclusion...................................................................................................................................................16

1. INTRODUCTION
Every machine learning project begins by understanding what the data and drawing
the objectives. While applying machine learning algorithms to your data set, you are
understanding, building and analyzing the data as to get the end result.

Following are the steps involved in creating a well-defined ML project:

1] Understand and define the problem

2] Prepare the data
2

0 0
3] Explore and Analyse the data
4] Apply the algorithms
5] Reduce the errors
6] Predict the result

To understand various machine learning algorithms let us use the Iris data set, one of
the most famous datasets available.

1.1 Problem statement

This data set consists of the physical parameters of three species of flower -
Versicolor, Setosa and Virginica. The numeric parameters which the dataset contains
are Sepal width, Sepal length, Petal width and Petal length. In this data we will be
predicting the classes of the flowers based on these parameters. The data consists of
continuous numeric values which describe the dimensions of the respective features.
We will be training the model based on these features.

1.2 Data Prepare the data

Table 1.1: Sample Data (Columns: 1-16)

0 0
It has been created Ronald Fisher in 1936. It contains the petal length, petal width,
sepal length and sepal width of 150 iris flowers from 3 different species. Variables
present in given dataset are SepalLengthCm, SepalWidthCm, PetalLengthCm,
PetalWidthCm, Species.

The details of variable present in the dataset are as follows - instant:

Now,
View the info of the data frame that contains details like the count of non-null
variables and the column’s datatype along with the column names. It will also
show the memory usage.

0 0
2. Methodology
2.1 Pre Processing

Any predictive modeling requires that we look at the data before we start
modeling. However, in data mining terms looking at data refers to so much more
than just looking. Looking at data refers to exploring the data, cleaning the data as
well as visualizing the data through graphs and plots. This is often called as
Exploratory Data Analysis.

2.1.1 Exploratory Data Analysis

In exploring the data we have,

If there are any missing values, then modify them before using the dataset. For
modifying you can use the fillna() method. It will fill null values.

Scatterplot of the Iris features

0 0
Bivariate scatterplots and univariate histograms in the same figure

2.1.2 Outlier Analysis

Outlier analysis is done to handle all inconsistent observations present in given

dataset. As outlier analysis can only be done on continuous variable.

0 0
Feature in Seaborn through a boxplot

Visualizes a kernel density estimate of the underlying feature

0 0
Iris-setosa species is separataed from the other two across all feature
Combinations

0 0
2.1.3. Box plot grid

Box plot grid

Andrews Curves involve using attributes of samples as coefficients for Fourier se

ries and then plotting these

0 0
Parallel coordinates plots each feature on a separate column & then draws
lines connecting the features for each data sample

feature as a point on a 2D plane, and then simulates

having each sample attached to those points through a spring weighted
by the relative value for that feature

0 0
3. The model implementation
Using some of the commonly used algorithms, we will be training our model to check how accurate
every algorithm is. We will be implementing these algorithms to compare:
1] Logistic Regression
2] K – Nearest Neighbour (KNN)
3] Support Vector Machine (SVM)
4] Decision Trees
5] Naive Bayes classifier

3.1. Logistic Regression

We can start with the first algorithm Logistic Regression. We can build our model like below:

0 0
3.2. K – Nearest Neighbour (KNN)
Now , let us see the scores with K-Nearest Neighbors technique.

0 0
3.3. Support Vector Machine (SVM)
Thirdly , with SVM (Support Vector Machines).

0 0
3.4. Decision Trees
Next , is yes – no type of algorithm , decision trees !

0 0
3.5. Naive Bayes classifier
And lastly , the Naive Bayes classifier. (Variants included)

And also display some other type of Naïve Bayes inside the graph.

0 0
4. Conclusion
 Flower classification is a very important, simple, and basic project for any machine learning
student. Every machine learning student should be thorough with the iris flowers dataset.

 This classification can be done by many classification algorithms in machine learning but in
our article, we used Logistic Regression (Accuracy: 0.98 ), K – Nearest Neighbour
(Accuracy:1.0), Support Vector Machine(Accuracy:1.0), Decision Trees (Accuracy:0.966),
Naïve Bayes classifier, Gaussian Naive Bayes (Accuracy:100.00), Multinomial Naïve
Bayes(Accuracy:0.83), Bernoulli Naïve Bayes(Accuracy:20.0),
Complement Naive Bayes(Accuracy:56.66).

0 0

Final - DNN - Hands - On - Jupyter Notebook
25% (8)
Final - DNN - Hands - On - Jupyter Notebook
8 pages
Virtual Base Class
No ratings yet
Virtual Base Class
4 pages
JAYESH BANSAL - FinalProjectReport - Jayesh Bansal
No ratings yet
JAYESH BANSAL - FinalProjectReport - Jayesh Bansal
38 pages
Iris Flower Classification Project
100% (1)
Iris Flower Classification Project
14 pages
BT-2016 SEM-IV Project Report (Review 1)
No ratings yet
BT-2016 SEM-IV Project Report (Review 1)
42 pages
Iris Classification
No ratings yet
Iris Classification
6 pages
Iris Flower Classification Final
No ratings yet
Iris Flower Classification Final
15 pages
ST1 4483 8995 Capstone PPT Template
No ratings yet
ST1 4483 8995 Capstone PPT Template
10 pages
SUMITs MINOR REPORT
No ratings yet
SUMITs MINOR REPORT
16 pages
61 JBS1753
No ratings yet
61 JBS1753
13 pages
An Approach Based Iris Flower Species Recognition Using Machine Learning Classifiers
No ratings yet
An Approach Based Iris Flower Species Recognition Using Machine Learning Classifiers
7 pages
Amber Iris
No ratings yet
Amber Iris
23 pages
Classification of Iris Flower Species Updated
100% (1)
Classification of Iris Flower Species Updated
5 pages
Iris Flower Classification
No ratings yet
Iris Flower Classification
3 pages
Machine Learning Project
No ratings yet
Machine Learning Project
9 pages
Task 1 Iris Flower Classification Using Machine Learning
No ratings yet
Task 1 Iris Flower Classification Using Machine Learning
10 pages
Sridevi Women'S Engineering College: Mini Project Seminar On
No ratings yet
Sridevi Women'S Engineering College: Mini Project Seminar On
23 pages
王玉 20201108012390
No ratings yet
王玉 20201108012390
13 pages
Fo DS
No ratings yet
Fo DS
9 pages
Iris Classification
No ratings yet
Iris Classification
8 pages
IJARESM June2021
No ratings yet
IJARESM June2021
10 pages
Iris Flower Classification Project
No ratings yet
Iris Flower Classification Project
9 pages
Data Science Project
No ratings yet
Data Science Project
31 pages
Iris Flower Classification
No ratings yet
Iris Flower Classification
1 page
Assignment 4 R Program1
No ratings yet
Assignment 4 R Program1
11 pages
Shelly
No ratings yet
Shelly
15 pages
Iris Flower Classification
No ratings yet
Iris Flower Classification
47 pages
Iris Flower Classification Using ML - by Modassir - Medium
No ratings yet
Iris Flower Classification Using ML - by Modassir - Medium
21 pages
R Course - Part7 ML - Exercise Sheet 2024
No ratings yet
R Course - Part7 ML - Exercise Sheet 2024
8 pages
Lab 6
No ratings yet
Lab 6
4 pages
Module 4
No ratings yet
Module 4
30 pages
熊雯欣20201108012489
No ratings yet
熊雯欣20201108012489
9 pages
Data Minig Lab File
No ratings yet
Data Minig Lab File
25 pages
Shelly Mehndiratta IrisFlowerClassification
No ratings yet
Shelly Mehndiratta IrisFlowerClassification
15 pages
Project Template
No ratings yet
Project Template
15 pages
Mod3 Classification
No ratings yet
Mod3 Classification
32 pages
DM Practicals in Python
No ratings yet
DM Practicals in Python
55 pages
Data Science: Objectives
No ratings yet
Data Science: Objectives
10 pages
KNN Datacamp
No ratings yet
KNN Datacamp
31 pages
Solution HW2
No ratings yet
Solution HW2
6 pages
ML Mod-4
No ratings yet
ML Mod-4
30 pages
Introduction To ML
No ratings yet
Introduction To ML
80 pages
Wa0001
No ratings yet
Wa0001
39 pages
12 Classification
No ratings yet
12 Classification
16 pages
ML Lecture 10 Project
No ratings yet
ML Lecture 10 Project
20 pages
EDA AnalysisA
No ratings yet
EDA AnalysisA
15 pages
Attiq Ahmad Afsar MLAssignment 3 Flask
No ratings yet
Attiq Ahmad Afsar MLAssignment 3 Flask
9 pages
DS Report
No ratings yet
DS Report
11 pages
Types of ML Systems
No ratings yet
Types of ML Systems
5 pages
Ludic - Workshop - Iris - Copie
No ratings yet
Ludic - Workshop - Iris - Copie
5 pages
Task 1
No ratings yet
Task 1
14 pages
ML Lab1 PGM
No ratings yet
ML Lab1 PGM
4 pages
Simulation of Back Propagation Neural Network For Iris Flower Classification
No ratings yet
Simulation of Back Propagation Neural Network For Iris Flower Classification
6 pages
Bs Report On Iris
No ratings yet
Bs Report On Iris
6 pages
Machine Learning: Lecture 7: Create Your First Project
No ratings yet
Machine Learning: Lecture 7: Create Your First Project
17 pages
ML#07
No ratings yet
ML#07
21 pages
Understanding-Code-for A-Classifier
No ratings yet
Understanding-Code-for A-Classifier
15 pages
K-Nearest Neighbors Classifiers 2025
No ratings yet
K-Nearest Neighbors Classifiers 2025
33 pages
Lab Report 10 FDS
No ratings yet
Lab Report 10 FDS
7 pages
Experiment 1 Aim:: Introduction To ML Lab With Tools (Hands On WEKA On Data Set (Iris - Arff) ) - (A) Start Weka
No ratings yet
Experiment 1 Aim:: Introduction To ML Lab With Tools (Hands On WEKA On Data Set (Iris - Arff) ) - (A) Start Weka
55 pages
Ch-7. Computer Aided Layout
No ratings yet
Ch-7. Computer Aided Layout
15 pages
The Random Forest Algorithm - A Complete Guide - Built in
No ratings yet
The Random Forest Algorithm - A Complete Guide - Built in
12 pages
Cutting Plane Method
No ratings yet
Cutting Plane Method
6 pages
Algorithms - Data Structures
No ratings yet
Algorithms - Data Structures
3 pages
CSE 205 - Quick Notes
No ratings yet
CSE 205 - Quick Notes
6 pages
Mcse 102 Advanced Data Structure and Algorithm Dec 2014
No ratings yet
Mcse 102 Advanced Data Structure and Algorithm Dec 2014
1 page
What Is Linear Data Structure
No ratings yet
What Is Linear Data Structure
2 pages
Why Convolutions?: Till Now in MLP
No ratings yet
Why Convolutions?: Till Now in MLP
38 pages
Factoring of Polynomials
No ratings yet
Factoring of Polynomials
2 pages
Practical Session No. 6 AVL Trees:) (Log N ) (Log N
No ratings yet
Practical Session No. 6 AVL Trees:) (Log N ) (Log N
9 pages
Topic 10 Recursive Backtracking
No ratings yet
Topic 10 Recursive Backtracking
40 pages
Mlpack
No ratings yet
Mlpack
3 pages
Optimization Operational Research
No ratings yet
Optimization Operational Research
19 pages
Factoring With Pascal's Triangle
100% (1)
Factoring With Pascal's Triangle
3 pages
DSA Poster (1) Ansh
No ratings yet
DSA Poster (1) Ansh
1 page
RA: Assignment Problem (Minimization-Maximization Problems) A55C PDF
No ratings yet
RA: Assignment Problem (Minimization-Maximization Problems) A55C PDF
6 pages
Intuitive Guide To Fourier Analysis and Spectral Estimation With Matlabchapter14 Charan Langton Victor Levin PDF Download
No ratings yet
Intuitive Guide To Fourier Analysis and Spectral Estimation With Matlabchapter14 Charan Langton Victor Levin PDF Download
51 pages
Laboratory Manual 4: Discrete Time Fourier Transform & Discrete Fourier Transform
No ratings yet
Laboratory Manual 4: Discrete Time Fourier Transform & Discrete Fourier Transform
10 pages
Delta-WYE Conversion
No ratings yet
Delta-WYE Conversion
8 pages
Part7 - Finite Impulse Response Filter Design
No ratings yet
Part7 - Finite Impulse Response Filter Design
55 pages
U1 - Introduction To Linear Programming and Applications: U1-T1-S1 - A1
No ratings yet
U1 - Introduction To Linear Programming and Applications: U1-T1-S1 - A1
31 pages
Pca
No ratings yet
Pca
19 pages
Tree Data Structure Slides
No ratings yet
Tree Data Structure Slides
7 pages
L5 Neural Network
No ratings yet
L5 Neural Network
67 pages
GANS
No ratings yet
GANS
22 pages
Circle Mid Point
0% (1)
Circle Mid Point
20 pages
Discrete Math-Recursion
No ratings yet
Discrete Math-Recursion
33 pages
Ss Lab 09
No ratings yet
Ss Lab 09
7 pages

Iris Dataset Project Report - Compress

Uploaded by

Iris Dataset Project Report - Compress

Uploaded by

Iris Flower

Following are the steps involved in creating a well-defined ML project:

1] Understand and define the problem

1.1 Problem statement

1.2 Data Prepare the data

Table 1.1: Sample Data (Columns: 1-16)

The details of variable present in the dataset are as follows - instant:

2.1.1 Exploratory Data Analysis

In exploring the data we have,

Scatterplot of the Iris features

2.1.2 Outlier Analysis

Outlier analysis is done to handle all inconsistent observations present in given

Visualizes a kernel density estimate of the underlying feature

Box plot grid

Andrews Curves involve using attributes of samples as coefficients for Fourier se

feature as a point on a 2D plane, and then simulates

3.1. Logistic Regression

You might also like