Machine Learning: Lecture 7: Create Your First Project

This document provides instructions for creating a machine learning project to classify iris flowers using the iris dataset, which includes 150 samples described by 4 features. It outlines loading and exploring the iris data, splitting it into training and test sets, building a decision tree classifier model, and evaluating the model's accuracy on both the training and test sets. Additionally, it suggests some homework extensions including applying normalization, comparing other classifier models, and finding the best predictive model.

Uploaded by

Bisnu Sarkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views

Machine Learning: Lecture 7: Create Your First Project

Uploaded by

Bisnu Sarkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Machine Learning

Lecture 7: Create Your First Project

COURSE CODE: CSE490
2019
Course Teacher
Dr. Mrinal Kanti Baowaly
Assistant Professor
Department of Computer Science and
Engineering, Bangabandhu Sheikh
Mujibur Rahman Science and
Technology University, Bangladesh.

Email: [email protected]
Iris flower classification
Iris dataset
 150 samples
 3 labels/categories: Species of Iris (Iris setosa, Iris virginica and Iris
versicolor)
 4 features: Sepal length, Sepal width, Petal length, Petal Width in
cm
Iris dataset instances
Import libraries
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn import tree
from sklearn.metrics import accuracy_score
Load the dataset
iris_data = pd.read_csv('IRIS.csv')
Summarize the dataset
# dimensions (no. of rows & columns)
print(iris_data.shape)
# list of columns/features
print(iris_data.columns)
# peek some data
print(iris_data.head(10))
# statistical summary
print(iris_data.describe())
Specify the target variable and its
distribution
# target variable
target = iris_data['species']

# distribution of class labels or categories

print(pd.value_counts(target))
Specify the target variable and its
distribution
# target variable
target = iris_data['species']

# distribution of class labels or categories

print(pd.value_counts(target))

# alternative of finding class distribution

print(iris_data.groupby('species').size())
Split dataset into training and test data
seed = 7
train_data, test_data = train_test_split(iris_data, test_size=0.3,
random_state= 7)
# shape of the datasets
print('\nShape of training data :',train_data.shape)
print('\nShape of testing data :',test_data.shape)
# class distribution of the training data
print(pd.value_counts(train_data['species']))
# class distribution of the test data
print(pd.value_counts(test_data['species']))
Balanced split of the dataset
seed = 7
train_data, test_data = train_test_split(iris_data, test_size=0.3,
random_state=seed, stratify=target)
Separate the independent and target
variables
# separate the independent and target variables from training data
train_x = train_data.drop(columns=['species'],axis=1)
train_y = train_data['species']

# separate the independent and target variables from test data

test_x = test_data.drop(columns=['species'],axis=1)
test_y = test_data['species']
Build the model
# create a classifier object/model
model=tree.DecisionTreeClassifier()

# train the model with fit function

model.fit(train_x, train_y)
Make predictions
# make predictions on training data
predictions_train = model.predict(train_x)
print('\nTraining Accuracy :', accuracy_score(train_y,
predictions_train))

# make predictions on test data

predictions_test = model.predict(test_x)
print('\nTest Accuracy :', accuracy_score(test_y, predictions_test))
Home work for the Lab.
Apply normalization or standardization
Apply different classifiers and compare their performances
• Logistic Regression (LR)
• K-Nearest Neighbors (KNN)
• Support Vector Machines (SVM)
Find the best model for the prediction task
Some example projects
Iris classification [Link1, Link2]
Machine Learning-Let’s Get Started [Link]
Your First Machine Learning Project in Python Step-By-Step [Link]
24 Data Science Projects To Boost Your Knowledge and Skills [link]
6 Complete Machine Learning Projects [Link]

Free Access to Elementary Statistics 11 th Edition Robert Johnson Chapter Answers
100% (4)
Free Access to Elementary Statistics 11 th Edition Robert Johnson Chapter Answers
82 pages
CS178 Homework #1: Problem 0: Getting Connected
No ratings yet
CS178 Homework #1: Problem 0: Getting Connected
4 pages
ML-Lecture-10-Project
No ratings yet
ML-Lecture-10-Project
20 pages
Lab 6
No ratings yet
Lab 6
4 pages
1613101309_JAYESH BANSAL_FinalProjectReport - Jayesh Bansal
No ratings yet
1613101309_JAYESH BANSAL_FinalProjectReport - Jayesh Bansal
38 pages
Iris Classification
No ratings yet
Iris Classification
6 pages
Major Project (Kartik Joshi)
No ratings yet
Major Project (Kartik Joshi)
4 pages
Understanding-Code-for A-Classifier
No ratings yet
Understanding-Code-for A-Classifier
15 pages
Wa0001
No ratings yet
Wa0001
39 pages
Iris Flower Classification Final
No ratings yet
Iris Flower Classification Final
15 pages
BT-2016 SEM-IV Project Report (Review 1)
No ratings yet
BT-2016 SEM-IV Project Report (Review 1)
42 pages
Ludic - Workshop - Iris - Copie
No ratings yet
Ludic - Workshop - Iris - Copie
5 pages
PRACTICAL FILE DL
No ratings yet
PRACTICAL FILE DL
14 pages
Assignment 4 r Program1
No ratings yet
Assignment 4 r Program1
11 pages
ML Lab Manual
No ratings yet
ML Lab Manual
6 pages
Module 4 - Supervised Learning - First ML Model
No ratings yet
Module 4 - Supervised Learning - First ML Model
23 pages
ML Lab1 pgm
No ratings yet
ML Lab1 pgm
4 pages
Exp 9 - 2131
No ratings yet
Exp 9 - 2131
7 pages
Tutorial 6
No ratings yet
Tutorial 6
8 pages
22IZ023 Nikhil - Exercise 7 a_ Decision Trees
No ratings yet
22IZ023 Nikhil - Exercise 7 a_ Decision Trees
4 pages
Iris - Copy1 - Jupyter Notebook
No ratings yet
Iris - Copy1 - Jupyter Notebook
8 pages
Task 1 Iris Flower Classification Using Machine Learning
No ratings yet
Task 1 Iris Flower Classification Using Machine Learning
10 pages
Lab 1 - Machine Learning with Python - ML Engineering مهم
No ratings yet
Lab 1 - Machine Learning with Python - ML Engineering مهم
10 pages
PR
No ratings yet
PR
17 pages
22BCS14374 - Sanya - Singh - Assignment 2
No ratings yet
22BCS14374 - Sanya - Singh - Assignment 2
8 pages
Amber Iris Ppt
No ratings yet
Amber Iris Ppt
23 pages
178 hw1
No ratings yet
178 hw1
4 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Assigmnent 3 (Data Mining)
No ratings yet
Assigmnent 3 (Data Mining)
18 pages
Nomlab 14 Ai
No ratings yet
Nomlab 14 Ai
3 pages
sklearn
No ratings yet
sklearn
141 pages
DS Report
No ratings yet
DS Report
11 pages
iris-dataset-project-report_compress
No ratings yet
iris-dataset-project-report_compress
16 pages
ChatGPT_MyLearning on Coding for Machine Learning
No ratings yet
ChatGPT_MyLearning on Coding for Machine Learning
16 pages
2 Machine Learning
No ratings yet
2 Machine Learning
21 pages
Bagging, Random Forest, Gradient boost, AdaBoost & PCA
No ratings yet
Bagging, Random Forest, Gradient boost, AdaBoost & PCA
8 pages
Animal Species Prediction Using Machine Learning
No ratings yet
Animal Species Prediction Using Machine Learning
10 pages
FREE AI Code Generator - Generate Code Online in Any Language
No ratings yet
FREE AI Code Generator - Generate Code Online in Any Language
12 pages
3text
No ratings yet
3text
2 pages
Classification of Iris Flower Species Updated
100% (1)
Classification of Iris Flower Species Updated
5 pages
ABHAYMLFILE
No ratings yet
ABHAYMLFILE
16 pages
module_4
No ratings yet
module_4
30 pages
Animal Species Prediction Using Machine Learning
No ratings yet
Animal Species Prediction Using Machine Learning
10 pages
ML INTERNAL ANSWERS
No ratings yet
ML INTERNAL ANSWERS
9 pages
To Study About Numpy, Pandas and Matplotlib Libraries in Python
No ratings yet
To Study About Numpy, Pandas and Matplotlib Libraries in Python
21 pages
Classification Algorithms II
No ratings yet
Classification Algorithms II
9 pages
machine learning aiml
No ratings yet
machine learning aiml
7 pages
Decision Tree Exp 5 DWM
No ratings yet
Decision Tree Exp 5 DWM
2 pages
Naive Bayes Classifier 066
No ratings yet
Naive Bayes Classifier 066
14 pages
CSC407_Chapter 5-6
No ratings yet
CSC407_Chapter 5-6
42 pages
Machine Learning LAB: Practical-1
100% (2)
Machine Learning LAB: Practical-1
24 pages
Machine Learning With SQL
100% (1)
Machine Learning With SQL
12 pages
ML EXPT 4
No ratings yet
ML EXPT 4
4 pages
KRAI LabManual
No ratings yet
KRAI LabManual
77 pages
ML LAB MANUAL 4-8
No ratings yet
ML LAB MANUAL 4-8
11 pages
02 - Decision Tree Classification On Iris Dataset
No ratings yet
02 - Decision Tree Classification On Iris Dataset
6 pages
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
No ratings yet
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
5 pages
Unit 2 ML
No ratings yet
Unit 2 ML
93 pages
TranMinhTu1 bt2 2
No ratings yet
TranMinhTu1 bt2 2
5 pages
ML New record (5)
No ratings yet
ML New record (5)
51 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
The Application of Statistics To Policy Analysis and Management Book Review
No ratings yet
The Application of Statistics To Policy Analysis and Management Book Review
5 pages
Tolerance Intervals For Any Data (Nonparametric)
No ratings yet
Tolerance Intervals For Any Data (Nonparametric)
9 pages
Evania Saskara - 175020307111039 - Tugas ch123
No ratings yet
Evania Saskara - 175020307111039 - Tugas ch123
10 pages
Probability Test PDF
No ratings yet
Probability Test PDF
4 pages
Question Bank
No ratings yet
Question Bank
3 pages
Statistical Analysis: Descriptive Statistics
No ratings yet
Statistical Analysis: Descriptive Statistics
59 pages
Predicting The Car Accident Severity in Seattle
No ratings yet
Predicting The Car Accident Severity in Seattle
12 pages
Materi Discrminant Analysis
No ratings yet
Materi Discrminant Analysis
83 pages
Evaluation Metrics in Machine Learning
No ratings yet
Evaluation Metrics in Machine Learning
14 pages
Math11 - Test Questions TB DQAS 4thQ
No ratings yet
Math11 - Test Questions TB DQAS 4thQ
4 pages
Enders Lee April 18 2010
No ratings yet
Enders Lee April 18 2010
33 pages
Nisa Post Test Word
No ratings yet
Nisa Post Test Word
4 pages
Siegle Reliability Calculator LINDA
No ratings yet
Siegle Reliability Calculator LINDA
398 pages
Process Capability Indices Based On Median Absolute Deviation PDF
No ratings yet
Process Capability Indices Based On Median Absolute Deviation PDF
6 pages
Hp1047, Vmr286 Loan Default Prediction Final Report
No ratings yet
Hp1047, Vmr286 Loan Default Prediction Final Report
8 pages
Chapter11 - Simple Regression
No ratings yet
Chapter11 - Simple Regression
12 pages
Mankiw PrinciplesOfEconomics 10e PPT CH38
No ratings yet
Mankiw PrinciplesOfEconomics 10e PPT CH38
43 pages
Statistical Modelling: Univ.-Prof. Dr. Habil. Albrecht Gnauck
No ratings yet
Statistical Modelling: Univ.-Prof. Dr. Habil. Albrecht Gnauck
67 pages
s41664 018 0068 2
No ratings yet
s41664 018 0068 2
14 pages
Lecture 7
No ratings yet
Lecture 7
6 pages
T.A. Pai Management Institute Manipal: Submitted To: Prof. Sudhindra S
No ratings yet
T.A. Pai Management Institute Manipal: Submitted To: Prof. Sudhindra S
15 pages
Lecture Notes 7.2 Estimating A Population Mean
No ratings yet
Lecture Notes 7.2 Estimating A Population Mean
5 pages
Bluman Elem Stats 9e CH03 PPTS
No ratings yet
Bluman Elem Stats 9e CH03 PPTS
89 pages
SCI 1020 - wk2
No ratings yet
SCI 1020 - wk2
4 pages
ARIMA
No ratings yet
ARIMA
3 pages
1 hw2
No ratings yet
1 hw2
1 page
Non-Parametric Inference Full
No ratings yet
Non-Parametric Inference Full
71 pages
Forecasting Techniques
No ratings yet
Forecasting Techniques
9 pages
Ganpat University V.M. Patel College of Management Studies (V.M.P.C.M.S)
No ratings yet
Ganpat University V.M. Patel College of Management Studies (V.M.P.C.M.S)
4 pages

Machine Learning: Lecture 7: Create Your First Project

Uploaded by

Machine Learning: Lecture 7: Create Your First Project

Uploaded by

Machine Learning

Lecture 7: Create Your First Project

# distribution of class labels or categories

# distribution of class labels or categories

# alternative of finding class distribution

# separate the independent and target variables from test data

# train the model with fit function

# make predictions on test data

You might also like