Machine Learning: Lecture 7: Create Your First Project
Machine Learning: Lecture 7: Create Your First Project
Email: [email protected]
Iris flower classification
Iris dataset
150 samples
3 labels/categories: Species of Iris (Iris setosa, Iris virginica and Iris
versicolor)
4 features: Sepal length, Sepal width, Petal length, Petal Width in
cm
Iris dataset instances
Import libraries
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn import tree
from sklearn.metrics import accuracy_score
Load the dataset
iris_data = pd.read_csv('IRIS.csv')
Summarize the dataset
# dimensions (no. of rows & columns)
print(iris_data.shape)
# list of columns/features
print(iris_data.columns)
# peek some data
print(iris_data.head(10))
# statistical summary
print(iris_data.describe())
Specify the target variable and its
distribution
# target variable
target = iris_data['species']