0% found this document useful (0 votes)

19 views4 pages

Assignment 5'

The document provides instructions for using Scikit-learn to analyze the Iris dataset, including printing keys, dimensions, feature names, and a description of the dataset. It details the dataset's characteristics, such as the number of instances and attributes, along with summary statistics. Additionally, it demonstrates how to manipulate the dataset using pandas and numpy.

Uploaded by

devashish250303

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views4 pages

Assignment 5'

Uploaded by

devashish250303

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

TAGALI ATHRAV DHAREPPA

MIS: 642310018
BRANCH: MECHANICAL
BATCH: H
ASSIGNMENT NO. 5
Use Scikit-learn to print the keys, number of rows-columns, feature names
and the description of the Iris data.
INPUT:
from sklearn.datasets import load_iris
iris = load_iris()
print("Keys of the dataset:\n", iris.keys())
print("\nNumber of rows and columns:\n", iris.data.shape)
print("\nFeature names:", iris.feature_names)
print("\nDataset description:\n", iris.DESCR)
OUTPUT:
Keys of the dataset:
dict_keys(['data', 'target', 'frame', 'target_names', 'DESCR', 'feature_names', 'filename', 'd
ata_module'])

Number of rows and columns:

(150, 4)

Feature names: ['sepal length (cm)', 'sepal width (cm)', 'petal length (cm)', 'petal width (
cm)']

Dataset description:
.. _iris_dataset:

Iris plants dataset

--------------------

Data Set Characteristics:

:Number of Instances: 150 (50 in each of three classes)

:Number of Attributes: 4 numeric, predictive attributes and the class
:Attribute Information:
- sepal length in cm
- sepal width in cm
- petal length in cm
- petal width in cm
- class:
- Iris-Setosa
- Iris-Versicolour
- Iris-Virginica

:Summary Statistics:

============== ==== ==== ======= ===== ====================

Min Max Mean SD Class Correlation
============== ==== ==== ======= ===== ====================
sepal length: 4.3 7.9 5.84 0.83 0.7826
sepal width: 2.0 4.4 3.05 0.43 -0.4194
petal length: 1.0 6.9 3.76 1.76 0.9490 (high!)
petal width: 0.1 2.5 1.20 0.76 0.9565 (high!)
============== ==== ==== ======= ===== ====================

:Missing Attribute Values: None

:Class Distribution: 33.3% for each of 3 classes.
:Creator: R.A. Fisher
:Donor: Michael Marshall (MARSHALL%[email protected])
:Date: July, 1988

The famous Iris database, first used by Sir R.A. Fisher. The dataset is taken
from Fisher's paper. Note that it's the same as in R, but not as in the UCI
Machine Learning Repository, which has two wrong data points.

This is perhaps the best known database to be found in the

pattern recognition literature. Fisher's paper is a classic in the field and
is referenced frequently to this day. (See Duda & Hart, for example.) The
data set contains 3 classes of 50 instances each, where each class refers to a
type of iris plant. One class is linearly separable from the other 2; the
latter are NOT linearly separable from each other.

|details-start|
**References**
|details-split|

- Fisher, R.A. "The use of multiple measurements in taxonomic problems"

Annual Eugenics, 7, Part II, 179-188 (1936); also in "Contributions to
Mathematical Statistics" (John Wiley, NY, 1950).
- Duda, R.O., & Hart, P.E. (1973) Pattern Classification and Scene Analysis.
(Q327.D83) John Wiley & Sons. ISBN 0-471-22361-1. See page 218.
- Dasarathy, B.V. (1980) "Nosing Around the Neighborhood: A New System
Structure and Classification Rule for Recognition in Partially Exposed
Environments". IEEE Transactions on Pattern Analysis and Machine
Intelligence, Vol. PAMI-2, No. 1, 67-71.
- Gates, G.W. (1972) "The Reduced Nearest Neighbor Rule". IEEE Transactions
on Information Theory, May 1972, 431-433.
- See also: 1988 MLC Proceedings, 54-64. Cheeseman et al"s AUTOCLASS II
conceptual clustering system finds 3 classes in the data.
- Many, many more ...
|details-end|7
INPUT:
import pandas as pd
df=pd.read_csv('iris.csv')
iris_data = pd.DataFrame(data = iris.data, columns = iris.feature_names)
stats=iris_data.describe()
print(stats)

OUTPUT:
sepal length (cm) sepal width (cm) petal length (cm) \
count 150.000000 150.000000 150.000000
mean 5.843333 3.057333 3.758000
std 0.828066 0.435866 1.765298
min 4.300000 2.000000 1.000000
25% 5.100000 2.800000 1.600000
50% 5.800000 3.000000 4.350000
75% 6.400000 3.300000 5.100000
max 7.900000 4.400000 6.900000

petal width (cm)

count 150.000000
mean 1.199333
std 0.762238
min 0.100000
25% 0.300000
50% 1.300000
75% 1.800000
max 2.500000

INPUT:
import numpy as np
from scipy.sparse import csr_matrix
dense_matrix=np.eye(5)
sparse_matrix = csr_matrix(dense_matrix)
print(sparse_matrix)

OUTPUT:
(0, 0) 1.0
(1, 1) 1.0
(2, 2) 1.0
(3, 3) 1.0
(4, 4) 1.0
INPUT:
df=pd.read_csv('iris.csv')
print(df.head())
df_modified=df.drop(columns=['sepal.length'], index=2)
print("\nModified DataFrame(without'sepal.length'columns and row 2)")
print(df_modified.head())

OUTPUT:
sepal.length sepal.width petal.length petal.width variety
0 5.1 3.5 1.4 0.2 Setosa
1 4.9 3.0 1.4 0.2 Setosa
2 4.7 3.2 1.3 0.2 Setosa
3 4.6 3.1 1.5 0.2 Setosa
4 5.0 3.6 1.4 0.2 Setosa

Modified DataFrame(without'sepal.length'columns and row 2)

sepal.width petal.length petal.width variety
0 3.5 1.4 0.2 Setosa
1 3.0 1.4 0.2 Setosa
3 3.1 1.5 0.2 Setosa
4 3.6 1.4 0.2 Setosa
5 3.9 1.7 0.4 Setosa

Oil & Gas Analytics & Machine Learning
No ratings yet
Oil & Gas Analytics & Machine Learning
31 pages
Manual de Pci Geomatica 240103 062603
No ratings yet
Manual de Pci Geomatica 240103 062603
162 pages
Progress in Energy and Combustion Science: Masoud Aliramezani, Charles Robert Koch, Mahdi Shahbakhti
No ratings yet
Progress in Energy and Combustion Science: Masoud Aliramezani, Charles Robert Koch, Mahdi Shahbakhti
38 pages
Tic Tac Toe
No ratings yet
Tic Tac Toe
55 pages
Door Lock Security System Based On Face Image As A Key Using Image Processing
100% (1)
Door Lock Security System Based On Face Image As A Key Using Image Processing
95 pages
Artificial Intelligence and Pattern Recognition Question Bank
100% (1)
Artificial Intelligence and Pattern Recognition Question Bank
5 pages
Workbook of Pattern Recognition
No ratings yet
Workbook of Pattern Recognition
11 pages
1.1 Introduction To Data Mining: 1.1.1 Moving Toward The Information Age
No ratings yet
1.1 Introduction To Data Mining: 1.1.1 Moving Toward The Information Age
14 pages
Machine Learning Lpu Notes
No ratings yet
Machine Learning Lpu Notes
187 pages
Click The Link Below To Download
100% (10)
Click The Link Below To Download
82 pages
Reasoning and Aptitude Development Syllabus
No ratings yet
Reasoning and Aptitude Development Syllabus
13 pages
Applications of Mathematics To Real-World Problems: Michelle Dunbar
No ratings yet
Applications of Mathematics To Real-World Problems: Michelle Dunbar
38 pages
Week 6 (PCA, SVD, LDA)
No ratings yet
Week 6 (PCA, SVD, LDA)
14 pages
Speaker Recognition Thesis
100% (3)
Speaker Recognition Thesis
8 pages
Ph.D. M.S. (By Research) M.tech
No ratings yet
Ph.D. M.S. (By Research) M.tech
159 pages
IPPR Ch9
No ratings yet
IPPR Ch9
66 pages
Dreyfus ComputersMustBodies 1967
No ratings yet
Dreyfus ComputersMustBodies 1967
21 pages
Pami Im2Show and Tell: Lessons Learned From The 2015 MSCOCO Image Captioning Challenge
No ratings yet
Pami Im2Show and Tell: Lessons Learned From The 2015 MSCOCO Image Captioning Challenge
12 pages
Digital Image Processing: Lecture # 7 Spatial Filtering
No ratings yet
Digital Image Processing: Lecture # 7 Spatial Filtering
32 pages
Cse Btech IV Yr Vii Sem Scheme Syllabus July 2022
No ratings yet
Cse Btech IV Yr Vii Sem Scheme Syllabus July 2022
25 pages
Assignment - 10 - Pandas
No ratings yet
Assignment - 10 - Pandas
53 pages
References
No ratings yet
References
21 pages
Machine Learning Based Speech Aid For Silent Communication
No ratings yet
Machine Learning Based Speech Aid For Silent Communication
31 pages
Dsbda Ouput 1-10
No ratings yet
Dsbda Ouput 1-10
89 pages
CPCS335-2-Introduction (Cont.)
No ratings yet
CPCS335-2-Introduction (Cont.)
26 pages
Only Endsem
No ratings yet
Only Endsem
5 pages
Language Courses Registration - Google Sheets
No ratings yet
Language Courses Registration - Google Sheets
24 pages
Artificial Neural Networks in Pattern Recognition: Mohammadreza Yadollahi, Ale S Proch Azka
No ratings yet
Artificial Neural Networks in Pattern Recognition: Mohammadreza Yadollahi, Ale S Proch Azka
8 pages
A Stock Pattern Recognition Algorithm Based On Neural Networks
No ratings yet
A Stock Pattern Recognition Algorithm Based On Neural Networks
5 pages
Template Csrid B.inggris
No ratings yet
Template Csrid B.inggris
14 pages
QCCE: Quality Constrained Co-Saliency Estimation For Common Object Detection
No ratings yet
QCCE: Quality Constrained Co-Saliency Estimation For Common Object Detection
4 pages
HT ESE Solutions
No ratings yet
HT ESE Solutions
12 pages
Vikash Main Presentation
No ratings yet
Vikash Main Presentation
11 pages
Lab Manual ML
No ratings yet
Lab Manual ML
23 pages
Data Visualization
No ratings yet
Data Visualization
18 pages
Design Volunteer Final List
No ratings yet
Design Volunteer Final List
2 pages
Ploomber Notebook Conversion - 2
No ratings yet
Ploomber Notebook Conversion - 2
14 pages
A Pattern Recognition Approach To Image Segmentation
No ratings yet
A Pattern Recognition Approach To Image Segmentation
7 pages
Depression Detection Using Multimodal Analysis With Chatbot Support
No ratings yet
Depression Detection Using Multimodal Analysis With Chatbot Support
7 pages
Import As Import As From Import Import As Import As From Import From Import From Import
No ratings yet
Import As Import As From Import Import As Import As From Import From Import From Import
6 pages
Image Haze Removal Using DCP
No ratings yet
Image Haze Removal Using DCP
4 pages
Iris Pca
No ratings yet
Iris Pca
13 pages
Nandini Matplotlib Ws
No ratings yet
Nandini Matplotlib Ws
10 pages
Brake Bleeding
No ratings yet
Brake Bleeding
3 pages
Dsa 1
No ratings yet
Dsa 1
8 pages
Unsupervised ML
No ratings yet
Unsupervised ML
17 pages
Pandas Exercises
No ratings yet
Pandas Exercises
15 pages
Data Visualizationyuo
No ratings yet
Data Visualizationyuo
28 pages
JD - Summer Trainee
No ratings yet
JD - Summer Trainee
2 pages
Invitation CYCLO
No ratings yet
Invitation CYCLO
2 pages
Mara Pitch 2025
No ratings yet
Mara Pitch 2025
2 pages
SK Learn 1
No ratings yet
SK Learn 1
11 pages
Train Test Splitting
No ratings yet
Train Test Splitting
3 pages
Session-24 - Jupyter Notebook
No ratings yet
Session-24 - Jupyter Notebook
13 pages
DL Experiment - 1
No ratings yet
DL Experiment - 1
10 pages
Practical of Professional Skills
No ratings yet
Practical of Professional Skills
4 pages
Implementing Logistic Regression For Iris Using Sklearn and Checking The Accuracy Using Confusion Matrix
No ratings yet
Implementing Logistic Regression For Iris Using Sklearn and Checking The Accuracy Using Confusion Matrix
7 pages
Data Visualization With Maplotlib
No ratings yet
Data Visualization With Maplotlib
8 pages
Assigntment 3 Python Lab
No ratings yet
Assigntment 3 Python Lab
1 page
Ass - 10.ipynb - Colab
No ratings yet
Ass - 10.ipynb - Colab
8 pages
ML LabReport Final Index Edited
No ratings yet
ML LabReport Final Index Edited
35 pages
K Fold
No ratings yet
K Fold
2 pages
25 - Assignment10.ipynb - Colaboratory
No ratings yet
25 - Assignment10.ipynb - Colaboratory
13 pages
137 Vsec 6
No ratings yet
137 Vsec 6
2 pages
Machine Learning Group Project
No ratings yet
Machine Learning Group Project
22 pages
6 Lab
No ratings yet
6 Lab
16 pages
KRAI LabManual
No ratings yet
KRAI LabManual
77 pages
Support Vector Machine (SVM Classifier) Implemenation in Python With Scikit-Learn
No ratings yet
Support Vector Machine (SVM Classifier) Implemenation in Python With Scikit-Learn
21 pages
Exp 5,6,7
No ratings yet
Exp 5,6,7
2 pages
ML#07
No ratings yet
ML#07
21 pages
DSBDA6
No ratings yet
DSBDA6
6 pages
Experiment 3
No ratings yet
Experiment 3
4 pages
Mlpy 2
No ratings yet
Mlpy 2
18 pages
Experiment-2-1-Ml Kritika
No ratings yet
Experiment-2-1-Ml Kritika
11 pages
Lab4 KNN
No ratings yet
Lab4 KNN
9 pages
Trần Mạnh Hùng 20192643.Ipynb - Colab
No ratings yet
Trần Mạnh Hùng 20192643.Ipynb - Colab
6 pages
Chap5 - Wei - Ipynb - Colab
No ratings yet
Chap5 - Wei - Ipynb - Colab
29 pages
ML N PY Programs
No ratings yet
ML N PY Programs
17 pages
Prac 10
No ratings yet
Prac 10
6 pages
Dsbda 3B
No ratings yet
Dsbda 3B
5 pages
1 Assignment 3 - Classification
No ratings yet
1 Assignment 3 - Classification
16 pages
Ihtisham Ali 6534
No ratings yet
Ihtisham Ali 6534
3 pages
ML Group 2
No ratings yet
ML Group 2
16 pages
Using R For Data Preprocessing, Exploratory Analysis, Visualization
No ratings yet
Using R For Data Preprocessing, Exploratory Analysis, Visualization
7 pages
KNN - Jupyter Notebook
No ratings yet
KNN - Jupyter Notebook
8 pages
A Complete Guide To The Iris Dataset in R
No ratings yet
A Complete Guide To The Iris Dataset in R
3 pages
EXP 07 (ML) - Sarthak
No ratings yet
EXP 07 (ML) - Sarthak
4 pages
K Means On IRIS Dataset
No ratings yet
K Means On IRIS Dataset
4 pages
Exno 4
No ratings yet
Exno 4
13 pages
# Common Datatype: Print Type Print Type Print Type Print Type Print Type
No ratings yet
# Common Datatype: Print Type Print Type Print Type Print Type Print Type
4 pages
EXP 07 (ML) - Ashu
No ratings yet
EXP 07 (ML) - Ashu
4 pages
Exp 07 (ML)
No ratings yet
Exp 07 (ML)
4 pages
EXP 07 (ML) - Darshu
No ratings yet
EXP 07 (ML) - Darshu
4 pages
Name:-Nisha Ambike: Roll No: - 02
No ratings yet
Name:-Nisha Ambike: Roll No: - 02
2 pages
Assignment 3 Iris
No ratings yet
Assignment 3 Iris
2 pages
Practical No - 1
No ratings yet
Practical No - 1
5 pages
10 - DBSCANClusteringOnIRIS-Copy1 - Jupyter Notebook
No ratings yet
10 - DBSCANClusteringOnIRIS-Copy1 - Jupyter Notebook
4 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)

Assignment 5'

Uploaded by

Assignment 5'

Uploaded by

TAGALI ATHRAV DHAREPPA

Number of rows and columns:

Iris plants dataset

**Data Set Characteristics:**

:Number of Instances: 150 (50 in each of three classes)

============== ==== ==== ======= ===== ====================

:Missing Attribute Values: None

This is perhaps the best known database to be found in the

- Fisher, R.A. "The use of multiple measurements in taxonomic problems"

petal width (cm)

Modified DataFrame(without'sepal.length'columns and row 2)

You might also like

Data Set Characteristics: