0% found this document useful (0 votes)

38 views11 pages

Experiment-2-1-Ml Kritika

The document loads and analyzes the Iris dataset using Pandas. It displays the first few rows, renames columns, extracts numeric columns, calculates statistics like mean and standard deviation, and shows subsets of rows.

Uploaded by

KRITIKA DAS

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views11 pages

Experiment-2-1-Ml Kritika

Uploaded by

KRITIKA DAS

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Experiment-2.

1
Aim : Study of Different Python Libraries

Pandas Library:
Load a dataset (Iris dataset: https://www.kaggle.com/datasets/uciml/iris) using pandas
Display the first few rows to understand its structure.
Calculate basic statistics (mean, median, standard deviation, etc.) for a numerical column in
the dataset.
Perform data filtering to extract rows based on specific conditions (e.g., SepalLengthCm>5.0).

In [1]:
import numpy as np
import pandas as pd

In [3]:
iris_df = pd.read_csv('C:/Users/kriti/Downloads/Iris.csv')
print(iris_df.head())

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa
1 2 4.9 3.0 1.4 0.2 Iris-setosa
2 3 4.7 3.2 1.3 0.2 Iris-setosa
3 4 4.6 3.1 1.5 0.2 Iris-setosa
4 5 5.0 3.6 1.4 0.2 Iris-setosa

In [4]:
new_col_name = ["ID","SepalLengthCm","SepalWidthCm" , "PetalLengthCm" , "PetalWi
iris_df.columns = new_col_name
iris_df.head()

Out[4]: ID SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

In [5]:
x = iris_df[iris_df.columns[1:-1]]
x.head()

Out[5]: SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm

0 5.1 3.5 1.4 0.2

1 4.9 3.0 1.4 0.2

2 4.7 3.2 1.3 0.2

3 4.6 3.1 1.5 0.2

SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm

4 5.0 3.6 1.4 0.2

In [6]:
y = iris_df[iris_df.columns[-1]]
y.head()

Out[6]: 0 Iris-setosa
1 Iris-setosa
2 Iris-setosa
3 Iris-setosa
4 Iris-setosa
Name: Species, dtype: object

In [7]:
sepal_length_stats = iris_df["SepalLengthCm"].describe()
print(sepal_length_stats)

count 150.000000
mean 5.843333
std 0.828066
min 4.300000
25% 5.100000
50% 5.800000
75% 6.400000
max 7.900000
Name: SepalLengthCm, dtype: float64

In [8]:
sepal_length_stats = iris_df["PetalWidthCm"].describe()
print(sepal_length_stats)

count 150.000000
mean 1.198667
std 0.763161
min 0.100000
25% 0.300000
50% 1.300000
75% 1.800000
max 2.500000
Name: PetalWidthCm, dtype: float64

In [9]:
iris_df.head(10)

Out[9]: ID SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

5 6 5.4 3.9 1.7 0.4 Iris-setosa

6 7 4.6 3.4 1.4 0.3 Iris-setosa

7 8 5.0 3.4 1.5 0.2 Iris-setosa

8 9 4.4 2.9 1.4 0.2 Iris-setosa

ID SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

9 10 4.9 3.1 1.5 0.1 Iris-setosa

In [10]:
iris_df.tail(10)

Out[10]: ID SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

140 141 6.7 3.1 5.6 2.4 Iris-virginica

141 142 6.9 3.1 5.1 2.3 Iris-virginica

142 143 5.8 2.7 5.1 1.9 Iris-virginica

143 144 6.8 3.2 5.9 2.3 Iris-virginica

144 145 6.7 3.3 5.7 2.5 Iris-virginica

145 146 6.7 3.0 5.2 2.3 Iris-virginica

146 147 6.3 2.5 5.0 1.9 Iris-virginica

147 148 6.5 3.0 5.2 2.0 Iris-virginica

148 149 6.2 3.4 5.4 2.3 Iris-virginica

149 150 5.9 3.0 5.1 1.8 Iris-virginica

In [11]:
iris_df[15:50]

Out[11]: ID SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

15 16 5.7 4.4 1.5 0.4 Iris-setosa

16 17 5.4 3.9 1.3 0.4 Iris-setosa

17 18 5.1 3.5 1.4 0.3 Iris-setosa

18 19 5.7 3.8 1.7 0.3 Iris-setosa

19 20 5.1 3.8 1.5 0.3 Iris-setosa

20 21 5.4 3.4 1.7 0.2 Iris-setosa

21 22 5.1 3.7 1.5 0.4 Iris-setosa

22 23 4.6 3.6 1.0 0.2 Iris-setosa

23 24 5.1 3.3 1.7 0.5 Iris-setosa

24 25 4.8 3.4 1.9 0.2 Iris-setosa

25 26 5.0 3.0 1.6 0.2 Iris-setosa

26 27 5.0 3.4 1.6 0.4 Iris-setosa

27 28 5.2 3.5 1.5 0.2 Iris-setosa

28 29 5.2 3.4 1.4 0.2 Iris-setosa

29 30 4.7 3.2 1.6 0.2 Iris-setosa

30 31 4.8 3.1 1.6 0.2 Iris-setosa

31 32 5.4 3.4 1.5 0.4 Iris-setosa

32 33 5.2 4.1 1.5 0.1 Iris-setosa

ID SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

33 34 5.5 4.2 1.4 0.2 Iris-setosa

34 35 4.9 3.1 1.5 0.1 Iris-setosa

35 36 5.0 3.2 1.2 0.2 Iris-setosa

36 37 5.5 3.5 1.3 0.2 Iris-setosa

37 38 4.9 3.1 1.5 0.1 Iris-setosa

38 39 4.4 3.0 1.3 0.2 Iris-setosa

39 40 5.1 3.4 1.5 0.2 Iris-setosa

40 41 5.0 3.5 1.3 0.3 Iris-setosa

41 42 4.5 2.3 1.3 0.3 Iris-setosa

42 43 4.4 3.2 1.3 0.2 Iris-setosa

43 44 5.0 3.5 1.6 0.6 Iris-setosa

44 45 5.1 3.8 1.9 0.4 Iris-setosa

45 46 4.8 3.0 1.4 0.3 Iris-setosa

46 47 5.1 3.8 1.6 0.2 Iris-setosa

47 48 4.6 3.2 1.4 0.2 Iris-setosa

48 49 5.3 3.7 1.5 0.2 Iris-setosa

49 50 5.0 3.3 1.4 0.2 Iris-setosa

In [12]:
iris_df.groupby("Species").head(5)

Out[12]: ID SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

50 51 7.0 3.2 4.7 1.4 Iris-versicolor

51 52 6.4 3.2 4.5 1.5 Iris-versicolor

52 53 6.9 3.1 4.9 1.5 Iris-versicolor

53 54 5.5 2.3 4.0 1.3 Iris-versicolor

54 55 6.5 2.8 4.6 1.5 Iris-versicolor

100 101 6.3 3.3 6.0 2.5 Iris-virginica

101 102 5.8 2.7 5.1 1.9 Iris-virginica

102 103 7.1 3.0 5.9 2.1 Iris-virginica

103 104 6.3 2.9 5.6 1.8 Iris-virginica

104 105 6.5 3.0 5.8 2.2 Iris-virginica

In [13]:
filter = iris_df["SepalLengthCm"] > 5.0
sel = iris_df[filter]
sel

Out[13]: ID SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

5 6 5.4 3.9 1.7 0.4 Iris-setosa

10 11 5.4 3.7 1.5 0.2 Iris-setosa

14 15 5.8 4.0 1.2 0.2 Iris-setosa

15 16 5.7 4.4 1.5 0.4 Iris-setosa

... ... ... ... ... ... ...

145 146 6.7 3.0 5.2 2.3 Iris-virginica

146 147 6.3 2.5 5.0 1.9 Iris-virginica

147 148 6.5 3.0 5.2 2.0 Iris-virginica

148 149 6.2 3.4 5.4 2.3 Iris-virginica

149 150 5.9 3.0 5.1 1.8 Iris-virginica

118 rows × 6 columns

In [14]:
iris_df.shape

Out[14]: (150, 6)

2. Matplotlib Library:
Create a line plot to visualize the trend of a numerical variable over time.
Generate a histogram to understand the distribution of a numerical variable in the dataset.
Create a bar chart to compare the performance of different categories.
Plot a scatter plot to explore the relationship between two numerical variables.
Customize your plots with labels, titles, colors, and styles.

In [15]:
import matplotlib.pyplot as plt

plt.plot(iris_df["SepalWidthCm"], '-', label="Line")

plt.plot(iris_df["SepalWidthCm"], 'o', label="Dots")
plt.xlabel("Index")
plt.ylabel("Sepal Width (cm)")
plt.title("Trend of Sepal Width")
plt.legend()
plt.show()
In [16]:
!pip install plotly

Requirement already satisfied: plotly in c:\users\kriti\anaconda3\lib\site-packa

ges (5.16.1)
Requirement already satisfied: packaging in c:\users\kriti\anaconda3\lib\site-pa
ckages (from plotly) (20.9)
Requirement already satisfied: tenacity>=6.2.0 in c:\users\kriti\anaconda3\lib\s
ite-packages (from plotly) (8.2.3)
Requirement already satisfied: pyparsing>=2.0.2 in c:\users\kriti\anaconda3\lib
\site-packages (from packaging->plotly) (2.4.7)

In [17]:
import plotly.express as px

species_count = iris_df["Species"].value_counts()
figure = px.pie(iris_df, values=species_count, names=species_count.index)
figure.show()

Iris-setosa
Iris-versicolor
Iris-virginica

33.3% 33.3%

33.3%
In [18]:
figure = px.histogram(iris_df , x = "SepalLengthCm")
figure.show()

20
count

0
4 5 6 7 8

SepalLengthCm

In [19]:
plt.scatter(iris_df["SepalLengthCm"] , iris_df["PetalLengthCm"])
plt.xlabel("Sepal Length")

Out[19]: Text(0.5, 0, 'Sepal Length')

In [20]:
species_counts = iris_df["Species"].value_counts()
plt.bar(species_counts.index , species_counts.values)
plt.xlabel("Species")
plt.ylabel("count")
plt.show()

3. Seaborn Library:
Create a box plot to visualize the distribution of a numerical variable across different
categories.
Generate a heatmap to explore the correlation between numerical variables.
Customize the appearance of seaborn plots using various parameters.

In [21]:
import seaborn as sns
sns.boxplot(x="Species",y="SepalLengthCm",data = iris_df)
plt.xlabel("Species")
plt.ylabel("Sepal Length (cm)")
plt.title("Distribution of Sepal Length across species")
plt.show()
In [22]:
# Heatmap to explore the correlation between numerical variables

df_filter = iris_df.select_dtypes(include = [np.number])

sns.heatmap(df_filter.corr())

Out[22]: <AxesSubplot:>

4. NumPy Library:
Create a NumPy array and perform basic operations like addition, subtraction, and
multiplication.
Use NumPy functions to calculate statistical measures like mean, median, and standard
deviation.
Reshape and slice NumPy arrays to extract specific data elements.
Perform element-wise operations and broadcasting with NumPy arrays.
Apply mathematical functions (e.g., exponential, logarithm) to NumPy arrays.
In [23]:
x = np.array([25 , 7 ,8 , 9 , 10 , 12])
y = np.array([10 , 20 , 58 , 100 , 204 , 7])

z = x + y

w = x - y

j = x * y

print("Addition : ", z)

print("Substraction : ", w)

print("Multiplication : ", j)

Addition : [ 35 27 66 109 214 19]

Substraction : [ 15 -13 -50 -91 -194 5]
Multiplication : [ 250 140 464 900 2040 84]

In [24]:
#statistics in numpy

print("Mean : ",np.mean(x))
print("Std Deviation : ",np.std(x))
print("Variance : ",np.var(x))

Mean : 11.833333333333334
Std Deviation : 6.094168432927407
Variance : 37.138888888888886

In [25]:
x = np.arange(1,11)
x1 = np.reshape(x , (2,5))
x1

Out[25]: array([[ 1, 2, 3, 4, 5],

[ 6, 7, 8, 9, 10]])

In [26]:
# numpy slicing
x1[0:1 , 2:5]

Out[26]: array([[3, 4, 5]])

In [27]:
# scalar broadcasting
x2 = x1 + 5
print(x2)

[[ 6 7 8 9 10]
[11 12 13 14 15]]

In [28]:
# logarithmic function
y = np.log(x)
plt.subplot(1,2,1)
plt.plot(x,y)
plt.title("Logarithmic Function")

# exponential function
plt.subplot(1,2,2)
f = np.exp(x)
plt.plot(x,f)
plt.title("Exponential Function")

Out[28]: Text(0.5, 1.0, 'Exponential Function')

5. SciPy Library:
Use SciPy to perform numerical integration for a given mathematical function.

In [29]:
from scipy.integrate import quad

# x = np.arange(0 , 2*np.pi , 0.1)

# y = np.sin(x)

def integrand(m):
return np.sin(m)

fun_intr , error = quad( integrand , 0 , np.pi)

print(fun_intr)
print(error)

# plt.plot(x , fun_intr)

2.0
2.220446049250313e-14
Name-Kritika Das

Regd no-2101020068

Rollno-CSE21068

PVS980 - 5MW Pre Commissioning Instruction
No ratings yet
PVS980 - 5MW Pre Commissioning Instruction
6 pages
TOLA Notes
No ratings yet
TOLA Notes
22 pages
Exno 4
No ratings yet
Exno 4
13 pages
Assignment - 10 - Pandas
No ratings yet
Assignment - 10 - Pandas
53 pages
# Common Datatype: Print Type Print Type Print Type Print Type Print Type
No ratings yet
# Common Datatype: Print Type Print Type Print Type Print Type Print Type
4 pages
ML LabReport Final Index Edited
No ratings yet
ML LabReport Final Index Edited
35 pages
Dsbda Ouput 1-10
No ratings yet
Dsbda Ouput 1-10
89 pages
BDA pr2
No ratings yet
BDA pr2
2 pages
Pandas Exercises
No ratings yet
Pandas Exercises
15 pages
Practical No - 1
No ratings yet
Practical No - 1
5 pages
Lab Manual
No ratings yet
Lab Manual
32 pages
Assignment 5'
No ratings yet
Assignment 5'
4 pages
Practical of Professional Skills
No ratings yet
Practical of Professional Skills
4 pages
ML N PY Programs
No ratings yet
ML N PY Programs
17 pages
Chap5 - Wei - Ipynb - Colab
No ratings yet
Chap5 - Wei - Ipynb - Colab
29 pages
Trần Mạnh Hùng 20192643.Ipynb - Colab
No ratings yet
Trần Mạnh Hùng 20192643.Ipynb - Colab
6 pages
Session-25 - Jupyter Notebook
No ratings yet
Session-25 - Jupyter Notebook
20 pages
Assigntment 3 Python Lab
No ratings yet
Assigntment 3 Python Lab
1 page
Data Visualization With Maplotlib
No ratings yet
Data Visualization With Maplotlib
8 pages
25 - Assignment10.ipynb - Colaboratory
No ratings yet
25 - Assignment10.ipynb - Colaboratory
13 pages
Exp 07 (ML)
No ratings yet
Exp 07 (ML)
4 pages
Dsbda 10
No ratings yet
Dsbda 10
8 pages
EXP 07 (ML) - Darshu
No ratings yet
EXP 07 (ML) - Darshu
4 pages
Session-24 - Jupyter Notebook
No ratings yet
Session-24 - Jupyter Notebook
13 pages
EXP 07 (ML) - Ashu
No ratings yet
EXP 07 (ML) - Ashu
4 pages
Batch1 Ds
No ratings yet
Batch1 Ds
15 pages
MLRecord
No ratings yet
MLRecord
24 pages
Name:-Nisha Ambike: Roll No: - 02
No ratings yet
Name:-Nisha Ambike: Roll No: - 02
2 pages
b21 DSBDA Assignment No 10
No ratings yet
b21 DSBDA Assignment No 10
1 page
EXP 07 (ML) - Sarthak
No ratings yet
EXP 07 (ML) - Sarthak
4 pages
Exp 5,6,7
No ratings yet
Exp 5,6,7
2 pages
DL Experiment - 1
No ratings yet
DL Experiment - 1
10 pages
Ads Exp 1 Code
No ratings yet
Ads Exp 1 Code
3 pages
Data Visualization and Matplot
No ratings yet
Data Visualization and Matplot
11 pages
Introduction To Python (Part III)
No ratings yet
Introduction To Python (Part III)
29 pages
Mlpy 2
No ratings yet
Mlpy 2
18 pages
Task 1
No ratings yet
Task 1
14 pages
Cota12 6
No ratings yet
Cota12 6
4 pages
Ai Lab 01
No ratings yet
Ai Lab 01
6 pages
5-1 Dataframes Intro Load Inspect - Instruction
No ratings yet
5-1 Dataframes Intro Load Inspect - Instruction
2 pages
Dsa 1
No ratings yet
Dsa 1
8 pages
KRAI LabManual
No ratings yet
KRAI LabManual
77 pages
Dsbda 3B
No ratings yet
Dsbda 3B
5 pages
Assignment 3 Iris
No ratings yet
Assignment 3 Iris
2 pages
Python Matplotlib Hands On - Compress
No ratings yet
Python Matplotlib Hands On - Compress
6 pages
Python Matplotlib Hands On
100% (1)
Python Matplotlib Hands On
6 pages
Untitled5 1
No ratings yet
Untitled5 1
13 pages
Tarea - 1.ipynb - Colab Jose
No ratings yet
Tarea - 1.ipynb - Colab Jose
12 pages
Iris - Ipynb - Colaboratory
No ratings yet
Iris - Ipynb - Colaboratory
8 pages
Data Science Lab Manual Full
No ratings yet
Data Science Lab Manual Full
47 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
48 pages
Chapter4 Pandas
No ratings yet
Chapter4 Pandas
43 pages
Ploomber Notebook Conversion - 2
No ratings yet
Ploomber Notebook Conversion - 2
14 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
Experiment 3
No ratings yet
Experiment 3
4 pages
DP Prog
No ratings yet
DP Prog
10 pages
Dsbda 10
No ratings yet
Dsbda 10
5 pages
ML Lab File
No ratings yet
ML Lab File
43 pages
Practical 10 Code
No ratings yet
Practical 10 Code
5 pages
3-Numpy Pandas
No ratings yet
3-Numpy Pandas
37 pages
Assignment No - 10
No ratings yet
Assignment No - 10
3 pages
FreeRTOS With Arduino Tutorial - How To Create Tasks
No ratings yet
FreeRTOS With Arduino Tutorial - How To Create Tasks
14 pages
Cola2 Manual
No ratings yet
Cola2 Manual
29 pages
Logcat CSC Update Log
No ratings yet
Logcat CSC Update Log
2,493 pages
50 REAL TIME LINUX Multiple Choice Questions and Answers-LINUX Multiple Choice Questions
60% (5)
50 REAL TIME LINUX Multiple Choice Questions and Answers-LINUX Multiple Choice Questions
16 pages
Lab 04 - Composition
No ratings yet
Lab 04 - Composition
3 pages
English Paper 1: Stage 9
No ratings yet
English Paper 1: Stage 9
48 pages
GR 2x Tutorial
No ratings yet
GR 2x Tutorial
68 pages
1.1. How Should We Define AI
No ratings yet
1.1. How Should We Define AI
14 pages
Panasonic VRF AHU Catalog 190626 Lo-Res
No ratings yet
Panasonic VRF AHU Catalog 190626 Lo-Res
11 pages
Eproc Tenders
No ratings yet
Eproc Tenders
104 pages
Open University Learning Analytics Dataset
No ratings yet
Open University Learning Analytics Dataset
6 pages
Application Form For Job
No ratings yet
Application Form For Job
3 pages
2) Instruction Manual
No ratings yet
2) Instruction Manual
1,340 pages
BAE5 - Tutorial 2 - 2023-1
No ratings yet
BAE5 - Tutorial 2 - 2023-1
2 pages
CP1404 - Assignment 2 - Movies To Watch 2.0 (Part 1 ONLY) : Task
No ratings yet
CP1404 - Assignment 2 - Movies To Watch 2.0 (Part 1 ONLY) : Task
5 pages
Data Analysis and Interpretation
No ratings yet
Data Analysis and Interpretation
32 pages
Lecture 3 Revision Questions
No ratings yet
Lecture 3 Revision Questions
3 pages
DLD Lab 7
No ratings yet
DLD Lab 7
9 pages
Offers: Why Choose Intrcity Smartbus ?
100% (1)
Offers: Why Choose Intrcity Smartbus ?
6 pages
ORAIMO 20000 Mah Power Bank (12 W, Fast Charging) Price in India - Buy ORAIMO 20000 Mah Power Bank (12 W, Fast Charging) Online at
No ratings yet
ORAIMO 20000 Mah Power Bank (12 W, Fast Charging) Price in India - Buy ORAIMO 20000 Mah Power Bank (12 W, Fast Charging) Online at
4 pages
Hemochron Elite - Itc Usa
No ratings yet
Hemochron Elite - Itc Usa
4 pages
Civil 3d Road Design General Workflow
100% (1)
Civil 3d Road Design General Workflow
3 pages
TCS NQT 24th Oct 8 Am To 11 Am Slot Analysis
No ratings yet
TCS NQT 24th Oct 8 Am To 11 Am Slot Analysis
35 pages
Wifi Pasword
No ratings yet
Wifi Pasword
1 page
Nilai Uh Statistika
No ratings yet
Nilai Uh Statistika
14 pages
GVP - CEMS Internship
No ratings yet
GVP - CEMS Internship
2 pages
Gis Exam Questions
67% (3)
Gis Exam Questions
2 pages
Manual Testing Important
No ratings yet
Manual Testing Important
44 pages