0% found this document useful (0 votes)

15 views17 pages

Packages in Python

The document outlines experiments conducted using Python Pandas and Matplotlib for data manipulation, analysis and visualization. It includes steps to load datasets, perform operations like grouping, merging, EDA using aggregates and null handling, and generate different plot types like line, bar, histogram and scatter plots.

Uploaded by

Bharath M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views17 pages

Packages in Python

Uploaded by

Bharath M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

S.

n Experiment Date Remark

1
LAB EXERCISES
Data Manipulation – Loading and Filtration
PACKAGES IN PYTHON
2 Grouping and Merging data using Pandas

3 Exploratory Data Analysis using Pandas

4 Plotting Using Matplotlib -1

5 Plotting Using Matplotlib -2

TABLE OF CONTENT
EXP NO DATE
1 Data Manipulation – Loading and Filtration

AIM:
To create a python – Pandas module to perform basic data manipulation tasks
using Jupyter Notebook

PROGRAM :
import pandas as pd
df=pd.read_csv('titanic.csv')
df
#MAKE 1ST COLUMN AS INDEX
df.set_index('Name')

#SELECT SINGLE COL AND PRINT DATA

df [ 'Pclass' ]
#SELECT MULTIPLE COLUMNS
df[['Name', 'Age', 'Sex']]

#SELECT SINGLE COLUMNS AND PRINT LAST 5 ELEMENTS

df['Name'].tail(5)

# SELECT MULTIPLE ROWS AND PRINT FIRST 5 ELEMENTS

df.iloc[784:789].head()

#SELECT MULTIPLE ROWS & COL FROM DATASET AND PRINT IT

df.iloc[ 0:5 , 0:5 ]

# SELECT ALL ROWS AND SOME COL(MORE THAN 2) AND PRINT IT

df.iloc[ : , 0:5 ]
# Deleting a Column
del df['Sex']
df

#CHANGE THE 1ST, 2ND & 3RD COL NAME AND PRINT IT
c=df.rename(columns={ 'Ticket':'Ticket_No', 'Name':'Passenger_name'})
c

OUTPUT:
RESULT:
Thus, the python program to perform basic data manipulation tasks using
Jupyter Notebook was executed and output is verified successfull

EXP NO DATE
2 Grouping and Merging data using Pandas

AIM:
To create a python – Pandas module to perform different grouping and merging
operations using Jupyter Notebook

PROCEDURE:
Step – 1 : Install the required packages from Command Prompt using pip
Step – 2 : Launch Jupyter notebook from command Prompt
Step – 3 : Import necessary library files at the beginning of the module
Step – 4: Upload the ‘Online_Attendance’ , ‘covid_vaccine_statewise ‘ and
‘StatewiseTestingDetails’ datasets and import the same using pandas
Functions
Step – 5: Perform the necessary programming

PROGRAM:
#(i) Grouping
import pandas as pd
df=pd.read_csv("Online_Attendance.csv")
df

#Grouping based on 'Category' Column

g=df.groupby("Category")
new=g.get_group("IBM")
new

#Exporting to a New CSV

new.to_csv("New_data_set.csv")

# With Corona Dataset

df1=pd.read_csv("StatewiseTestingDetails.csv")
df1

#Grouping By Date Column

g1=df1.groupby("Date")
g1

#Grouping for a Particular Date

new1=g1.get_group("14-02-2021")
new1

#Exporting New Dataset

new1.to_csv("New_data1_set.csv")

# Grouping for TamilNadu

df2=pd.read_csv("covid_vaccine_statewise.csv")
g2=df2.groupby("State")
new2=g2.get_group("Tamil Nadu")
new2

# Joining two different Datasets

#Combining Df1 with Df2 using Left Join
join_data=pd.merge(df1,df2,on="State",how="left")
join_data

#Combining Df1 with Df2 using Right Join

join_data=pd.merge(df1,df2,on="State",how="right")
join_data

# Combining Df1 with Df2 using inner Join

join_data=pd.merge(df1,df2,on="State",how="inner")
join_data

# Combining Df1 with Df2 using Outer Join

join_data=pd.merge(df1,df2,on="State",how="outer")
join_data
OUTPUT:
RESULT:
Thys a python – Pandas module was created to perform different grouping and
merging operations using Jupyter Notebook and the output is verified
successfully

EXP NO DATE
3 Exploratory Data Analysis using Pandas

AIM:
To create a python – Pandas module to perform an Exploratory data analysis
using Jupyter Notebook

Program :
#Exploratory Data Analysis
import pandas as pd
df=pd.read_csv("Loan_Data.csv")
df

#Display Number of rows and Columns

df.shape
#Checking Number of Null Values in Each Column
df.isnull().sum()

#Displaying Data types of Individual Columns

df.dtypes

#Display Last 5 Columns

df.tail()

#Replacing Gender Column's Null Value with mode()

df['Gender'].fillna(df['Gender'].mode()[0], inplace=True)
df
df.isnull().sum()

#Gender - Null Value is now Zero

#Replacing every column Null Value with mode() and mean()
df['Married'].fillna(df['Married'].mode()[0], inplace=True)
df['Dependents']. fillna(df['Dependents'].mode()[0], inplace=True)
df['Self_Employed'].fillna(df[ 'Self_Employed'].mode() [0],
inplace=True)
df['Loan_Amount_Term'].fillna(df['Loan_Amount_Term'].mean(),
inplace=True)
df['LoanAmount'].fillna(df['LoanAmount'].mean(), inplace=True)
df['Credit_History'].fillna(df['Credit_History'].mode()
[0],inplace=True)

#Checking Null Values Again

df.isnull().sum()

#Exporing to an External CSV File

df.to_csv("FINAL.CSV")

#Converting Yes to 1 and No to 0

df["Self_Employed"].replace(to_replace="Yes", value=1, inplace=True)
df["Self_Employed"].replace(to_replace="No", value=0, inplace=True)
df

OUTPUT:
RESULT :
Thus, a python – Pandas module to perform an Exploratory data analysis using
Jupyter Notebook was created and output is verified successfully.

EXP NO DATE
4 Plotting Using Matplotlib -1

AIM:
To create a python – Pandas module to generate basic graphs in matplotlib
using Jupyter Notebook

PROCEDURE:
Step – 1 : Install the required packages from Command Prompt using pip
Step – 2 : Launch Jupyter notebook from command Prompt
Step – 3 : Import necessary library files at the beginning of the module
Step – 4: Upload the ‘C19INDIA’ datasets and import the same using pandas
Functions
Step – 5 : Perform the necessary programming
PROGRAM :
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
#Loading the Dataset
df=pd.read_csv("C19INDIA.CSV")
df
df.head()
df.describe()

#Converting Sno as Index and dropping the excess

df.index.name="Sno"
df.drop ("Sno", axis=1, inplace=True)
df

#Dropping the Zeros

new_df = df.drop(0)
new_df
new_df.describe()

#Plotting State vs Confirmed

plt.figure(figsize=(10,10) )
plt.bar(new_df['State/UnionTerritory'],new_df['Confirmed'])
plt.xticks(rotation=90)
plt.show()
#Using the Second Dataset
df1 = pd.read_csv("Vaccine.csv")
df1

#Plotting
plt. figure (figsize=(20,25))
plt.plot(df1['CoviShield (Doses Administered)'],color='Blue',
linestyle='--', marker='o')
plt.xlabel("Days")
plt.ylabel("Vaccine Number")
plt.show()

OUTPUT:
RESULT:
Thus, a python – Pandas module to generate basic graphs in matplotlib using
Jupyter Notebook was executed and output is verified successfully
EXP NO DATE
5 Plotting Using Matplotlib -2

AIM:
To create a python – Pandas module to generate basic graphs in matplotlib
using Jupyter Notebook

PROGRAM:
#Importing
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import matplotlib.style

#Line Graph
x=[5,6,8,10,15]
y=[20,30,40,50,55]
plt.plot(x,y)
plt.title('STUDENT DATA-LINE GRAPH')
plt.ylabel('Present ')
plt.xlabel('Roll.no')
plt.show()

# LINE GRAPH WITH STYLE:

x=[5,6,8,10,15]
y=[20,30,40,50,55]
x2=[2,13,16,20,18]
y2=[25,35,16,23.5,40]
plt.plot(x,y,'c',label='A',linewidth=6)
plt.plot(x2,y2,'purple',label='B',linewidth=6)
plt.title('STUDENT DATA-LINE GRAPH WITH STYLE')
plt.ylabel('Present %')
plt.xlabel('Roll.no')
plt.legend()
plt.show()

# BAR GRAPH:
studentnames = ['Jack','Daniel','Bira','Antiquity','Heineken']
marks = [850,1350,220,900,190]
plt.bar(studentnames,marks,color='purple')
plt.title('STUDENT DATA-BAR GRAPH VERTICAL')
plt.xlabel('NAMES')
plt.ylabel('MARKS')
plt.show()

# Horizontal Bar Graph

studentnames = ['Jack','Daniel','Bira','Antiquity','Heineken']
marks = [850,1350,220,900,190]
plt.barh(studentnames,marks,color='orange')
plt.title('STUDENT DATA-BAR GRAPH VERTICAL')
plt.xlabel('NAMES')
plt.ylabel('MARKS')
plt.show()

# Histogram
student_marks=[45,12,13,26,15,55,100,98,95,54,58,56,52,24,71,6
6,66.5,12,23,55,78,10,9,5,10,22,35,65,45]
bins=[0,10,20,30,40,50,60,70,80,90,100]
plt.hist(student_marks,bins,rwidth=0.8,color='purple')
plt.xlabel('MARKS')
plt.ylabel('NUMBER OF STUDENT')
plt.title('STUDENT DATA-HISTOGTAM')
plt.show()

# SCATTER PLOT:
x=[5,6,8,10,15]
y=[20,30,40,50,55]
x2=[2,13,16,20,18]
y2=[25,35,16,23.5,40]
plt.scatter(x,y,color='red')
plt.scatter(x2,y2,color='black')
plt.title=('STUDENT DATA-SCATTER PLOT')
plt.ylabel('Present %')
plt.xlabel('Roll.no')
plt.show()

OUTPUT:

RESULT:
Thus, a python – Pandas module to generate basic graphs in matplotlib using
Jupyter Notebook was executed and output is verified successfully

Unit 4 Fod
100% (1)
Unit 4 Fod
21 pages
Informatics Practices CBSE Project File Class 12
0% (1)
Informatics Practices CBSE Project File Class 12
40 pages
Aiml Lab Manaual R23
100% (1)
Aiml Lab Manaual R23
10 pages
Data Exploration and Visualization Laboratory - AD3301 - Lab Manual
No ratings yet
Data Exploration and Visualization Laboratory - AD3301 - Lab Manual
55 pages
Advanced Programming Final Client Report
No ratings yet
Advanced Programming Final Client Report
27 pages
Data Science Lab Manual..
No ratings yet
Data Science Lab Manual..
54 pages
28 03 2024 Sample Paper Grade 12 Informatics Practices 2023 24
No ratings yet
28 03 2024 Sample Paper Grade 12 Informatics Practices 2023 24
8 pages
CP - Syllabus FINAL
No ratings yet
CP - Syllabus FINAL
4 pages
Data Science Lab
No ratings yet
Data Science Lab
61 pages
01 Introduction To Python
No ratings yet
01 Introduction To Python
36 pages
DAL EXT 1 and 2
No ratings yet
DAL EXT 1 and 2
125 pages
Aditya Kumar - Internship Report
No ratings yet
Aditya Kumar - Internship Report
3 pages
Cleaning Dirty Data With Pandas & Python - DevelopIntelligence Blog PDF
No ratings yet
Cleaning Dirty Data With Pandas & Python - DevelopIntelligence Blog PDF
8 pages
Data Science With Python - Lesson 07 - Data Manipulation With Python - Pandas
No ratings yet
Data Science With Python - Lesson 07 - Data Manipulation With Python - Pandas
72 pages
Final Project Report 1
No ratings yet
Final Project Report 1
74 pages
Data Analytics FULL Course For Begi
No ratings yet
Data Analytics FULL Course For Begi
2 pages
CS 3362 FDS
No ratings yet
CS 3362 FDS
53 pages
Creation of Series Using List, Dictionary & Ndarray
No ratings yet
Creation of Series Using List, Dictionary & Ndarray
65 pages
Practical File Question 28.09.2022
No ratings yet
Practical File Question 28.09.2022
15 pages
Unit 5
No ratings yet
Unit 5
27 pages
Pandas in Python
No ratings yet
Pandas in Python
59 pages
3rd Semester DDM AI DAA DEV Print Pages For Spiral Record 25-1-24 - Removed
No ratings yet
3rd Semester DDM AI DAA DEV Print Pages For Spiral Record 25-1-24 - Removed
28 pages
(The Ultimate PDF) Practical File For I.P. Practical 2023-24
No ratings yet
(The Ultimate PDF) Practical File For I.P. Practical 2023-24
45 pages
ML File Updated
No ratings yet
ML File Updated
60 pages
Data Science Notes
No ratings yet
Data Science Notes
4 pages
Python For Machine Learning
No ratings yet
Python For Machine Learning
66 pages
Fdsa Lab Manual Final
No ratings yet
Fdsa Lab Manual Final
70 pages
Fds Merged
No ratings yet
Fds Merged
102 pages
Python Interview Questions
No ratings yet
Python Interview Questions
8 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
62 pages
Manishadav
No ratings yet
Manishadav
27 pages
CS3362 Data Science Laboratory Manual 2022-23
No ratings yet
CS3362 Data Science Laboratory Manual 2022-23
54 pages
Eda Lab Manual
No ratings yet
Eda Lab Manual
34 pages
DV Lab Manual Modified
No ratings yet
DV Lab Manual Modified
31 pages
AD3301 DEV Lab Manual
No ratings yet
AD3301 DEV Lab Manual
26 pages
DEV Lab Material
No ratings yet
DEV Lab Material
16 pages
UNIT 3 (Chapter 2) Pandas
No ratings yet
UNIT 3 (Chapter 2) Pandas
43 pages
ML (Sudhanshu)
No ratings yet
ML (Sudhanshu)
24 pages
Final Dev Record
No ratings yet
Final Dev Record
49 pages
Labdev
No ratings yet
Labdev
57 pages
Lab Record Dev
No ratings yet
Lab Record Dev
20 pages
L6 and 7-Data Preprocessing-Coding
No ratings yet
L6 and 7-Data Preprocessing-Coding
34 pages
Exp No. 1-3 (MLC)
No ratings yet
Exp No. 1-3 (MLC)
12 pages
FDA Lab Manual Final
No ratings yet
FDA Lab Manual Final
42 pages
Pierian Data - Python For Finance & Algorithmic Trading Course Notes
No ratings yet
Pierian Data - Python For Finance & Algorithmic Trading Course Notes
11 pages
DS Final
No ratings yet
DS Final
46 pages
Informatic Practices HHW
No ratings yet
Informatic Practices HHW
59 pages
Data Visualization - Lab - Manual - 2024
No ratings yet
Data Visualization - Lab - Manual - 2024
13 pages
CS3361-Data Science Lab Manual - B.rethina Kumar
No ratings yet
CS3361-Data Science Lab Manual - B.rethina Kumar
36 pages
Rajni Ip File Final
No ratings yet
Rajni Ip File Final
42 pages
Data Analysis Lab - Final - 23-24
No ratings yet
Data Analysis Lab - Final - 23-24
11 pages
Machine Learning With Python (Vasavi)
No ratings yet
Machine Learning With Python (Vasavi)
20 pages
FDS Record-1-4
No ratings yet
FDS Record-1-4
18 pages
Dev Lab Manual Org
No ratings yet
Dev Lab Manual Org
28 pages
FDS Aim Algorithm
No ratings yet
FDS Aim Algorithm
18 pages
Swastika
No ratings yet
Swastika
60 pages
04-Data Manipulation With Pandas
No ratings yet
04-Data Manipulation With Pandas
28 pages
FDS Lab
No ratings yet
FDS Lab
43 pages
Dev Record Aids
No ratings yet
Dev Record Aids
24 pages
Fdsa Lab Manual
No ratings yet
Fdsa Lab Manual
53 pages
Data Exploration Preparation
No ratings yet
Data Exploration Preparation
12 pages
Masters AIML 3rd Course Jan2025
No ratings yet
Masters AIML 3rd Course Jan2025
27 pages
QP-1PB-IP-2024 Set 1
No ratings yet
QP-1PB-IP-2024 Set 1
9 pages
L CsvReadWrite
No ratings yet
L CsvReadWrite
10 pages
IP Lab Record
No ratings yet
IP Lab Record
23 pages
Kendriya Vidyalaya Sangathan, Mumbai Region 1 Pre-Board Examination 2019-20
No ratings yet
Kendriya Vidyalaya Sangathan, Mumbai Region 1 Pre-Board Examination 2019-20
11 pages
Informatic Practices HHW
No ratings yet
Informatic Practices HHW
21 pages
EX-02-Data Manipulation Pandas Matplot
No ratings yet
EX-02-Data Manipulation Pandas Matplot
9 pages
NumPy and Pandas Tutorial
No ratings yet
NumPy and Pandas Tutorial
8 pages
NumPy and Pandas
No ratings yet
NumPy and Pandas
12 pages
Unit 4 DSE
No ratings yet
Unit 4 DSE
9 pages
Documentclass
No ratings yet
Documentclass
6 pages
Index
No ratings yet
Index
4 pages
20ad41e2 - Data Science
No ratings yet
20ad41e2 - Data Science
2 pages
Commands SQL, Python (BASICS)
No ratings yet
Commands SQL, Python (BASICS)
7 pages
Syllabus - CKST 9
No ratings yet
Syllabus - CKST 9
4 pages
Lab #2 - Data Analysis With NumPy and Pandas
No ratings yet
Lab #2 - Data Analysis With NumPy and Pandas
7 pages
Resume Deepak
No ratings yet
Resume Deepak
3 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
PW2 DataCleaning
No ratings yet
PW2 DataCleaning
6 pages
Dev Lab Record
No ratings yet
Dev Lab Record
21 pages
Cyber Threat Detection Based On Artificial Neural Networks
No ratings yet
Cyber Threat Detection Based On Artificial Neural Networks
5 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
Class 12 IP Practice Assignment Series 9
No ratings yet
Class 12 IP Practice Assignment Series 9
3 pages
PYTHONPROGRAMMING
No ratings yet
PYTHONPROGRAMMING
2 pages
OCS353 - Review Questions
No ratings yet
OCS353 - Review Questions
3 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
ATS Resume
No ratings yet
ATS Resume
1 page
Hugging Face
No ratings yet
Hugging Face
1 page
Mastering Pandas in Python: Course Book
From Everand
Mastering Pandas in Python: Course Book
Pedro Martins
No ratings yet

Packages in Python

Uploaded by

Packages in Python

Uploaded by

S.

n Experiment Date Remark

3 Exploratory Data Analysis using Pandas

4 Plotting Using Matplotlib -1

5 Plotting Using Matplotlib -2

#SELECT SINGLE COL AND PRINT DATA

#SELECT SINGLE COLUMNS AND PRINT LAST 5 ELEMENTS

# SELECT MULTIPLE ROWS AND PRINT FIRST 5 ELEMENTS

#SELECT MULTIPLE ROWS & COL FROM DATASET AND PRINT IT

# SELECT ALL ROWS AND SOME COL(MORE THAN 2) AND PRINT IT

#Grouping based on 'Category' Column

#Exporting to a New CSV

# With Corona Dataset

#Grouping By Date Column

#Grouping for a Particular Date

#Exporting New Dataset

# Grouping for TamilNadu

# Joining two different Datasets

#Combining Df1 with Df2 using Right Join

# Combining Df1 with Df2 using inner Join

# Combining Df1 with Df2 using Outer Join

#Display Number of rows and Columns

#Displaying Data types of Individual Columns

#Display Last 5 Columns

#Replacing Gender Column's Null Value with mode()

#Gender - Null Value is now Zero

#Checking Null Values Again

#Exporing to an External CSV File

#Converting Yes to 1 and No to 0

#Converting Sno as Index and dropping the excess

#Dropping the Zeros

#Plotting State vs Confirmed

# LINE GRAPH WITH STYLE:

# Horizontal Bar Graph

You might also like