0% found this document useful (0 votes)

3 views

Project_Prog

The document provides various examples of using the Pandas library in Python for data manipulation and visualization. It includes programs to count rows and columns in a DataFrame, select data based on conditions, handle missing values, import/export CSV files, and create different types of charts. Each example is accompanied by code snippets and expected outputs.

Uploaded by

tapaskumarmahato

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Project_Prog

Uploaded by

tapaskumarmahato

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

write a pandas program to count the number of rows and columns of a dataframe with practical example

Name Score Age Qualify_label

Amit 98 20 yes
Kamal 80 25 yes
Ram 60 22 No
Riya 85 24 Yes
Anup 49 21 No
Suman 92 20 Yes

Ans

: # importing pandas
import pandas as pd

result_data = {'name': ['Katherine', 'James', 'Emily',

'Michael', 'Matthew', 'Laura'],

'score': [98, 80, 60, 85, 49, 92],

'age': [20, 25, 22, 24, 21, 20],

'qualify_label': ['yes', 'yes', 'no',

'yes', 'no', 'yes']}

# creating dataframe

df = pd.DataFrame(result_data, index=None)

# computing number of rows

rows = len(df.axes[0])

# computing number of columns

cols = len(df.axes[1])

print("Number of Rows: ", rows)

print("Number of Columns: ", cols)

Output:

Number of Rows: 6

Number of Columns: 4
Write a Pandas program to select the name of persons whose height is between 5 to 5.5 (both values
inclusive)

'name': ['Asha', 'Radha', 'Kamal', 'Divy', 'Anjali'],

'height': [ 5.5, 5, np.nan, 5.9, np.nan],
'age': [11, 23, 22, 33, 22]
Solution: import pandas as pd
import numpy as np
pers_data = {'name': ['Asha', 'Radha', 'Kamal', 'Divy', 'Anjali'], 'height': [ 5.5, 5, np.nan, 5.9,
np.nan], 'age': [11, 23, 22, 33, 22]}
labels = ['a', 'b', 'c', 'd', 'e']
df = pd.DataFrame(pers_data , index=labels)
print("Persons whose height is between 5 and 5.5")
print(df[(df['height']>= 5 )& (df['height']<= 5.5)])

Write a Pandas program to select the rows the score is between 15 and 20 (inclusive)
import pandas as pd
import numpy as np
exam_data = {'name': ['Anastasia', 'Dima', 'Katherine', 'James', 'Emily', 'Michael', 'Matthew', 'Laura', 'Kevin',
'Jonas'],
'score': [12.5, 9, 16.5, np.nan, 9, 20, 14.5, np.nan, 8, 19],
'attempts': [1, 3, 2, 3, 2, 3, 1, 1, 2, 1],
'qualify': ['yes', 'no', 'yes', 'no', 'no', 'yes', 'yes', 'no', 'no', 'yes']}
labels = ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j']

df = pd.DataFrame(exam_data , index=labels)
print("Rows where score between 15 and 20 (inclusive):")
print(df[df['score'].between(15, 20)])

output: Rows where score between 15 and 20 (inclusive):

attempts name qualify score
c 2 Katherine yes 16.5
f 3 Michael yes 20.0
j 1 Jonas yes 19.0
Write a Pandas program to find and replace the missing values in a given
DataFrame which do not have any valuable information.

Example:
Missing values: ?, --
Replace those values with NaN

Test Data:
ord_no purch_amt ord_date customer_id salesman_id
0 70001 150.5 ? 3002 5002
1 NaN 270.65 2012-09-10 3001 5003
2 70002 65.26 NaN 3001 ?
3 70004 110.5 2012-08-17 3003 5001
4 NaN 948.5 2012-09-10 3002 NaN
5 70005 2400.6 2012-07-27 3001 5002
6 -- 5760 2012-09-10 3001 5001
7 70010 ? 2012-10-10 3004 ?
8 70003 12.43 2012-10-10 -- 5003
9 70012 2480.4 2012-06-27 3002 5002
10 NaN 250.45 2012-08-17 3001 5003
11 70013 3045.6 2012-04-25 3001 --
Sample Solution:

Python Code :
import pandas as pd

import numpy as np

pd.set_option('display.max_rows', None)

#pd.set_option('display.max_columns', None)

df = pd.DataFrame({

'ord_no':
[70001,np.nan,70002,70004,np.nan,70005,"--",70010,70003,70012,np.na
n,70013],

'purch_amt':
[150.5,270.65,65.26,110.5,948.5,2400.6,5760,"?",12.43,2480.4,250.45
, 3045.6],

'ord_date': ['?','2012-09-10',np.nan,'2012-08-17','2012-09-
10','2012-07-27','2012-09-10','2012-10-10','2012-10-10','2012-06-
27','2012-08-17','2012-04-25'],

'customer_id':
[3002,3001,3001,3003,3002,3001,3001,3004,"--",3002,3001,3001],

'salesman_id':
[5002,5003,"?",5001,np.nan,5002,5001,"?",5003,5002,5003,"--"]})

print("Original Orders DataFrame:")

print(df)

print("\nReplace the missing values with NaN:")

result = df.replace({"?": np.nan, "--": np.nan})

print(result)

Copy
Sample Output:
Original Orders DataFrame:
ord_no purch_amt ord_date customer_id salesman_id
0 70001 150.5 ? 3002 5002
1 NaN 270.65 2012-09-10 3001 5003
2 70002 65.26 NaN 3001 ?
3 70004 110.5 2012-08-17 3003 5001
4 NaN 948.5 2012-09-10 3002 NaN
5 70005 2400.6 2012-07-27 3001 5002
6 -- 5760 2012-09-10 3001 5001
7 70010 ? 2012-10-10 3004 ?
8 70003 12.43 2012-10-10 -- 5003
9 70012 2480.4 2012-06-27 3002 5002
10 NaN 250.45 2012-08-17 3001 5003
11 70013 3045.6 2012-04-25 3001 --

Replace the missing values with NaN:

ord_no purch_amt ord_date customer_id salesman_id
0 70001.0 150.50 NaN 3002.0 5002.0
1 NaN 270.65 2012-09-10 3001.0 5003.0
2 70002.0 65.26 NaN 3001.0 NaN
3 70004.0 110.50 2012-08-17 3003.0 5001.0
4 NaN 948.50 2012-09-10 3002.0 NaN
5 70005.0 2400.60 2012-07-27 3001.0 5002.0
6 NaN 5760.00 2012-09-10 3001.0 5001.0
7 70010.0 NaN 2012-10-10 3004.0 NaN
8 70003.0 12.43 2012-10-10 NaN 5003.0
9 70012.0 2480.40 2012-06-27 3002.0 5002.0
10 NaN 250.45 2012-08-17 3001.0 5003.0
11 70013.0 3045.60 2012-04-25 3001.0 NaN

write a program to import and export data between pandas and csv file

import pandas as pd
df=pd.read_csv("C:\\Users\\Desktop\\covid19.csv")

import pandas as pd
data = {'Name': ['Smith', 'Parker'], 'ID': [101, 102], 'Language': ['Python', 'JavaScript']}
info = pd.DataFrame(data)
print('DataFrame Values:\n', info)
# default CSV
csv_data = info.to_csv()
print('\nCSV String Values:\n', csv_data)
Given the school result data, analyses the performance of the students on different
parameters, e.g subject wise or class wise.
import pandas as pd
import matplotlib.pyplot as plt
# Simple Line Chart with setting of Label of X and Y axis,
# title for chart line and color of line
subject = ['Physic','Chemistry','Mathematics', 'Biology','Computer']
marks =[80,75,70,78,82]
# To draw line in red colour
plt.plot(subject,marks,'r',marker ='*')
# To Write Title of the Line Chart
plt.title('Marks Scored')
# To Put Label At Y Axis
plt.xlabel('SUBJECT')
# To Put Label At X Axis
plt.ylabel('MARKS')
plt.show()
Output:
Write a program to create bar chart of five most countries are effected by corona virus in 2020.Read the
data from CSV file.
import matplotlib.pyplot as plt
import numpy as np
import pandas as pd
a=pd.read_csv("C:\\Download\\Covid.csv")
x=np.linspace(1,61,5)
plt.xticks(x+6/2,['China','Italy','India','Bangladesh,'USA'])
plt.bar(x,a['c'],width=3,color='blue',label='Cases')
plt.bar(x+3,a['r'],width=3,color='green',label='Recover')
plt.bar(x+6,a['d'],width=3,color='red',label='Death')
plt.title("Most affected countries due to covid19")
plt.legend()
plt.xlabel("Countries")
plt.ylabel("Number")
plt.show()

Draw the histogram based on the Production of Wheatin different Years

Year:2000,2002,2004,2006,2008,2010,2012,2014,2016,2018
Production':4,6,7,15,24,2,19,5,16,4
import pandas as pd
import matplotlib.pyplot as plt
data={'Year':[2000,2002,2004,2006,2008,2010,2012,2014,2016,2018],\ 'Production':
[4,6,7,15,24,2,19,5,16,4]}
d=pd.DataFrame(data)
print(d)
x=d.hist(column='Production',bins=5,grid=True)
plt.show(x)

The table shows passenger car fuel rates in miles per gallon for several years. Make a LINE GRAPH of the
data. During which 2-year period did the fuel rate decrease?
YEAR: 2000 2002 2004 2006
RATE: 21.0 20.7 21.2 21.6
import matplotlib.pyplot as p
Yr=[2000,2002,2004,2006]
rate=[21.0,20.7,21.2,21.6]
p.plot(Yr,rate)
p.show()

The number of bed-sheets manufactured by a factory during five consecutive weeks is given below.
Week First Second Third Fourth Fifth
Number of Bed-sheets 600 850 700 300 900
Draw the bar graph representing the above data

import matplotlib.pyplot as plt

x=['First','Second','Third','Fourth','Fifth']
y=[600,850,700,300,900]

p.title('Production By Factory')
p.xlabel('Week')
p.ylabel('No. of Bed Sheets')
p.bar(x,y,color='Blue',width=.50)
p.show()

Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Document (4)
No ratings yet
Document (4)
15 pages
Pandas-Missing Values
No ratings yet
Pandas-Missing Values
2 pages
Practical File Questions With Answers
No ratings yet
Practical File Questions With Answers
7 pages
Assignment
No ratings yet
Assignment
2 pages
ML Lab Manual Final
No ratings yet
ML Lab Manual Final
36 pages
Panda Merged
No ratings yet
Panda Merged
19 pages
1
No ratings yet
1
12 pages
Document (4)-1
No ratings yet
Document (4)-1
15 pages
Python Amit
No ratings yet
Python Amit
11 pages
Suryadatta National School Class 12 CBSE Informatics Practices Practicals List
No ratings yet
Suryadatta National School Class 12 CBSE Informatics Practices Practicals List
19 pages
IP_Lab_record[1]
No ratings yet
IP_Lab_record[1]
23 pages
IP Practic MINE
No ratings yet
IP Practic MINE
30 pages
Pandas Practice 2
No ratings yet
Pandas Practice 2
12 pages
Pandas Practicals - Term-1
100% (1)
Pandas Practicals - Term-1
18 pages
Data Science Practical Problems
No ratings yet
Data Science Practical Problems
40 pages
Unit3_3) Pandas.ipynb - Colab
No ratings yet
Unit3_3) Pandas.ipynb - Colab
11 pages
PDF&Rendition=1
No ratings yet
PDF&Rendition=1
47 pages
GR12 RECORD PROGRAMS 6TH ONWARDS
No ratings yet
GR12 RECORD PROGRAMS 6TH ONWARDS
18 pages
DA lab
No ratings yet
DA lab
27 pages
XII IP PRACTICAL LIST 2022-23-1
No ratings yet
XII IP PRACTICAL LIST 2022-23-1
23 pages
Ip Practical
No ratings yet
Ip Practical
23 pages
12 Pandas
100% (1)
12 Pandas
21 pages
Lab Programmes Adwaith
No ratings yet
Lab Programmes Adwaith
18 pages
Pandas
No ratings yet
Pandas
4 pages
Data Sci
No ratings yet
Data Sci
29 pages
Informatics Practices Practical List22-2323
100% (1)
Informatics Practices Practical List22-2323
7 pages
Dataframe in Pandas
No ratings yet
Dataframe in Pandas
23 pages
Practical_File (1)
No ratings yet
Practical_File (1)
19 pages
Pandas Questions Ip File
No ratings yet
Pandas Questions Ip File
13 pages
PracticalRevisionMaterial
No ratings yet
PracticalRevisionMaterial
13 pages
Practical File Question 28.09.2022
No ratings yet
Practical File Question 28.09.2022
15 pages
PYQ Data Analysis and Visualisation Using Python GE May 2024
No ratings yet
PYQ Data Analysis and Visualisation Using Python GE May 2024
6 pages
Vantika Kamra's Practical File 12 Diamond (26600872)
No ratings yet
Vantika Kamra's Practical File 12 Diamond (26600872)
46 pages
EDA (2)
No ratings yet
EDA (2)
7 pages
EXP-3
No ratings yet
EXP-3
10 pages
2023 Data Analysis and Visualization Using Python
100% (1)
2023 Data Analysis and Visualization Using Python
9 pages
CH-6 Data Loading, Storage, and File Formats
No ratings yet
CH-6 Data Loading, Storage, and File Formats
163 pages
Informatics Practices Practical List22-2323
No ratings yet
Informatics Practices Practical List22-2323
6 pages
Practical File ANKIT RAJ CLASS 12-F
No ratings yet
Practical File ANKIT RAJ CLASS 12-F
48 pages
IP - Record 2023-24
No ratings yet
IP - Record 2023-24
79 pages
Etl1 6
No ratings yet
Etl1 6
6 pages
Practical-9 PYTHON
No ratings yet
Practical-9 PYTHON
5 pages
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
No ratings yet
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
16 pages
dav 2024 pyq
No ratings yet
dav 2024 pyq
7 pages
12 IP Practical Exampl
No ratings yet
12 IP Practical Exampl
6 pages
Wa0012.
No ratings yet
Wa0012.
30 pages
FDS Slot 1
No ratings yet
FDS Slot 1
19 pages
Dav Pyq 2023
No ratings yet
Dav Pyq 2023
15 pages
PRACTICAL FILE IP - Copy (1)
No ratings yet
PRACTICAL FILE IP - Copy (1)
27 pages
Data Science Practical Book - Ipynb
No ratings yet
Data Science Practical Book - Ipynb
21 pages
Dataframe
No ratings yet
Dataframe
19 pages
Practical - With Solution - XII - IP
No ratings yet
Practical - With Solution - XII - IP
13 pages
2 Python Data Processing
100% (2)
2 Python Data Processing
66 pages
DS (Pandas)
No ratings yet
DS (Pandas)
17 pages
Python Practical Questions
No ratings yet
Python Practical Questions
13 pages
ML lab manual 1-10
No ratings yet
ML lab manual 1-10
58 pages
Blazor and API Example: Classroom Quiz Application
From Everand
Blazor and API Example: Classroom Quiz Application
Taurius Litvinavicius
No ratings yet
AutoCAD Electrical 2020 for Electrical Control Designers, 11th Edition
From Everand
AutoCAD Electrical 2020 for Electrical Control Designers, 11th Edition
Prof. Sham Tickoo
No ratings yet
Learning Oracle 12c: A PL/SQL Approach
From Everand
Learning Oracle 12c: A PL/SQL Approach
Prof. Sham Tickoo
No ratings yet
Prize Distribution 2023-2024
No ratings yet
Prize Distribution 2023-2024
57 pages
BOARDKEY24AI[1]
No ratings yet
BOARDKEY24AI[1]
5 pages
Class X ICSE English Language Handwritten Notes and Format
No ratings yet
Class X ICSE English Language Handwritten Notes and Format
124 pages
Virus
No ratings yet
Virus
4 pages
set 3
No ratings yet
set 3
1 page
ClassVII_Final_2022
No ratings yet
ClassVII_Final_2022
4 pages
Set2
No ratings yet
Set2
2 pages
10+ICSE+Homework+Worksheet+-1
No ratings yet
10+ICSE+Homework+Worksheet+-1
1 page
SAMPLE PAPER - 1-Copy
No ratings yet
SAMPLE PAPER - 1-Copy
5 pages
set2
No ratings yet
set2
1 page
Class_VII_FirstTerm_23
No ratings yet
Class_VII_FirstTerm_23
3 pages
IP_1
No ratings yet
IP_1
5 pages
IP_2
No ratings yet
IP_2
9 pages
What is List
No ratings yet
What is List
6 pages
ClassVIIFirstTerm22
No ratings yet
ClassVIIFirstTerm22
4 pages
sql assignment
No ratings yet
sql assignment
3 pages
X_AI_Preboard_(2)[1]
No ratings yet
X_AI_Preboard_(2)[1]
5 pages
What is heat sink-converted
No ratings yet
What is heat sink-converted
2 pages
StringManipulation
No ratings yet
StringManipulation
3 pages
Python_Practicale
No ratings yet
Python_Practicale
7 pages
Class_VII_Final
No ratings yet
Class_VII_Final
4 pages
Worksheet - List
No ratings yet
Worksheet - List
13 pages
NEW
No ratings yet
NEW
4 pages
Exam 1 Heat and Mass
No ratings yet
Exam 1 Heat and Mass
2 pages
How To Install Mask-Rcnn For Nvidia Gpu
No ratings yet
How To Install Mask-Rcnn For Nvidia Gpu
19 pages
Praktikum - M3 .Ipynb - Colaboratory
No ratings yet
Praktikum - M3 .Ipynb - Colaboratory
2 pages
Quantum GIS (QGIS) Tutorials - Tutorial - Digitizing in QGIS
No ratings yet
Quantum GIS (QGIS) Tutorials - Tutorial - Digitizing in QGIS
12 pages
LaTeX: Designing It Yourself
No ratings yet
LaTeX: Designing It Yourself
11 pages
3a Data Frame - Jupyter Notebook
No ratings yet
3a Data Frame - Jupyter Notebook
5 pages
Latex
No ratings yet
Latex
14 pages
Catering Elfmenda
No ratings yet
Catering Elfmenda
9 pages
Document 123
No ratings yet
Document 123
2 pages
Donald E. Knuth - Texbook
100% (6)
Donald E. Knuth - Texbook
494 pages
Data Frame
No ratings yet
Data Frame
11 pages
Q1-1
No ratings yet
Q1-1
8 pages
Candidate Elimination - Jupyter Notebook
No ratings yet
Candidate Elimination - Jupyter Notebook
3 pages
Python - How To Draw A Heart With Pylab - Stack Overflow
No ratings yet
Python - How To Draw A Heart With Pylab - Stack Overflow
5 pages
Practice Questions (Unsolved)
No ratings yet
Practice Questions (Unsolved)
8 pages
統計 python作業一
No ratings yet
統計 python作業一
9 pages
Manu1 U1 A2 Osrr
No ratings yet
Manu1 U1 A2 Osrr
23 pages
Pronosticos - Ipynb - Colaboratory
No ratings yet
Pronosticos - Ipynb - Colaboratory
13 pages
Piad-425 Trabajofinal
No ratings yet
Piad-425 Trabajofinal
5 pages
Python Pandas Handson
No ratings yet
Python Pandas Handson
6 pages
Brochure Python For Data Scientist
No ratings yet
Brochure Python For Data Scientist
14 pages
Statsmodel Python Example
No ratings yet
Statsmodel Python Example
2 pages
Xe Persian
No ratings yet
Xe Persian
196 pages
Figure PPT ch005
No ratings yet
Figure PPT ch005
59 pages
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
No ratings yet
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
5 pages
AUTOREGRESSION
No ratings yet
AUTOREGRESSION
15 pages
Numpy Mathlib
No ratings yet
Numpy Mathlib
9 pages
Array-Numpy-Quiz - Attempt Review
No ratings yet
Array-Numpy-Quiz - Attempt Review
10 pages
Matplotlib - Pyplot PLT Numpy NP Scipy Seaborn Sns Scipy Random
No ratings yet
Matplotlib - Pyplot PLT Numpy NP Scipy Seaborn Sns Scipy Random
4 pages

Project_Prog

Uploaded by

Project_Prog

Uploaded by

write a pandas program to count the number of rows and columns of a dataframe with practical example

Name Score Age Qualify_label

result_data = {'name': ['Katherine', 'James', 'Emily',

'Michael', 'Matthew', 'Laura'],

'score': [98, 80, 60, 85, 49, 92],

'age': [20, 25, 22, 24, 21, 20],

'qualify_label': ['yes', 'yes', 'no',

'yes', 'no', 'yes']}

# computing number of rows

# computing number of columns

print("Number of Rows: ", rows)

print("Number of Columns: ", cols)

'name': ['Asha', 'Radha', 'Kamal', 'Divy', 'Anjali'],

output: Rows where score between 15 and 20 (inclusive):

print("Original Orders DataFrame:")

print("\nReplace the missing values with NaN:")

result = df.replace({"?": np.nan, "--": np.nan})

Replace the missing values with NaN:

Draw the histogram based on the Production of Wheatin different Years

import matplotlib.pyplot as plt

You might also like