0% found this document useful (0 votes)

37 views26 pages

DAV Practicals

The document contains questions and answers related to data analysis using Python. It includes examples of manipulating dataframes, plotting charts, handling missing values and converting date formats. Relevant NumPy and Pandas functions are used to group, aggregate, filter and visualize the data.

Uploaded by

108 Anirban

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views26 pages

DAV Practicals

Uploaded by

108 Anirban

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 26

NAME: ANIRBAN

BHATTACHARJEE
ROLL NO.- 21HCS4116
EXAMINATION ROLL NO.-
SEMESTER: Vth
PAPER: DATA ANALYSIS &
VISUALIZATION PRACTICALS

Q1. Given below is a dictionary having two keys ‘Boys’ and ‘Girls’ and having two lists of
heights of five Boys and Five Girls respectively as values associated with these keys

Original dictionary of lists:

{'Boys': [72, 68, 70, 69, 74], 'Girls': [63, 65, 69, 62, 61]}

From the given dictionary of lists create the following list of dictionaries:

[{'Boys': 72, 'Girls': 63}, {'Boys': 68, 'Girls': 65}, {'Boys': 70, 'Girls': 69}, {'Boys': 69, 'Girls': 62},
{‘Boys’:74, ‘Girls’:61]
Answer:-
def list_of_dict(heights):
keys=heights.keys()
# print(keys)
values = zip(*[heights[k] for k in keys])
# print(values)
result = [dict(zip(keys,v )) for v in values]
return result

heights = {'Boys':[72,68,70,69,74], 'Girls':[63,65,69,62,61]}

print("\n ORIGINAL DICTIONARY OF LISTS :" , heights)
print("\n NOW LIST OF DICTIONARIES : \n",list_of_dict(heights))

OUTPUT :

ORIGINAL DICTIONARY OF LISTS : {'Boys': [72, 68, 70, 69, 74], 'Girls': [63, 65, 69, 62, 61]}

NOW LIST OF DICTIONARIES :

[{'Boys': 72, 'Girls': 63}, {'Boys': 68, 'Girls': 65}, {'Boys': 70, 'Girls': 69}, {'Boys': 69, 'Girls': 62},
{'Boys': 74, 'Girls': 61}]

Q2. Write programs in Python using NumPy library to do the following:

a. Compute the mean, standard deviation, and variance of a two dimensional random
integer array along the second axis.

b. Get the indices of the sorted elements of a given array.

a. B = [56, 48, 22, 41, 78, 91, 24, 46, 8, 33]

c. Create a 2-dimensional array of size m x n integer elements, also print the shape, type
and data type of the array and then reshape it into nx m array, n and m are user inputs
given at the run time.

d. Test whether the elements of a given array are zero, non-zero and NaN. Record the
indices of these elements in three separate arrays.

Answer:- (a)
import numpy as np
arr = np.random.randint(1,50,(4,6))
arr

#along the second axis

#Mean
print('Mean of the array: ',arr.mean(axis=1))
#standard deviation
print('Standard Deviation of the array: ',arr.std(axis=1))
#variance
print('Variance of the array: ',arr.var(axis=1))

(b)
B = [56, 48, 22, 41, 78, 91, 24, 46, 8, 33]
arr1 = np.array(B)
#arr1
print("Sorted array: ",np.sort(arr1))
print("Indices of the sorted elements of a given array:
",np.argsort(arr1))

(c)

m = int(input('Enter the number of rows(m): '))

n = int(input('Enter the number of columns(n): '))
arr2 = np.random.randint(1,100,(m,n))
print(arr2)
print('Shape: ',arr2.shape)
print('Type: ',type(arr2))
print('Data Type: ',arr2.dtype)
arr2 = arr2.reshape(n,m)
print('After reshaping: \n',arr2)
print('New Shape: ',arr2.shape)

(d)

x = np.array([1, 0, 3, 4])
print("ORIGINAL ARRAY ::-> ",x)
print("\nTest if none of the elements of the said array is
zero ::-> ", np.all(x))

res = np.where(x == 0)[0]

print("The index of the zero elements is :: ",res)

x = np.array([1, 0, 0, 3, 2, 0])
print("\n")
print("\nORIGINAL ARRAY ::-> ",x)
print("\nTest whether any of the elements of a given array is
non-zero ::",np.any(x))
res = np.where(x != 0)[0]
print("The index of the non- zero elements is :: ",res)
x = np.array([0, 0, 0, 0])

a = np.array([1, 0, np.nan, 3, np.nan])

print("\n")
print("\nORIGINAL ARRAY ::-> ",a)
print("\nTest element-wise for NaN :: ",np.isnan(a))
res = np.where(np.isnan(a) == True)[0]
print("The index of the zero elements is :: ",res)

OUTPUT :

array([[17, 20, 31, 12, 16, 10],

[44, 22, 32, 42, 30, 6],
[49, 46, 33, 6, 3, 14],
[34, 39, 35, 17, 29, 20]])

(a)
Mean of the array: [17.66666667 29.33333333 25.16666667 29.
]
Standard Deviation of the array: [ 6.79869268 12.78888406
18.46994556 8.02080628]
Variance of the array: [ 46.22222222 163.55555556 341.13888889
64.33333333]

(b)
Sorted array: [ 8 22 24 33 41 46 48 56 78 91]
Indices of the sorted elements of a given array: [8 2 6 9 3 7 1
0 4 5]

(c)
[[ 6 77 89]
[55 43 24]]
Shape: (2, 3)
Type: <class 'numpy.ndarray'>
Data Type: int32
After reshaping:
[[ 6 77]
[89 55]
[43 24]]
New Shape: (3, 2)

(d)
ORIGINAL ARRAY ::-> [1 0 3 4]

Test if none of the elements of the said array is zero ::->

False
The index of the zero elements is :: [1]

ORIGINAL ARRAY ::-> [1 0 0 3 2 0]

Test whether any of the elements of a given array is non-zero ::
True
The index of the non- zero elements is :: [0 3 4]

ORIGINAL ARRAY ::-> [ 1. 0. nan 3. nan]

Test element-wise for NaN :: [False False True False True]

The index of the zero elements is :: [2 4]

Q3. Create a dataframe having at least 3 columns and 50 rows to store numeric data
generated using a random function. Replace 10% of the values by null values whose
index positions are generated using random function. Do the following:

 a. Identify and count missing values in a dataframe.

 b. Drop the column having more than 5 null values.
 c. Identify the row label having maximum of the sum of all values in a row and
drop that row.
 d. Sort the dataframe on the basis of the first column.
 e. Remove all duplicates from the first column.
 f. Find the correlation between first and second column and covariance between
second and third column.
 g. Detect the outliers and remove the rows having outliers.
 h. Discretize second column and create 5 bins

Answer:-

import pandas as pd
import numpy as np
df = pd.DataFrame(np.random.randint(0,100,size=(50,3)),
columns=list('123'))
df.head()

for c in df.sample(int(df.shape[0]*df.shape[1]*0.10)).index:
df.loc[c,str(np.random.randint(1,4))]=np.nan
df

(a)
print(df.isnull().sum().sum())
(b)
for col in df.columns:
print(col,df[col].isnull().sum())
df.dropna(axis = 1,thresh=(df.shape[0]-5)).head()

(c)
sum=df.sum(axis=1)
print("SUM IS :\n",sum)
print("\nMAXIMUM SUM IS :",sum.max())
max_sum_row = df.sum(axis=1).idxmax()
print("\nRow index having maximum sum is :" ,max_sum_row)

df = df.drop(max_sum_row ,axis =0)

print("\nDATA Frame AFTER REMOVING THE ROW HAVING MAXIMUM SUM
VALUE")
df

(d)
sortdf=df.sort_values('1')
sortdf.head()

(e)
df =df.drop_duplicates(subset='1',keep = "first")
print(df)

(f)
correlation = df['1'].corr(df['2'])
print("CORRELATION between column 1 and 2 : ", correlation)
covariance = df['2'].cov(df['3'])
print("COVARIANCE between column 2 and 3 :",covariance)

(g)
df.plot.box()

(h)
df1 = pd.cut(df['2'],bins=5).head()
df1

OUTPUT :
(a)
Q4. Consider two excel files having attendance of a workshop’s participants for two
days. Each file has three fields ‘Name’, ‘Time of joining’, duration (in minutes) where
names are unique within a file. Note that duration may take one of three values (30, 40,
50) only. Import the data into two dataframes and do the following:

 a.Perform merging of the two dataframes to find the names of students who had
attended the workshop on both days.
 b. Find names of all students who have attended workshop on either of the days.
 c. Merge two data frames row-wise and find the total number of records in the
data frame.
 d. Merge two data frames and use two columns names and duration as multi-row
indexes. Generate descriptive statistics for this multi-index.

Answer:-
import numpy as np
import pandas as pd
dfDay1 = pd.read_excel('Day1_anirbn.xlsx')
dfDay2 = pd.read_excel('Day2_anirbn.xlsx')
print(dfDay1.head(),"\n")
print(dfDay2.head())

(a)
pd.merge(dfDay1,dfDay2,how='inner',on='Name')

(b)
either_day = pd.merge(dfDay1,dfDay2,how='outer',on='Name')
either_day

(d)
both_days =
pd.merge(dfDay1,dfDay2,how='outer',on=['Name','Duration']).copy()
# creates a copy of an existing list

both_days.fillna(value='-',inplace=True) # to fill out the

missing values in the given series object

both_days.set_index(['Name','Duration']) # a method to set a

List as index of a Data Frame

OUTPUT :
Q5. Taking Iris data, plot the following with proper legend and axis labels: (Download
IRIS data from: https://archive.ics.uci.edu/ml/datasets/iris or import it from
sklearn.datasets)

 a. Plot bar chart to show the frequency of each class label in the data.
 b. Draw a scatter plot for Petal width vs sepal width.
 c. Plot density distribution for feature petal length.
 d. Use a pair plot to show pairwise bivariate distribution in the Iris Dataset.

Answer:-

import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
iris = sns.load_dataset('iris')

(a)
sns.countplot(x='species',data=iris,palette='Set2')
plt.xlabel('Species')
plt.ylabel('Frequency')
plt.title('Frequency of Each class label')

(b)
plt.scatter(x='petal_width',y='sepal_width',data=iris)
plt.xlabel('Petal Width')
plt.ylabel('Sepal Width')
plt.title("Scatter plot Petel width vs Sepal Width")

(d)
sns.pairplot(iris,hue='species',palette='coolwarm')

OUTPUT :
Next Page -----
Q6. Consider any sales training/ weather forecasting dataset

a. Compute mean of a series grouped by another series

b. Fill an intermittent time series to replace all missing dates with values of previous
non-missing date.

c. Perform appropriate year-month string to dates conversion.

d. Split a dataset to group by two columns and then sort the aggregated results within
the groups.

e. Split a given dataframe into groups with bin counts.

Answer:-
import pandas as pd
import numpy as np

data = {
'Date': pd.date_range(start='2022-01-01', end='2022-01-10'),
'Sales': [100, 120, np.nan, 150, 200, 180, np.nan, 220, 250,
300],
'Product': ['A', 'B', 'A', 'B', 'A', 'B', 'A', 'B', 'A', 'B']
}

df = pd.DataFrame(data)
print("Original Dataset:")
print(df)
print("\n")

# a. Compute mean of 'Sales' grouped by 'Product'

mean_sales = df.groupby('Product')['Sales'].mean()
print("Mean Sales Grouped by Product:")
print(mean_sales)
print("\n")

# b. Fill missing values in 'Sales' with the previous non-missing

date
df['Date'] = pd.to_datetime(df['Date'])
df = df.set_index('Date')
df = df.resample('D').ffill()
print("Dataset after filling missing values:")
print(df)
print("\n")

# c. Perform year-month string to date conversion

df['YearMonth'] =
pd.to_datetime(df.index.to_period('M')).to_timestamp()
print("Dataset after year-month string to date conversion:")
print(df)
print("\n")

# d. Split a dataset to group by two columns and then sort the

aggregated results within the groups
sorted_sales = df.groupby(['Product', 'Date'])
['Sales'].sum().sort_values(ascending=False)
print("Sorted Sales Grouped by Product and Date:")
print(sorted_sales)
print("\n")

# e. Split a given dataframe into groups with bin counts

num_bins = 3
df['SalesBins'] = pd.cut(df['Sales'], bins=num_bins)
bin_counts = df.groupby('SalesBins').size()
print("Bin Counts for Sales:")
print(bin_counts)

OUTPUT:-

Original Dataset:
Date Sales Product
0 2022-01-01 100.0 A
1 2022-01-02 120.0 B
2 2022-01-03 NaN A
3 2022-01-04 150.0 B
4 2022-01-05 200.0 A
5 2022-01-06 180.0 B
6 2022-01-07 NaN A
7 2022-01-08 220.0 B
8 2022-01-09 250.0 A
9 2022-01-10 300.0 B

Mean Sales Grouped by Product:

Product
A 182.5
B 212.5
Name: Sales, dtype: float64

Dataset after filling missing values:

Sales Product
Date
2022-01-01 100.0 A
2022-01-02 120.0 B
2022-01-03 120.0 A
2022-01-04 150.0 B
2022-01-05 200.0 A
2022-01-06 180.0 B
2022-01-07 180.0 A
2022-01-08 220.0 B
2022-01-09 250.0 A
2022-01-10 300.0 B

Dataset after year-month string to date conversion:

Sales Product YearMonth
Date
2022-01-01 100.0 A 2022-01-01
2022-01-02 120.0 B 2022-01-01
2022-01-03 120.0 A 2022-01-01
2022-01-04 150.0 B 2022-01-01
2022-01-05 200.0 A 2022-01-01
2022-01-06 180.0 B 2022-01-01
2022-01-07 180.0 A 2022-01-01
2022-01-08 220.0 B 2022-01-01
2022-01-09 250.0 A 2022-01-01
2022-01-10 300.0 B 2022-01-01

Sorted Sales Grouped by Product and Date:

Product Date
B 2022-01-10 300.0
2022-01-08 220.0
2022-01-06 180.0
2022-01-04 150.0
2022-01-02 120.0
A 2022-01-09 250.0
2022-01-05 200.0
2022-01-01 100.0
2022-01-07 180.0
2022-01-03 120.0
Name: Sales, dtype: float64

Bin Counts for Sales:

SalesBins
(99.7, 140.0] 3
(140.0, 180.0] 4
(180.0, 300.0] 3
dtype: int64

Q7. Consider a data frame containing data about students i.e. name, gender and
passing division:

a. Perform one hot encoding of the last two columns of categorical data using the
get_dummies() function.

b. Sort this data frame on the “Birth Month” column (i.e. January to December). Hint:
Convert Month to Categorical.

Answer: -
import pandas as pd

# Creating the student DataFrame

data = {
'Name': ['Mudit Chauhan', 'Seema Chopra', 'Rani Gupta',
'Aditya Narayan', 'Sanjeev Sahni', 'Prakash Kumar',
'Ritu Agarwal', 'Akshay Goel', 'Meeta Kulkarni',
'Preeti Ahuja', 'Sunil Das Gupta', 'Sonali Sapre',
'Rashmi Talwar', 'Ashish Dubey', 'Kiran Sharma',
'Sameer Bansal'],
'Birth_Month': ['December', 'January', 'March', 'October',
'February', 'December', 'September', 'August',
'July', 'November', 'April', 'January',
'June', 'May', 'February', 'October'],
'Gender': ['M', 'F', 'F', 'M', 'M', 'M', 'F', 'M', 'F', 'F',
'M', 'F', 'F', 'M', 'F', 'M'],
'Pass_Division': ['III', 'II', 'I', 'I', 'II', 'III', 'I',
'I', 'II', 'II', 'III', 'I', 'III', 'II', 'II', 'I']
}

df = pd.DataFrame(data)

# a. Perform one hot encoding of the last two columns using

get_dummies()
df_encoded = pd.get_dummies(df, columns=['Gender',
'Pass_Division'])

# b. Sort the DataFrame on the "Birth Month" column

month_order = ['January', 'February', 'March', 'April', 'May',
'June', 'July', 'August', 'September', 'October', 'November',
'December']
df_encoded['Birth_Month'] =
pd.Categorical(df_encoded['Birth_Month'], categories=month_order,
ordered=True)
df_encoded = df_encoded.sort_values(by='Birth_Month')

# Displaying the resulting DataFrame

print("DataFrame after one-hot encoding and sorting:")
print(df_encoded)

OUTPUT:-
Q8. Consider the following data frame containing a family name, gender of the family
member and her/his monthly income in each record.

Write a program in Python using Pandas to perform the following:

a. Calculate and display familywise gross monthly income.

b. Calculate and display the member with the highest monthly income in a family.

c. Calculate and display monthly income of all members with income greater than Rs.
60000.00.

d. Calculate and display the average monthly income of the female members in the
Shah family.

Answer:-
import pandas as pd

# Creating the DataFrame

data = {
'Name': ['Shah', 'Vats', 'Vats', 'Kumar', 'Vats', 'Kumar',
'Shah', 'Shah', 'Kumar', 'Vats'],
'Gender': ['Male', 'Male', 'Female', 'Female', 'Female',
'Male', 'Male', 'Female', 'Female', 'Male'],
'MonthlyIncome': [114000.00, 65000.00, 43150.00, 69500.00,
155000.00, 103000.00, 55000.00, 112400.00, 81030.00, 71900.00]
}

df = pd.DataFrame(data)

# a. Calculate and display familywise gross monthly income

familywise_income = df.groupby('Name')['MonthlyIncome'].sum()
print("Familywise Gross Monthly Income:")
print(familywise_income)
print("\n")

# b. Calculate and display the member with the highest monthly

income in each family
max_income_member = df.loc[df.groupby('Name')
['MonthlyIncome'].idxmax()]
print("Member with the Highest Monthly Income in Each Family:")
print(max_income_member)
print("\n")

# c. Calculate and display monthly income of all members with

income greater than Rs. 60000.00
high_income_members = df[df['MonthlyIncome'] > 60000.00]
print("Monthly Income of Members with Income Greater than Rs.
60000.00:")
print(high_income_members[['Name', 'Gender', 'MonthlyIncome']])
print("\n")

# d. Calculate and display the average monthly income of the

female members in the Shah family
average_female_income_shah = df[(df['Name'] == 'Shah') &
(df['Gender'] == 'Female')]['MonthlyIncome'].mean()
print("Average Monthly Income of Female Members in the Shah
Family:")
print(average_female_income_shah)

OUTPUT:-

Abu Dhabi Ports Company (PJSC) : Shamal Development - New 33/11Kv Substation
No ratings yet
Abu Dhabi Ports Company (PJSC) : Shamal Development - New 33/11Kv Substation
52 pages
2023 Data Analysis and Visualization Using Python
100% (2)
2023 Data Analysis and Visualization Using Python
9 pages
Sarthak Python
No ratings yet
Sarthak Python
6 pages
ENGLISH-8-Quarter 2-Week 5
100% (1)
ENGLISH-8-Quarter 2-Week 5
6 pages
HW3
0% (1)
HW3
3 pages
DAV Practical
No ratings yet
DAV Practical
12 pages
21hcs4108 Davpracticals
No ratings yet
21hcs4108 Davpracticals
29 pages
QP - Ip PB19-01QP
No ratings yet
QP - Ip PB19-01QP
7 pages
Assignment 4 - Jupyter Notebook
No ratings yet
Assignment 4 - Jupyter Notebook
6 pages
Conjugate Beam Method SLU
No ratings yet
Conjugate Beam Method SLU
41 pages
Abhiml ML File
No ratings yet
Abhiml ML File
74 pages
Service Manual, PM7100, English PT00112534 Rev A Release 8-2020
No ratings yet
Service Manual, PM7100, English PT00112534 Rev A Release 8-2020
64 pages
Data Science Practicals - Ipynb
No ratings yet
Data Science Practicals - Ipynb
54 pages
Cs Sem III Dav Upc 2343012002 Sl. No. Qp. 1673 Dec '23
No ratings yet
Cs Sem III Dav Upc 2343012002 Sl. No. Qp. 1673 Dec '23
12 pages
Ai Tools and Applications-Lab
No ratings yet
Ai Tools and Applications-Lab
33 pages
Code 2
No ratings yet
Code 2
3 pages
Code
No ratings yet
Code
3 pages
CS3361 Set2
No ratings yet
CS3361 Set2
6 pages
Manishadav
No ratings yet
Manishadav
27 pages
QP DAV 3rd Sem Dec 2023
No ratings yet
QP DAV 3rd Sem Dec 2023
12 pages
23bet10114 Naman Gupta Assignment-1
No ratings yet
23bet10114 Naman Gupta Assignment-1
17 pages
FDS Slot 1
No ratings yet
FDS Slot 1
19 pages
Ge - Computer Science Data Analysis
No ratings yet
Ge - Computer Science Data Analysis
16 pages
Batch2 FDS Printout
No ratings yet
Batch2 FDS Printout
38 pages
GE Python Visualization 2023
No ratings yet
GE Python Visualization 2023
16 pages
FDS Lab 1 Manuel .1..1new
No ratings yet
FDS Lab 1 Manuel .1..1new
38 pages
The Relationship of Endodontic-Periodontic Lesions
No ratings yet
The Relationship of Endodontic-Periodontic Lesions
7 pages
FDS Lab 1 Manuel .1..1new
No ratings yet
FDS Lab 1 Manuel .1..1new
34 pages
HW 1
No ratings yet
HW 1
11 pages
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
No ratings yet
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
16 pages
DAV Practical File 234003
No ratings yet
DAV Practical File 234003
14 pages
Chapter Three
No ratings yet
Chapter Three
15 pages
DAV Practicle File
No ratings yet
DAV Practicle File
28 pages
Practical File
No ratings yet
Practical File
19 pages
23HCS4142 PDF
No ratings yet
23HCS4142 PDF
24 pages
WAH5 - Functional Language Worksheets
No ratings yet
WAH5 - Functional Language Worksheets
6 pages
Print Print Print Print: Import As
No ratings yet
Print Print Print Print: Import As
6 pages
GE Practical Sem 2
No ratings yet
GE Practical Sem 2
28 pages
Assignment 2 Ds
No ratings yet
Assignment 2 Ds
8 pages
Mayank Chaudhary DEV Practicals
No ratings yet
Mayank Chaudhary DEV Practicals
14 pages
Data Science Practical Book - Ipynb
No ratings yet
Data Science Practical Book - Ipynb
21 pages
Dal Programs With Output
No ratings yet
Dal Programs With Output
11 pages
Numpy Dataframe
No ratings yet
Numpy Dataframe
12 pages
DWDM Lab Manual
No ratings yet
DWDM Lab Manual
32 pages
Story of Creation
No ratings yet
Story of Creation
3 pages
Gec Practicals
No ratings yet
Gec Practicals
31 pages
Vanshika Goyal Gec Practicals
No ratings yet
Vanshika Goyal Gec Practicals
31 pages
DP Prog
No ratings yet
DP Prog
10 pages
Sowmi DS
No ratings yet
Sowmi DS
27 pages
DAVPy 2024GE
No ratings yet
DAVPy 2024GE
12 pages
DXE 24gksmknvj
No ratings yet
DXE 24gksmknvj
16 pages
Practical 1 and 2-1
No ratings yet
Practical 1 and 2-1
33 pages
Even Students
No ratings yet
Even Students
36 pages
PW2 DataCleaning
No ratings yet
PW2 DataCleaning
6 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
Python 1
No ratings yet
Python 1
16 pages
L-2 (Data Frame Part 1) .Ipynb - Colab
No ratings yet
L-2 (Data Frame Part 1) .Ipynb - Colab
5 pages
PYQ Data Analysis and Visualisation Using Python GE May 2024
No ratings yet
PYQ Data Analysis and Visualisation Using Python GE May 2024
6 pages
DSC Lab Programs
No ratings yet
DSC Lab Programs
24 pages
GE - Computer Scien 4ogygeb
No ratings yet
GE - Computer Scien 4ogygeb
8 pages
PP DWDM 4 5
No ratings yet
PP DWDM 4 5
26 pages
GE02 (DAVP) Assignment
No ratings yet
GE02 (DAVP) Assignment
3 pages
Fda Batch2program
No ratings yet
Fda Batch2program
18 pages
AE II Simulation File PDF
No ratings yet
AE II Simulation File PDF
32 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
GE - Computer Scien EaQvs42
No ratings yet
GE - Computer Scien EaQvs42
6 pages
Data Analysis 6060
No ratings yet
Data Analysis 6060
6 pages
Chapter 2
No ratings yet
Chapter 2
179 pages
Pebeo
No ratings yet
Pebeo
1 page
Mercedes-Benz: Faculty of Political Science
No ratings yet
Mercedes-Benz: Faculty of Political Science
7 pages
Acceleration-Deceleration Behaviour of Various Vehicle Types PDF
No ratings yet
Acceleration-Deceleration Behaviour of Various Vehicle Types PDF
29 pages
Unit2-Database System Concepts and Archi Notes
No ratings yet
Unit2-Database System Concepts and Archi Notes
6 pages
Boiling: 1. Neutralization of Magma Gas in Host Rock at Deep Location
No ratings yet
Boiling: 1. Neutralization of Magma Gas in Host Rock at Deep Location
84 pages
Hanover Report 1978
100% (1)
Hanover Report 1978
10 pages
Trabajo Final de Ingles Técnico
No ratings yet
Trabajo Final de Ingles Técnico
5 pages
Project Charter Template
No ratings yet
Project Charter Template
9 pages
FICM Unit 3
No ratings yet
FICM Unit 3
6 pages
Unit4-Relational Data Model Notes
No ratings yet
Unit4-Relational Data Model Notes
7 pages
Chapter Six
No ratings yet
Chapter Six
12 pages
TH 2
No ratings yet
TH 2
4 pages
Nature 14432
No ratings yet
Nature 14432
17 pages
2022ce11566 Srijan Lab
No ratings yet
2022ce11566 Srijan Lab
9 pages
Curs 2-Formatare Conditionala
No ratings yet
Curs 2-Formatare Conditionala
12 pages
Update On Renewed Effort To Strengthen Routine Immunization
No ratings yet
Update On Renewed Effort To Strengthen Routine Immunization
49 pages
Hopf Bifurcation Normal Form
100% (2)
Hopf Bifurcation Normal Form
3 pages
Web of Science Core Collection:: Journal Evaluation Process and Selection Criteria
No ratings yet
Web of Science Core Collection:: Journal Evaluation Process and Selection Criteria
35 pages
Abdullah Shakeel, Bscs 3b 19arid5127
No ratings yet
Abdullah Shakeel, Bscs 3b 19arid5127
6 pages
Chapter 7 Software Reuse
No ratings yet
Chapter 7 Software Reuse
30 pages
Abrahams & Millar (2008)
No ratings yet
Abrahams & Millar (2008)
27 pages
Chapter Four
No ratings yet
Chapter Four
10 pages
Active Assisted Knee Flexion and Extension
No ratings yet
Active Assisted Knee Flexion and Extension
46 pages
Semitic Alphabets
No ratings yet
Semitic Alphabets
16 pages
Porsche Case Study
No ratings yet
Porsche Case Study
4 pages
M P5 Rev1 Sem2 2024 2025
No ratings yet
M P5 Rev1 Sem2 2024 2025
6 pages
IGNOU BCA Computer Oriented Numerical Technique Previous Year Unsolved Papers BCS 054
From Everand
IGNOU BCA Computer Oriented Numerical Technique Previous Year Unsolved Papers BCS 054
Manish Soni
No ratings yet

DAV Practicals

Uploaded by

DAV Practicals

Uploaded by

NAME: ANIRBAN

Original dictionary of lists:

heights = {'Boys':[72,68,70,69,74], 'Girls':[63,65,69,62,61]}

NOW LIST OF DICTIONARIES :

Q2. Write programs in Python using NumPy library to do the following:

b. Get the indices of the sorted elements of a given array.

a. B = [56, 48, 22, 41, 78, 91, 24, 46, 8, 33]

#along the second axis

m = int(input('Enter the number of rows(m): '))

res = np.where(x == 0)[0]

a = np.array([1, 0, np.nan, 3, np.nan])

array([[17, 20, 31, 12, 16, 10],

Test if none of the elements of the said array is zero ::->

ORIGINAL ARRAY ::-> [1 0 0 3 2 0]

ORIGINAL ARRAY ::-> [ 1. 0. nan 3. nan]

Test element-wise for NaN :: [False False True False True]

 a. Identify and count missing values in a dataframe.

df = df.drop(max_sum_row ,axis =0)

both_days.fillna(value='-',inplace=True) # to fill out the

both_days.set_index(['Name','Duration']) # a method to set a

a. Compute mean of a series grouped by another series

c. Perform appropriate year-month string to dates conversion.

e. Split a given dataframe into groups with bin counts.

# a. Compute mean of 'Sales' grouped by 'Product'

# b. Fill missing values in 'Sales' with the previous non-missing

# c. Perform year-month string to date conversion

# d. Split a dataset to group by two columns and then sort the

# e. Split a given dataframe into groups with bin counts

Mean Sales Grouped by Product:

Dataset after filling missing values:

Dataset after year-month string to date conversion:

Sorted Sales Grouped by Product and Date:

Bin Counts for Sales:

# Creating the student DataFrame

# a. Perform one hot encoding of the last two columns using

# b. Sort the DataFrame on the "Birth Month" column

# Displaying the resulting DataFrame

Write a program in Python using Pandas to perform the following:

a. Calculate and display familywise gross monthly income.

# Creating the DataFrame

# a. Calculate and display familywise gross monthly income

# b. Calculate and display the member with the highest monthly

# c. Calculate and display monthly income of all members with

# d. Calculate and display the average monthly income of the

You might also like