0% found this document useful (0 votes)

7 views30 pages

Ids 1

The document outlines a practical file for a BCA course on Data Science, detailing various experiments involving data manipulation using Python's pandas library. Each experiment includes code snippets and expected outputs, covering topics such as creating Series and DataFrames, statistical calculations, and data visualization techniques. The file serves as a comprehensive guide for students to apply their theoretical knowledge in practical scenarios.

Uploaded by

rawatsumit9902

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views30 pages

Ids 1

Uploaded by

rawatsumit9902

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 30

DEPARTMENT OF INFORMATION COMMUNICATION &

TECHNOLOGY
PRACTICAL FILE
OF
BCA 212P
(INTRODUCTION OF DATA SCIENCE)
Academic session: 2024-25
Batch: 2023-26

Submitted to: Submitted by:

Ms. Mansi Jaiswal Name: Harsh Negi
(Assistant Professor) Enrolment no: 00717002023
Program: BCA
Semester: 4th
Shift: 1st
Division: A
1
INDEX
S.no Experiments Sign
1 Create a pandas series from a dictionary of values and an
ndarray.
2 Create a Series and print all the elements that are above
75th percentile.
3 Perform sorting on Series data and DataFrames
4 Write a program to implement pivot() and pivot-table() on a
DataFrame.
5 Write a program to find mean absolute deviation on a
DataFrame.
6 Two Series object, Population stores the details of four
metro cities of India and another object AvgIncome stores
the total average income reported in four years in these
cities. Calculate income per capita for each of these metro
cities.
7 Create a DataFrame based on E-Commerce data and
generate mean, mode, median.
8 Create a DataFrame based on employee data and generate
quartile and variance.
9 Program to implement Skewness on Random data.
10 Create a DateFrame on any Data and compute statistical
function of Kurtosis.
11 Series objects Temp1, temp2, temp3, temp 4 stores the
temperature of days of week 1, week 2, week 3, week 4.
Write a script to:-
a. Print average temperature per week
b. Print average temperature of entire month
12 Write a Program to read a CSV file and create its
DataFrame.
13 Consider the DataFrame QtrSales where each row contains
the item category, item name and expenditure and group
the rows by category, and print the average expenditure per
category.
14 Create a DataFrame having age, name, weight of five
students. Write a program to display only the weight of first
and fourth rows.
15 Write a program to create a DataFrame to store weight, age
and name of three people. Print the DataFrame and its
transpose.

2
Experiment -1
Create a pandas series from a dictionary of values and an
ndarray.
Code: -
import pandas as pd
import numpy as np
data=np.array([1,2,3,4,5])
Series1=pd.Series(data)
print(Series1)
data_dict={"a":10,"b":20,"c":30}
Series2=pd.Series(data_dict)
print(Series2)

Output: -

3
Experiment-2
Create a Series and print all the elements that are above 75th
percentile.
Code: -
import pandas as pd

import numpy as np

# Create a random Series

np.random.seed(42) # For reproducibility

s = pd.Series(np.random.randint(1, 100, 10)) # 10 random integers between 1 and 100

print("Original Series:\n", s)

# Calculate 75th percentile

percentile_75 = s.quantile(0.75)

print("\n75th Percentile:", percentile_75)

# Filter and print elements above 75th percentile

above_75th = s[s > percentile_75]

print("\nElements above 75th percentile:\n", above_75th)

Output: -

4
Experiment-3
Perform sorting on Series data and DataFrames.
Code: -
import pandas as pd

# Create a Series

my_series = pd.Series([5, 1, 9, 2, 7])

print("Original Series:\n", my_series)

# Sort the Series (smallest to largest)

sorted_series = my_series.sort_values()

print("\nSorted Series:\n", sorted_series)

# Sort Series from largest to smallest

sorted_series_desc = my_series.sort_values(ascending=False)

print("\nSorted Series (Descending):\n", sorted_series_desc)

# --- Sorting DataFrames (Easy) ---

# Create a DataFrame

data = {'Name': ['Charlie', 'Alice', 'Bob'],

'Age': [25, 30, 22]}

my_df = pd.DataFrame(data)

print("\nOriginal DataFrame:\n", my_df)

# Sort the DataFrame by 'Age' (youngest to oldest)

sorted_df = my_df.sort_values(by='Age')

print("\nSorted DataFrame by Age:\n", sorted_df)

5
# Sort the DataFrame by 'Name' (alphabetical order)

sorted_df_name = my_df.sort_values(by='Name')

print("\nSorted DataFrame by Name:\n", sorted_df_name)

# Sort the DataFrame by 'Age' (oldest to youngest)

sorted_df_desc_age = my_df.sort_values(by='Age', ascending=False)

print("\nSorted DataFrame by Age (Descending):\n", sorted_df_desc_age)

Output: -

6
7
Experiment-4
Write a program to implement pivot() and pivot-table() on a
DataFrame.
Code: -
import pandas as pd

# Sample DataFrame
data = {
'Date': ['2023-01-01', '2023-01-01', '2023-01-02', '2023-01-02', '2023-01-03'],
'Category': ['A', 'B', 'A', 'B', 'A'],
'Value': [10, 20, 15, 25, 30]
}

df = pd.DataFrame(data)

# Display the original DataFrame

print("Original DataFrame:")
print(df)

# Using pivot() to reshape the DataFrame

pivot_df = df.pivot(index='Date', columns='Category', values='Value')
print("\nPivoted DataFrame using pivot():")
print(pivot_df)

# Using pivot_table() to reshape the DataFrame

# Here we will use pivot_table to handle potential duplicates by taking the mean
pivot_table_df = df.pivot_table(index='Date', columns='Category', values='Value',
aggfunc='mean')

8
print("\nPivoted DataFrame using pivot_table():")
print(pivot_table_df)

Output: -

9
Experiment-5
Write a program to find mean absolute deviation on a
DataFrame.
Code: -
import pandas as pd

# Sample DataFrame
data = {
'A': [1, 2, 3, 4, 5],
'B': [5, 6, 7, 8, 9],
'C': [10, 11, 12, 13, 14]
}

df = pd.DataFrame(data)
print("Original DataFrame:\n", df)

# Function to calculate Mean Absolute Deviation

def mean_absolute_deviation(df):
# Calculate the mean of each column
mean = df.mean()
# Calculate the absolute deviation from the mean
absolute_deviation = abs(df - mean)
# Calculate the mean of the absolute deviations
mad = absolute_deviation.mean()
return mad

# Calculate Mean Absolute Deviation for the DataFrame

mad_result = mean_absolute_deviation(df)

10
# Display the result
print("Mean Absolute Deviation for each column:")
print(mad_result)

Output: -

11
Experiment-6
Two Series object, Population stores the details of four metro
cities of India and another object AvgIncome stores the total
average income reported in four years in these cities.
Calculate income per capita for each of these metro cities.
Code:-
import pandas as pd

# Example data for Population (in millions)

Population = pd.Series({
'DehraDun': 20.4,
'Almora': 18.9,
'Nanital': 12.3,
})
print("Population of Different cities:")
print(Population,end="\n\n")

# Example data for AvgIncome (in millions)

AvgIncome = pd.Series({
'DehraDun': 150,
'Almora': 120,
'Nanital': 100,
})
print("Average Income of Different cities:")
print(AvgIncome,end="\n\n")
# Calculate income per capita
12
IncomePerCapita = AvgIncome / Population

# Display the result

print("IncomePerCapita of Different Cities:")
print(IncomePerCapita)

Output:-

13
Experiment-7
Create a DataFrame based on E-Commerce data and generate
mean, mode, median.
Code:-
import pandas as pd

# Sample E-Commerce data

data = {
'OrderID': [1, 2, 3, 4, 5, 6, 7, 8, 9, 10],
'Product': ['Laptop', 'Smartphone', 'Tablet', 'Laptop', 'Smartphone',
'Tablet', 'Laptop', 'Smartphone', 'Tablet', 'Laptop'],
'Quantity': [1, 2, 1, 1, 3, 2, 1, 1, 2, 1],
'Price': [1000, 500, 300, 1000, 500, 300, 1000, 500, 300, 1000]
}

# Create DataFrame
ecommerce_df = pd.DataFrame(data)

# Display the DataFrame

print("E-Commerce Dataframe:")
print(ecommerce_df)
print("\n")

# Calculate mean
mean_price = ecommerce_df['Price'].mean()

14
# Calculate mode
mode_price = ecommerce_df['Price'].mode()[0]

# Calculate median
median_price = ecommerce_df['Price'].median()

# Display the results

print(f"Mean Price: {mean_price}")
print(f"Mode Price: {mode_price}")
print(f"Median Price: {median_price}")

Output:-

15
Experiment-8
Create a DataFrame based on employee data and generate
quartile and variance.
Code:-
import pandas as pd
# Sample employee data
data = {
'EmployeeID': [1, 2, 3, 4, 5, 6, 7, 8, 9, 10],
'Name': ['Krishna', 'Murali', 'Chaitanya', 'Shyam', 'Govind',
'Madhav', 'Gopal', 'Gopal', 'Murari', 'Keshava'],
'Age': [25, 30, 35, 40, 28, 32, 45, 50, 29, 38],
'Salary': [50000, 60000, 70000, 80000, 55000,
62000, 75000, 90000, 58000, 72000],
'YearsAtCompany': [1, 2, 3, 4, 1, 2, 5, 6, 2, 3]
}
# Create DataFrame
employee_df = pd.DataFrame(data)
# Display the DataFrame
print("Employees Data:")
print(employee_df)
# Calculate quartiles
quartiles_salary = employee_df['Salary'].quantile([0.25, 0.5, 0.75])
quartiles_years = employee_df['YearsAtCompany'].quantile([0.25, 0.5, 0.75])

# Calculate variance

16
variance_salary = employee_df['Salary'].var()
variance_years = employee_df['YearsAtCompany'].var()
# Display the results
print("\nQuartiles for Salary:")
print(quartiles_salary)
print("\nQuartiles for Years at Company:")
print(quartiles_years)
print(f"\nVariance for Salary: {variance_salary}")
print(f"Variance for Years at Company: {variance_years}")

Output: -

17
Experiment-9
Program to implement Skewness on Random data.
Code: -
# Program to implement Skewness on Random data.
import numpy as np
from scipy.stats import skew
# Generate random data
data = data = np.random.normal(1, 100, 15)
print("Random Numbers:")
print(data)
# Calculate skewness
data_skewness = skew(data)
# Print the skewness
print(f"\nSkewness of the data: {data_skewness}")

Output: -

18
Experiment-10
Create a DateFrame on any Data and compute statistical
function of Kurtosis.
Code: -
import pandas as pd
from scipy.stats import kurtosis

# Step 1: Create a sample DataFrame

data = {
'EmployeeID': [1, 2, 3, 4, 5, 6, 7, 8, 9, 10],
'Name':['Krishna', 'Murali', 'Chaitanya', 'Shyam', 'Govind',
'Madhav', 'Gopal', 'Gopal', 'Murari', 'Keshava'],
'Age': [25, 30, 35, 40, 28, 32, 45, 50, 29, 38],
'Salary': [50000, 60000, 70000, 80000, 55000,
62000, 75000, 90000, 58000, 72000],
'YearsAtCompany': [1, 2, 3, 4, 1, 2, 5, 6, 2, 3]
}

# Create DataFrame
employee_df = pd.DataFrame(data)

# Display the DataFrame

print("Employee DataFrame:")
print(employee_df)

# Step 2: Compute kurtosis for the 'Salary' column

kurtosis_salary = kurtosis(employee_df['Salary'], fisher=True) # Fisher's definition (subtracts
3)

19
# Display the kurtosis result
print(f"\nKurtosis of Salary: {kurtosis_salary}")

Output: -

20
Experiment-11
Series objects Temp1, temp2, temp3, temp 4 stores the
temperature of days of week 1, week 2, week 3, week 4.
Write a script to:-
a. Print average temperature per week
b. Print average temperature of entire month
Code: -
import pandas as pd

# Sample temperature data for four weeks (7 days each)

data = {
'Week 1': [30, 32, 31, 29, 28, 30, 31], # Week 1
'Week 2': [31, 30, 29, 32, 33, 31, 30], # Week 2
'Week 3': [28, 29, 30, 31, 32, 30, 29], # Week 3
'Week 4': [30, 31, 32, 33, 34, 30, 31] # Week 4
}

# Create DataFrame
temperature_df = pd.DataFrame(data)

# Display the DataFrame

print("Temperature DataFrame:")
print(temperature_df)

# a. Print average temperature per week

avg_temp_per_week = temperature_df.mean()
print("\nAverage temperature per week:")

21
print(avg_temp_per_week)

# b. Print average temperature of entire month

avg_temp_month = temperature_df.values.flatten().mean()
print(f"\nAverage temperature for the entire month: {avg_temp_month:.2f}°C")

Output: -

22
Experiment-12
Write a Program to read a CSV file and create its DataFrame.
Code: -
CSV File
EmployeeID,Name,Age,Salary
1,Shyam,30,50000
2,Gopal,25,60000
3,Madhav,35,70000
4,keshava,40,80000
5,Murari,28,55000
Python File
import pandas as pd

# Step 1: Read the CSV file

file_path = 'L12.csv' # Make sure this path is correct
employee_df = pd.read_csv(file_path)

# Step 2: Display the DataFrame

print("Employee DataFrame:")
print(employee_df)

# Optional: Display basic information about the DataFrame

print("\nBasic Information about the DataFrame:")
print(employee_df.info())

# Optional: Display the first few rows of the DataFrame

print("\nFirst few rows of the DataFrame:")
print(employee_df.head())

23
Output: -

24
Experiment-13
Consider the DataFrame QtrSales where each row contains
the item category, item name and expenditure and group the
rows by category, and print the average expenditure per
category.
Code: -
import pandas as pd

# Sample data for QtrSales DataFrame

data = {
'Category': ['Electronics', 'Electronics', 'Clothing', 'Clothing', 'Groceries',
'Groceries'],
'Item': ['Laptop', 'Smartphone', 'T-shirt', 'Jeans', 'Milk', 'Bread'],
'Expenditure': [1200, 800, 50, 60, 30, 20]
}

# Create DataFrame
QtrSales = pd.DataFrame(data)

# Display the DataFrame

print("QtrSales DataFrame:")
print(QtrSales)

# Group by 'Category' and calculate the average expenditure

average_expenditure = QtrSales.groupby('Category')['Expenditure'].mean()

25
# Display the average expenditure per category
print("\nAverage Expenditure per Category:")
print(average_expenditure)

Output: -

26
Experiment-14
Create a DataFrame having age, name, weight of five
students. Write a program to display only the weight of first
and fourth rows.
Code: -
import pandas as pd

# Sample data for five students

data = {
'Name': ['Madhav', 'Shyam', 'Murari', 'Gopal', 'Keshava'],
'Age': [20, 21, 19, 22, 20],
'Weight': [55, 70, 60, 80, 65] # Weight in kg
}

# Create DataFrame
students_df = pd.DataFrame(data)

# Display the DataFrame

print("Students DataFrame:")
print(students_df)

# Display the weight of the first and fourth rows

weights = students_df.iloc[[0, 3]]['Weight']

print("\nWeight of the first and fourth students:")

print(weights)
27
Output: -

28
Experiment-15
Write a program to create a DataFrame to store weight, age
and name of three people. Print the DataFrame and its
transpose.
Code: -
import pandas as pd

# Sample data for three people

data = {
'Name': ['Keshava', 'Madhav', 'Murari'],
'Age': [25, 30, 35],
'Weight': [55, 70, 80] # Weight in kg
}

# Create DataFrame
people_df = pd.DataFrame(data)

# Display the DataFrame

print("DataFrame:")
print(people_df)

# Print the transpose of the DataFrame

print("\nTranspose of the DataFrame:")
print(people_df.T)

29
Output: -

Practical File 2024
No ratings yet
Practical File 2024
25 pages
12th Practical
No ratings yet
12th Practical
21 pages
18bba098 Alison PDF
No ratings yet
18bba098 Alison PDF
8 pages
IP Lab Record
No ratings yet
IP Lab Record
23 pages
Lab Report Exel
No ratings yet
Lab Report Exel
20 pages
IT Skill LAB-2 Practical Question
No ratings yet
IT Skill LAB-2 Practical Question
2 pages
MCQ Questions
No ratings yet
MCQ Questions
23 pages
Property Portfolio: Postcode Type Location No Bedrooms No Bathrooms Reception Rooms Garden Size Date On Market
100% (1)
Property Portfolio: Postcode Type Location No Bedrooms No Bathrooms Reception Rooms Garden Size Date On Market
8 pages
Report Builder
No ratings yet
Report Builder
129 pages
Dhruv 1121
No ratings yet
Dhruv 1121
24 pages
Sanyam Data Science
No ratings yet
Sanyam Data Science
33 pages
Data Journalism Heist
100% (1)
Data Journalism Heist
43 pages
PBI Desktop Fundamentals Training Session 1
No ratings yet
PBI Desktop Fundamentals Training Session 1
70 pages
Practical File Sai Lalit
No ratings yet
Practical File Sai Lalit
32 pages
XII IP Practical File
No ratings yet
XII IP Practical File
52 pages
Lab Report 565
No ratings yet
Lab Report 565
18 pages
2025 It Sba Problem Statemnt
No ratings yet
2025 It Sba Problem Statemnt
7 pages
Aanik Info Practical 3261
No ratings yet
Aanik Info Practical 3261
61 pages
Fdsa Record Ai&Ds
No ratings yet
Fdsa Record Ai&Ds
26 pages
Ipclass 12
No ratings yet
Ipclass 12
21 pages
Python Codes
No ratings yet
Python Codes
28 pages
Foundation of Data Science Lab Manual Full
No ratings yet
Foundation of Data Science Lab Manual Full
8 pages
Pandasmatplotlib Practical File
No ratings yet
Pandasmatplotlib Practical File
15 pages
BDA Lab Manual UPDATED
No ratings yet
BDA Lab Manual UPDATED
45 pages
Iprf
No ratings yet
Iprf
78 pages
Ip Practical (2) (Autosaved)
No ratings yet
Ip Practical (2) (Autosaved)
21 pages
Informatics Practices Record Class 12
No ratings yet
Informatics Practices Record Class 12
60 pages
Xii Ip Practical File 24-25
No ratings yet
Xii Ip Practical File 24-25
111 pages
Informatics Practicals 12th (Personal)
No ratings yet
Informatics Practicals 12th (Personal)
89 pages
1001 Microsoft Excel Shortcuts by FinPolNomics ANALYTICA
No ratings yet
1001 Microsoft Excel Shortcuts by FinPolNomics ANALYTICA
22 pages
National Public School: Name-Karan Choudhary Class-XII Subject - Informatics Practices (065) Board Roll No.
No ratings yet
National Public School: Name-Karan Choudhary Class-XII Subject - Informatics Practices (065) Board Roll No.
24 pages
Oddstudents
No ratings yet
Oddstudents
35 pages
List of Programs For Informatics - XII - IP
No ratings yet
List of Programs For Informatics - XII - IP
26 pages
Eportfolio Activity 2 - UU100 - SII-2020-03
No ratings yet
Eportfolio Activity 2 - UU100 - SII-2020-03
5 pages
Excel Tables, Formulas & Pivot Tables: What You Learn
No ratings yet
Excel Tables, Formulas & Pivot Tables: What You Learn
10 pages
Xii - Ip - Holiday HW
No ratings yet
Xii - Ip - Holiday HW
2 pages
DS - Lab Manual
No ratings yet
DS - Lab Manual
31 pages
Practical File Infomatics Practices 2024-25
No ratings yet
Practical File Infomatics Practices 2024-25
39 pages
Even Students
No ratings yet
Even Students
36 pages
PitchBook Due Diligence Guide
No ratings yet
PitchBook Due Diligence Guide
23 pages
IP Grade 12 Record
No ratings yet
IP Grade 12 Record
12 pages
Mohd Adnan File Draft 2
No ratings yet
Mohd Adnan File Draft 2
37 pages
Bca212 Ids 2023
No ratings yet
Bca212 Ids 2023
3 pages
IP Practical File Project
No ratings yet
IP Practical File Project
60 pages
Ip Project Work 2
No ratings yet
Ip Project Work 2
52 pages
DAX Zero To Hero
No ratings yet
DAX Zero To Hero
113 pages
12 Ip HW
No ratings yet
12 Ip HW
10 pages
Ip Practical File Final
No ratings yet
Ip Practical File Final
50 pages
Practical (Data Science)
No ratings yet
Practical (Data Science)
13 pages
IP Record Final-1
No ratings yet
IP Record Final-1
34 pages
Document 1
No ratings yet
Document 1
16 pages
IP Record Python 23-24 Aryan
No ratings yet
IP Record Python 23-24 Aryan
42 pages
Power Bi Interview Question AND ANSWER
88% (8)
Power Bi Interview Question AND ANSWER
36 pages
Questionnaire
0% (1)
Questionnaire
135 pages
Pandas Practicals - Term-1
100% (1)
Pandas Practicals - Term-1
18 pages
Class 12 IP File 23 24
No ratings yet
Class 12 IP File 23 24
27 pages
Ankit Python
No ratings yet
Ankit Python
26 pages
12 IP Practical
No ratings yet
12 IP Practical
14 pages
Index
No ratings yet
Index
4 pages
DAELab Cycle-23-24 - 240703 - 171843
No ratings yet
DAELab Cycle-23-24 - 240703 - 171843
9 pages
IP Practic MINE
No ratings yet
IP Practic MINE
30 pages
Model Practical Examination 2024-25 Python Pandas QP
No ratings yet
Model Practical Examination 2024-25 Python Pandas QP
3 pages
National Public School: Name-Mohit Kumar Class-XII Subject - Informatics Practices (065) Board Roll No.
No ratings yet
National Public School: Name-Mohit Kumar Class-XII Subject - Informatics Practices (065) Board Roll No.
35 pages
National Public School: Name-Karan Choudhary Class-XII Subject - Informatics Practices (065) Board Roll No.
No ratings yet
National Public School: Name-Karan Choudhary Class-XII Subject - Informatics Practices (065) Board Roll No.
35 pages
Course
No ratings yet
Course
12 pages
Practical File IP
No ratings yet
Practical File IP
27 pages
Database Analytics
No ratings yet
Database Analytics
29 pages
Data Science
No ratings yet
Data Science
18 pages
Class 12 Practical File Informatics Practices
No ratings yet
Class 12 Practical File Informatics Practices
28 pages
12 IP Practical Exampl
No ratings yet
12 IP Practical Exampl
6 pages
Hydro Excel 116
No ratings yet
Hydro Excel 116
46 pages
FDS Record-1-4
No ratings yet
FDS Record-1-4
18 pages
239 Excel Shortcuts For Windows - My Online Training Hub
No ratings yet
239 Excel Shortcuts For Windows - My Online Training Hub
1 page
MS Excel PivotTable Deleted Items Remain - Excel and Access
No ratings yet
MS Excel PivotTable Deleted Items Remain - Excel and Access
1 page
DWBI Venky Final Print
No ratings yet
DWBI Venky Final Print
39 pages
12 Ip Practical List With Solution Complete
No ratings yet
12 Ip Practical List With Solution Complete
5 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
Journal 12
No ratings yet
Journal 12
54 pages
Practical List Questions-1
No ratings yet
Practical List Questions-1
6 pages
Project Management Dashboard Non 365
No ratings yet
Project Management Dashboard Non 365
24 pages
Class 12 IP - Program List - Term1
No ratings yet
Class 12 IP - Program List - Term1
2 pages
7 - Data Analysis and Presentation
No ratings yet
7 - Data Analysis and Presentation
27 pages
Practical File Question 28.09.2022
No ratings yet
Practical File Question 28.09.2022
15 pages
Excel Lab Manual-2
No ratings yet
Excel Lab Manual-2
62 pages
Revit Warning Guide
No ratings yet
Revit Warning Guide
19 pages
XII - Informatics Practices (LAB MANUAL)
100% (1)
XII - Informatics Practices (LAB MANUAL)
42 pages
Ip Practical File
No ratings yet
Ip Practical File
20 pages
Practical File Part 1
No ratings yet
Practical File Part 1
17 pages
IP Practical 2023-24 (1 To 34)
100% (1)
IP Practical 2023-24 (1 To 34)
32 pages
C Language Programming Codes
From Everand
C Language Programming Codes
Durgesh
No ratings yet