0% found this document useful (0 votes)

9 views

c

The document outlines a laboratory session for a Data Science course using Python and Pandas, detailing pre-lab and post-lab tasks. It includes explanations of key concepts such as DataFrames, the concat() function, and methods for filtering and adding columns. The lab tasks provide practical coding examples for creating DataFrames, concatenating them, and applying conditions to filter data.

Uploaded by

nagachaitanyaprathipati97

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

c

Uploaded by

nagachaitanyaprathipati97

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

REGD. NO.

238W1A5449 DATA SCIENCE USING PYTHON LABORATORY-23AI&DS4354 ACADEMIC YEAR: 2024-

2025

Lab Session 06: Perform following operations using pandas

Date of the Session: 17/02/2025 Time of the Session:10:20AM to 1:00PM

Pre-Lab Task: Write answers before entering into lab.

Writing space for pre task :( For Student’s use only)
1. What is a DataFrame in Pandas, and how is it different from a NumPy array?
A. A Pandas DataFrame (DF) is a 2D, heterogeneous data structure with labeled axes. It's different from
a NumPy array in that:
- DF has labeled rows and columns
- DF can handle missing data and different data types per column
- DF has advanced data manipulation and analysis capabilities
NumPy arrays are ideal for numerical computations, while Pandas DataFrames are designed for data
manipulation, analysis, and visualization.

2. How can we create a DataFrame in Pandas using a dictionary?

A. DataFrame in Pandas using a dictionary by passing the dictionary to the pd.DataFrame()
constructor. Example:
import pandas as pd
data = {'Name': ['John', 'Anna', 'Peter', 'Linda'],
'Age': [28, 24, 35, 32],
'Country': ['USA', 'UK', 'Australia', 'Germany']}
df = pd.DataFrame(data)
print(df)
OUTPUT:

3. What is the purpose of the concat() function in Pandas?

A. The concat() function in Pandas is used to concatenate two or more DataFrames, Series, or panels
along a particular axis. This allows you to:
- Combine data from different sources into a single DataFrame
- Merge data with different structures or indices
- Create a new DataFrame by stacking or joining existing ones
The concat() function can concatenate along either the rows (axis=0) or columns (axis=1)
of the DataFrames.

4. How can we filter rows in a DataFrame based on a condition?

A. You can filter rows in a DataFrame based on a condition using the following methods:
-Boolean Indexing: df[df['column_name'] > value]
-Query Function: df.query('column_name > value')
-Loc Function: df.loc[df['column_name'] > value]
These methods allow you to select rows where the condition is true.

LAB No.6 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

REGD. NO. 238W1A5449 DATA SCIENCE USING PYTHON LABORATORY-23AI&DS4354 ACADEMIC YEAR: 2024-
2025

5. Why might we need to add a new column to a DataFrame, and how can we do it in Pandas?
A. We might need to add a new column to a DataFrame to:
- Perform calculations based on existing columns
- Add new data from an external source
- Transform existing data into a new format
- Create a new feature for data analysis or modeling
To add a new column in Pandas, you can use the following methods:
1. Assign a new column: df['new_column'] = values
2. Use the assign function: df.assign(new_column=values)
3. Use the insert function: df.insert(loc, 'new_column', values)

In Lab Task:
1. Creating dataframe.
SOURCE CODE:
import pandas as pd
data = {'Name': ['John', 'Anna', 'Peter', 'Linda'],
'Age': [28, 24, 35, 32],
'Country': ['USA', 'UK', 'Australia', 'Germany']}
df = pd.DataFrame(data)
print(df)
OUTPUT:

2. concat()
SOURCE CODE:
import pandas as pd
df1 = pd.DataFrame({'Name': ['John', 'Anna'],
'Age': [28, 24],
'Country': ['USA', 'UK']})
df2 = pd.DataFrame({'Name': ['Peter', 'Linda'],
'Age': [35, 32],
'Country': ['Australia',
'Germany']}) df_concat = pd.concat([df1, df2])
print(df_concat)
OUTPUT:

3. Setting conditions
SOURCE CODE:
import pandas as pd
data = {'Name': ['John', 'Anna', 'Peter', 'Linda'],
'Age': [28, 24, 35, 32],
'Country': ['USA', 'UK', 'Australia', 'Germany']}
df = pd.DataFrame(data)
filtered_df = df[(df['Age'] > 25) & (df['Country'] != 'Australia')]

LAB No.6 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

REGD. NO. 238W1A5449 DATA SCIENCE USING PYTHON LABORATORY-23AI&DS4354 ACADEMIC YEAR: 2024-
2025

print(filtered_df)
OUTPUT:

4. Adding a new column

SOURCE CODE:
import pandas as pd
data = {'Name': ['John', 'Anna', 'Peter', 'Linda'],
'Score': [85, 90, 78, 92]}
df = pd.DataFrame(data)
df['Grade'] = df['Score'].apply(lambda x: 'A' if x >= 90 else 'B' if x >= 80 else 'C' if x >= 70 else 'D' if x
>= 60 else 'F')
print(df)
OUTPUT:

Post Lab Task:

1. Write a Python code snippet to create a Pandas DataFrame with at least three columns and five rows
SOURCE CODE:
import pandas as pd
# Create a sample DataFrame
data = {'Name': ['Alice', 'Bob', 'Charlie', 'David', 'Eve'],
'Age': [25, 30, 22, 28, 24],
'City': ['New York', 'London', 'Paris', 'Tokyo',
'Sydney']} df = pd.DataFrame(data)
print(df)
OUTPUT:

2. Given two DataFrames, df1 and df2, how would you concatenate them vertically and horizontally?
SOURCE CODE:
import pandas as pd
# Create a sample DataFrame
data = {'Name': ['Alice', 'Bob', 'Charlie', 'David', 'Eve'],
'Age': [25, 30, 22, 28, 24],
'City': ['New York', 'London', 'Paris', 'Tokyo',
'Sydney']} df = pd.DataFrame(data)
print(df)
# Concatenate DataFrames vertically (row-wise)
df_vertical = pd.concat([df1, df2], ignore_index=True)
print("\nVertical Concatenation:\n", df_vertical)

LAB No.6 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

REGD. NO. 238W1A5449 DATA SCIENCE USING PYTHON LABORATORY-23AI&DS4354 ACADEMIC YEAR: 2024-
2025

# Concatenate DataFrames horizontally (column-wise)

df_horizontal = pd.concat([df1, df2], axis=1) print("\
nHorizontal Concatenation:\n", df_horizontal) OUTPUT:

3. How would you filter out rows where the values in the “Age” column are greater than 25?
SOURCE CODE:
import pandas as pd
# Create a sample DataFrame
data = {'Name': ['Alice', 'Bob', 'Charlie', 'David', 'Eve'],
'Age': [25, 30, 22, 28, 24],
'City': ['New York', 'London', 'Paris', 'Tokyo',
'Sydney']} df = pd.DataFrame(data)
print(df)
# Filter rows where Age > 25
filtered_df = df[df['Age'] > 25]
print("\nFiltered DataFrame (Age > 25):\n", filtered_df)
OUTPUT:

4. If you have a DataFrame containing employee names and salaries, how would you add a new
column for a "Bonus" (10% of salary)?
SOURCE CODE:
import pandas as pd
# Create a sample DataFrame
data = {'Name': ['Alice', 'Bob', 'Charlie', 'David', 'Eve'],
'Age': [25, 30, 22, 28, 24],

LAB No.6 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

REGD. NO. 238W1A5449 DATA SCIENCE USING PYTHON LABORATORY-23AI&DS4354 ACADEMIC YEAR: 2024-
2025

'City': ['New York', 'London', 'Paris', 'Tokyo',

'Sydney']} df = pd.DataFrame(data)
df['Salary']= [60000, 75000, 50000,45000,55000]
df['Bonus'] = df['Salary'] * 0.10 print("\
nDataFrame with Bonus:\n", df) OUTPUT:

5. Explain a real-world scenario where using Pandas operations like concatenation and filtering
conditions would be beneficial.
A. Customer Data Analysis for Marketing Campaigns
Suppose you're a marketing analyst at an online retail company, and you need to analyze customer data
to create targeted marketing campaigns.
You have three datasets:
1. Customer Information: Contains customer demographics, such as name, email, age, and location.
2. Purchase History: Contains customer purchase history, including product IDs, purchase dates,
and amounts.
3. Product Catalog: Contains product information, including product IDs, names, categories, and
prices. You need to:
1. Combine the customer information and purchase history datasets.
2. Filter out customers who haven't made a purchase in the last 6 months.
3. Identify customers who have purchased products from specific categories (e.g., electronics, clothing).
4. Create targeted marketing campaigns based on customer demographics and purchase
behavior. By using Pandas operations like concatenation and filtering conditions, you can:
- Efficiently analyze large datasets
- Identify specific customer segments
- Create targeted marketing campaigns
- Improve customer engagement and sales

Students Signature
(For Evaluator’s use only)
Comment of the Evaluator (if Any) Evaluator’s Observation

Marks Secured: out of

Signature of the Evaluator with Date:

LAB No.6 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

Python Cheat Sheet 2.0
100% (1)
Python Cheat Sheet 2.0
10 pages
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (4)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
11 pages
Principal of Marketing
No ratings yet
Principal of Marketing
121 pages
Bba - A Study On Brand Loyalty Towards Bescal Steels
No ratings yet
Bba - A Study On Brand Loyalty Towards Bescal Steels
84 pages
Presentation Case Study Calyx & Corolla
100% (2)
Presentation Case Study Calyx & Corolla
10 pages
64[6]
No ratings yet
64[6]
5 pages
python interviews
No ratings yet
python interviews
154 pages
Working With Panda
No ratings yet
Working With Panda
13 pages
python 2.1.3 (2)
No ratings yet
python 2.1.3 (2)
6 pages
Pandas+With+Python+ +DATAhill+Solutions
No ratings yet
Pandas+With+Python+ +DATAhill+Solutions
24 pages
Learn Data Analysis With Pandas - Introduction
No ratings yet
Learn Data Analysis With Pandas - Introduction
2 pages
DS Practical
No ratings yet
DS Practical
30 pages
12 Pandas
100% (1)
12 Pandas
21 pages
Ip Practical File
No ratings yet
Ip Practical File
20 pages
Pandas
No ratings yet
Pandas
94 pages
Python Cheat Sheet For Excel Users
100% (2)
Python Cheat Sheet For Excel Users
5 pages
Lab Record IP
No ratings yet
Lab Record IP
13 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
60 pages
Data Frame Notes1
No ratings yet
Data Frame Notes1
7 pages
Data Frame Demo
No ratings yet
Data Frame Demo
73 pages
a5
No ratings yet
a5
28 pages
Informatics Practices Practical File
No ratings yet
Informatics Practices Practical File
8 pages
Python Notes by Prof T
No ratings yet
Python Notes by Prof T
10 pages
1
No ratings yet
1
12 pages
Chapter 2 Python Pandas - II
No ratings yet
Chapter 2 Python Pandas - II
19 pages
Python CheatSheet
No ratings yet
Python CheatSheet
2 pages
Data Wrangling and Analysis
100% (1)
Data Wrangling and Analysis
36 pages
Rapids Cheatsheet
100% (1)
Rapids Cheatsheet
2 pages
99c949c0-5910-425f-9ac5-155882800fa5
No ratings yet
99c949c0-5910-425f-9ac5-155882800fa5
36 pages
List of Practical Ip065 Xii Session 2025 Ckc Academy
No ratings yet
List of Practical Ip065 Xii Session 2025 Ckc Academy
19 pages
LIST OF PRACTICAL IP065 XII SESSION 2025 CKC ACADEMY
No ratings yet
LIST OF PRACTICAL IP065 XII SESSION 2025 CKC ACADEMY
19 pages
Pandas
No ratings yet
Pandas
5 pages
python 2.1.2 (2)
No ratings yet
python 2.1.2 (2)
7 pages
Screenshot 2023-12-27 at 7.05.37 PM
No ratings yet
Screenshot 2023-12-27 at 7.05.37 PM
23 pages
Content Pandas Cheat Sheet
No ratings yet
Content Pandas Cheat Sheet
9 pages
Data Analysis With Pandas - Introduction To Pandas Cheatsheet - Codecademy PDF
No ratings yet
Data Analysis With Pandas - Introduction To Pandas Cheatsheet - Codecademy PDF
3 pages
Data Analysis With Pandas - Introduction To Pandas Cheatsheet - Codecademy PDF
No ratings yet
Data Analysis With Pandas - Introduction To Pandas Cheatsheet - Codecademy PDF
3 pages
Data Analysis With Pandas - Introduction To Pandas Cheatsheet - Codecademy PDF
100% (1)
Data Analysis With Pandas - Introduction To Pandas Cheatsheet - Codecademy PDF
3 pages
Practical File IP
No ratings yet
Practical File IP
27 pages
Acknowledgement
No ratings yet
Acknowledgement
25 pages
I.p file
No ratings yet
I.p file
20 pages
Chapter-2 Python Pandas
100% (2)
Chapter-2 Python Pandas
33 pages
practical file class xii
No ratings yet
practical file class xii
25 pages
EXP-3
No ratings yet
EXP-3
10 pages
PDF&Rendition=1
No ratings yet
PDF&Rendition=1
47 pages
Python For Data Science Cheat Sheet 2.0
No ratings yet
Python For Data Science Cheat Sheet 2.0
11 pages
Pandas Cheat Sheet Final
No ratings yet
Pandas Cheat Sheet Final
1 page
EDA with Pandas
No ratings yet
EDA with Pandas
8 pages
Python Cheat Sheet For Excel Users
No ratings yet
Python Cheat Sheet For Excel Users
5 pages
Classwork For GGIS XII 2024-25
No ratings yet
Classwork For GGIS XII 2024-25
1 page
Python - Pandas Merging, Joining, and Concatenating
No ratings yet
Python - Pandas Merging, Joining, and Concatenating
1 page
Class 12 Practical File Informatics Practices (1)
No ratings yet
Class 12 Practical File Informatics Practices (1)
22 pages
Pandas - Digitalocean
No ratings yet
Pandas - Digitalocean
15 pages
pandas_merged
No ratings yet
pandas_merged
2 pages
Pandas
No ratings yet
Pandas
13 pages
HHHH
No ratings yet
HHHH
22 pages
Python & MySQL for Data Analysis
No ratings yet
Python & MySQL for Data Analysis
45 pages
Chapter Notes - Data Handling Using Pandas DataFrame
No ratings yet
Chapter Notes - Data Handling Using Pandas DataFrame
16 pages
Pandas Cheat Sheet
100% (1)
Pandas Cheat Sheet
2 pages
Exercise 7 - Pandas
No ratings yet
Exercise 7 - Pandas
2 pages
dataframing_in_csv
No ratings yet
dataframing_in_csv
14 pages
Mastering Pandas in Python: Course Book
From Everand
Mastering Pandas in Python: Course Book
Pedro Martins
No ratings yet
Learning Oracle 12c: A PL/SQL Approach
From Everand
Learning Oracle 12c: A PL/SQL Approach
Prof. Sham Tickoo
No ratings yet
Brison cv15 0815rev2
No ratings yet
Brison cv15 0815rev2
16 pages
Report of Miu Miu PDF Market Segmentation Brand
No ratings yet
Report of Miu Miu PDF Market Segmentation Brand
4 pages
Customer Relationship Management
No ratings yet
Customer Relationship Management
17 pages
Cat Fight in Pet Food Industry Case Write Up 61920771 PDF
No ratings yet
Cat Fight in Pet Food Industry Case Write Up 61920771 PDF
2 pages
01 Activity 1
No ratings yet
01 Activity 1
1 page
Marketing Practices of Fertilizer Produc
No ratings yet
Marketing Practices of Fertilizer Produc
7 pages
Last Mile 5category Layouts
No ratings yet
Last Mile 5category Layouts
24 pages
Garnier MPR
No ratings yet
Garnier MPR
78 pages
Customer Behaviour
100% (3)
Customer Behaviour
11 pages
Final Chap 7 Planning Sales Call
No ratings yet
Final Chap 7 Planning Sales Call
15 pages
1.formulation of Competitive Strategies
No ratings yet
1.formulation of Competitive Strategies
103 pages
Buss 1
No ratings yet
Buss 1
7 pages
Brand Impact Assessment Worksheet
No ratings yet
Brand Impact Assessment Worksheet
7 pages
Milestone One
100% (1)
Milestone One
10 pages
Safeguard Marketing..taimoor TK
100% (11)
Safeguard Marketing..taimoor TK
25 pages
Feasib J Coffee Pods Revised Final Paper
No ratings yet
Feasib J Coffee Pods Revised Final Paper
106 pages
E - Commerce in India
No ratings yet
E - Commerce in India
10 pages
3.Digital Marketing Notes
No ratings yet
3.Digital Marketing Notes
23 pages
MM576 Group 1 TWG TEA PDF
No ratings yet
MM576 Group 1 TWG TEA PDF
28 pages
Kotler 1964
No ratings yet
Kotler 1964
7 pages
Case Study Edited
No ratings yet
Case Study Edited
14 pages
Economics 21.11.2021
No ratings yet
Economics 21.11.2021
48 pages
Efe, Ife, CPM, QSPM Matrix
100% (1)
Efe, Ife, CPM, QSPM Matrix
6 pages
BB3003 Xnur Dmo Week 1 Juni 2024
No ratings yet
BB3003 Xnur Dmo Week 1 Juni 2024
23 pages
Aida Model Notes
No ratings yet
Aida Model Notes
3 pages
Inventory Concepts in SCM
No ratings yet
Inventory Concepts in SCM
45 pages
MBA Project Advertising Effectiveness
No ratings yet
MBA Project Advertising Effectiveness
17 pages

c

Uploaded by

c

Uploaded by

REGD. NO.

238W1A5449 DATA SCIENCE USING PYTHON LABORATORY-23AI&DS4354 ACADEMIC YEAR: 2024-

Lab Session 06: Perform following operations using pandas

Date of the Session: 17/02/2025 Time of the Session:10:20AM to 1:00PM

Pre-Lab Task: Write answers before entering into lab.

2. How can we create a DataFrame in Pandas using a dictionary?

3. What is the purpose of the concat() function in Pandas?

4. How can we filter rows in a DataFrame based on a condition?

LAB No.6 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

LAB No.6 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

4. Adding a new column

Post Lab Task:

LAB No.6 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

# Concatenate DataFrames horizontally (column-wise)

LAB No.6 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

'City': ['New York', 'London', 'Paris', 'Tokyo',

Marks Secured: out of

Signature of the Evaluator with Date:

LAB No.6 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

You might also like