0% found this document useful (0 votes)

7 views3 pages

Batch 1

The document outlines three distinct analysis tasks using Python: Sales Analysis, Employee Performance Analysis, and Housing Market Analysis. Each task includes a code snippet for generating a dataset and a series of analytical steps to perform, such as calculating averages, creating plots, and identifying trends. The tasks focus on different domains, including sales revenue, employee performance scores, and housing prices.

Uploaded by

Ankit Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views3 pages

Batch 1

Uploaded by

Ankit Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Question 1: Sales Analysis

Problem Statement:

1. Use the following Python code snippet to generate the dataset:

1 import pandas as pd
2 import numpy as np
3
4 np . random . seed (42)
5 dates = pd . date_range ( start = " 2023 -01 -01 " , end = " 2023 -12 -31
" , freq = " D " )
6 categories = [ " Electronics " , " Clothing " , " Furniture " ]
7 data = {
8 " Date " : np . tile ( dates , 1) ,
9 " Product_Category " : np . random . choice ( categories , len (
dates ) ) ,
10 " Revenue " : np . random . randint (500 , 5000 , len ( dates ) ) ,
11 " Discount " : np . random . uniform (0.05 , 0.25 , len ( dates ) )
,
12 " Units_Sold " : np . random . randint (1 , 20 , len ( dates ) ) ,
13 }
14 sales_data = pd . DataFrame ( data )
15 sales_data . to_csv ( " sales_data . csv " , index = False )
16 print ( sales_data . head () )

Listing 1: Sales Data Generation

2. Perform the following analysis:

(a) Calculate the average revenue per product category.

(b) Identify the best-performing product category in terms of
total revenue.
(c) Generate a time series plot of revenue over the year for each
product category.
(d) Create a scatter plot of Units Sold vs. Revenue, colored by
Product Category.
(e) Generate a heatmap showing the correlation between Revenue,
Units Sold, and Discount.

1
Question 2: Employee Performance Analysis
Problem Statement:

1. Use the following Python code snippet to generate the dataset:

1 import pandas as pd
2 import numpy as np
3
4 np . random . seed (42)
5 departments = [ " HR " , " Finance " , " Marketing " , " IT " ]
6 data = {
7 " Employee_ID " : np . arange (1 , 101) ,
8 " Department " : np . random . choice ( departments , 100) ,
9 " Y e a rs _ o f_ E x pe r i en c e " : np . random . randint (1 , 25 , 100) ,
10 " Salary " : np . random . randint (50000 , 200000 , 100) ,
11 " Perfo rmance _Score " : np . random . uniform (50 , 100 , 100) ,
12 }
13 employee_data = pd . DataFrame ( data )
14 employee_data . to_csv ( " employee_data . csv " , index = False )
15 print ( employee_data . head () )

Listing 2: Employee Performance Data Generation

2. Perform the following analysis:

(a) Calculate the average salary per department.

(b) Identify employees whose performance score is above 90 and
group them by department.
(c) Create a bar plot showing the average salary per department.
(d) Generate a boxplot to visualize the distribution of Performance Score
for each department.
(e) Plot a scatter plot of Years of Experience vs. Salary, and
color the points based on Performance Score ranges (e.g., 50–70,
70–90, 90+).

2
Question 3: Housing Market Analysis
Problem Statement:

1. Use the following Python code snippet to generate the dataset:

1 import pandas as pd
2 import numpy as np
3
4 np . random . seed (42)
5 locations = [ " Urban " , " Suburban " , " Rural " ]
6 data = {
7 " House_ID " : np . arange (1 , 201) ,
8 " Price " : np . random . randint (50000 , 500000 , 200) ,
9 " Bedrooms " : np . random . randint (1 , 6 , 200) ,
10 " Square_Feet " : np . random . randint (500 , 4000 , 200) ,
11 " Location " : np . random . choice ( locations , 200) ,
12 }
13 housing_data = pd . DataFrame ( data )
14 housing_data . to_csv ( " housing_data . csv " , index = False )
15 print ( housing_data . head () )

Listing 3: Housing Market Data Generation

2. Perform the following analysis:

(a) Identify and visualize outliers in Price using the IQR method.
(b) Calculate the average price for each Location.
(c) Generate a scatter plot of Price vs. Square Feet, colored by
Bedrooms.
(d) Create a correlation matrix for Price, Bedrooms, and Square Feet.
Visualize it using a heatmap.
(e) Generate a boxplot of Price by Location.

PySpark Slides
No ratings yet
PySpark Slides
30 pages
Linear Regression Assignment
0% (2)
Linear Regression Assignment
8 pages
Data Analysis and Data Science Task - 2
No ratings yet
Data Analysis and Data Science Task - 2
3 pages
Chemguide PDF
100% (2)
Chemguide PDF
2,267 pages
Data Analytics Fundamentals-2
No ratings yet
Data Analytics Fundamentals-2
34 pages
Kushal Kadayat
No ratings yet
Kushal Kadayat
33 pages
Module 2notes
No ratings yet
Module 2notes
44 pages
Assignment
No ratings yet
Assignment
12 pages
Khadeeja - DS - PRACTICAL 4
No ratings yet
Khadeeja - DS - PRACTICAL 4
24 pages
Data Mining Journal 1 Kashan
No ratings yet
Data Mining Journal 1 Kashan
13 pages
Supermarket Sales Analysis Project
No ratings yet
Supermarket Sales Analysis Project
8 pages
DWM Lab Workbook Sample
No ratings yet
DWM Lab Workbook Sample
10 pages
PRT 2 Q's
No ratings yet
PRT 2 Q's
7 pages
ELC Assignment
No ratings yet
ELC Assignment
4 pages
Assingment 1
No ratings yet
Assingment 1
2 pages
List of Practicals Python 2024 - 25
No ratings yet
List of Practicals Python 2024 - 25
13 pages
Exp 8 - LM
No ratings yet
Exp 8 - LM
10 pages
Data Preprocessing & Visualization1
No ratings yet
Data Preprocessing & Visualization1
2 pages
Salaries For San Francisco Employee - ML - FA - DA Projects
No ratings yet
Salaries For San Francisco Employee - ML - FA - DA Projects
33 pages
Python - Pandas - Numpy Interview Q&A
No ratings yet
Python - Pandas - Numpy Interview Q&A
12 pages
BIDA Practical Print
No ratings yet
BIDA Practical Print
56 pages
Act 6
No ratings yet
Act 6
1 page
Pandas Prac
No ratings yet
Pandas Prac
4 pages
Geo Python Doc (1) 7,8 Bavesh
No ratings yet
Geo Python Doc (1) 7,8 Bavesh
9 pages
Prac 1
No ratings yet
Prac 1
5 pages
Assignment 7
No ratings yet
Assignment 7
2 pages
ML File
No ratings yet
ML File
6 pages
Programming Notes 3
No ratings yet
Programming Notes 3
3 pages
Capstone Project Assignment
No ratings yet
Capstone Project Assignment
3 pages
ML Lab Manual
No ratings yet
ML Lab Manual
36 pages
Practical File Questions
No ratings yet
Practical File Questions
2 pages
4BUIS014W Business Computing-Portfolio
No ratings yet
4BUIS014W Business Computing-Portfolio
7 pages
Index
No ratings yet
Index
4 pages
IP Practical File
No ratings yet
IP Practical File
23 pages
Practical File Class 12 2025-26
No ratings yet
Practical File Class 12 2025-26
19 pages
Edap Lab
No ratings yet
Edap Lab
47 pages
DK Phase2
No ratings yet
DK Phase2
5 pages
ML 1-11
No ratings yet
ML 1-11
27 pages
DAP Lab Manual
No ratings yet
DAP Lab Manual
20 pages
Data Analysis
No ratings yet
Data Analysis
4 pages
L6 and 7-Data Preprocessing-Coding
No ratings yet
L6 and 7-Data Preprocessing-Coding
34 pages
DAP Writeups - Merged
No ratings yet
DAP Writeups - Merged
33 pages
DS Question Bank Unit-1 Part-2
No ratings yet
DS Question Bank Unit-1 Part-2
3 pages
IS5312 Mini Project-2
No ratings yet
IS5312 Mini Project-2
5 pages
IP Practicals Filed
No ratings yet
IP Practicals Filed
5 pages
Python Practice Questions
No ratings yet
Python Practice Questions
5 pages
Data Science Sample
No ratings yet
Data Science Sample
5 pages
Three-Dimensional Analysis of Train-Rail-Bridge Interaction Problems
No ratings yet
Three-Dimensional Analysis of Train-Rail-Bridge Interaction Problems
37 pages
EDA Report Week2
No ratings yet
EDA Report Week2
15 pages
DSBDA Manual
No ratings yet
DSBDA Manual
76 pages
20dcs009 Internal 1 Presentation
No ratings yet
20dcs009 Internal 1 Presentation
13 pages
UNIT 5 Scenario
No ratings yet
UNIT 5 Scenario
5 pages
Supermarket Sales Data Analysis
No ratings yet
Supermarket Sales Data Analysis
6 pages
PMOS NMOS Equations and Examples
100% (1)
PMOS NMOS Equations and Examples
3 pages
Data Science
No ratings yet
Data Science
18 pages
ESE Ques Pattern
No ratings yet
ESE Ques Pattern
3 pages
Prac 1
No ratings yet
Prac 1
5 pages
Data Exploration Preparation
No ratings yet
Data Exploration Preparation
12 pages
Some Exercises
No ratings yet
Some Exercises
9 pages
Practice Questions2
No ratings yet
Practice Questions2
2 pages
Module 2
No ratings yet
Module 2
20 pages
Irc 097-1987
No ratings yet
Irc 097-1987
10 pages
The Six Days of Genesis
94% (18)
The Six Days of Genesis
125 pages
Base Plate Calculation
100% (1)
Base Plate Calculation
8 pages
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
DLL - SCIENCE 5-3rd Quarter Week 1-9
100% (6)
DLL - SCIENCE 5-3rd Quarter Week 1-9
42 pages
Ip Worksheet 3 - Q'S
No ratings yet
Ip Worksheet 3 - Q'S
6 pages
Lecture 3 EARTH WORKS AND MASS HAUL DIAGRAM
100% (2)
Lecture 3 EARTH WORKS AND MASS HAUL DIAGRAM
18 pages
Micro
No ratings yet
Micro
17 pages
Math8 - q1 - w4 - d1 - Adding and Subtracting Rational Algebraic Expression - M8AL Ia B 1 - v1
No ratings yet
Math8 - q1 - w4 - d1 - Adding and Subtracting Rational Algebraic Expression - M8AL Ia B 1 - v1
4 pages
Synthesis and Characterization of ZnCo2O4 Nanomaterial For Symmetric Supercapacitor Applications
100% (8)
Synthesis and Characterization of ZnCo2O4 Nanomaterial For Symmetric Supercapacitor Applications
4 pages
Parallel Postulates Revised
No ratings yet
Parallel Postulates Revised
73 pages
Brochure IIT Mandi 2023
No ratings yet
Brochure IIT Mandi 2023
19 pages
AT Commands Manual: UMTS/HSPA Module Series
No ratings yet
AT Commands Manual: UMTS/HSPA Module Series
130 pages
Aldehydes and Ketones-02 Solved Problems
No ratings yet
Aldehydes and Ketones-02 Solved Problems
13 pages
Class 12 Maths Project Helpful
No ratings yet
Class 12 Maths Project Helpful
23 pages
NLP Transformer-Based Models Used For Sentiment Analysis
No ratings yet
NLP Transformer-Based Models Used For Sentiment Analysis
45 pages
GPS Guidebook
No ratings yet
GPS Guidebook
66 pages
GSM Network: S.H.Jamali
No ratings yet
GSM Network: S.H.Jamali
42 pages
DeepMicrobes Taxonomic Classification For Metagenomics Using Deep Learning
No ratings yet
DeepMicrobes Taxonomic Classification For Metagenomics Using Deep Learning
13 pages
Sistem Pendukung Keputusan Penilaian Kinerja Pegawai Dengan Metode Multi
No ratings yet
Sistem Pendukung Keputusan Penilaian Kinerja Pegawai Dengan Metode Multi
58 pages
9-Simple Distillation (P)
No ratings yet
9-Simple Distillation (P)
3 pages
Mathematics Lec4 Vector Space-1-36
No ratings yet
Mathematics Lec4 Vector Space-1-36
36 pages
CORP AM P006 Preparation of Drawing Documents
No ratings yet
CORP AM P006 Preparation of Drawing Documents
36 pages
Adobe Scan 10-Nov-2022
No ratings yet
Adobe Scan 10-Nov-2022
25 pages
PPR 3
No ratings yet
PPR 3
12 pages
Tutorial 3
No ratings yet
Tutorial 3
2 pages
SotC Modding-Hacking Tutorial
No ratings yet
SotC Modding-Hacking Tutorial
10 pages
Batch2 MASAI MTH101 Re MidTerm Exam Solution
No ratings yet
Batch2 MASAI MTH101 Re MidTerm Exam Solution
6 pages
Matching. Graph
No ratings yet
Matching. Graph
13 pages
Knapsack Problem
No ratings yet
Knapsack Problem
18 pages
139 Part 3
No ratings yet
139 Part 3
16 pages
Table 1 - Normal Tolerances For Radial Bearings, Except Tapered Roller Bearings
No ratings yet
Table 1 - Normal Tolerances For Radial Bearings, Except Tapered Roller Bearings
3 pages
Batch 2
No ratings yet
Batch 2
3 pages
Ma211: Advanced Calculus Assignment 1: Semester 1, 2023 TOTAL: 70 Marks
No ratings yet
Ma211: Advanced Calculus Assignment 1: Semester 1, 2023 TOTAL: 70 Marks
2 pages
The Mobius Strip 18
No ratings yet
The Mobius Strip 18
2 pages
Experimental Study On Bond Slip Relationship of Steel Sleeve
No ratings yet
Experimental Study On Bond Slip Relationship of Steel Sleeve
5 pages
Tutorial 4
No ratings yet
Tutorial 4
3 pages
Ephysicsl Experiment 6 - Torque - Finalreport
No ratings yet
Ephysicsl Experiment 6 - Torque - Finalreport
4 pages
Problem Sheet-1
No ratings yet
Problem Sheet-1
2 pages
5 4 Pressure and Gases
No ratings yet
5 4 Pressure and Gases
1 page
Tutorial 3
No ratings yet
Tutorial 3
2 pages
Tutorial - 2
No ratings yet
Tutorial - 2
2 pages
DDPG
No ratings yet
DDPG
1 page

Batch 1

Uploaded by

Batch 1

Uploaded by

Question 1: Sales Analysis

1. Use the following Python code snippet to generate the dataset:

Listing 1: Sales Data Generation

2. Perform the following analysis:

(a) Calculate the average revenue per product category.

1. Use the following Python code snippet to generate the dataset:

Listing 2: Employee Performance Data Generation

2. Perform the following analysis:

(a) Calculate the average salary per department.

1. Use the following Python code snippet to generate the dataset:

Listing 3: Housing Market Data Generation

2. Perform the following analysis:

You might also like