0% found this document useful (0 votes)

17 views15 pages

SMDM Project Business Report - Group Assignment-Copy2

This document discusses a dataset related to a marketing campaign for an auto company. It provides technical details of the dataset, performs preliminary data cleaning and analysis of the variables, explores relationships between variables through univariate and bivariate analysis, and evaluates statements made by employees regarding preferences for vehicle types.

Uploaded by

Basavaraj Ky

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views15 pages

SMDM Project Business Report - Group Assignment-Copy2

Uploaded by

Basavaraj Ky

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 15

DSBA – JULY 2023

Submitted by:
Sahil Shreshtha
SMDM
PROJEC
Basavaraj K Y
Nikhil Upadyaya
Lalit Soundar Venkataraman
Mittal Shah

T
Ravi Kumar Ethiraju

Problem 1 : Austo Motor Company

BUSINES
Austo Motor Company is a leading car manufacturer specializing in
SUV, Sedan, and Hatchback models. In its recent board meeting, the
members raised concerns about the efficiency of the marketing S
campaign currently being used. The board decides to rope in
analytics professional to improve the existing campaign.
REPORT
1A. What is the important technical information about the dataset that a database administrator would be
interested in?
 We have a 1581 business and working professions data as rows with 14 features
(Including both independent and dependent variables)
 The data set has 1 float64 value, 5 integer values and 8 objects.

If we look at some basic information about the data out of the 14 variables there are 6 numerical and 8
categorical variables Also, there are a few null values in the Gender and Partner_salary variables.

1B. Take a critical look at the data and do a preliminary analysis of the variables. Do a quality check of the data
so that the variables are consistent. Are there any discrepancies present in the data? If yes, perform preliminary
treatment of data.

Preliminary data analysis

From the above table we found that there are null values in Gender and Partner_salary variables
In Gender there are 53 nulls and
Partner_salary there are 106 nulls
Now to fill in the missing data or the nulls -
For Gender we can use the majority of the 2 outputs to fill in the nulls
In this case the nulls are imputed with ‘Male’ since there are in majority
For Partner_salary,
We are using conditional imputation since there are other variables related to salary-
Salary + Partner_salary = Total_salary
The condition is that if the Partner_working is YES then the
Partner_salary = Total_salary–Salary
If the Partner_working is NO the Partner_salary = 0
There were two spelling error found in the column “Gender” – ‘Femal & Femle’
This spelling error for both the error has been corrected and replaced
Null value (Total – 53) is replaced with Mode of the ‘Gender’ column
Replaced NaN value with – Yes in ‘Partner_salary’ with below calculation

Replacing the missing values in Gender and Partner_salary. Now the dataset is consistent and free from null
values and the discrepancies are resolved. Hereby the data cleaning is done and the dataset is good to
proceed with data analysis

Numerical Description of Data

1C. Explore all the features of the data separately by using appropriate visualizations and draw insights that can
be utilized by the business.

Univariate Analysis
 Analyzing the car buyers on Age variable

Observation: Younger age group (Range 20- 30) tends to buy more cars as compared to the middle aged
(Range 31-45) and older age group (range from 46-55). Also there is fluctuation in buying pattern for the
age group between 35-40.

 Analyzing the car buyers on Price variable

Observation: The number of cars purchased is higher in the lower price range compared to the costlier
cars.
 Analyzing the salary distribution of the car buyers
Observation: The highest number of cars purchased occurs within the salary range of 50000 to 70000.

 Analyzing the car buyers from their partner salary

Observation: Most car buyers have partners whose salaries are below 10,000, and the next most
frequent salary range falls between 30,000 and 40,000.

 Analyzing the car buyers from their Total salary

Observation: The majority of car buyers have total income range of 60000 to 100000.
Also there is a decline in car buyers within the salary range of 100000 to 160000.

 Analyzing of cars for each unique make

Count plot

Observation: From the above graph, we can conclude that sedan type cars are buying more than the
hatchback and SUV cars.

 Analyzing the car buyers by Gender

Observation: From the above graph, we can say that the male car buyers are higher than the female
buyers.
 Number of car buyers based on their education level

Observation: Post graduates have shown more interest in purchasing cars compared to graduates.

 Number of car buyers by Marital status

Observation: Married males are more interested in buying cars more than female in any category.

 Number of car buyers by Profession and Gender

Observation: The Salaried profession in male category has more numbers in buying cars than
comparing with other categories.

 Analyzing the car buyers on number of dependents

Observation: We can see that the data of dependents is bimodal i.e. there are two modes in the number
of dependents data in the dataset i.e. 2 and 3.

 Average salary of car buyers based on their loan status

Observation: From the above graph, we can understand that almost 50% of car buyers having personal
loan, and the car buyers with House loan is quite lesser.

1D. Understanding the relationships among the variables in the dataset is crucial for every analytical project.
Perform analysis on the data fields to gain deeper insights. Comment on your understanding of the data.

For understanding the relationship between the variables, we need to do bivariate analysis, to better
understand the dataset.

 Relationship between the level of education and type of car

Observation: Post graduates have shown more interest in purchasing cars compared to graduates.

 Relationship between car buyers Profession and the car type

Observation: By above graph we can see that majority of population in the dataset prefer to have
Sedan, in both Business and Salaried class.
 Relationship between car buyers marital status and car type
Observation: By above graph we can see that bar of Make and Martial status, there is a higher
preference for Sedan overall.

 Relationship between working partner and type of car

Observation: By above graph we can see that in general, the preference for Sedan is on higher side,
whether the partner is working or not.

 Relationship between House loan and type of car

Observation: By above graph we can see that the proportion of customers availing House loan are more
than 50% prefer Sedan, followed by Hatchback and SUV. While, the proportion of customers not availing
house loan are more than 41% prefer Sedan, followed by Hatchback and SUV.
 Relationship between Salary and Type of car
Observation: By above graph we can see that average salary of the customers who prefer SUV is greater than
Sedan and Hatchback. Which indirectly also implies that SUV is a high range car.

 Relationship between Total Salary and type of car

Observation: By above graph we can see that average total salary of the customers who prefer SUV is greater than
Sedan and Hatchback. Which indirectly also implies that SUV is a high range car.

 Analysis using correlation and heatmap

Observation: From this data, Age has high positive correlation towards Salary, Total_salary and
Price of the vehicle.
o This means as age increases, the salary increases and also the price of the vehicle increases.
o Also, there is high correlation between the salary and total salary. Similarly, high correlation between
Partner salary and total salary, which is understandable. Between the rests of variables, either there is
very weak or negative correlation.

Multivariate analysis
Observation: From the above pair plot we can see that in most of the variables of the dataset, there is a weak or
no correlation. However, there is a correlation between the data points for variables Salary and Total Salary,
Total Salary and Age etc.

1E. Employees working on the existing marketing campaign have made the following remarks. Based on the data
and your analysis state whether you agree or disagree with their observations. Justify your answer Based on the
data available.

E1) Steve Roger says “Men prefer SUV by a large margin, compared to the women”.
Observation: From the above graph and table, we can see that the E1 statement i.e. Steve Roger saying “Men
prefer SUV by a large margin, compared to the women”, does not hold true.

E2) Ned Stark believes that a salaried person is more likely to buy a Sedan.

Observation: From the above graph we can conclude that the statement E2 holds true.
If we compare the preference of salaried class for the type of car preferred we see that the total salaried data
comparison to SUV, Sedan and Hatchback.
Hence, the probability of owning Sedan amongst the salaried class is high.
E3) Sheldon Cooper does not believe any of them; he claims that a salaried male is an easier target for a SUV
sale over a Sedan Sale.
Observation: From the above graph we can conclude that that the statement E3 doesn’t hold true. A salaried
male is an easier target for a Sedan sale than SUV sale.

Problem 2: A physiotherapist with a male football team is interested in studying the

relationship between foot injuries and the positions at which the players play from
the data collected
Striker Forward Attacking Midfielder Winger Total

Players
Injured 45 56 24 20 145

Players Not
Injured Hatchback 582 11 9 90

Total SUV 297 35 29 235

2.1 What is the probability that a randomly chosen player would suffer an injury?

The likelihood of a randomly selected player experiencing an injury is 61.7%.

2.2 What is the probability that a player is a forward or a winger?

The probability that a player is either a forward or a winger is approximately 0.523 or 52.3%.

2.3 What is the probability that a randomly chosen player plays in a striker position and has a foot injury?

The probability that a randomly chosen player plays in a striker position and has a foot injury is approximately
0.191 or 19.1%.

2.4 What is the probability that a randomly chosen injured player is a striker?

The probability that a randomly chosen injured player is a striker is approximately 0.310 or 31%.

2.5 What is the probability that a randomly chosen injured player is either a forward or an attacking midfielder?

The probability that a randomly chosen injured player is either a forward or an attacking midfielder is
approximately 0.552 or 55.2%.

SMDM Project Report
100% (2)
SMDM Project Report
35 pages
Kewal Kumar Singh
No ratings yet
Kewal Kumar Singh
16 pages
EDA Loan Case Study PPT - Ver 1.1
80% (5)
EDA Loan Case Study PPT - Ver 1.1
22 pages
Austo Case Study
No ratings yet
Austo Case Study
19 pages
SMDM Coded Project - Vidya Sawant
No ratings yet
SMDM Coded Project - Vidya Sawant
27 pages
SMDM Project Sample Report
No ratings yet
SMDM Project Sample Report
30 pages
Cars4u Project: Proprietary Content. © Great Learning. All Rights Reserved. Unauthorized Use or Distribution Prohibited
100% (2)
Cars4u Project: Proprietary Content. © Great Learning. All Rights Reserved. Unauthorized Use or Distribution Prohibited
30 pages
Project Report Abhay PDF
100% (1)
Project Report Abhay PDF
20 pages
SMDM Project Doc FaizanAliSayyed 16-07-2023
No ratings yet
SMDM Project Doc FaizanAliSayyed 16-07-2023
48 pages
Business Report
No ratings yet
Business Report
23 pages
Autos Automobile.. EDA Project by Anjali Sinha
No ratings yet
Autos Automobile.. EDA Project by Anjali Sinha
26 pages
SMDM-Project Sample Business Report
No ratings yet
SMDM-Project Sample Business Report
47 pages
Business Report - PDS
No ratings yet
Business Report - PDS
11 pages
Problem 1 BrahmaChari
No ratings yet
Problem 1 BrahmaChari
17 pages
Austo Automobile
No ratings yet
Austo Automobile
20 pages
Business Project Report
No ratings yet
Business Project Report
23 pages
PDS Coded Project
No ratings yet
PDS Coded Project
20 pages
Yeniyeniduzelcek
No ratings yet
Yeniyeniduzelcek
37 pages
Business Report Suchita Bhovar Coded Project
No ratings yet
Business Report Suchita Bhovar Coded Project
18 pages
DVT Project
No ratings yet
DVT Project
35 pages
SMDM Project Report - Set2
No ratings yet
SMDM Project Report - Set2
21 pages
Assignment ML
100% (2)
Assignment ML
21 pages
Kailash BusinessReport (AutoRecovered) 1
No ratings yet
Kailash BusinessReport (AutoRecovered) 1
16 pages
SMDM Project Business
80% (5)
SMDM Project Business
13 pages
Price Analysis of BMW Cars in Dealerships
No ratings yet
Price Analysis of BMW Cars in Dealerships
25 pages
Business Report
No ratings yet
Business Report
10 pages
Business Report
No ratings yet
Business Report
17 pages
SMDM Project Business Report - Ketan Sawalkar: (Document Title)
100% (2)
SMDM Project Business Report - Ketan Sawalkar: (Document Title)
17 pages
Soba C.project
No ratings yet
Soba C.project
20 pages
Project 4 - Cars-Datasets PDF
100% (2)
Project 4 - Cars-Datasets PDF
44 pages
SMDM Project Report
100% (1)
SMDM Project Report
19 pages
SMDM Project - Vinay Rangari - 25-June-2023
No ratings yet
SMDM Project - Vinay Rangari - 25-June-2023
14 pages
Certificate For Absence of Drug Personalities
No ratings yet
Certificate For Absence of Drug Personalities
2 pages
Business Report
No ratings yet
Business Report
43 pages
Business Report Pradeep Chauhan 11june'23
100% (1)
Business Report Pradeep Chauhan 11june'23
25 pages
Fin081 Liquidity Ratio Au Fa1 Bsa2 Main6
No ratings yet
Fin081 Liquidity Ratio Au Fa1 Bsa2 Main6
11 pages
Great Learning SMDM Project
No ratings yet
Great Learning SMDM Project
16 pages
Assignment Python
No ratings yet
Assignment Python
25 pages
Detail Project Report SMDM
100% (1)
Detail Project Report SMDM
25 pages
Project - Analyzing The Impact of Car Features On Price and Profitability
No ratings yet
Project - Analyzing The Impact of Car Features On Price and Profitability
8 pages
Business Report SMDM Project - Coded
No ratings yet
Business Report SMDM Project - Coded
27 pages
Business Report - 1 (Austo Automobies)
No ratings yet
Business Report - 1 (Austo Automobies)
19 pages
Dino Vs Jardines Case Digest
No ratings yet
Dino Vs Jardines Case Digest
2 pages
Unwto - Kyrgyzstan
No ratings yet
Unwto - Kyrgyzstan
2 pages
Business Report
No ratings yet
Business Report
12 pages
SMDM Project Report - Shubham Bakshi - 07.05.2023
0% (1)
SMDM Project Report - Shubham Bakshi - 07.05.2023
23 pages
MGSI - Undertaking Against Abscondment & Violation of Immigration Laws - Bilingual 03.18.2024
No ratings yet
MGSI - Undertaking Against Abscondment & Violation of Immigration Laws - Bilingual 03.18.2024
3 pages
SMDM Project Report
No ratings yet
SMDM Project Report
39 pages
MBMA Lunch and Learn - Safety Culture 042214rev
No ratings yet
MBMA Lunch and Learn - Safety Culture 042214rev
15 pages
Arpita Saha SMDM Coded Project Module 2 10 01 2024 G2 Business Report
No ratings yet
Arpita Saha SMDM Coded Project Module 2 10 01 2024 G2 Business Report
21 pages
Peso Guideline
No ratings yet
Peso Guideline
16 pages
The Art of My Neighbor Totoro - Viz Media, Subs. of Shogakukan Inc (2005) - Text
No ratings yet
The Art of My Neighbor Totoro - Viz Media, Subs. of Shogakukan Inc (2005) - Text
45 pages
Poly Lecture
No ratings yet
Poly Lecture
66 pages
Britannica - Islamic Thought
No ratings yet
Britannica - Islamic Thought
54 pages
History III
No ratings yet
History III
6 pages
Sample Project - IP - 12
No ratings yet
Sample Project - IP - 12
14 pages
Keps 2 Ps
No ratings yet
Keps 2 Ps
14 pages
Isaiah 5310 Dec 17 Final
No ratings yet
Isaiah 5310 Dec 17 Final
44 pages
Price Analysis of BMW Cars in Dealerships
No ratings yet
Price Analysis of BMW Cars in Dealerships
25 pages
Impact of Car Features
No ratings yet
Impact of Car Features
9 pages
Car Features Case Study
No ratings yet
Car Features Case Study
10 pages
SMDM Project Report
No ratings yet
SMDM Project Report
27 pages
Ethiopian Writing System - Baye Yimam PDF
100% (1)
Ethiopian Writing System - Baye Yimam PDF
9 pages
Business Report 16 April 2023
No ratings yet
Business Report 16 April 2023
16 pages
SMDM - Project Report - Lakshmi
No ratings yet
SMDM - Project Report - Lakshmi
26 pages
MGT515AC SP24 GroupAssignment Case
No ratings yet
MGT515AC SP24 GroupAssignment Case
2 pages
SMDM Project
No ratings yet
SMDM Project
17 pages
Mid Term Exam-SectionC - C1
No ratings yet
Mid Term Exam-SectionC - C1
1 page
Deriving Insights From Data
No ratings yet
Deriving Insights From Data
8 pages
Unit 31 Statistics For ManagementAssignment 1 (LO1 - LOs) 1
0% (1)
Unit 31 Statistics For ManagementAssignment 1 (LO1 - LOs) 1
3 pages
EGR Handouts
100% (1)
EGR Handouts
13 pages
Arens AAS17 SM 08
No ratings yet
Arens AAS17 SM 08
23 pages
SMDM Prjoect Instructions
No ratings yet
SMDM Prjoect Instructions
5 pages
Coursehero ch03
No ratings yet
Coursehero ch03
22 pages
Dangarembga Tsitsi. Nervous Conditions Thesis Proposal - Maya Lama
No ratings yet
Dangarembga Tsitsi. Nervous Conditions Thesis Proposal - Maya Lama
13 pages
I, Don Quixote
No ratings yet
I, Don Quixote
209 pages
Pranjal - Singh - 30.10.2022 SMDM PROJECT REPORT
No ratings yet
Pranjal - Singh - 30.10.2022 SMDM PROJECT REPORT
9 pages
OCBC Titanium Credit Card
No ratings yet
OCBC Titanium Credit Card
2 pages
Race and The Idea of The Aesthetic
No ratings yet
Race and The Idea of The Aesthetic
24 pages
Edc Question
No ratings yet
Edc Question
2 pages
Puer
100% (1)
Puer
12 pages
MONEY PECHU - July 14
No ratings yet
MONEY PECHU - July 14
7 pages
Hybrid Democracy in Pakistan: A Case Study of The PDM Government
No ratings yet
Hybrid Democracy in Pakistan: A Case Study of The PDM Government
11 pages
Referencia Técnica
No ratings yet
Referencia Técnica
25 pages
DJI Remote Identification Whitepaper 3-22-17
No ratings yet
DJI Remote Identification Whitepaper 3-22-17
10 pages
Competition: A Marxist View: Giulio Palermo
No ratings yet
Competition: A Marxist View: Giulio Palermo
27 pages
Redemption of Reward Points Towards Card Outstanding & Air Miles
No ratings yet
Redemption of Reward Points Towards Card Outstanding & Air Miles
3 pages
Anterior Segment Aqueous Drainage Device PDF
No ratings yet
Anterior Segment Aqueous Drainage Device PDF
5 pages
Role of Digital Marketing
No ratings yet
Role of Digital Marketing
2 pages
The Broker’s Bible: The Way Forward
From Everand
The Broker’s Bible: The Way Forward
Nancy Gardner
No ratings yet

SMDM Project Business Report - Group Assignment-Copy2

Uploaded by

SMDM Project Business Report - Group Assignment-Copy2

Uploaded by

DSBA – JULY 2023

Problem 1 : Austo Motor Company

Preliminary data analysis

Numerical Description of Data

 Analyzing the car buyers on Price variable

 Analyzing the car buyers from their partner salary

 Analyzing the car buyers from their Total salary

 Analyzing of cars for each unique make

 Analyzing the car buyers by Gender

 Number of car buyers by Marital status

 Number of car buyers by Profession and Gender

 Analyzing the car buyers on number of dependents

 Average salary of car buyers based on their loan status

 Relationship between the level of education and type of car

 Relationship between car buyers Profession and the car type

 Relationship between working partner and type of car

 Relationship between House loan and type of car

 Relationship between Total Salary and type of car

 Analysis using correlation and heatmap

Problem 2: A physiotherapist with a male football team is interested in studying the

Total SUV 297 35 29 235

The likelihood of a randomly selected player experiencing an injury is 61.7%.

2.2 What is the probability that a player is a forward or a winger?

You might also like