0% found this document useful (0 votes)
143 views1 page

SMDM Extended Project

The document provides instructions for an extended project involving analysis of wholesale customer spending data and college data. For the first problem, participants are asked to conduct descriptive statistical analysis to identify the highest and lowest spending regions and channels, describe spending patterns across items/regions/channels, identify items with most and least inconsistent behavior, check for outliers, and provide business recommendations based on the analysis. For the second problem, participants are asked to perform exploratory data analysis including univariate, bivariate and multivariate techniques on a college dataset, and draw insights from the analysis. Code and outputs must be submitted in a Jupyter notebook along with detailed responses and explanations in a PDF/Word business report. Plagiarized or late submissions will not be graded.

Uploaded by

srashti tripathi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
143 views1 page

SMDM Extended Project

The document provides instructions for an extended project involving analysis of wholesale customer spending data and college data. For the first problem, participants are asked to conduct descriptive statistical analysis to identify the highest and lowest spending regions and channels, describe spending patterns across items/regions/channels, identify items with most and least inconsistent behavior, check for outliers, and provide business recommendations based on the analysis. For the second problem, participants are asked to perform exploratory data analysis including univariate, bivariate and multivariate techniques on a college dataset, and draw insights from the analysis. Code and outputs must be submitted in a Jupyter notebook along with detailed responses and explanations in a PDF/Word business report. Plagiarized or late submissions will not be graded.

Uploaded by

srashti tripathi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 1

SMDM Extended Project

Dear Participants,

Please find below SMDM Project instructions:

 You have to submit 2 files : 


1. Answer Report: In this, you need to submit all the answers to all the questions in a
sequential manner. It should include a detailed explanation of the approach used,
insights, inferences, and all outputs of codes like graphs, tables, etc. Your report
should not be filled with codes. You will be evaluated based on the business report
only. Hence please ensure that your business report is detailed and includes everything
apart from the code. THE REPORT HAS TO BE STRICTLY SUBMITTED IN A
PDF/DOC FORMAT. ANY OTHER FORMAT WILL NOT BE CONSIDERED FOR
GRADING. 6 Marks are allotted for the "Quality of Business Report".
2. Jupyter Notebook file: This is a must and will be used for reference while evaluating
 Any assignment found copied/ plagiarized by another person will not be graded and marked
as zero.
 Please ensure timely submission as a post-deadline assignment will not be accepted.

Problem 1

Wholesale Customers Analysis (Download Data) (attached)

 Problem Statement:

A wholesale distributor operating in different regions of Portugal has information on the annual
spending of several items in their stores across different regions and channels. The data consists
of 440 large retailers’ annual spending on 6 different varieties of products in 3 different regions
(Lisbon, Oporto, Other) and across different sales channels (Hotel, Retail).

1.1 Use methods of descriptive statistics to summarize data. Which Region and which


Channel spent the most? Which Region and which Channel spent the least?

1.2 There are 6 different varieties of items that are considered. Describe and
comment/explain all the varieties across Region and Channel? Provide a detailed
justification for your answer.

1.3 On the basis of a descriptive measure of variability, which item shows the most
inconsistent behaviour? Which items show the least inconsistent behaviour?

1.4 Are there any outliers in the data? Back up your answer with a suitable plot/technique
with the help of detailed comments.

1.5 On the basis of your analysis, what are your recommendations for the business? How
can your analysis help the business to solve its problem? Answer from the business
perspective

Problem 2:

The dataset Education - Post 12th Standard.csv  (attached)contains information on various


colleges. You are expected to do a Principal Component Analysis for this case study according
to the instructions given. The data dictionary of the 'Education - Post 12th Standard.csv' can be
found in the following file: Data Dictionary.xlsx. (attached)

 Perform Exploratory Data Analysis [Univariate, Bivariate, and Multivariate analysis to be


performed]. What insight do you draw from the EDA?

You might also like