0% found this document useful (0 votes)
34 views20 pages

Business Report On Foodhub Data Analysis

Uploaded by

Sunayana Jindal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
34 views20 pages

Business Report On Foodhub Data Analysis

Uploaded by

Sunayana Jindal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 20

BUSINESS REPORT

FOODHUB DATA ANALYSIS

sunayana jindal 9/22/24 [Course title]


Contents

Problem Definition -1-2


Understanding Data / Data Overview 1-2

1. How many rows and columns are present in the data? -3


2. What are the datatypes of the different columns in the dataset? -3
3. Are there any missing values in the data? If yes, treat them using an appropriate method. -3
4. Check the statistical summary of the data. What is the minimum, average, and maximum time it
takes for food to be prepared once an order is placed? -3
5. How many orders are not rated? -4
6. Explore all the variables and provide observations on their distributions. (Generally, histograms,
boxplots, countplots, etc. are used for univariate exploration.) 4- 10
7. Which are the top 5 restaurants in terms of the number of orders received? - 10
8. Which is the most popular cuisine on weekends? -10
9. What percentage of the orders cost more than 20 dollars? -10
10. What is the mean order delivery time? -10
11. The company has decided to give 20% discount vouchers to the top 3 most frequent customers. Find
the IDs of these customers and the number of orders they placed. - 10
12. Perform a multivariate analysis to explore relationships between the important variables in the
dataset. (It is a good idea to explore relations between numerical variables as well as relations
between numerical and categorical variables) -10 - 15
13. The company wants to provide a promotional offer in the advertisement of the restaurants. The
condition to get the offer is that the restaurants must have a rating count of more than 50 and the
average rating should be greater than 4. Find the restaurants fulfilling the criteria to get the
promotional offer. -15
14. The company charges the restaurant 25% on the orders having cost greater than 20 dollars and 15%
on the orders having cost greater than 5 dollars. Find the net revenue generated by the company
across all orders. -16
15. The company wants to analyze the total time required to deliver the food. What percentage of
orders take more than 60 minutes to get delivered from the time the order is placed? (The food has
to be prepared and then delivered.) - 16
16. The company wants to analyze the delivery time of the orders on weekdays and weekends. How
does the mean delivery time vary during weekdays and weekends? -16

Actionable Insights and Recommendations 16-17


Problem Definition

Context
The number of restaurants in New York is increasing day by day. Lots of students and busy
professionals rely on those restaurants due to their hectic lifestyles. Online food delivery
service is a great option for them. It provides them with good food from their favourite
restaurants. A food aggregator company FoodHub offers access to multiple restaurants
through a single smartphone app.
The app allows the restaurants to receive a direct online order from a customer. The app
assigns a delivery person from the company to pick up the order after it is confirmed by the
restaurant. The delivery person then uses the map to reach the restaurant and waits for the
food package. Once the food package is handed over to the delivery person, he/she confirms
the pick-up in the app and travels to the customer's location to deliver the food. The delivery
person confirms the drop-off in the app after delivering the food package to the customer.
The customer can rate the order in the app. The food aggregator earns money by collecting a
fixed margin of the delivery order from the restaurants.
Objective
The food aggregator company has stored the data of the different orders made by the
registered customers in their online portal. They want to analyze the data to get a fair idea
about the demand of different restaurants which will help them in enhancing their customer
experience. Suppose you are a Data Scientist at Foodhub and the Data Science team has
shared some of the key questions that need to be answered. Perform the data analysis to
find answers to these questions that will help the company to improve the business.

Understanding Data / Data Overview

Data Description
The data contains the different data related to a food order. The detailed data dictionary is
given below.
Data Dictionary
• order_id: Unique ID of the order
• customer_id: ID of the customer who ordered the food
• restaurant_name: Name of the restaurant
• cuisine_type: Cuisine ordered by the customer
• cost_of_the_order: Cost of the order

Page |1
• day_of_the_week: Indicates whether the order is placed on a weekday or weekend
(The weekday is from Monday to Friday and the weekend is Saturday and Sunday)
• rating: Rating given by the customer out of 5
• food_preparation_time: Time (in minutes) taken by the restaurant to prepare the
food. This is calculated by taking the difference between the timestamps of the
restaurant's order confirmation and the delivery person's pick-up confirmation.
• delivery time: Time (in minutes) taken by the delivery person to deliver the food
package. This is calculated by taking the difference between the timestamps of the
delivery person's pick-up confirmation and drop-off information

Page |2
Question 1: How many rows and columns are present in the data?

There are 1898 rows and 9 columns

Question 2: What are the datatypes of the different columns in the dataset?

• There are 5 numerical columns and 4 object type columns in the dataset.

• Order_id and customer_id are identifiers but not numbers.

• Rating is read as object type column but it can be a numerical column.

• Cuisine_type and day_of_the_week are categories.

• All the data seems to be filled in because Non-Null count is equal to the number of rows for
each columns.

• cost of the order are float in the dataset

Question 3: Are there any missing values in the data? If yes, treat them using an appropriate
method.

There is no missing value in the data

Question 4: Check the statistical summary of the data. What is the minimum, average, and
maximum time it takes for food to be prepared once an order is placed?

The minimum, average and maximum time taken for the food to be prepared is 20, 27.3, 35

Page |3
Question 5: How many orders are not rated?

The rating is not given to 736 orders.

Exploratory Data Analysis (EDA)


Univariate Analysis

Question 6: Explore all the variables and provide observations on their distributions. (Generally,
histograms, boxplots, countplots, etc. are used for univariate exploration.)

Below are the unique values to the dataset

Order ID – 1898

Customer ID – 1200

Restaurant name -178

Cuisine type -14

Observation for Countplots


cuisine
<Axes: xlabel='cuisine _type', y label='count'>

Page |4
The type of cuisine most in demand is American cuisine, then Japanese cuisine and then
Italian Cuisine

Cost of the order

Observation

• The distribution of cost_of_the_order is right skewed.

Page |5
• There is no outliers in this variable.

• Most of the orders are lower than 23 $.

• The range of cost is between 4.5 and 35.5 $.

Day of the week

Observation
The number of orders is higher on weekend then on weekday

Rating

Page |6
Observation
• No one has given rating 1 and 2
• Most of the customers have not given rating.
• Rating 4 is given to less than 400 restaurants
• Rating 3 is given to less than 200 restaurants
• Rating 5 is given to less than 600 restaurants

Food Preparation time

Page |7
Observation
• The variable has a normal distribution.
• The mean and the median are almost the same.
• The minimum time of preparation is 20 min and the maximum is 35 min.
• There is no outlier in the box plot

Page |8
Delivery time

Observation
• The distribution of Delivery time is left skewed.
• There is no outliers in this variable.

Page |9
• 75% of the orders are delivered in less than 28.
• The rang of the delivery is between 15 and 33 minutes.

Question 7: Which are the top 5 restaurants in terms of the number of orders received?
The top 5 restaurants in terms of the number of orders received are mentioned below-
1. Shake Shack – 219
2. The Meatball Shop – 132
3. Blue Ribbon Sushi - 119
4. Blue Ribbon Fried Chicken - 96
5. Parm - 68

Question 8: Which is the most popular cuisine on weekends?


The most popular cuisine on weekends is American.

Question 9: What percentage of the orders cost more than 20 dollars?


The number of total orders that cost above 20 dollars is: 555
Percentage of orders above 20 dollars: 29.24 %

Question 10: What is the mean order delivery time?


The mean delivery time for this dataset is 24.16 minutes

Question 11: The company has decided to give 20% discount vouchers to the top 3 most
frequent customers. Find the IDs of these customers and the number of orders they
placed.
The top 3 most frequent customers ID’s is given below
customer_id
52832 13
47440 10
83287 9

Multivariate Analysis

P a g e | 10
Question 12: Perform a multivariate analysis to explore relationships between the
important variables in the dataset. (It is a good idea to explore relations between
numerical variables as well as relations between numerical and categorical variables)

Cuisine vs Cost of the order

Observation
• Average order cost is less than $20 for all type of cuisine.
• Korean, Mediterranean and Vietnamese cuisine have outliers
• Korean and Vietnamese cuisine are the cheapest.
• Mediterranean cuisine has the most expensive order.

Cuisine vs Food Preparation time

P a g e | 11
Observation
• Korean cuisine has outlier.
• Average food preparation time is less than 28 minutes.
• Korean cuisine food preparation time is fast.

Day of the Week vs Delivery time

Obesrvation

P a g e | 12
The food delivery time is less on weekend and more on weekday

Run the below code and write your observations on the revenue generated by the restaurants

Rating vs Delivery time

Observation

The delivery time is high so the customer has given rating 3

Rating vs Food preparation time

P a g e | 13
Observation

• Average raitng for food prepation time is almost 27.4 minutes.

• Food preparation time is maximum in rating 3.

Rating vs Cost of the order

Observation

• The rating is given 5 for the cost of the order which is more expensive.

• The rating is given 3 for the cost of the order which is of less value.

P a g e | 14
Correlation among variables

Observation
• Food preparation time positively corelates with delivery time.
• cost of the order negatively corelates with delivery time.

Question 13: The company wants to provide a promotional offer in the advertisement of
the restaurants. The condition to get the offer is that the restaurants must have a rating
count of more than 50 and the average rating should be greater than 4. Find the
restaurants fulfilling the criteria to get the promotional offer.

The restaurants fulfilling the criteria to get the promotional offer are given below-

P a g e | 15
Question 14: The company charges the restaurant 25% on the orders having cost greater
than 20 dollars and 15% on the orders having cost greater than 5 dollars. Find the net
revenue generated by the company across all orders.

The net revenue generated by the company across all orders is around 6166.3 dollars

Question 15: The company wants to analyze the total time required to deliver the food.
What percentage of orders take more than 60 minutes to get delivered from the time the
order is placed? (The food has to be prepared and then delivered.)
The percentage of orders that have more than 60 minutes of total delivery time is 10.54 %

Question 16: The company wants to analyze the delivery time of the orders on weekdays
and weekends. How does the mean delivery time vary during weekdays and weekends?
The mean delivery time on weekday is around 28 minutes
The mean delivery time on weekend is around 22 minutes

Actionable Insights and Recommendations

Actionable Insights:
• The number of orders is higher on weekends.

• American cuisine is the most requested by customers, followed by Japanese and Italian
cuisine.

• Customer preferences are not changed on weekends and weekdays.

• 15 restaurants receive more than 50 % of orders.

• 40 % of orders are not rated, which impact the business in two ways : 1- It's difficult to
understand overall customer satisfaction. 2- Customers won't be interested in the service.

• 70 % of the orders cost less than 20 dollars.

• 65 % of customers in the dataset made only one order.

• 28% of restaurants in the dataset received only one order.

• There is no difference in preparation time between weekends and weekdays.

• The total average delivery time is high and can be more than 1 hour.

• Weekend delivery is faster than weekdays.

P a g e | 16
• External factors may impact the delivery time like traffic jam and proximity.

Recommendations:

• Encourage customers to rate their orders.

• Make restaurants aware of the importance of rating, so that they will work on it.

• Encourage customers to make more orders (offering coupons, discounts...).

• Work with restaurants to increase meals under 20 $.

• Work with restaurants to increase the American, Japanese and Italian cuisine choices.

• Work with new restaurants specializing in these types of cuisine.

• Collaborate with top performing restaurants to create exclusive deals to further increase
order volume.

• Leverage the popularity of American, Japanese, and Italian cuisines during weekends for
promotional campaigns.

• Optimize staffing during weekends to handle higher order volumes and ensure quicker
delivery times.

• Implement strategies to reduce food preparation and delivery times to enhance customer
satisfaction.

• Adjust pricing or introduce premium services for orders costing more than $20 to boost
revenue further.

P a g e | 17
P a g e | 18

You might also like