Business Report On Foodhub Data Analysis
Business Report On Foodhub Data Analysis
Context
The number of restaurants in New York is increasing day by day. Lots of students and busy
professionals rely on those restaurants due to their hectic lifestyles. Online food delivery
service is a great option for them. It provides them with good food from their favourite
restaurants. A food aggregator company FoodHub offers access to multiple restaurants
through a single smartphone app.
The app allows the restaurants to receive a direct online order from a customer. The app
assigns a delivery person from the company to pick up the order after it is confirmed by the
restaurant. The delivery person then uses the map to reach the restaurant and waits for the
food package. Once the food package is handed over to the delivery person, he/she confirms
the pick-up in the app and travels to the customer's location to deliver the food. The delivery
person confirms the drop-off in the app after delivering the food package to the customer.
The customer can rate the order in the app. The food aggregator earns money by collecting a
fixed margin of the delivery order from the restaurants.
Objective
The food aggregator company has stored the data of the different orders made by the
registered customers in their online portal. They want to analyze the data to get a fair idea
about the demand of different restaurants which will help them in enhancing their customer
experience. Suppose you are a Data Scientist at Foodhub and the Data Science team has
shared some of the key questions that need to be answered. Perform the data analysis to
find answers to these questions that will help the company to improve the business.
Data Description
The data contains the different data related to a food order. The detailed data dictionary is
given below.
Data Dictionary
• order_id: Unique ID of the order
• customer_id: ID of the customer who ordered the food
• restaurant_name: Name of the restaurant
• cuisine_type: Cuisine ordered by the customer
• cost_of_the_order: Cost of the order
Page |1
• day_of_the_week: Indicates whether the order is placed on a weekday or weekend
(The weekday is from Monday to Friday and the weekend is Saturday and Sunday)
• rating: Rating given by the customer out of 5
• food_preparation_time: Time (in minutes) taken by the restaurant to prepare the
food. This is calculated by taking the difference between the timestamps of the
restaurant's order confirmation and the delivery person's pick-up confirmation.
• delivery time: Time (in minutes) taken by the delivery person to deliver the food
package. This is calculated by taking the difference between the timestamps of the
delivery person's pick-up confirmation and drop-off information
Page |2
Question 1: How many rows and columns are present in the data?
Question 2: What are the datatypes of the different columns in the dataset?
• There are 5 numerical columns and 4 object type columns in the dataset.
• All the data seems to be filled in because Non-Null count is equal to the number of rows for
each columns.
Question 3: Are there any missing values in the data? If yes, treat them using an appropriate
method.
Question 4: Check the statistical summary of the data. What is the minimum, average, and
maximum time it takes for food to be prepared once an order is placed?
The minimum, average and maximum time taken for the food to be prepared is 20, 27.3, 35
Page |3
Question 5: How many orders are not rated?
Question 6: Explore all the variables and provide observations on their distributions. (Generally,
histograms, boxplots, countplots, etc. are used for univariate exploration.)
Order ID – 1898
Customer ID – 1200
Page |4
The type of cuisine most in demand is American cuisine, then Japanese cuisine and then
Italian Cuisine
Observation
Page |5
• There is no outliers in this variable.
Observation
The number of orders is higher on weekend then on weekday
Rating
Page |6
Observation
• No one has given rating 1 and 2
• Most of the customers have not given rating.
• Rating 4 is given to less than 400 restaurants
• Rating 3 is given to less than 200 restaurants
• Rating 5 is given to less than 600 restaurants
Page |7
Observation
• The variable has a normal distribution.
• The mean and the median are almost the same.
• The minimum time of preparation is 20 min and the maximum is 35 min.
• There is no outlier in the box plot
Page |8
Delivery time
Observation
• The distribution of Delivery time is left skewed.
• There is no outliers in this variable.
Page |9
• 75% of the orders are delivered in less than 28.
• The rang of the delivery is between 15 and 33 minutes.
Question 7: Which are the top 5 restaurants in terms of the number of orders received?
The top 5 restaurants in terms of the number of orders received are mentioned below-
1. Shake Shack – 219
2. The Meatball Shop – 132
3. Blue Ribbon Sushi - 119
4. Blue Ribbon Fried Chicken - 96
5. Parm - 68
Question 11: The company has decided to give 20% discount vouchers to the top 3 most
frequent customers. Find the IDs of these customers and the number of orders they
placed.
The top 3 most frequent customers ID’s is given below
customer_id
52832 13
47440 10
83287 9
Multivariate Analysis
P a g e | 10
Question 12: Perform a multivariate analysis to explore relationships between the
important variables in the dataset. (It is a good idea to explore relations between
numerical variables as well as relations between numerical and categorical variables)
Observation
• Average order cost is less than $20 for all type of cuisine.
• Korean, Mediterranean and Vietnamese cuisine have outliers
• Korean and Vietnamese cuisine are the cheapest.
• Mediterranean cuisine has the most expensive order.
P a g e | 11
Observation
• Korean cuisine has outlier.
• Average food preparation time is less than 28 minutes.
• Korean cuisine food preparation time is fast.
Obesrvation
P a g e | 12
The food delivery time is less on weekend and more on weekday
Run the below code and write your observations on the revenue generated by the restaurants
Observation
P a g e | 13
Observation
Observation
• The rating is given 5 for the cost of the order which is more expensive.
• The rating is given 3 for the cost of the order which is of less value.
P a g e | 14
Correlation among variables
Observation
• Food preparation time positively corelates with delivery time.
• cost of the order negatively corelates with delivery time.
Question 13: The company wants to provide a promotional offer in the advertisement of
the restaurants. The condition to get the offer is that the restaurants must have a rating
count of more than 50 and the average rating should be greater than 4. Find the
restaurants fulfilling the criteria to get the promotional offer.
The restaurants fulfilling the criteria to get the promotional offer are given below-
P a g e | 15
Question 14: The company charges the restaurant 25% on the orders having cost greater
than 20 dollars and 15% on the orders having cost greater than 5 dollars. Find the net
revenue generated by the company across all orders.
The net revenue generated by the company across all orders is around 6166.3 dollars
Question 15: The company wants to analyze the total time required to deliver the food.
What percentage of orders take more than 60 minutes to get delivered from the time the
order is placed? (The food has to be prepared and then delivered.)
The percentage of orders that have more than 60 minutes of total delivery time is 10.54 %
Question 16: The company wants to analyze the delivery time of the orders on weekdays
and weekends. How does the mean delivery time vary during weekdays and weekends?
The mean delivery time on weekday is around 28 minutes
The mean delivery time on weekend is around 22 minutes
Actionable Insights:
• The number of orders is higher on weekends.
• American cuisine is the most requested by customers, followed by Japanese and Italian
cuisine.
• 40 % of orders are not rated, which impact the business in two ways : 1- It's difficult to
understand overall customer satisfaction. 2- Customers won't be interested in the service.
• The total average delivery time is high and can be more than 1 hour.
P a g e | 16
• External factors may impact the delivery time like traffic jam and proximity.
Recommendations:
• Make restaurants aware of the importance of rating, so that they will work on it.
• Work with restaurants to increase the American, Japanese and Italian cuisine choices.
• Collaborate with top performing restaurants to create exclusive deals to further increase
order volume.
• Leverage the popularity of American, Japanese, and Italian cuisines during weekends for
promotional campaigns.
• Optimize staffing during weekends to handle higher order volumes and ensure quicker
delivery times.
• Implement strategies to reduce food preparation and delivery times to enhance customer
satisfaction.
• Adjust pricing or introduce premium services for orders costing more than $20 to boost
revenue further.
P a g e | 17
P a g e | 18