Python Hospitality Data Analysis Project
Python Hospitality Data Analysis Project
Problem statement
AtliQ Grand owns multiple five-star hotels across India . They have been in
the hospitality industry for past 20 years . Due to strategic moves from
other competitors and ineffective decision-making in the management ,
AtliQ grands are losing its market share and revenue in the luxury/business
hotels category .
As a strategic move , the managing director of AtliQ Grands wanted to
regain their market share and revenue . Their revenue management team
had decided to hire a 3rd party service provider to provide them with
insights from their historical data .
You are a data analyst who has been provided with sample data to provide
the revenue insights to the team
Team provided the 3 months bookings data of AtliQ Grand having around
1.4 lakh records .
Dataset contains 3 dimension tables and 2 fact tables
In between project, we were also provided with the August month data to
include it in the previous data
Importing datasets
dim_date dim_hotels
dim_rooms fact_aggregated_bookings
fact_bookings
If we look closely we find that minimum number of guests is -ve , which can’t be true
Since rows containing negative number of guests are less as compared to the
total rows , hence we can ignore them for insight generation .
So keeping the rows which has number of guests +ve
Since we have only 5 outliers , we can ignore them like we did earlier .
One observation we can have in above dataframe is that all rooms are RT4 type which means
Presidential Suite. Now since RT4 is a luxurious room it is likely their rent will be higher. To
make a fair analysis, we need to do data analysis only on RT4 room type
Here higher limit comes to be 50583 and in our dataframe above we can see that max
value for revenue realized is 45220. Hence we can conclude that there is no outlier
and we don't need to do any data cleaning on this particular column
There are various types of data transformations that you may have to perform
based on the need. Few examples of data transformations are,
1.Creating new columns
2.Normalization
3.Merging data- using merge function
4.Aggregation- sum , mean , etc