Explored hotel data using Python and libraries like Pandas, Matplotlib, and Seaborn. The project aims to analyze the dataset, uncover trends in hotel bookings, and draw meaningful conclusions about factors influencing the hotel business.
- This data set contains booking information for a city hotel and a resort hotel, and includes information such as when the booking was made, length of stay, the number of adults, children, and/or babies, and the number of available parking spaces, among other things.
- Data Overview
- Data Loading and Preprocessing
- Data Analysis and Exploration
- Data Visualization using Matplotlib and Seaborn
- Conclusion
- Which type of hotel is mostly prefered by the guests?
- What is the pecentage of cancellation?
- What is the Percentage of repeated guests?
- Which type of food is mostly preferred by the guests?
- In which month most of the bookings happened?
- Which year had the highest bookings?
- Which hotel type has the highest ADR?
- which hotel has longer waiting time?
- What is optimal stay length in both types of hotel?
- Which distribution channel contributed more to ADR in order to increase the income?
- Which Market Segment has the higest cancellation rate?
- Correlation Heatmap
- City hotels are the most preferred hotel type by the guests. We can say City hotel is the busiest hotel.
- 27.5 % bookings were got cancelled out of all the bookings.
- Only 3.9 % people were revisited the hotels. Rest 96.1 % were new guests. Thus retention rate is low.
- BB( Bed & Breakfast) is the most preferred type of meal by the guests.
- August month has most bookings followed by July.
- Most of the bookings for City hotels and Resort hotel were happened in 2016.
- City hotel has highest ADR. Highest ADR means more revenue.
- Waiting time period for City hotel is high as compared to resort hotels. That means city hotels are much busier than Resort hotels.
- Optimal stay in both the type hotel is less than 7 days. Usually people stay for a week.
- GDS distribution channel contributed most in ADR in city hotel but no contribution in resort hotel
- 'Online T/A' has the highest cancellation in both type of cities
- arrival_date_year and arrival_date_week_number columns has negative correlation which is -0.51.
- stays_in_weel_nights and total_stays has positive correlation which is 0.95.