0% found this document useful (0 votes)

55 views29 pages

ET - Project Presentation Solution

This document discusses analyzing customer data to predict which customers are likely to purchase a new travel package being offered by a tourism company. Key points discussed include: - Customers with passports, from tier 1 cities, who are younger, single, and have been contacted multiple times are more likely to purchase packages. - Higher income customers and those in high positions are less likely to purchase. Basic and standard packages have higher conversion rates. - The document outlines data preprocessing steps like dropping outliers and rare values in the dataset to prepare the data for modeling to predict customers likely to purchase the new wellness travel package.

Uploaded by

Sugrib K Shaha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views29 pages

ET - Project Presentation Solution

Uploaded by

Sugrib K Shaha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Travel Package Purchase

Prediction
[email protected]
D1GS97LPEQ

Visit with Us: Ensemble Technique

This file is meant for personal use by [email protected] only.

● Business Problem Overview and Solution Approach

● EDA Results

● Data Preprocessing
[email protected]
●
D1GS97LPEQ Model Performance Summary

● Appendix

This file is meant for personal use by [email protected] only.

Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Executive Summary
● Our analysis shows that very few customers have passports and they are more likely to purchase
the travel package. The company should customize more international packages to attract more
such customers.

● We have customers from tier 1 and tier 3 cities but very few from tier 2 cities. The company
should expand its marketing strategies to increase the number of customers from tier 2 cities.

● We saw
[email protected] in our analysis that people with higher income or at high positions like AVP or VP are
D1GS97LPEQ
less likely to buy the product. The company can offer short-term travel packages and customize
the package for higher- income customers with added luxuries to target such customers.

● When implementing a marketing strategy, external factors, such as the number of follow-ups,
time of call, should also be carefully considered as our analysis shows that the customers who
have been followed up more are the ones buying the package.

This file is meant for personal use by [email protected] only.

Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Executive Summary
● After we identify a potential customer, the company should pitch packages as per the customer's
monthly income, for example, do not pitch king packages to a customer with low income and such
packages can be pitched more to the higher-income customers.

● We saw in our analysis that young and single people are more likely to buy the offered packages.
The company can offer discounts or customize the package to attract more couples, families, and
customers above 30 years of age.
[email protected]
D1GS97LPEQ

This file is meant for personal use by [email protected] only.

Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Business Problem Overview and Solution Approach
● Visit with us is a tourism company, and the policymaker wants to enable and establish a viable
business model to expand the customer base. A viable business model is a central concept that
helps you understand the existing ways of doing business and how to change the ways for the
beneﬁt of the tourism sector.

● One of the ways to expand the customer base is to introduce a new offering of packages.
Currently, there are 5 types of packages the company is offering - Basic, Standard, Deluxe, Super
[email protected]
D1GS97LPEQ Deluxe, and King. However, it was difﬁcult to identify the potential customers because customers
were contacted at random without looking at the available information.

● The company is now planning to launch a new product i.e. Wellness Tourism Package. Wellness
Tourism is deﬁned as Travel that allows the traveler to maintain, enhance or kick-start a healthy
lifestyle, and support or increase one's sense of well-being. This time company wants to harness
the available data of existing and potential customers to target the right customers.

● The task is to analyze the data and build a model to predict which customer is potentially going
to purchase the newly introduced travel package.
This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
EDA Results

[email protected]
D1GS97LPEQ

● The distribution for monthly income shows that most of the values lie between 20,000 to 40,000.

● Income is one of the important factors to consider while approaching a customer with a certain package.
We can explore this further in bivariate analysis.

● There are some observations on the left and some observations on the right of the boxplot which can be
considered as outliers.

This file is meant for personal use by [email protected] only.

Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
EDA Results

● There are approx 70% of customers who reached out to the company
ﬁrst i.e. self-inquiry.

● This shows the positive outreach of the company as most of the

inquires are initiated from the customer's end.

[email protected]
D1GS97LPEQ

● The company pitches Deluxe or Basic packages to their customers more

than the other packages.

● This might be because the company makes more proﬁt from Deluxe or Basic
packages or these packages are less expensive, so preferred by the majority
of the customers.
This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
EDA Results

● We have seen that married people are the most common customer
for the company but this graph shows that the conversion rate is
higher for single and unmarried customers as compared to the
married customers.

● The company can target single and unmarried customers more and
can modify packages as per these customers.
[email protected]
D1GS97LPEQ

● The conversion rate for large business owners is higher than salaried or
small business owners.

● This might be because large business owners have high income.

● Freelancer have 100% conversion rate but there is just 2 such

observation, so cannot give any conclusive insights.
This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
EDA Results

[email protected]
D1GS97LPEQ

● The conversion rate of customers is higher if the product pitched is Basic. This might be because
the basic package is less expensive.

● We saw earlier that company pitches the deluxe package more than the standard package, but the
standard package shows a higher conversion rate than the deluxe package. The company can pitch
standard packages more often.
This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
EDA Results

[email protected]
D1GS97LPEQ

● The Number of trips and age have a weak positive correlation, which makes sense as age increases
number of trips is expected to increase.

● Age and monthly income are positively correlated.

● ProdTaken has a weak negative correlation with age which agrees with our earlier observation that
as age increases the probability for purchasing a package decreases.

● No other variables have a high correlation among them.

This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Preprocessing
● There are only two observations where the duration of pitch is greater than 37, so we will drop
these rows.

● There are only four observations where the monthly income is greater than 40,000 and less than
12000. Checked these observations and they seem to be the outliers.

● The percentage of categories for the number of trips 19 or above is very less. We can consider
these values as outliers. We can see that there are just four observations with a number of trips
[email protected]
D1GS97LPEQ
19 or greater, so we will drop these rows.

● There are missing values in a few of the numeric variables Age, Monthly income, and Number of
trips, so we will impute these values with a median.

● There are missing values in a few of the categorical variables Type of contact, Preferred property
star, and Number of children visiting, so we will impute these values with mode / most frequent.

● There are 6 categorical variables having string values, so we will be encoding these variables
with dummies.
This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Model Performance Summary
● We want to predict whether a liability customer will buy newly introduced travel package or not
using the information provided to us.

● We will use the Recall as the performance metric for our model because

● Predicting a customer will buy the product and the customer doesn't buy - Loss of
resources
[email protected]
D1GS97LPEQ ● Predicting a customer will not buy the product and the customer buys - Loss of opportunity

● We would want Recall to be maximized. The greater the Recall higher the chances of
minimizing false negatives

● Tuned XGBoost model indicates that the most signiﬁcant predictors of buying a travel package:

○ Passport
○ Designation
○ Marital Status
○ City tier This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Model Performance Summary

[email protected]
D1GS97LPEQ

Best performing model

This file is meant for personal use by [email protected] only.

APPENDIX

This file is meant for personal use by [email protected] only.

● The attributes include Age, Occupation, Income,Gender,Prod taken, Occupation, Passport, and
more.

● Average age of customers is 37 years, age of customers has a wide range from 18 to 61 years.

● Monthly income variable has some outliers at both ends.

[email protected]
D1GS97LPEQ
● Average income of customers is 25k dollars. Income has a wide range from 1k dollars to 98k
dollars. The distribution of Income is skewed to right.

● Half of the customers are married.

● 70% of the customers do not have passport.

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

● 0 errors on the training set, each sample ● The decision tree model is overﬁtting
has been classiﬁed correctly.
[email protected]
D1GS97LPEQ
the data as expected and not able to
generalize well on the test set.
● Model has performed very well on the
training set. ● We will have to use hyperparameter
tuning with the decision tree.
● As we know, a decision tree will continue
to grow and classify each data point
correctly if no restrictions are applied as the
trees will learn all the patterns in the
training set.
This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Model Improvement: Decision Tree

Training Performance Testing Performance

● The performance of the model after hyperparameter tuning has become generalized.
[email protected]
● We are getting
D1GS97LPEQ a Recall of 0.663 and 0.652 for training and test set, respectively.

● Let’s try building some ensemble models and see if the metrics improve.

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

● With default parameters, random forest is overﬁtting the training data.

[email protected]
● We'll try to
D1GS97LPEQ reduce overﬁtting and improve the performance by hyperparameter tuning.

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

● We are getting a Recall of 0.881 and 0.662 for training and test set, respectively.
[email protected]
● After tuning
D1GS97LPEQ the hyperparameters the random forest is still overﬁtting

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

● We are getting a Recall of 0.951 and 0.510 for training and test set, respectively, which is
a very big difference.
[email protected]
D1GS97LPEQ

● We'll try to reduce overﬁtting and improve the performance by hyperparameter tuning.

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

● After tuning the hyperparameters the bagging classiﬁer is still overﬁtting.

[email protected]
● There's a
D1GS97LPEQ big difference in the training and the test recall.

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

● The recall of both train and test set is very less.

[email protected]
● We'll try to
D1GS97LPEQ improve the performance by hyperparameter tuning.

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

● The recall of both train and test set is improved but there is a big difference between both
the sets.
[email protected]
D1GS97LPEQ

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

● The recall of both train and test set is very less.

[email protected]
● We'll try to
D1GS97LPEQ improve the performance by hyperparameter tuning.

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

● The recall of both train and test set is improved but there is a difference between both the
sets.
[email protected]
D1GS97LPEQ

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

● The XGBoost model on the training set has performed very well but it is not able to
generalize on the test set.
[email protected]
D1GS97LPEQ

● Let's try and tune the hyperparameters and see if the performance can be generalized.

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

● The overﬁtting has reduced after hyperparameter tuning but is still an overﬁt model.
[email protected]
D1GS97LPEQ

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

● For the Stacking Classifier, the tuned random forest, the tuned gradient boosting classifier
and the decision tree models were used as the initial estimators while the tuned xgboost
[email protected]
D1GS97LPEQ
classifier was used as the final estimator.
● We have received recall scores of 0.878 and 0.735 on the training and test set,
respectively.

This file is meant for personal use by [email protected] only.

Capstone Presentation: Telecom Churn Study
100% (3)
Capstone Presentation: Telecom Churn Study
19 pages
2023 UBS AC Prep Material
No ratings yet
2023 UBS AC Prep Material
6 pages
Lead Score Case Study - Presentation
33% (3)
Lead Score Case Study - Presentation
17 pages
Standard Operating Procedure On Coal Loss Accounting
100% (2)
Standard Operating Procedure On Coal Loss Accounting
62 pages
ML-2 Guided Project Report
No ratings yet
ML-2 Guided Project Report
63 pages
Address Proof
No ratings yet
Address Proof
1 page
Capstone Project 1
100% (1)
Capstone Project 1
20 pages
Assignment Guidelines
No ratings yet
Assignment Guidelines
14 pages
Telecom Customer Churn Project Report
50% (2)
Telecom Customer Churn Project Report
25 pages
Logistic Regression and Lda
75% (4)
Logistic Regression and Lda
27 pages
Ticket From Jessore To Dhaka
No ratings yet
Ticket From Jessore To Dhaka
3 pages
Management of Ventral Hernias
No ratings yet
Management of Ventral Hernias
22 pages
Social Media Geeta
No ratings yet
Social Media Geeta
33 pages
Tourism Adoption Project Report
No ratings yet
Tourism Adoption Project Report
14 pages
Reference Report 2
No ratings yet
Reference Report 2
43 pages
Enterprise Final Demo
No ratings yet
Enterprise Final Demo
8 pages
PM Guided Project Sample Business Report
100% (1)
PM Guided Project Sample Business Report
52 pages
Great Lakes Extraa - Learn Project Business Report - 2-Kavish-Rathod
No ratings yet
Great Lakes Extraa - Learn Project Business Report - 2-Kavish-Rathod
22 pages
Business Report - 17nov2024
No ratings yet
Business Report - 17nov2024
20 pages
Analysis For Holiday Package
No ratings yet
Analysis For Holiday Package
6 pages
PM Guided Project
No ratings yet
PM Guided Project
25 pages
Lead Score Case Study
No ratings yet
Lead Score Case Study
13 pages
Ga MRM 160225 SJK GR3
No ratings yet
Ga MRM 160225 SJK GR3
20 pages
Business Analytics Course
No ratings yet
Business Analytics Course
11 pages
Projects PDF
No ratings yet
Projects PDF
12 pages
Deema Hyper Market - Case Study
No ratings yet
Deema Hyper Market - Case Study
6 pages
Data Science Task-2
No ratings yet
Data Science Task-2
13 pages
Travel Agency Package
No ratings yet
Travel Agency Package
26 pages
Lead Score Case Study
No ratings yet
Lead Score Case Study
13 pages
Acquisition Analytics Assignment
No ratings yet
Acquisition Analytics Assignment
15 pages
Conclusion and Business Recommendations Predictive PDF
No ratings yet
Conclusion and Business Recommendations Predictive PDF
6 pages
LEAD SCORING CASE STUDY-converted-compressed
No ratings yet
LEAD SCORING CASE STUDY-converted-compressed
13 pages
How Predictive Analytics Can Deepen Customer Relationships
No ratings yet
How Predictive Analytics Can Deepen Customer Relationships
39 pages
Marketing Analytics PDF
No ratings yet
Marketing Analytics PDF
23 pages
Abhay Ankit Customer Churn Capstone Project
No ratings yet
Abhay Ankit Customer Churn Capstone Project
19 pages
Business Analytics
No ratings yet
Business Analytics
74 pages
Report
No ratings yet
Report
17 pages
SMT Capstone PPT Ayushi Rastogi PGPDSBA.O.MAY22.C
No ratings yet
SMT Capstone PPT Ayushi Rastogi PGPDSBA.O.MAY22.C
12 pages
Lab 3 Customer Behaviour Analysis 0826422
No ratings yet
Lab 3 Customer Behaviour Analysis 0826422
9 pages
Report Online Shoppers Intentions
No ratings yet
Report Online Shoppers Intentions
37 pages
Abigail Tsani Darmawan - Streamlining Bank Campaign Promotion (Batch 16)
No ratings yet
Abigail Tsani Darmawan - Streamlining Bank Campaign Promotion (Batch 16)
56 pages
Social Media Tourism - Capstone Project
No ratings yet
Social Media Tourism - Capstone Project
13 pages
Project 4 Data Mining Final v2
100% (1)
Project 4 Data Mining Final v2
19 pages
Business Report of Social Media Tourism Project
No ratings yet
Business Report of Social Media Tourism Project
21 pages
Lead Score
No ratings yet
Lead Score
23 pages
College Presentation
No ratings yet
College Presentation
9 pages
Amit-Soni
No ratings yet
Amit-Soni
1 page
MA230 JordanW8Project
No ratings yet
MA230 JordanW8Project
10 pages
Lead Score Case Study
No ratings yet
Lead Score Case Study
13 pages
Module - 2 - Template - Susan
No ratings yet
Module - 2 - Template - Susan
13 pages
Direct Marketing Data Analysis
No ratings yet
Direct Marketing Data Analysis
15 pages
Milestone 4 Sem2 Final
0% (1)
Milestone 4 Sem2 Final
28 pages
Lead Scoring Case Study
No ratings yet
Lead Scoring Case Study
14 pages
Predictive Analysis For Retail Banking
No ratings yet
Predictive Analysis For Retail Banking
28 pages
Divya Naukri
No ratings yet
Divya Naukri
2 pages
Lead Score Case Study
No ratings yet
Lead Score Case Study
9 pages
Data Analytics CASE
No ratings yet
Data Analytics CASE
14 pages
Analysis and Presentation For Bank Marketing Data: Vinay Kumar MS by Research Scholar IIT Kharagpur +91-8348575432
No ratings yet
Analysis and Presentation For Bank Marketing Data: Vinay Kumar MS by Research Scholar IIT Kharagpur +91-8348575432
20 pages
Cart-Rf-Ann: Prepared by Muralidharan N
67% (3)
Cart-Rf-Ann: Prepared by Muralidharan N
33 pages
BADM
No ratings yet
BADM
9 pages
Bike Company Market Analysis by Diseph
No ratings yet
Bike Company Market Analysis by Diseph
44 pages
East West Airlines NN
No ratings yet
East West Airlines NN
205 pages
Ignite Milestone 1
No ratings yet
Ignite Milestone 1
11 pages
Abhishek Singh Report
No ratings yet
Abhishek Singh Report
9 pages
Metals 218666 Peer Review v1 1
No ratings yet
Metals 218666 Peer Review v1 1
19 pages
PM FC0205 Sample1 Checked Jan27 15
No ratings yet
PM FC0205 Sample1 Checked Jan27 15
602 pages
Mechanical Behavior of Cast and Forged Magnesium Alloys and Their Microstructures
No ratings yet
Mechanical Behavior of Cast and Forged Magnesium Alloys and Their Microstructures
5 pages
QC Students Material
No ratings yet
QC Students Material
254 pages
Structure and Properties of Cast Al-Si Based Alloy With Zr-v-Ti
No ratings yet
Structure and Properties of Cast Al-Si Based Alloy With Zr-v-Ti
13 pages
How To Send An INTERAC E-Transfer - 20141210
No ratings yet
How To Send An INTERAC E-Transfer - 20141210
1 page
Abstract1 CMSC Dyuti Mar1 12 v5
No ratings yet
Abstract1 CMSC Dyuti Mar1 12 v5
1 page
Help - Grains (The Class GrainSet) (MTEX Toolbox)
No ratings yet
Help - Grains (The Class GrainSet) (MTEX Toolbox)
4 pages
Ieee Mems 2015 Conference Sample Abstract and Instructions For Abstract Preparation
No ratings yet
Ieee Mems 2015 Conference Sample Abstract and Instructions For Abstract Preparation
2 pages
CL30-AA-2-5%-250-3h-F Initial Length, L
No ratings yet
CL30-AA-2-5%-250-3h-F Initial Length, L
115 pages
Comparison With Experimental
No ratings yet
Comparison With Experimental
2 pages
Dll-Types of Chemical RXN
No ratings yet
Dll-Types of Chemical RXN
23 pages
Freshman Admission and Enrollment Procedure
No ratings yet
Freshman Admission and Enrollment Procedure
4 pages
Decimals: Skill 4 - 27B: Estimate Sums and Differences Directions: Estimate by Rounding. Rewrite Each Problem
No ratings yet
Decimals: Skill 4 - 27B: Estimate Sums and Differences Directions: Estimate by Rounding. Rewrite Each Problem
3 pages
ch07 Solution Manual Managerial Accounting Tools For Business Decision Making PDF
100% (2)
ch07 Solution Manual Managerial Accounting Tools For Business Decision Making PDF
63 pages
Geologia Econômica Kupferschiefer
No ratings yet
Geologia Econômica Kupferschiefer
2 pages
Update Plan
100% (1)
Update Plan
79 pages
Introduction To Hospitality - Food Safety
No ratings yet
Introduction To Hospitality - Food Safety
49 pages
Numerical Reasoning Test - Managers
No ratings yet
Numerical Reasoning Test - Managers
8 pages
Selenium Exception Handling 1744116046
No ratings yet
Selenium Exception Handling 1744116046
6 pages
Sherwin Govender Research Report 2018
No ratings yet
Sherwin Govender Research Report 2018
50 pages
Fungi CHP Question Paper
No ratings yet
Fungi CHP Question Paper
4 pages
Plumbing Symbol: Sewer Line Layout Ground Floor Water Line Layout Storm Drainage Layout
No ratings yet
Plumbing Symbol: Sewer Line Layout Ground Floor Water Line Layout Storm Drainage Layout
1 page
Ansari Mansur Ahammad Resume
No ratings yet
Ansari Mansur Ahammad Resume
5 pages
Water Resource - Watermark
No ratings yet
Water Resource - Watermark
4 pages
College of Teacher Education Modular Learning: Module Format For UEP
No ratings yet
College of Teacher Education Modular Learning: Module Format For UEP
4 pages
Accounting - Seneca - Toronto, Canada
No ratings yet
Accounting - Seneca - Toronto, Canada
7 pages
(Answers) 20200915172413prl3 - v1 - 0 - Exercise - Year - End - Federal - 2017 - 0120
100% (1)
(Answers) 20200915172413prl3 - v1 - 0 - Exercise - Year - End - Federal - 2017 - 0120
13 pages
Young Rewired State: White Paper V1.1
No ratings yet
Young Rewired State: White Paper V1.1
28 pages
Turnover Checklist
No ratings yet
Turnover Checklist
5 pages
Study of A Novel Cathode Tool Structure For Improving Heat Removal in Electrochemical Micro-Machining
No ratings yet
Study of A Novel Cathode Tool Structure For Improving Heat Removal in Electrochemical Micro-Machining
7 pages
Rent House Solo Baru Area
No ratings yet
Rent House Solo Baru Area
10 pages
Risk Management Procedure
No ratings yet
Risk Management Procedure
13 pages
(1905) Baltimore Bargain House Catalogue
No ratings yet
(1905) Baltimore Bargain House Catalogue
34 pages
Chapter 6-Well Completion
100% (4)
Chapter 6-Well Completion
49 pages
AFS19-SA094-Unisteel Scaffolding and Formwork-Alpino-31122019
No ratings yet
AFS19-SA094-Unisteel Scaffolding and Formwork-Alpino-31122019
5 pages
Zapper Frequency Generator Hulda Clark Royal Rife
No ratings yet
Zapper Frequency Generator Hulda Clark Royal Rife
48 pages

ET - Project Presentation Solution

Uploaded by

ET - Project Presentation Solution

Uploaded by

Travel Package Purchase

Visit with Us: Ensemble Technique

This file is meant for personal use by [email protected] only.

● Business Problem Overview and Solution Approach

This file is meant for personal use by [email protected] only.

This file is meant for personal use by [email protected] only.

This file is meant for personal use by [email protected] only.

This file is meant for personal use by [email protected] only.

● This shows the positive outreach of the company as most of the

● The company pitches Deluxe or Basic packages to their customers more

● This might be because large business owners have high income.

● Freelancer have 100% conversion rate but there is just 2 such

● Age and monthly income are positively correlated.

● No other variables have a high correlation among them.

Best performing model

This file is meant for personal use by [email protected] only.

This file is meant for personal use by [email protected] only.

● Monthly income variable has some outliers at both ends.

● Half of the customers are married.

● 70% of the customers do not have passport.

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

Training Performance Testing Performance

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

● With default parameters, random forest is overﬁtting the training data.

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

● After tuning the hyperparameters the bagging classiﬁer is still overﬁtting.

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

● The recall of both train and test set is very less.

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

● The recall of both train and test set is very less.

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

This file is meant for personal use by [email protected] only.

Training Performance Testing Performance

This file is meant for personal use by [email protected] only.

This file is meant for personal use by [email protected] only.

You might also like