Case Study Summary

The document summarizes a case study on using logistic regression to score and rank leads for an online education company. The goal was to increase sales efficiency by identifying the most promising leads. Data on past leads was cleaned, explored, and split for model training and testing. A logistic regression model was built that achieved 80% accuracy on the test data. It generated conversion probability scores to classify leads as "hot" or not. This scoring system can help the sales team focus on prospects most likely to convert.

Uploaded by

Nitish Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

74 views3 pages

Case Study Summary

Uploaded by

Nitish Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Case Study Summary

The aim of this report is to summarise the approach taken for Lead Scoring case
study.

Let us divide the summary in 3 parts namely What, How and Conclusions.

What: The X education company offers online courses. Everyday lots of people
looking for these online courses lands on company’s website or get to know about
the courses by different lead origins. All this data is stored in a dataset and used by
sales team to approach lead. The efficiency of this process is not good. The target is
to increase the efficiency by reducing the time spent on leads while keeping the
same conversion rate.

How: The way to achieve a better efficiency is by creating a logistic regression

model which can predict the probability of conversion based on the existing
dataset. The steps taken to build this logistic model are below:
1. Data is loaded and shape, info etc queried
2. “Select” is considered a null value and therefore replaced with NAN value
3. The data is cleaned in following steps:
a. Columns with more than 50 % missing values were dropped
b. On further examination, 4 columns had about 45 percent missing values
("Asymmetrique Activity Index", "Asymmetrique Profile Index", "Asymmetrique Activity
Score", "Asymmetrique Profile Score"), so with some examination and further
exploration, we concluded its better to drop these columns rather than imputing
there missing values.
c. Skewed columns were dropped with a cut-off value as 85 percent.
d. Rows with 5 or more missing values were dropped
e. The columns with 20-40 % missing values were imputed by substituting
with median values as they were all categorical columns
f. After cleaning the data 99.54 % data retained

4. EDA was performed

a. Univariate analysis in categorical variables showed the maximum and minimum
occurrence of categories. Some categorical columns had a lot of categories, so
based on each column, some categories with very less frequencies were merged into
a single category called “Other”
b. Univariate analysis on numerical columns showed that there were outliers
in few columns so the values in these columns were capped to 99%
c. Bivariate analysis was done using ‘Converted’ as target variable and using
that columns helping in conversion were interpreted
d. Multivariate analysis using correlation matrix showed the most correlated
variables

5. Data preparation steps:

a. Columns with too many categories were binned to reduce the number of
dummies
b. Data was split in 70-30 % ratio for train and test
c. Standard scaler was used for scaling to help algorithm converge faster
6. Modelling:
a. Using RFE initially 25 columns were selected
b. It took 14 model iteration to achieve stable and <5% p-values, less than 5
vif and 80% accuracy
c. ROC curve was plotted to check the sensitivity and specificity variation
d. The optimal cut off probability value was optimized by iterating over cut
off values and plotting the sensitivity, specificity and accuracy on a plot.
e. The sensitivity, specificity and accuracy plot intersected at about 0.37 but
we had a pre-requisite requirement of sensitivity of 80 percent, so cut-off value was
chosen as 0.2 which yielded sensitivity of 83 percent on train data and 81 percent
of Test data.

Conclusion
Based on the conversion probabilities calculated by the model, created a new
column called Score to rate the leads. It will help the Sales team in finding out hot
leads.
e. The precision recall curve was plotted but not used for cut-off value
optimization as our target was to chase hot leads and not cold leads so having a
good balance of sensitivity and specificity was more important
f. The model was run on test data and it gave a sensitivity value of 83%

Conclusion – A logistic regression model is created with desired accuracy of 80% and can be used to
find the hot leads.

Kazadi Joel 9213934 DLMDSPWP01
No ratings yet
Kazadi Joel 9213934 DLMDSPWP01
18 pages
Subjective Questions
92% (13)
Subjective Questions
6 pages
Capstone Interim Report - HR CTC Prediction
80% (10)
Capstone Interim Report - HR CTC Prediction
16 pages
Thoits 1994 StressorsProblemSolvingIndividual
No ratings yet
Thoits 1994 StressorsProblemSolvingIndividual
19 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Cme 270 Midterm Exam, Fall 2010 Professor Hofmann Notes
No ratings yet
Cme 270 Midterm Exam, Fall 2010 Professor Hofmann Notes
7 pages
Problem 1: Linear Regression
54% (13)
Problem 1: Linear Regression
14 pages
Week4 EnhancedSystemDecomposition Part2
No ratings yet
Week4 EnhancedSystemDecomposition Part2
22 pages
Sample Presentation
No ratings yet
Sample Presentation
6 pages
Final Theory 2022 en
No ratings yet
Final Theory 2022 en
31 pages
Lead Score Case Study - Presentation
33% (3)
Lead Score Case Study - Presentation
17 pages
Edi 104 - Chapter 3
No ratings yet
Edi 104 - Chapter 3
47 pages
Lead Score
No ratings yet
Lead Score
23 pages
UE271
No ratings yet
UE271
1 page
Religion, Guilt, and Ethical Standards
No ratings yet
Religion, Guilt, and Ethical Standards
17 pages
Assignment-Based Subjective Questions
No ratings yet
Assignment-Based Subjective Questions
1 page
Facial Expressionsandthe Abilityto Recognize Emotionsfromthe Eyesor Mouth AComparison Between Childrenand Adults
No ratings yet
Facial Expressionsandthe Abilityto Recognize Emotionsfromthe Eyesor Mouth AComparison Between Childrenand Adults
11 pages
Grade Woe Data
No ratings yet
Grade Woe Data
2,759 pages
Security Aspects in IoT Based Cloud Computing
No ratings yet
Security Aspects in IoT Based Cloud Computing
12 pages
BerkeGündüz MelihAydın Cmpe442 Training Report
No ratings yet
BerkeGündüz MelihAydın Cmpe442 Training Report
14 pages
Watch Beyblade Burst Dynamite Battle English Subbed Online Free
0% (1)
Watch Beyblade Burst Dynamite Battle English Subbed Online Free
2 pages
Watch The Love in Your Eyes (2022) Episode 70 English Subbed On Myasiantv
No ratings yet
Watch The Love in Your Eyes (2022) Episode 70 English Subbed On Myasiantv
2 pages
Watch Trolley (2022) Episode 1 English Subbed On Myasiantv
No ratings yet
Watch Trolley (2022) Episode 1 English Subbed On Myasiantv
1 page
Proportional Relief Valves, High Pressure: SS-4R3A
No ratings yet
Proportional Relief Valves, High Pressure: SS-4R3A
2 pages
Kokdu Season of Deity (2023) (2023)
No ratings yet
Kokdu Season of Deity (2023) (2023)
2 pages
Watch The Love in Your Eyes (2022) Episode 72 English Subbed On Myasiantv
No ratings yet
Watch The Love in Your Eyes (2022) Episode 72 English Subbed On Myasiantv
2 pages
Main Projects Rubrics - PM - Coded (NEW)
No ratings yet
Main Projects Rubrics - PM - Coded (NEW)
2 pages
Bar Mid 1
No ratings yet
Bar Mid 1
12 pages
Employee Attrition Study Case
No ratings yet
Employee Attrition Study Case
88 pages
Glass Mask (2012)
No ratings yet
Glass Mask (2012)
2 pages
Lead Score Case Study
No ratings yet
Lead Score Case Study
13 pages
Watch Dungeon Ni Deai Wo Motomeru No Wa Machigatteiru Darou Ka III OVA English Subbed Online Free
No ratings yet
Watch Dungeon Ni Deai Wo Motomeru No Wa Machigatteiru Darou Ka III OVA English Subbed Online Free
2 pages
Skythewood Translations Overlord Volume 1 Prologue & Chapter 1
No ratings yet
Skythewood Translations Overlord Volume 1 Prologue & Chapter 1
1 page
Skythewood Translations Overlord Volume 1 Chapter 2 & Intermission - 6
No ratings yet
Skythewood Translations Overlord Volume 1 Chapter 2 & Intermission - 6
1 page
Skythewood Translations Overlord Volume 1 Chapter 2 & Intermission - 11
No ratings yet
Skythewood Translations Overlord Volume 1 Chapter 2 & Intermission - 11
1 page
How Does Anesthesia Work - Steven Zheng - YouTube
No ratings yet
How Does Anesthesia Work - Steven Zheng - YouTube
3 pages
AnimeSeries Watch Anime Online Free
No ratings yet
AnimeSeries Watch Anime Online Free
7 pages
YouTube
No ratings yet
YouTube
5 pages
Skythewood Translations Overlord Volume 1 Chapter 2 & Intermission - 5
No ratings yet
Skythewood Translations Overlord Volume 1 Chapter 2 & Intermission - 5
1 page
Watch Hetalia World Stars English Subbed Online Free
No ratings yet
Watch Hetalia World Stars English Subbed Online Free
2 pages
Watch Aikatsu Planet! English Subbed Online Free
No ratings yet
Watch Aikatsu Planet! English Subbed Online Free
2 pages
Simplex Algorithm - Wikipedia
No ratings yet
Simplex Algorithm - Wikipedia
20 pages
Brochure - Fibra-Cel Disks Questions and Answers
No ratings yet
Brochure - Fibra-Cel Disks Questions and Answers
4 pages
Lead Score Case Study
No ratings yet
Lead Score Case Study
9 pages
Sukanya December Predictive Modeling 14th Jan 2024
No ratings yet
Sukanya December Predictive Modeling 14th Jan 2024
50 pages
Lead Score Case Study
No ratings yet
Lead Score Case Study
9 pages
Watch One Piece English Subbed Online Free
No ratings yet
Watch One Piece English Subbed Online Free
2 pages
Pt. Fortindo Sukses Makmur: Price List
No ratings yet
Pt. Fortindo Sukses Makmur: Price List
22 pages
Summary Report - Vineeta - Aman
No ratings yet
Summary Report - Vineeta - Aman
2 pages
Presentation Lead Case Score
No ratings yet
Presentation Lead Case Score
12 pages
WIT-Color Ultra 9000 High Definition Printer Operations Manual
100% (1)
WIT-Color Ultra 9000 High Definition Printer Operations Manual
95 pages
Skythewood Translations Overlord Volume 1 Prologue & Chapter 1
No ratings yet
Skythewood Translations Overlord Volume 1 Prologue & Chapter 1
34 pages
Exemples de Writing English BAC
No ratings yet
Exemples de Writing English BAC
3 pages
Watch Anime Online, Watch English Anime Online Subbed, Dubbed
No ratings yet
Watch Anime Online, Watch English Anime Online Subbed, Dubbed
2 pages
Session 1: Simple Linear Regression: Figure 1 - Supervised and Unsupervised Learning Methods
No ratings yet
Session 1: Simple Linear Regression: Figure 1 - Supervised and Unsupervised Learning Methods
16 pages
AZ E-Lite
100% (1)
AZ E-Lite
85 pages
Fazal Mahmood - Resume
No ratings yet
Fazal Mahmood - Resume
1 page
Instruction Manual: Digital Genset Controller DGC-500
No ratings yet
Instruction Manual: Digital Genset Controller DGC-500
151 pages
Hemant Sawakare - Lead Scoring Case Study - Summary
No ratings yet
Hemant Sawakare - Lead Scoring Case Study - Summary
4 pages
Social Science Disciplines
No ratings yet
Social Science Disciplines
2 pages
Manual: High Pressure Cleaner MC 300/21
No ratings yet
Manual: High Pressure Cleaner MC 300/21
46 pages
Lead Scoring Case Study
No ratings yet
Lead Scoring Case Study
11 pages
Logistic Regression and Lda
75% (4)
Logistic Regression and Lda
27 pages
Lead Score Summary
No ratings yet
Lead Score Summary
4 pages
Lead Scoring Case Study Summary
No ratings yet
Lead Scoring Case Study Summary
3 pages
Lead Scoring Case Study Presentatin Shravan + Kavana
No ratings yet
Lead Scoring Case Study Presentatin Shravan + Kavana
15 pages
Output: Aoi
No ratings yet
Output: Aoi
24 pages
Lead Scoring Group Case Study Presentation
100% (2)
Lead Scoring Group Case Study Presentation
19 pages
Under Balanced Managed Pressure Drilling
No ratings yet
Under Balanced Managed Pressure Drilling
19 pages
Lead Scoring Logistic Regression
No ratings yet
Lead Scoring Logistic Regression
19 pages
Moodular Coordination
No ratings yet
Moodular Coordination
10 pages
Troubleshooting GEFANUC 90 30
No ratings yet
Troubleshooting GEFANUC 90 30
18 pages
Handout - NTPC
No ratings yet
Handout - NTPC
1 page
Tamplate To Document Learning From Orientation Program-2
No ratings yet
Tamplate To Document Learning From Orientation Program-2
1 page
Column Interaction Diagram
No ratings yet
Column Interaction Diagram
4 pages
Capstone Assessment
No ratings yet
Capstone Assessment
18 pages
Lead Scoring Assignment Summary
No ratings yet
Lead Scoring Assignment Summary
4 pages
Project Employee Absenteeism
No ratings yet
Project Employee Absenteeism
33 pages
List of MCA For CSC
No ratings yet
List of MCA For CSC
9 pages
FRA Milestone 1
No ratings yet
FRA Milestone 1
33 pages
Problem Statement - Graded Project: Variable Details
0% (1)
Problem Statement - Graded Project: Variable Details
3 pages
Lead Scoring Case Study
No ratings yet
Lead Scoring Case Study
7 pages
FRA Milestone 1
No ratings yet
FRA Milestone 1
33 pages
FM Heat & Smoke Detector
No ratings yet
FM Heat & Smoke Detector
34 pages
Lead Scoring Case Study Summary Report
100% (1)
Lead Scoring Case Study Summary Report
3 pages
Lead Scoring Case Study
No ratings yet
Lead Scoring Case Study
12 pages
LeadscoringCaseStudySummary Aparna Ashish
100% (2)
LeadscoringCaseStudySummary Aparna Ashish
2 pages
Lead Scoring Case Study Summary-Mamta Lohani and Garima Bansal
100% (1)
Lead Scoring Case Study Summary-Mamta Lohani and Garima Bansal
2 pages
Application of Logistic Regression To People-Analytics
No ratings yet
Application of Logistic Regression To People-Analytics
30 pages
Dual Clutch Transmission
100% (1)
Dual Clutch Transmission
7 pages
Lead Score Case Study: Presented By: Vaibhav Dubey Amar Uttarkar DSC-25
No ratings yet
Lead Score Case Study: Presented By: Vaibhav Dubey Amar Uttarkar DSC-25
11 pages
Lead Scoring Case Study Presentation
100% (2)
Lead Scoring Case Study Presentation
11 pages
Lead Score Case Study
No ratings yet
Lead Score Case Study
13 pages
Lead Score Case Study Presentation
No ratings yet
Lead Score Case Study Presentation
16 pages
Ucs551 Group Project Instructions (Dec 2023)
No ratings yet
Ucs551 Group Project Instructions (Dec 2023)
7 pages
Group - Case 1 Assignment Marketing Conundrum
No ratings yet
Group - Case 1 Assignment Marketing Conundrum
7 pages
Nanduri Naga Sowri Pgp-Dsba - Octa - G2 Great Learning
No ratings yet
Nanduri Naga Sowri Pgp-Dsba - Octa - G2 Great Learning
40 pages
Predictive Modelling Sweta Kumari
No ratings yet
Predictive Modelling Sweta Kumari
35 pages
Questions
No ratings yet
Questions
3 pages
Lead Score Case Study
No ratings yet
Lead Score Case Study
13 pages
Drug Calculation Tutorial
100% (2)
Drug Calculation Tutorial
13 pages
Project Submission Predictive Modelling - Logistic Regression and LDA
No ratings yet
Project Submission Predictive Modelling - Logistic Regression and LDA
29 pages
Travel Agency Package
No ratings yet
Travel Agency Package
26 pages
Lead Scoring Case Study: Aparna Trivedi Ashish Nipane DS C29
No ratings yet
Lead Scoring Case Study: Aparna Trivedi Ashish Nipane DS C29
13 pages
Student Performance Prediction: Mukul Gharpure, Pushpak Chaudhari, Yash Bhole, Sagar Borkar, Aashutosh Awasthi
No ratings yet
Student Performance Prediction: Mukul Gharpure, Pushpak Chaudhari, Yash Bhole, Sagar Borkar, Aashutosh Awasthi
7 pages
Assignment Report - Group A
No ratings yet
Assignment Report - Group A
31 pages
Lead Score Case Study Presentation
No ratings yet
Lead Score Case Study Presentation
13 pages
Business Report: Predictive Modelling
100% (2)
Business Report: Predictive Modelling
37 pages
Answer Report (Preditive Modelling)
100% (1)
Answer Report (Preditive Modelling)
29 pages
BT4211 Data-Driven Marketing: Fundamentals: Process and Statistical Issues in Predictive Modeling
No ratings yet
BT4211 Data-Driven Marketing: Fundamentals: Process and Statistical Issues in Predictive Modeling
38 pages
Advanced Business Analytics Project: Prepared By: Group 10 Lohith Kumar Vamshi Aparna Samarth
No ratings yet
Advanced Business Analytics Project: Prepared By: Group 10 Lohith Kumar Vamshi Aparna Samarth
7 pages
Big Data Jury
No ratings yet
Big Data Jury
21 pages
LEAD SCORING CASE STUDY-converted-compressed
No ratings yet
LEAD SCORING CASE STUDY-converted-compressed
13 pages
Acquisition Analytics Assignment
No ratings yet
Acquisition Analytics Assignment
15 pages
SLC 70 Marks Set 1
No ratings yet
SLC 70 Marks Set 1
3 pages

Case Study Summary

Uploaded by

Case Study Summary

Uploaded by

Case Study Summary

How: The way to achieve a better efficiency is by creating a logistic regression

4. EDA was performed

5. Data preparation steps:

You might also like