0% found this document useful (0 votes)

17 views15 pages

Linear Regression and Logit

Uploaded by

pra2112catprep

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views15 pages

Linear Regression and Logit

Uploaded by

pra2112catprep

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Linear Regression

• Linear Regression Models are used to identify the

relationship between a continuous dependent variable
and one or more independent variables.
• Simple Linear Regression:
• When there is only one independent variable and one
dependent variable.
• Multiple Linear Regression:
• When there are more than one independent variables.
Linear Regression
• Y = a + b1(X1) + b2(X2) + …….bn(Xn)
• Where,
• Y is the dependent variable
• Xs are independent variables
• a is the intercept
• b1....bn are slope coefficients
Logistic Regression
(Logit)
• Similar to linear regression, logistic regression is also
used to estimate the relationship between a dependent
variable and one or more independent variables,
• But it is used to make a prediction about a categorical
variable versus a continuous one.
• A categorical variable can be true or false, yes or no, 1
or 0, etc.
• Logit estimates the probability of an event occurring,
such as voted or didn’t vote, based on a given data set
of independent variables.
Logistic Regression
(Logit)
• The Logit equation is written as:
• Log Odds of Event =β0+β1X1+β2X2+⋯+βnXn
• Where:
• β0 is the intercept.
• β1,β2,…βn are coefficients for the predictors X1,X2……Xn.
• The term log odds is a way of expressing the likelihood of an event (e.g., loan
defaulting, a person being employed) in a form that can be modeled linearly.
• Odds: A ratio of probabilities (p/(1−p)), where p is the probability of the event
happening.
• Log Odds: The natural logarithm of the odds, which allows probabilities to be
modeled linearly.
• Logistic regression predicts log odds, which can be transformed back to
probabilities for interpretation.
Types of Logit
• Binary logistic regression:
• In this approach, the response or dependent variable is dichotomous in nature—
i.e. it has only two possible outcomes (e.g. 0 or 1).
• Within logistic regression, this is the most commonly used approach, and more
generally, it is one of the most common classifiers for binary classification.
• Example 1: Suppose that we are interested in the factors that influence whether
a political candidate wins an election.
• The outcome (response) variable is binary (0/1); win or lose.
• The predictor variables of interest are:
• the amount of time spent on the campaign,
• the amount of money spent campaigning,
• whether the candidate is an incumbent.
• Example 2: A researcher is interested in how variables, such as GRE (Graduate
Record Exam scores), GPA (grade point average) and prestige of the
undergraduate institution, effect admission into graduate school.
• The outcome variable, admit/don’t admit, is binary.
Types of Logit
• Multinomial logistic regression:
• In this type of logistic regression model, the dependent variable has
three or more possible outcomes; however, these values have no
specified order.
• E.g.: movie studios want to predict what genre of film a moviegoer is
likely to see to market films more effectively. A multinomial logistic
regression model can help the studio to determine the strength of
influence a person's age, gender, and dating status may have on the
type of film that they prefer. The studio can then orient an advertising
campaign of a specific movie toward a group of people likely to go see
it.
• The marketing team of an organization can use the model to predict
the likelihood of a customer purchasing a specific product type (Basic,
Standard, or Premium) based on their age, income, and gender.
Types of Logit
• Ordinal logistic regression:
• In this type of logistic regression model, the response variable
has three or more possible outcomes
• But in this case, these values have a defined order.
• E.g.: grading scales from A to F or rating scales from 1 to 5.
Some Applications of Logit
• Fraud detection: Logistic regression models can help
teams identify data anomalies, which are predictive of
fraud. Certain behaviors or characteristics may have a
higher association with fraudulent activities, which is
particularly helpful to banking and other financial
institutions in protecting their clients.
• Disease prediction: In medicine, Logit can be used to
predict the likelihood of disease or illness for a given
population. Healthcare organizations can set up
preventative care for individuals that show higher
propensity for specific illnesses.
Some Applications of Logit
• Churn prediction: Specific behaviors may be
indicative of churn in different functions of an
organization. For example, human resources and
management teams may want to know if there are high
performers within the company who are at risk of
leaving the organization; this type of insight can prompt
conversations to understand problem areas within the
company, such as culture or compensation.
Case Study
• A leading financial institution is striving to improve its loan approval
process by better understanding the risk factors associated with loan
default. Defaulting on a loan not only causes financial losses but also
affects the institution's operational efficiency and reputation. By taking
data on borrowers, the institution seeks to develop a predictive model to
identify individuals who are more likely to default on their loans. The
institution has collected data on past loans, including financial,
demographic, and loan-specific attributes of borrowers, and their loan
repayment outcomes (whether they defaulted or not). The goal is to
analyze this dataset and build a model that predicts the probability of
loan default based on borrower characteristics.
• You are required to create a logistic regression model that:
1.Identifies the key predictors of loan default.
2.Provides actionable insights into borrower profiles more likely to default.
Case Study
(Data description)
• The dataset consists of the following features:
1.Income: Annual income of the borrower.
2.Credit Score: Credit score of the borrower, reflecting their
creditworthiness.
3.Employment Status: Employment status of the borrower (0 for
unemployed, 1 for employed).
4.Debt to Income Ratio: Ratio of the borrower’s debt payments to
their income.
5.Loan Amount: Amount of loan requested by the borrower.
6.Age: Age of the borrower.
7.Loan Default: The target variable (1 for default, 0 for no default).
Results
• A p<0.05 indicates statistical significance at 95% confidence level.
This means that the variable likely affects the dependent variable.
• Income:
• Interpretation: Measures the effect of a one-unit increase in income (e.g., 1
dollar) on the log odds of loan default. The negative coefficient (−0.0002)
suggests that higher income reduces the likelihood of default.
• p-value: 0.1320. This is not statistically significant (p>0.05), meaning the
effect of income on loan default is not conclusive in this model.
• Credit Score:
• Interpretation: Measures the effect of a one-point increase in credit score on
the log odds of loan default. The negative coefficient (−0.0638) suggests
higher credit scores reduce the likelihood of default.
• p-value: 0.1020. This is close to being statistically significant but not below
the 0.05 threshold.
Results
• Employment Status
• Interpretation: Employment status is encoded as 0
(unemployed) and 1 (employed). The negative coefficient
(−0.2967) suggests that being employed might slightly reduce
the likelihood of default, though the effect is negligible.
• p-value: 0.9380. This indicates no significant effect of
employment status on loan default.
• Interpreting coefficients of dummy variable:
• A positive coefficient suggests that when the dummy variable is 1
(as opposed to 0), the dependent variable (e.g., loan default) is
expected to increase.
• A negative coefficient suggests that when the dummy variable is 1
(as opposed to 0), the dependent variable is expected to decrease.
Results
• Debt-to-Income Ratio
• Interpretation: Reflects the effect of a 1 unit increase in debt-to-
income ratio on the log odds of default. The positive coefficient
(5.75555) suggests that higher ratios might increase default
likelihood.
• p-value: 0.6830 This is not statistically significant.
• Loan Amount
• Interpretation: Reflects the effect of a one-unit increase in loan
amount (e.g., 1 dollar) on the log odds of default. The negative
coefficient (−0.0002) suggests larger loan amounts might reduce
default likelihood.
• p-value: 0.2840. This indicates no significant effect of loan amount
on default.
Results
• Age
• Interpretation: Reflects the effect of a one-year increase in
age on the log odds of default. The negative coefficient
(−0.1643) suggests older individuals are slightly less likely to
default.
• p-value: 0.3540 is not statistically significant.

Statistics of Inheritance POGIL
50% (2)
Statistics of Inheritance POGIL
3 pages
LIBROJ. S. Cramer - Logit Models From Economics and Other Fields-Cambridge University Press (2003)
100% (1)
LIBROJ. S. Cramer - Logit Models From Economics and Other Fields-Cambridge University Press (2003)
185 pages
Pushpull Final
100% (1)
Pushpull Final
59 pages
Virtual Density Lab 2018 PDF
No ratings yet
Virtual Density Lab 2018 PDF
2 pages
Binary Logistic
No ratings yet
Binary Logistic
29 pages
Logistic Regression
No ratings yet
Logistic Regression
18 pages
Logistic Regression - 2011
No ratings yet
Logistic Regression - 2011
76 pages
Logistic Regression
No ratings yet
Logistic Regression
54 pages
Chapter 10 - Logistic Regression: Data Mining For Business Intelligence
No ratings yet
Chapter 10 - Logistic Regression: Data Mining For Business Intelligence
20 pages
Chap10 LogisticRegression
No ratings yet
Chap10 LogisticRegression
19 pages
Report Logistic Regression
No ratings yet
Report Logistic Regression
17 pages
QTA 18-04-2013 Logistic Regression
No ratings yet
QTA 18-04-2013 Logistic Regression
4 pages
Logistic Regression
100% (1)
Logistic Regression
56 pages
Logistic Regression
No ratings yet
Logistic Regression
41 pages
Business Analytics: Advance: Logistic Regression
100% (1)
Business Analytics: Advance: Logistic Regression
26 pages
Logistic Regression
100% (3)
Logistic Regression
41 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
LO3 Logistic Regression1
No ratings yet
LO3 Logistic Regression1
31 pages
Chapter 10 Logistic Reg
No ratings yet
Chapter 10 Logistic Reg
29 pages
7.logistics Regression - BDSM - Oct - 2020
No ratings yet
7.logistics Regression - BDSM - Oct - 2020
49 pages
Logistic Regression Monograph
No ratings yet
Logistic Regression Monograph
33 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
Logistic Regression Monograph - DSBA v2
No ratings yet
Logistic Regression Monograph - DSBA v2
54 pages
Classification
No ratings yet
Classification
56 pages
CH 8
No ratings yet
CH 8
13 pages
Data Analytics Using R
No ratings yet
Data Analytics Using R
23 pages
Logistic Regression
No ratings yet
Logistic Regression
20 pages
Logistic Regression
No ratings yet
Logistic Regression
27 pages
Econometrics II CH 1
No ratings yet
Econometrics II CH 1
48 pages
Logistic Regression Report
No ratings yet
Logistic Regression Report
39 pages
Logistic Regression
No ratings yet
Logistic Regression
12 pages
Regression Analysis
No ratings yet
Regression Analysis
16 pages
04 Chap04 ClassificationMethods-LogisticRegression 2024
No ratings yet
04 Chap04 ClassificationMethods-LogisticRegression 2024
23 pages
Lecture 22. GLM
No ratings yet
Lecture 22. GLM
41 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistic+Regression+Monograph+ +DSBA+v2
No ratings yet
Logistic+Regression+Monograph+ +DSBA+v2
54 pages
DS535 Note 4 (With Marks)
No ratings yet
DS535 Note 4 (With Marks)
18 pages
Logistic Regression
100% (2)
Logistic Regression
47 pages
Report Logistic Regression
No ratings yet
Report Logistic Regression
21 pages
Loges Tic
No ratings yet
Loges Tic
30 pages
Limited Dependent Variables Models-1
No ratings yet
Limited Dependent Variables Models-1
23 pages
Newsletter 23 - Logit, Probit, Tobit (2P)
No ratings yet
Newsletter 23 - Logit, Probit, Tobit (2P)
2 pages
Topic 7 Regression (Cont2) Logistic Regression
No ratings yet
Topic 7 Regression (Cont2) Logistic Regression
33 pages
BANA 560 Lecture - 4 - LogisticRegression
No ratings yet
BANA 560 Lecture - 4 - LogisticRegression
26 pages
ML 4
No ratings yet
ML 4
80 pages
Logistic Regression:: PGP Dse Bangalore July 2018
No ratings yet
Logistic Regression:: PGP Dse Bangalore July 2018
62 pages
W5S01 - PM-Logistic Regression
No ratings yet
W5S01 - PM-Logistic Regression
17 pages
spss10 LOGIT
No ratings yet
spss10 LOGIT
17 pages
Fai Module 3
No ratings yet
Fai Module 3
67 pages
Logistic Regression
No ratings yet
Logistic Regression
10 pages
Regression Analysis
No ratings yet
Regression Analysis
14 pages
FEM 2063 - Data Analytics: CHAPTER 4: Classifications
100% (2)
FEM 2063 - Data Analytics: CHAPTER 4: Classifications
76 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistic Regression
No ratings yet
Logistic Regression
30 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
Chapter 10 Logistic Reg (Python)
No ratings yet
Chapter 10 Logistic Reg (Python)
29 pages
Logistic Regression
No ratings yet
Logistic Regression
7 pages
ML - Unit 2
No ratings yet
ML - Unit 2
155 pages
02 LogisticRegression
No ratings yet
02 LogisticRegression
29 pages
ML2 Logistic Regression
No ratings yet
ML2 Logistic Regression
23 pages
5.1) Binary Logistic Regression
No ratings yet
5.1) Binary Logistic Regression
32 pages
High Credit Score Step by Step
From Everand
High Credit Score Step by Step
paulo gomes
No ratings yet
Financial Plans for Successful Wealth Management In Retirement: An Easy Guide to Selecting Portfolio Withdrawal Strategies
From Everand
Financial Plans for Successful Wealth Management In Retirement: An Easy Guide to Selecting Portfolio Withdrawal Strategies
Tushar S. Chande, Ph.D., MBA
No ratings yet
Kate Wilson
No ratings yet
Kate Wilson
27 pages
UNIT-5 (C) Rural Entrepreneurship & Rural Industry in India (Ok)
No ratings yet
UNIT-5 (C) Rural Entrepreneurship & Rural Industry in India (Ok)
34 pages
Cyber Law 2
No ratings yet
Cyber Law 2
88 pages
UNIT-4 (C) Dimensions of HRD & Basic Amenities & Population Composition-2022
No ratings yet
UNIT-4 (C) Dimensions of HRD & Basic Amenities & Population Composition-2022
26 pages
UNIT-5 (D) Women Entrepreneurship, Introduction, Definition and Women Entrepreneurship in India-2022
No ratings yet
UNIT-5 (D) Women Entrepreneurship, Introduction, Definition and Women Entrepreneurship in India-2022
27 pages
Audience Selection
No ratings yet
Audience Selection
1 page
6) Maths Unit1 Extra Question Charpit's Method, Cauchy's Method & Non - Linear Pde
No ratings yet
6) Maths Unit1 Extra Question Charpit's Method, Cauchy's Method & Non - Linear Pde
15 pages
Module-3
No ratings yet
Module-3
44 pages
Solution On Paper
No ratings yet
Solution On Paper
12 pages
1) Maths Unit1 NP Bali
No ratings yet
1) Maths Unit1 NP Bali
74 pages
Solution VaR and Systematic Risk
No ratings yet
Solution VaR and Systematic Risk
65 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
2 pages
1 Binary & Hexadecimal Systems J24
No ratings yet
1 Binary & Hexadecimal Systems J24
19 pages
LC-10 LOAD CELL Trainer PDF
No ratings yet
LC-10 LOAD CELL Trainer PDF
10 pages
Lesson 1 Measures of Position
No ratings yet
Lesson 1 Measures of Position
23 pages
Atm System FINAL
No ratings yet
Atm System FINAL
77 pages
Cincinnati Products Catalog
No ratings yet
Cincinnati Products Catalog
20 pages
SpecificationsMotor 3176c PDF
No ratings yet
SpecificationsMotor 3176c PDF
107 pages
Cells and Tissues
No ratings yet
Cells and Tissues
5 pages
Materials For Mechanical Parts
No ratings yet
Materials For Mechanical Parts
20 pages
Chapter 13-Concrete USD
No ratings yet
Chapter 13-Concrete USD
58 pages
Consumer Theory
No ratings yet
Consumer Theory
17 pages
Vogelsang ETEP-Journal Detection of Electrical Tree Propagation by Partial Discharge Measurements
No ratings yet
Vogelsang ETEP-Journal Detection of Electrical Tree Propagation by Partial Discharge Measurements
7 pages
Maths2b Ipe QN Bank 20-21
No ratings yet
Maths2b Ipe QN Bank 20-21
18 pages
January 1995 PW
100% (1)
January 1995 PW
78 pages
Analysis of Temperature and Pressure Changes in Liquefied Natural
No ratings yet
Analysis of Temperature and Pressure Changes in Liquefied Natural
9 pages
Stone Masonry For Structures
No ratings yet
Stone Masonry For Structures
8 pages
Perhitungan Sistem Bilga Di Kapal
No ratings yet
Perhitungan Sistem Bilga Di Kapal
63 pages
Cable Laying Specification
No ratings yet
Cable Laying Specification
16 pages
PTCR Behaviour of Highly Donor Doped Batio: S. Urek and M. Drofenik
No ratings yet
PTCR Behaviour of Highly Donor Doped Batio: S. Urek and M. Drofenik
4 pages
Aditya Kaplash Research Paper (Ground Improvement Using Stone Column)
No ratings yet
Aditya Kaplash Research Paper (Ground Improvement Using Stone Column)
22 pages
02 Chem30 Exemplars 2009 10
No ratings yet
02 Chem30 Exemplars 2009 10
94 pages
LNB For KU Band
No ratings yet
LNB For KU Band
6 pages
Emeng 3131 Electrical Power Systems: Power System Transients, Power System Stability & Load Flow Studies
No ratings yet
Emeng 3131 Electrical Power Systems: Power System Transients, Power System Stability & Load Flow Studies
25 pages
Cbse - Class X - Maths Worksheet - Trigonometry
67% (6)
Cbse - Class X - Maths Worksheet - Trigonometry
2 pages
Year 4 Statistics and Probability Assessment
No ratings yet
Year 4 Statistics and Probability Assessment
9 pages
The Preparation of Ethene From Ethanol - Chemistry U2
No ratings yet
The Preparation of Ethene From Ethanol - Chemistry U2
3 pages
Picrosiriusred Protocol
No ratings yet
Picrosiriusred Protocol
8 pages

Linear Regression and Logit

Uploaded by

Linear Regression and Logit

Uploaded by

Linear Regression

• Linear Regression Models are used to identify the

You might also like