0% found this document useful (0 votes)

11 views39 pages

3-Lecture 3-1

The document discusses the concepts of response and predictor variables in the context of multiple linear regression, emphasizing the relationship between predictors and the outcome variable. It covers the use of qualitative predictors, the creation of dummy variables, and the implications of collinearity and interaction effects on model interpretation. Additionally, it highlights the importance of residual analysis and the potential for overfitting when too many predictors or interaction terms are included.

Uploaded by

Adel Shousha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views39 pages

3-Lecture 3-1

Uploaded by

Adel Shousha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

Response vs.

Predictor Variables

X Y
predictors outcome
features response variable
covariates dependent variable

TV radio newspaper sales

n observations

230.1 37.8 69.2 22.1

44.5 39.3 45.1 10.4
17.2 45.9 69.3 9.3
151.5 41.3 58.5 18.5
180.8 10.8 58.4 12.9

p predictors
CS109A, PROTOPAPAS, RADER, TANNER 3
Multilinear Models

In practice, it is unlikely that any response variable Y depends solely on one predictor x.
Rather, we expect that is a function of multiple predictors 𝑓(𝑋! , … , 𝑋" ). Using the
notation we introduced last lecture,

𝑌 = 𝑦! , … , 𝑦# , 𝑋 = 𝑋! , … , 𝑋" and 𝑋$ = 𝑥!$ , … , 𝑥%$ , … , 𝑥#$ ,

we can still assume a simple form for 𝑓 -a multilinear

form:
𝑓 𝑋! , … , 𝑋" = 𝛽# + 𝛽! 𝑋! + ⋯ + 𝛽" 𝑋"

+ has the form:

Hence, 𝑓,
𝑓) 𝑋! , … , 𝑋" = 𝛽)# + 𝛽)! 𝑋! + ⋯ + 𝛽)" 𝑋"

CS109A, PROTOPAPAS, RADER, TANNER 4

Multiple Linear Regression

Given a set of observations,

{(x1,1 , . . . , x1,J , y1 ), . . . (xn,1 , . . . , xn,J , yn )},

the data and the model can be expressed in vector notation,

0 1 0 1
0 1 1 x1,1 ... x1,J 0
y1 B C B 1 C
B .. C B 1 x2,1 ... x2,J C B C
Y = @ . A, X=B .. .. .. .. C, = B . C,
@ . . . . A @ .. A
yy
1 xn,1 ... xn,J J

CS109A, PROTOPAPAS, RADER, TANNER 5

Multilinear Model, example

For our data

𝑆𝑎𝑙𝑒𝑠 = 𝛽, + 𝛽- × 𝑇𝑉 + 𝛽. ×𝑅𝑎𝑑𝑖𝑜 + 𝛽/ ×𝑁𝑒𝑤𝑠𝑝𝑎𝑝𝑒𝑟

In linear algebra notation

𝑆𝑎𝑙𝑒𝑠- 1 𝑇𝑉- 𝑅𝑎𝑑𝑖𝑜- 𝑁𝑒𝑤𝑠- 𝛽,

𝒀= ⋮ ,𝑿 = ⋮ ⋮ ⋮ ,𝜷 = ⋮
𝑆𝑎𝑙𝑒𝑠0 1 𝑇𝑉0 . 𝑅𝑎𝑑𝑖𝑜0 𝑁𝑒𝑤𝑠0 𝛽/

= ×

CS109A, PROTOPAPAS, RADER, TANNER 6

Multiple Linear Regression

The model takes a simple algebraic form:

Y =X +✏

We will again choose the MSE as our loss function, which can be
expressed in vector notation as
1
MSE( ) = kY X k2
n
Minimizing the MSE using vector calculus yields,

1
b = X> X X> Y = argmin MSE( ).
CS109A, PROTOPAPAS, RADER, TANNER 7
z
Multiple Linear Regression
Qualitative Predictors

So far, we have assumed that all variables are quantitative. But in

practice, often some predictors are qualitative.
Example: The credit data set contains information about balance, age,
cards, education, income, limit , and rating for a number of potential
customers.

Income Limit Rating Cards Age Education Gender Student Married Ethnicity Balance
14.890 3606 283 2 34 11 Male No Yes Caucasian 333

106.02 6645 483 3 82 15 Female Yes Yes Asian 903

104.59 7075 514 4 71 11 Male No No Asian 580

148.92 9504 681 3 36 11 Female No No Asian 964

55.882 4897 357 2 68 16 Male No Yes Caucasian 331

CS109A, PROTOPAPAS, RADER, TANNER 13

Qualitative Predictors

If the predictor takes only two values, then we create an indicator or

dummy variable that takes on two possible numerical values.
For example for the gender, we create a new variable:
⇢
1 if i th person is female
xi =
0 if i th person is male
We then use this variable as a predictor in the regression equation.
⇢
0 + 1 + ✏i if i th person is female
yi = 0 + 1 xi + ✏i =
0 + ✏i if i th person is male

CS109A, PROTOPAPAS, RADER, TANNER 14

Qualitative Predictors

Question: What is interpretation of 𝛽+ and 𝛽,?

CS109A, PROTOPAPAS, RADER, TANNER 15

Qualitative Predictors

Question: What is interpretation of 𝛽+ and 𝛽,?

• 𝛽+ is the average credit card balance among males,

• 𝛽+ + 𝛽, is the average credit card balance among females,

• and 𝛽, the average difference in credit card balance between females

and males.

Example: Calculate 𝛽+ and 𝛽, for the Credit data.

You should find 𝛽+~$509, 𝛽,~$19

CS109A, PROTOPAPAS, RADER, TANNER 16

More than two levels: One hot encoding

Often, the qualitative predictor takes more than two values (e.g. ethnicity
in the credit data).

In this situation, a single dummy variable cannot represent all possible

values.

We create additional dummy variable as:

⇢
1 if i th person is Asian
xi,1 =
0 if i th person is not Asian
⇢
1 if i th person is Caucasian
xi,2 =
0 if i th person is not Caucasian
CS109A, PROTOPAPAS, RADER, TANNER 17
More than two levels: One hot encoding

We then use these variables as predictors, the regression

equation becomes: 8
< 0 + 1 + ✏i if i th person is Asian
yi = 0 + 1 xi,1 + 2 xi,2 + ✏i = 0 + 2 + ✏i if i th person is Caucasian
:
0 + ✏i if i th person is AfricanAmerican

Question: What is the interpretation of 𝛽, , 𝛽- , 𝛽. ?

CS109A, PROTOPAPAS, RADER, TANNER 18

Collinearity
Collinearity and multicollinearity refers to the case in which two or more predictors
are correlated (related).
Limit and Rating are
highly correlated

Both limit and rating have positive

The regression coefficients are not coefficients, but it is hard to understand if the
uniquely determined. In turn it hurts the balance is higher because of the rating or is
interpretability of the model as then the it because of the limit? If we remove limit
regression coefficients are not unique then we achieve almost the same model
and have influences from other features. performance but the coefficients change.
CS109A, PROTOPAPAS, RADER, TANNER 19
Beyond linearity

So far we assumed:

• linear relationship between X and Y

• the residuals 𝑟- = 𝑦- − 𝑦.- were uncorrelated (taking the average of the
square residuals to calculate the MSE implicitly assumed
uncorrelated residuals).

These assumptions need to be verified using the data and visually

inspecting the residuals.

CS109A, PROTOPAPAS, RADER, TANNER 20

Residual Analysis

If the correct model is not linear then,

𝑦 = 𝛽+ + 𝛽,𝑥 + 𝝓 𝒙 + 𝜖
our model assuming linear relationship is:
𝑦. = 𝛽3+ + 𝛽3,𝑥
Then the residuals, 𝑟 = 𝑦 − 𝑦. = 𝜖 + 𝝓 𝒙 , are not independent of 𝒙

In residual analysis, we typically create two types of plots:

1. a plot of 𝑟- with respect to 𝑥- or 𝑦.- . This allows us to compare the

distribution of the noise at different values of 𝑥- or 𝑦.- .
2. a histogram of 𝑟- . This allows us to explore the distribution of the
noise independent of 𝑥- or 𝑦.- .
CS109A, PROTOPAPAS, RADER, TANNER 21
Residual Analysis

Linear assumption is correct. There is Linear assumption is incorrect. There

no obvious relationship between is an obvious relationship between
residuals and x. Histogram of residuals residuals and x. Histogram of
is symmetric and normally distributed. residuals is symmetric but not
normally distributed.

Note: For multi-regression, we plot the residuals vs predicted y, 𝑦,

. since there are too many
x’s and that could wash out the relationship.
CS109A, PROTOPAPAS, RADER, TANNER 22
Beyond linearity: synergy effect or interaction effect

We also assume that the average effect on sales of a one-unit increase

in TV is always 𝛽, regardless of the amount spent on radio.

Synergy effect or interaction effect states that when an increase on the

radio budget affects the effectiveness of the TV spending on sales.

We change

𝑌 = 𝛽+ + 𝛽,𝑋, + 𝛽0𝑋0 + 𝜖
to:
𝑌 = 𝛽+ + 𝛽,𝑋, + 𝛽0𝑋0 + 𝛽1𝑋,𝑋0 + 𝜖

CS109A, PROTOPAPAS, RADER, TANNER 23

What does it mean?

0 𝐵𝑎𝑙𝑎𝑛𝑐𝑒 = 𝛽+ + 𝛽,×𝐼𝑛𝑐𝑜𝑚𝑒.
𝑥2345673 =6
1 𝐵𝑎𝑙𝑎𝑛𝑐𝑒 = 𝛽+ + 𝛽0 + 𝛽, ×𝐼𝑛𝑐𝑜𝑚𝑒.

CS109A, PROTOPAPAS, RADER, TANNER 24

What does it mean?

0 𝐵𝑎𝑙𝑎𝑛𝑐𝑒 = 𝛽+ + 𝛽,×𝐼𝑛𝑐𝑜𝑚𝑒.
𝑥2345673 =6
1 𝐵𝑎𝑙𝑎𝑛𝑐𝑒 = 𝛽+ + 𝛽0 + 𝛽, ×𝐼𝑛𝑐𝑜𝑚𝑒.
0 𝐵𝑎𝑙𝑎𝑛𝑐𝑒 = 𝛽+ + 𝛽,×𝐼𝑛𝑐𝑜𝑚𝑒.
𝑥2345673 = 6
1 𝐵𝑎𝑙𝑎𝑛𝑐𝑒 = 𝛽+ + 𝛽0 + 𝛽, + 𝛽1 ×𝐼𝑛𝑐𝑜𝑚𝑒
CS109A, PROTOPAPAS, RADER, TANNER 25
Too many predictors, collinearity and too many
interaction terms leads to OVERFITTING!

CS109A, PROTOPAPAS, RADER, TANNER 26

CS109A, PROTOPAPAS, RADER, TANNER 27

CSEC Mathematics January 2015 P2 PDF
No ratings yet
CSEC Mathematics January 2015 P2 PDF
36 pages
Unit 18
No ratings yet
Unit 18
4 pages
ML Module3 Regression
No ratings yet
ML Module3 Regression
51 pages
Predective Analytics or Inferential Statistics
No ratings yet
Predective Analytics or Inferential Statistics
27 pages
STATG5 - Simple Linear Regression Using SPSS Module
No ratings yet
STATG5 - Simple Linear Regression Using SPSS Module
16 pages
Data Analytics Unit 3
No ratings yet
Data Analytics Unit 3
104 pages
Marketing Research: Ninth Edition
No ratings yet
Marketing Research: Ninth Edition
44 pages
FA Lec 6 Regression
No ratings yet
FA Lec 6 Regression
96 pages
2.linear Regression
No ratings yet
2.linear Regression
49 pages
Navigational Aids Chief Mate F.G. Phase 2 Question Papers Till Nov24 A5gzkf
No ratings yet
Navigational Aids Chief Mate F.G. Phase 2 Question Papers Till Nov24 A5gzkf
92 pages
Sistem Persediaan
No ratings yet
Sistem Persediaan
34 pages
Lecture 4
No ratings yet
Lecture 4
62 pages
Simultaneous Equations 1
No ratings yet
Simultaneous Equations 1
8 pages
Chapter 2
No ratings yet
Chapter 2
53 pages
2-Lecture 2-1
No ratings yet
2-Lecture 2-1
30 pages
Solutions Chapter4
100% (2)
Solutions Chapter4
27 pages
Unit 5 Notes Grok
No ratings yet
Unit 5 Notes Grok
9 pages
CLS - XIII Phy Target-1 Level-2 Chapter-3
No ratings yet
CLS - XIII Phy Target-1 Level-2 Chapter-3
16 pages
Oscillations Printed Notes and Assignment
No ratings yet
Oscillations Printed Notes and Assignment
72 pages
IS4242 W3 Regression Analyses
No ratings yet
IS4242 W3 Regression Analyses
67 pages
Presentation Regression Analysis
No ratings yet
Presentation Regression Analysis
61 pages
Chapter Simple Linear Regression 1
100% (1)
Chapter Simple Linear Regression 1
77 pages
Module 2 Transcripts - v3
No ratings yet
Module 2 Transcripts - v3
103 pages
Regression
No ratings yet
Regression
6 pages
Predictive Analytics - Regression
No ratings yet
Predictive Analytics - Regression
27 pages
Regression
No ratings yet
Regression
44 pages
Cost and Management Accounting I Group (5) Assignment
No ratings yet
Cost and Management Accounting I Group (5) Assignment
9 pages
Elementary Regression Analysis
No ratings yet
Elementary Regression Analysis
25 pages
Kline, A., Ahner, D., & Hill, R. (2019) - The Weapon-Target
No ratings yet
Kline, A., Ahner, D., & Hill, R. (2019) - The Weapon-Target
11 pages
Predictive ModellingAnalytics
No ratings yet
Predictive ModellingAnalytics
27 pages
Note 13 - Linear Regression
No ratings yet
Note 13 - Linear Regression
25 pages
Lec 3
No ratings yet
Lec 3
69 pages
Week 5 Notes
No ratings yet
Week 5 Notes
175 pages
State Reduction and State Ass
No ratings yet
State Reduction and State Ass
37 pages
PMSM
No ratings yet
PMSM
8 pages
Unit 5 Business Analytics
No ratings yet
Unit 5 Business Analytics
24 pages
Linear Regression
100% (2)
Linear Regression
28 pages
3 Linear Regression 3
No ratings yet
3 Linear Regression 3
10 pages
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
No ratings yet
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
60 pages
Research Paper On Security Paillier
No ratings yet
Research Paper On Security Paillier
16 pages
LinearRegression1 210720 171800
No ratings yet
LinearRegression1 210720 171800
41 pages
Daily Lesson Log
No ratings yet
Daily Lesson Log
6 pages
Week 4 Groundwater Flow Equation - Unconfined Aquifer
No ratings yet
Week 4 Groundwater Flow Equation - Unconfined Aquifer
18 pages
MultivariableRegression 1
No ratings yet
MultivariableRegression 1
30 pages
Regression Analysis (AI)
No ratings yet
Regression Analysis (AI)
9 pages
Notes 1017 Part1
No ratings yet
Notes 1017 Part1
50 pages
Basics of Sigma-Delta Modulation
No ratings yet
Basics of Sigma-Delta Modulation
25 pages
2009 Lotos Bssa
No ratings yet
2009 Lotos Bssa
21 pages
Week 1 - Introduction To Discrete Structures
No ratings yet
Week 1 - Introduction To Discrete Structures
3 pages
Opn Research by Prof Narang
No ratings yet
Opn Research by Prof Narang
43 pages
Topic 7 Regression (Cont.)
No ratings yet
Topic 7 Regression (Cont.)
47 pages
Cumulative Test
No ratings yet
Cumulative Test
7 pages
Measures of Centrality and Variability
No ratings yet
Measures of Centrality and Variability
42 pages
F.Y.B.Tech Course Contents - 2021-22
No ratings yet
F.Y.B.Tech Course Contents - 2021-22
50 pages
A Combination of Hidden Markov Model and Fuzzy Model For Stock Market Forecasting PDF
No ratings yet
A Combination of Hidden Markov Model and Fuzzy Model For Stock Market Forecasting PDF
8 pages
G-01 KAN Guide On Measurement Uncertainty (En)
No ratings yet
G-01 KAN Guide On Measurement Uncertainty (En)
32 pages
R Lang-Unit-05
No ratings yet
R Lang-Unit-05
7 pages
Module 3
No ratings yet
Module 3
34 pages
D. Inverse Trigonometric Functions: One-To-One Onto
No ratings yet
D. Inverse Trigonometric Functions: One-To-One Onto
69 pages
Linear Regression
No ratings yet
Linear Regression
22 pages
A New Discrete Element Model For Simulating A Flexible Ring Net Barrier Under Rockfall Impact Comparing With Large-Scale Physical Model Test Data
No ratings yet
A New Discrete Element Model For Simulating A Flexible Ring Net Barrier Under Rockfall Impact Comparing With Large-Scale Physical Model Test Data
12 pages
9 Fourier Transform Properties: Solutions To Recommended Problems
No ratings yet
9 Fourier Transform Properties: Solutions To Recommended Problems
15 pages
Mod3 Eda
No ratings yet
Mod3 Eda
16 pages
Module01 LinearRegression
No ratings yet
Module01 LinearRegression
41 pages
Prediction & Forecasting: Regression Analysis
No ratings yet
Prediction & Forecasting: Regression Analysis
3 pages
Mult Regression
No ratings yet
Mult Regression
28 pages
Operation Breakdown
No ratings yet
Operation Breakdown
21 pages
SimpleMultipleLinearRegression FoundationalMathofAI S24
No ratings yet
SimpleMultipleLinearRegression FoundationalMathofAI S24
6 pages
Chapter 3 MLR
No ratings yet
Chapter 3 MLR
40 pages
Hanan
No ratings yet
Hanan
9 pages
Classroom Management Binder Scott Cagnard
No ratings yet
Classroom Management Binder Scott Cagnard
30 pages
Unit - 1
No ratings yet
Unit - 1
8 pages
Module01.1 LinearRegression
No ratings yet
Module01.1 LinearRegression
32 pages
Module 5
No ratings yet
Module 5
48 pages
STAT630Slide Adv Data Analysis
No ratings yet
STAT630Slide Adv Data Analysis
238 pages
Simple LR Lecture
No ratings yet
Simple LR Lecture
60 pages
3 Regression Diagnostics
100% (1)
3 Regression Diagnostics
53 pages
01 - Quantitative Methods
No ratings yet
01 - Quantitative Methods
28 pages
Regression and Introduction To Bayesian Network
No ratings yet
Regression and Introduction To Bayesian Network
12 pages
Chapter 3
No ratings yet
Chapter 3
36 pages
Regression
No ratings yet
Regression
25 pages
Linear Regression
No ratings yet
Linear Regression
10 pages
Part 8 Linear Regression
No ratings yet
Part 8 Linear Regression
6 pages
M Stage 8 p110 02 Afp PDF
67% (3)
M Stage 8 p110 02 Afp PDF
14 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
Unit 5
No ratings yet
Unit 5
104 pages
Algebra & Trigonometry II Essentials
From Everand
Algebra & Trigonometry II Essentials
Editors of REA
4/5 (4)
Generalized Fermat Equation
From Everand
Generalized Fermat Equation
Ran Van Vo
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)