0% found this document useful (0 votes)

119 views30 pages

Lectureslides Chap6-Annot PDF

Linear regression models the relationship between a response variable Y and one or more explanatory variables X. It finds the line of best fit for predicting Y from X by minimizing the sum of squared errors between the actual Y values and the predicted Y values from the regression line. The slope and intercept of the regression line are estimated from sample data using the least squares method. Assumptions of the linear regression model include independence and homoscedasticity of the error terms.

Uploaded by

rashid.iisc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

119 views30 pages

Lectureslides Chap6-Annot PDF

Uploaded by

rashid.iisc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

18.

650 – Fundamentals of Statistics

6. Linear regression

1/30
Goals
Consider two random variables X and Y . For example,
1. X is the amount of $ spent on Facebook ads and Y is the
total conversion rate
2. X is the age of the person and Y is the number of clicks
Given two random variables (X, Y ), we can ask the following
questions:

I How to predict Y from X?

I Error bars around this prediction?
I How much more conversions Y for an additional dollar?
I Does the number of clicks even depend on age?
I What if X is a random vector? For example, X = (X1 , X2 )
where X1 is the amount of $ spent on Facebook ads and X2
is the duration in days of the campaign.

2/30
Conversions vs. amount spent

3/30
Clicks vs. age

4/30
Modeling assumptions

(Xi , Yi ), i = 1, . . . , n are i.i.d from some unknown joint

distribution IP.

IP can be described by (assuming all exist)

I Either a joint PDF h(x, y)

I The marginal density of X h(x) = and the

h(y|x) =

h(y|x) answers all our questions. It contains all the information

about given

5/30
Partial modeling
We can also describe the distribution only ,e.g., using
I The expectation of Y :

I The conditional expectation of Y given X = x:

The function
Z
x 7! f (x) := IE[Y |X = x] =

is called
I Other possibilities:
I The conditional median: such that
Z m
h(y|x)dy =
1

I Conditional
I Conditional variance (not informative about location)
6/30
Conditional expectation and standard deviation

7/30
Conditional expectation

8/30
Conditional density and conditional quantiles

9/30
Conditional distribution: boxplots

10/30
Linear regression

We first focus on modeling the regression function

f (x) =
I Too many possible regression functions f (nonparametric)

I Useful to restrict to simple functions that are described by a

few parameters
I Simplest:

f (x) = a + bx

Under this assumption, we talk about

11/30
Nonparametric regression

12/30
Linear regression

13/30
Probabilistic analysis
I Let X and Y be two real r.v. (not necessarily independent)
with two moments and such that var(X) > 0.

I The theoretical linear regression of Y on X is the line

⇤ ⇤
x 7! a + b x where
h i
⇤ ⇤ 2
(a , b ) = argmin IE (Y a bX)
(a,b)2IR2

I Setting partial derivatives to zero gives

cov(X, Y )
I b =
⇤
,
var(X)

I a⇤ = IE[Y ] ⇤
b IE[X] = IE[Y ] IE[X].

14/30
Noise

a ⇤
Clearly the points are not exactly on the line x 7! + ⇤
b xif
⇤ ⇤
(Y |X = x) > 0. The random variable " = Y (a + b X) is

called noise and satisfies

Y = a + bX + ",

with
I IE["] = 0 and
I cov(X, ") = 0.

15/30
Statistical problem
In practice ⇤
a ,b⇤ need to be estimated from data.

I Assume that we observe n i.i.d. random pairs

(X1 , Y1 ), . . . , (Xn , Yn ) with same distribution as (X, Y ):
Yi = + "i

I We want to estimate a⇤ and b⇤ .

16/30
Statistical problem
⇤ ⇤
Y i = a + b Xi + "i

17/30
Statistical problem
⇤ ⇤
Y i = a + b Xi + "i

18/30
Statistical problem
⇤ ⇤
Y i = a + b Xi + "i

19/30
Statistical problem
⇤ ⇤
Y i = a + b Xi + "i

20/30
Least squares

Definition
The least squared error (LSE) estimator of (a, b) is the minimizer
of the sum of squared errors:
n
X
2
(Yi a bXi ) .
i=1

(â, b̂) is given by

XY X̄ Ȳ
b̂ =
X2 X̄ 2
â = Ȳ b̂X̄.

21/30
Residuals

22/30
Multivariate regression
> ⇤
Yi = Xi + "i , i = 1, . . . , n.

I Vector of explanatory variables or covariates: Xi 2 IRp

(wlog, assume its first coordinate is 1).

I Response / Dependent variable: Yi .

I ⇤
= ⇤ ⇤ > >
(a , b ) ; ⇤ (=
1
⇤
a ) is called the intercept.

I {"i }i=1,...,n : noise terms satisfying cov(Xi , "i ) = 0.

Definition
⇤
The least squares estimator (LSE) of is the minimizer of the
sum of square errors:
n
X
ˆ = argmin
2IRp i=1
23/30
LSE in matrix form

I Let Y = (Y1 , . . . , Yn )> 2 IRn .

I Let X be the n ⇥ p matrix whose rows are X> >

1 , . . . , Xn (X is
called the design matrix).

I Let " = ("1 , . . . , "n )> 2 IRn (unobserved noise)

I Y= , ⇤
unknwon.

I The LSE ˆ satisfies:

ˆ = argmin kY X 2
k2 .
2IRp

24/30
Closed form solution

I Assume that rank(X) = p.

I Analytic computation of the LSE:

ˆ = (X> X) 1 >
X Y.

I Geometric interpretation of the LSE: X ˆ is the orthogonal

projection of Y onto the subspace spanned by the columns of
X:
X ˆ = P Y,
where P = > >
X(X X) X .
1

25/30
Statistical inference

To make inference (confidence regions, tests) we need more

assumptions.
Assumptions:

I The design matrix X is deterministic and rank(X) = p.

I The model is homoscedastic: "1 , . . . , "n are i.i.d.

I The noise vector " is Gaussian:

"⇠

for some known or unknown 2 > 0.

26/30
Properties of LSE
I LSE = MLE

I Distribution of ˆ : ˆ ⇠ .
h i ⇣ ⌘
I Quadratic risk of ˆ : IE k ˆ 2
k2 = 2 >
tr (X X) 1
.
h i
I Prediction error: IE kY ˆ 2
X k2 = 2
(n p).

I Unbiased estimator of 2: ˆ =2
.

Theorem
ˆ2
I (n p) ⇠ .
2

I ˆ ?? ˆ 2 .
27/30
Significance tests
I Test whether the j-th explanatory variable is significant in the
linear regression (1  j  p).

I H0 : j = 0 v.s. H1 : j 6= 0.

I If j is the j-th diagonal coefficient of >

(X X) 1 ( j > 0):
ˆj j
p ⇠
ˆ2 j

ˆj
I Let Tn(j) = p .
ˆ2 j

I Test with non asymptotic level ↵ 2 (0, 1):

Rj,↵ =
where q ↵2 (tn p ) is the (1 ↵/2)-quantile of tn p.
I We can also compute p-values.
28/30
Bonferroni’s test

I Test whether a group of explanatory variables is significant in

the linear regression.

I H0 : j = 0, 8j 2 S v.s. H1 : 9j 2 S, j 6= 0, where
S ✓ {1, . . . , p}.

I Bonferroni’s test: RB,↵ = , where k = |S|.

I This test has nonasymptotic level at most ↵.

29/30
Remarks

I Linear regression exhibits correlations, NOT causality

I Normality of the noise: One can use goodness of fit tests to

test whether the residuals "ˆi = Yi X> ˆ are Gaussian.
i

I Deterministic design: If X is not deterministic, all the above

can be understood conditionally on X, if the noise is assumed
to be Gaussian, conditionally on X.

30/30

Definition of Simple Linear Regression
No ratings yet
Definition of Simple Linear Regression
9 pages
Business Statistics II
100% (2)
Business Statistics II
100 pages
Econometrics I: Professor William Greene Stern School of Business Department of Economics
No ratings yet
Econometrics I: Professor William Greene Stern School of Business Department of Economics
47 pages
PE Civil: Transportation Ebook Practice Exam
No ratings yet
PE Civil: Transportation Ebook Practice Exam
41 pages
Linear Regression
No ratings yet
Linear Regression
47 pages
Simple Regression
100% (1)
Simple Regression
50 pages
CVEN2002 Week11
No ratings yet
CVEN2002 Week11
49 pages
Kamrul Hasan PDF
No ratings yet
Kamrul Hasan PDF
153 pages
CH 6. Simple Regression
No ratings yet
CH 6. Simple Regression
98 pages
Anova
67% (3)
Anova
55 pages
Simple Linear Regression and Correlation: Abrasion Loss vs. Hardness
No ratings yet
Simple Linear Regression and Correlation: Abrasion Loss vs. Hardness
23 pages
BA501 Week5 Linear Regression
No ratings yet
BA501 Week5 Linear Regression
45 pages
Statics Thinking-Regression
No ratings yet
Statics Thinking-Regression
51 pages
Ch2 Linear Regression Analysis
No ratings yet
Ch2 Linear Regression Analysis
57 pages
Chapter 3 and 4 Research Paper
100% (3)
Chapter 3 and 4 Research Paper
21 pages
CH 2
No ratings yet
CH 2
31 pages
Week1 SLR
No ratings yet
Week1 SLR
30 pages
8-1 To 8-3 Simple - Lin - Regress - Inference
No ratings yet
8-1 To 8-3 Simple - Lin - Regress - Inference
49 pages
Session CLRM Review 1
No ratings yet
Session CLRM Review 1
47 pages
Statistics 3 Notes
No ratings yet
Statistics 3 Notes
90 pages
Linear Regression
No ratings yet
Linear Regression
29 pages
Stats101A - Chapter 2
No ratings yet
Stats101A - Chapter 2
59 pages
Linear Regression
No ratings yet
Linear Regression
108 pages
Sta 3
No ratings yet
Sta 3
9 pages
MIT18 650F16 Regression
No ratings yet
MIT18 650F16 Regression
44 pages
Classical LinearReg 000
No ratings yet
Classical LinearReg 000
41 pages
Lecture Notes On High Dimensional Linear Regression
No ratings yet
Lecture Notes On High Dimensional Linear Regression
73 pages
WST 311 Notes Part 2 2024
No ratings yet
WST 311 Notes Part 2 2024
21 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
63 pages
Simple Linear Regression Analysis - Final
No ratings yet
Simple Linear Regression Analysis - Final
46 pages
Linera Regression II PDF
No ratings yet
Linera Regression II PDF
14 pages
Chap 7
No ratings yet
Chap 7
7 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
15.simple Linear Regression-530
No ratings yet
15.simple Linear Regression-530
54 pages
Uttam Linear Regression 17march24
No ratings yet
Uttam Linear Regression 17march24
82 pages
The Bucharest University of Economic Studies Bucharest Business School Romanian - French INDE MBA Program
No ratings yet
The Bucharest University of Economic Studies Bucharest Business School Romanian - French INDE MBA Program
67 pages
TSNotes 1
No ratings yet
TSNotes 1
29 pages
Math644 - Chapter 1 - Part2 PDF
No ratings yet
Math644 - Chapter 1 - Part2 PDF
14 pages
Basic Econometrics Health
No ratings yet
Basic Econometrics Health
183 pages
1.1 Simple Linear Regression Model
100% (1)
1.1 Simple Linear Regression Model
15 pages
Ordinary Least Squares
No ratings yet
Ordinary Least Squares
54 pages
Notes On Applied Linear Regression
No ratings yet
Notes On Applied Linear Regression
47 pages
Part IV VDD
No ratings yet
Part IV VDD
28 pages
Gcse Parachute Coursework
67% (3)
Gcse Parachute Coursework
4 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
55 pages
Simple Linear Regression.: 29.1 Method of Least Squares
No ratings yet
Simple Linear Regression.: 29.1 Method of Least Squares
4 pages
Notes 2
No ratings yet
Notes 2
16 pages
Econ 471 Notes 1
No ratings yet
Econ 471 Notes 1
14 pages
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
No ratings yet
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
78 pages
ch12 0
No ratings yet
ch12 0
43 pages
Literature Review Data Mining Techniques
100% (1)
Literature Review Data Mining Techniques
6 pages
SimpleLinearRegression PDF
No ratings yet
SimpleLinearRegression PDF
86 pages
Chapter 9 Simple Linear Regression and Correlation
No ratings yet
Chapter 9 Simple Linear Regression and Correlation
56 pages
Math644 Chapter 1 Part1
No ratings yet
Math644 Chapter 1 Part1
5 pages
Unit-1 Bi
No ratings yet
Unit-1 Bi
56 pages
STAT630Slide Adv Data Analysis
No ratings yet
STAT630Slide Adv Data Analysis
238 pages
Simple Linear Regression.: 29.1 Method of Least Squares
No ratings yet
Simple Linear Regression.: 29.1 Method of Least Squares
4 pages
Advanced Econometrics PDF
No ratings yet
Advanced Econometrics PDF
58 pages
MATH6183 Introduction+Regression
No ratings yet
MATH6183 Introduction+Regression
70 pages
SYLLABUS
No ratings yet
SYLLABUS
3 pages
Least Squares Estimation PDF
No ratings yet
Least Squares Estimation PDF
5 pages
3 SimpleLinearRegression
No ratings yet
3 SimpleLinearRegression
30 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
18 pages
Regression Models Notes
No ratings yet
Regression Models Notes
13 pages
Full Paper 341
No ratings yet
Full Paper 341
11 pages
Regression Notes - Part-1
No ratings yet
Regression Notes - Part-1
17 pages
The Effectiveness of Realistic Mathematics Education Approach On Ability of Students' Mathematical Concept Understanding
No ratings yet
The Effectiveness of Realistic Mathematics Education Approach On Ability of Students' Mathematical Concept Understanding
10 pages
STAT101 Assignment 1
No ratings yet
STAT101 Assignment 1
3 pages
Shakiba Rahimiaghdam - 61130 - Assignsubmission - File - DatasetAnalysis - MINERS
No ratings yet
Shakiba Rahimiaghdam - 61130 - Assignsubmission - File - DatasetAnalysis - MINERS
56 pages
2232 CorpuzMNC2013 PDF
No ratings yet
2232 CorpuzMNC2013 PDF
11 pages
Sbe10 10 Simple Regression
No ratings yet
Sbe10 10 Simple Regression
100 pages
Augmented Dickney Fuller and Phillip-Peron Tests: Prior To Global Contagion
No ratings yet
Augmented Dickney Fuller and Phillip-Peron Tests: Prior To Global Contagion
10 pages
Factor Influencing On Hanu Students' House Rent: Econometrics Project
No ratings yet
Factor Influencing On Hanu Students' House Rent: Econometrics Project
23 pages
Flight Ticket Price Predictor - Formatted Paper
No ratings yet
Flight Ticket Price Predictor - Formatted Paper
5 pages
LDC Sci Ttrubric Dimensions 9 12 March2016
No ratings yet
LDC Sci Ttrubric Dimensions 9 12 March2016
5 pages
11 Simple Linear Regression Workbook
No ratings yet
11 Simple Linear Regression Workbook
23 pages
Metrics 2019 Lec3
No ratings yet
Metrics 2019 Lec3
59 pages
BUS - 5030 - Milestone - 2 - Worksheet (2) (1) (Repaired)
No ratings yet
BUS - 5030 - Milestone - 2 - Worksheet (2) (1) (Repaired)
12 pages
Tabel SPSS Tugas Akhir Teknik Sipil-Universitas Trisakti
No ratings yet
Tabel SPSS Tugas Akhir Teknik Sipil-Universitas Trisakti
7 pages
EFA Manuscript
No ratings yet
EFA Manuscript
11 pages
Sta5176 SAS Exam1 Fall 2024 Report Nafees
No ratings yet
Sta5176 SAS Exam1 Fall 2024 Report Nafees
8 pages
Stats
No ratings yet
Stats
33 pages
2023 10 - 23 0036 AQPSD Dutystatement
No ratings yet
2023 10 - 23 0036 AQPSD Dutystatement
3 pages
Villanueva BSA22 LaboratoryExercise3
No ratings yet
Villanueva BSA22 LaboratoryExercise3
2 pages
3.2 LSRL Worksheet Parts I 1-4
No ratings yet
3.2 LSRL Worksheet Parts I 1-4
2 pages
Class5 Lecture
No ratings yet
Class5 Lecture
53 pages
Sop Bsbi
No ratings yet
Sop Bsbi
2 pages
Table of Content
No ratings yet
Table of Content
3 pages

Lectureslides Chap6-Annot PDF

Uploaded by

Lectureslides Chap6-Annot PDF

Uploaded by

18.

650 – Fundamentals of Statistics

I How to predict Y from X?

(Xi , Yi ), i = 1, . . . , n are i.i.d from some unknown joint

IP can be described by (assuming all exist)

I The marginal density of X h(x) = and the

h(y|x) answers all our questions. It contains all the information

I The conditional expectation of Y given X = x:

We first focus on modeling the regression function

I Useful to restrict to simple functions that are described by a

Under this assumption, we talk about

I The theoretical linear regression of Y on X is the line

I Setting partial derivatives to zero gives

called noise and satisfies

I Assume that we observe n i.i.d. random pairs

I We want to estimate a⇤ and b⇤ .

(â, b̂) is given by

I Vector of explanatory variables or covariates: Xi 2 IRp

I Response / Dependent variable: Yi .

I {"i }i=1,...,n : noise terms satisfying cov(Xi , "i ) = 0.

I Let Y = (Y1 , . . . , Yn )> 2 IRn .

I Let X be the n ⇥ p matrix whose rows are X> >

I Let " = ("1 , . . . , "n )> 2 IRn (unobserved noise)

I The LSE ˆ satisfies:

I Assume that rank(X) = p.

I Analytic computation of the LSE:

I Geometric interpretation of the LSE: X ˆ is the orthogonal

To make inference (confidence regions, tests) we need more

I The design matrix X is deterministic and rank(X) = p.

I The model is homoscedastic: "1 , . . . , "n are i.i.d.

I The noise vector " is Gaussian:

for some known or unknown 2 > 0.

I If j is the j-th diagonal coefficient of >

I Test with non asymptotic level ↵ 2 (0, 1):

I Test whether a group of explanatory variables is significant in

I Bonferroni’s test: RB,↵ = , where k = |S|.

I This test has nonasymptotic level at most ↵.

I Linear regression exhibits correlations, NOT causality

I Normality of the noise: One can use goodness of fit tests to

I Deterministic design: If X is not deterministic, all the above

You might also like