0% found this document useful (0 votes)

47 views7 pages

Business Stat & Emetrics - Inference in Regression

The document discusses key concepts in simple linear regression including: 1) The OLS estimators for the slope (B1) and intercept (B0) are linear, unbiased estimators. 2) The sampling distributions of the OLS estimators are normally distributed. 3) Confidence intervals for B1 and B0 can be constructed using the t-distribution based on the estimated variances of the estimators. 4) Hypothesis tests about the population parameters B1 and B0 can be conducted similarly to tests about other population parameters. 5) The R-squared goodness of fit statistic measures the proportion of variation in the dependent variable explained by the regression model.

Uploaded by

kasim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as RTF, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views7 pages

Business Stat & Emetrics - Inference in Regression

Uploaded by

kasim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as RTF, PDF, TXT or read online on Scribd

You are on page 1/ 7

Inferences and hypothesis testing

In simple regression analysis, we assume that x and y are related linearly in the population.
Mathematically,

y i=B0 + B1 x i + ε i

But since we don’t know the population parameters, we use sample data to obtain a sample regression
model

^0+ B
y= B ^1 Xi

Here the slope and intercept are simply estimates of the corresponding parameters in the population. But,
these parameters are just realizations of a random variable: the estimators of each of the coefficients. The
estimators being random variable they can be characterized statistically.

Properties of slope estimator

 It is a random variable
 The mean of the estimator equals the population value.
 The estimator is normally distributed

Sampling distribution of ^
B0 and ^B1

Linearity of OLS estimators

 A linear estimator is an estimator that is a linear combination of the dependent variable. That is,
^Bi=w1 Y 1 + w2 Y 2 + …+ w n Y n

To see that the OLS estimator of the slope is linear

∑ ( Y i−Y ) (¿ X i −X )
^B= 1
¿
n

∑ ( X i− X ) 2

i=1

n
^B=∑ X i−X ¿ Y i ¿
n

∑ ( X i− X )2
1

i=1

For notational simplicity, let x I =X i− X

∑ xi Y i

( )
n
xi
^B= =∑ Y i =∑ w i Y i
1
n n

∑ (x i ) 2
∑ (x i )2 1

i=1 i=1

Hence, the OLS estimator is linear estimator.

OLS estimator is unbiased

^ 1 )=E ¿
E( B

Since

Y i−Y =B0 +B 1 x i+ ε i−(B0 +B 1 X )=B1 ( X i−X )+ε i

E (B
^ 1 ) =E ¿

Thus the estimator is biased only if the second term above is not zero. But, consider two possibilities’.
First, if we assume X is constant because it is a treatment determined by the researcher, then since E ε j=0
, the last term becomes zero. Second, if we assume X is random, then as far as the covariance of the error
with X is zero (assumption of our classical model), then again the last terms is zero. In either case, the
estimator is unbiased.

Variance of the sampling distribution of ^

∑ ε i (¿ X i− X )
^ ^ 1 )=
�筽1−E ( B 1
¿
n

∑ ( X i −X ) 2

i=1

Which means that

n
^ ∑ ( X i −X )2 var (ε ¿¿i) ¿
var ( B)=
i=1
¿ ¿¿

Variance of B1 is smaller if the variance of y is smaller

Variance of B1 is smaller if the sample size is larger

Variance of B1 is larger the smaller the variance of the explanatory variable, X.

Normality of the OLS estiamtors

Since
n
^B=∑ wi Y i
1

It is a linear function of Y. Since we assumed that

2
Y i N (B 0+ B1 X i , σ ε )

A linear function of Y is also normally distributed. Hence,

σ y2
^B1 N (B1 , )
n

∑ ( X i−X ) 2

i=1

n
σy
2
∑ X i2
^B N (B , i=1
)
0 0 n
n ∑ ( X i −X )
2

i=1

Confidence intervals

In deriving the 100(1-α)% confidence interval for the parameter ^

B1, first let’s standardize ^B1

^ −B
B 1 1
σ ^B 1

Then,
^B −B
1 1
P( Z α ≤ ≤ Z α )=1−α
1−
2
σ B^ 1 1−
2

And solved for the population parameter

P( B1−σ B^ 1 Z α
^ 1 ≤ B 1+ σ ^ Z
≤B α )=1−α
1− B1 1−
2 2

2
But, now we don’t often know the population variance, σ y . Hence, we also need to estimate the
variance of the sampling distirution of the parameter estimator because we don’t know the variance
of y which determines the variance of the parameter as can be seen here

σ y2
^ )=
var ( B n

∑ ( X i − X )2
i=1

Now, we have to estimate the population variance of Y by the sample variance

∑ e i2
2 i=1
s y =,
n−1

After this substitution, we use the t-distribution for confidence interval

P( B1−s ^B 1 t α ≤ ^B1 ≤ B 1+ s ^B 1 t α )=1−α

1− ,n−2 1− ,n−2
2 2

Where,

^ )= sy2
var ( B n

∑ ( X i − X )2
i=1
Hypothesis testing

Hypothsis testing is done in quite similar fashion as done when testing hypothesis for mean etc.

Goodness of fit test (R-squared)

This tells us the fraction of the variation in the dpendent variable that is explained by the explanatory
variable.

Consider the following decomposition.

^ .
For an individual observation, we have the actual or observed value Y i and the estimated value Y i
Why the value differ from the mean value Y . There are two possible factors to explain this. First,
the value of X may be different from the mean of X for the observation. Hence, the dependent
variable also takes a value different from the mean of y. This difference is

Y^ i−Y

This is accounted for by the explanatory variable: it is what the regression line tries to explain.

Next, the value of y may also be different from the estimated value. This difference is

Y i−Y^ i

This is due to random error. This is what the error term represents.

Now, the total deviation of actual value of y from the mean is the sum of the two deviations
Y i−Y =( Y^ i −Y ) +(Y i−Y^ i )

It can be shown that we can square and sum this equation as

n n n

∑ (Y ¿¿ i−Y ) =∑ (Y^ i−Y )2+∑ (Y i−Y^ i)2 ¿

1 1 1

Total ∑ of squares=explained ∑ of squares+ unepxlai �< ed ∑ of squares

The R^2 is measure of the percentage of explained variabation

∑ (Y^ i−Y )2
2 1
R= n

n ∑ (Y i −Y^ i)2
∑ (Y ¿¿ i−Y )2=1− n
1
¿
∑ (Y ¿¿ i−Y ) ¿
1 2

Since OLS minimizes the unexplained sum of squares, it also maximizes the explained sum square,
or the R-squared.

Questions and Answers On Stalin Russia
No ratings yet
Questions and Answers On Stalin Russia
52 pages
Computer Hardware With Images
97% (32)
Computer Hardware With Images
41 pages
BR100 Po
100% (1)
BR100 Po
61 pages
Case Study On Food Corporation of India
80% (10)
Case Study On Food Corporation of India
23 pages
Intern Report Final
100% (1)
Intern Report Final
44 pages
Procurment Case Study
100% (2)
Procurment Case Study
15 pages
History of Agrarian Reform
No ratings yet
History of Agrarian Reform
27 pages
Research Proposal
100% (1)
Research Proposal
6 pages
Compact First For Schools Teacher's Book
No ratings yet
Compact First For Schools Teacher's Book
16 pages
Chapter 1
No ratings yet
Chapter 1
17 pages
Procurment Article Review
100% (6)
Procurment Article Review
4 pages
Research Proposal
No ratings yet
Research Proposal
9 pages
Chp-3 General Fund PG
No ratings yet
Chp-3 General Fund PG
16 pages
Procurment Article Review
100% (2)
Procurment Article Review
4 pages
Zemen Post-Graduate College Department of Management MA in Project Management
100% (1)
Zemen Post-Graduate College Department of Management MA in Project Management
9 pages
Chapter 2
No ratings yet
Chapter 2
16 pages
Kinetics: The Oxidation of Iodide by Hydrogen Peroxide
No ratings yet
Kinetics: The Oxidation of Iodide by Hydrogen Peroxide
3 pages
Chp-4 Capital Project Fund
No ratings yet
Chp-4 Capital Project Fund
8 pages
Table of Concrete Design Properties Including Strength Properties
No ratings yet
Table of Concrete Design Properties Including Strength Properties
7 pages
Project New
No ratings yet
Project New
27 pages
Advantages and Disadvantages of ICT
No ratings yet
Advantages and Disadvantages of ICT
2 pages
Teaching Practice-ELT619: Sr. No Grade/Class Subject Topic
100% (5)
Teaching Practice-ELT619: Sr. No Grade/Class Subject Topic
27 pages
Ultimate Beneficial Ownership Self Declaration Form 2025
No ratings yet
Ultimate Beneficial Ownership Self Declaration Form 2025
2 pages
Student Room Dissertation Thread
100% (2)
Student Room Dissertation Thread
5 pages
Chapter 5
No ratings yet
Chapter 5
11 pages
Decision Analysis
No ratings yet
Decision Analysis
35 pages
Dynamic Programming
No ratings yet
Dynamic Programming
8 pages
Ch.2 PG
No ratings yet
Ch.2 PG
12 pages
The Forgotten Glowing Vale Beyond The Veil
No ratings yet
The Forgotten Glowing Vale Beyond The Veil
5 pages
Mba 3rd Sem Syllabus
No ratings yet
Mba 3rd Sem Syllabus
33 pages
Management Science Module-Example
No ratings yet
Management Science Module-Example
19 pages
Diliman Preparatory School 3 Quarter Exam. (Reviewer) Language 6 Name: - Date
No ratings yet
Diliman Preparatory School 3 Quarter Exam. (Reviewer) Language 6 Name: - Date
2 pages
Analyzing The Decentralization of Health Systems in Developing Countries: Decision Space, Innovation and Performance
No ratings yet
Analyzing The Decentralization of Health Systems in Developing Countries: Decision Space, Innovation and Performance
15 pages
Robert Waelder Five Lectures
No ratings yet
Robert Waelder Five Lectures
68 pages
College Essay Brainstorming Questions
No ratings yet
College Essay Brainstorming Questions
5 pages
How To Manage Your Time Like A CEO
No ratings yet
How To Manage Your Time Like A CEO
47 pages
Sleep Hygiene
No ratings yet
Sleep Hygiene
1 page
Defenses in Criminal Law
No ratings yet
Defenses in Criminal Law
14 pages
Barracuda Load Balancer WP Deployment Options and Considerations
No ratings yet
Barracuda Load Balancer WP Deployment Options and Considerations
2 pages
Entrepreneur Group Assignment
No ratings yet
Entrepreneur Group Assignment
23 pages
Lab No.6 Queue Variables: Objective
No ratings yet
Lab No.6 Queue Variables: Objective
4 pages
General Objective of The Study
No ratings yet
General Objective of The Study
28 pages
Descriptive Writing
No ratings yet
Descriptive Writing
9 pages
Article Reviewww
No ratings yet
Article Reviewww
4 pages
Research Summer Final Exam
No ratings yet
Research Summer Final Exam
3 pages
Chapter 4
No ratings yet
Chapter 4
10 pages
16.1 & 16.2 Sexual & Asexual Reproduction
No ratings yet
16.1 & 16.2 Sexual & Asexual Reproduction
14 pages
Transportation Law Common Carriage of Passengers Case List - ATTY LAMAN
No ratings yet
Transportation Law Common Carriage of Passengers Case List - ATTY LAMAN
2 pages
Miguel vs. Montanez
No ratings yet
Miguel vs. Montanez
11 pages
Assignment For Moral and Citizenship Education
No ratings yet
Assignment For Moral and Citizenship Education
1 page
Seerah Class Notes
No ratings yet
Seerah Class Notes
7 pages
Resume For Jessica Navarrete
No ratings yet
Resume For Jessica Navarrete
1 page
Project Cost Managment Article Review
No ratings yet
Project Cost Managment Article Review
3 pages

Business Stat & Emetrics - Inference in Regression

Uploaded by

Business Stat & Emetrics - Inference in Regression

Uploaded by

Inferences and hypothesis testing

Properties of slope estimator

Linearity of OLS estimators

To see that the OLS estimator of the slope is linear

For notational simplicity, let x I =X i− X

Hence, the OLS estimator is linear estimator.

OLS estimator is unbiased

Y i−Y =B0 +B 1 x i+ ε i−(B0 +B 1 X )=B1 ( X i−X )+ε i

Variance of the sampling distribution of ^

Which means that

Variance of B1 is smaller if the variance of y is smaller

Variance of B1 is smaller if the sample size is larger

Normality of the OLS estiamtors

It is a linear function of Y. Since we assumed that

A linear function of Y is also normally distributed. Hence,

In deriving the 100(1-α)% confidence interval for the parameter ^

And solved for the population parameter

Now, we have to estimate the population variance of Y by the sample variance

After this substitution, we use the t-distribution for confidence interval

P( B1−s ^B 1 t α ≤ ^B1 ≤ B 1+ s ^B 1 t α )=1−α

Goodness of fit test (R-squared)

Consider the following decomposition.

It can be shown that we can square and sum this equation as

∑ (Y ¿¿ i−Y ) =∑ (Y^ i−Y )2+∑ (Y i−Y^ i)2 ¿

Total ∑ of squares=explained ∑ of squares+ unepxlai �< ed ∑ of squares

The R^2 is measure of the percentage of explained variabation

You might also like