0% found this document useful (0 votes)

2 views

Lesson 3

Course outline

Uploaded by

godstarkalinga

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Lesson 3

Course outline

Uploaded by

godstarkalinga

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 33

ECONOMETRIC MODELLING

WISDOM R. MGOMEZULU
ECONOMETRIC PROCEDURE
• Statement of theory or hypothesis.
• Specification of the mathematical model of the theory.
• Specification of the statistical, or econometric, model.
• Obtaining the data.
• Estimation of the parameters of the econometric model.
• Hypothesis testing.
• Forecasting or prediction.
• Using the model for control or policy purposes.
Statement of Theory or Hypothesis

• This is where the researcher states what economic theory

says about the inter-dependency of economic variables of
interest.
• Economists include variables in econometric models based
on theory. For example, the theory of demand states that
quantity demanded for a product for a given time period is
inversely related to its own price.
• Economic theory also indicates the direction of
relationship between variables. However, the theory does
not quantify the dependency.
Specification of the Mathematical Model of Consumption

• The theory of demand can be presented in a mathematical

form as shown below;
• 𝑌 = 𝛽 0 + 𝛽1 𝑋
• 𝑌 is the dependent variable. For example, quantity
demanded 𝑥 is the independent variable. In this case price.
• 𝛽0 is the intercept parameter
• 𝛽1 is the slope parameter
• The mathematical model is deterministic or exact. It does
not take into account the possibility of error.
Specification of the Econometric Model

• The equation in the previous slide assumes that there is an

exact or deterministic relationship between the two
variables. But relationships between economic variables
are generally inexact.
• An econometric model is hence stochastic in nature
Data collection

• Data is collected on both dependent and independent

variables. The quality of estimates are as good as the
quality of data. There is need for selecting an adequate
sample using appropriate sampling technique. Data
collection officers
• (enumerators) should be those who are trustworthy, well
trained and preferably experienced.
Estimation of parameters

• Now that we have the data, our next task is to estimate the
parameters of the model. The numerical estimates of the
parameters give empirical content to the consumption
function
• Y = 0.65 + 7.2Xi
Hypothesis Testing
• Assuming that the fitted model is a reasonably good
approximation of reality, we have to develop suitable
criteria of finding out whether the estimates are in accord
with the expectations of the theory that is being tested.
• A theory or hypothesis that is not verifiable by empirical
evidence may not be acceptable as a part of scientific
enquiry.
Forecasting or Prediction
• If the chosen model does not refute the hypothesis or
theory under consideration, we may use it to predict the
future value(s) of the dependent, or forecast, variable Y on
the basis of the known or expected future value(s) of the
explanatory, or predictor, variable X.
Use of the Model for Control or Policy
Purposes

• Having known the parameter estimates, we can go ahead

and advise policy makers and implementers on how best
consumption should be changed to alter GDP levels.
REGRESSION
• Regression is a technique for determining the statistical
relationship between two or more variables where a
change in a dependent variable is associated with, and
depends on, a change in one or more independent
variables.
• A regression model can also be defined as a mathematical
equation that helps to predict or forecast the value of the
dependent variable based on the known values of
independent variables.
SIMPLE LINEAR
REGRESSION
• A simple linear regression is a statistical
equation that characterizes the relationship
between a dependent variable and only one
independent variable.
III. Simple Linear Model: Estimation

1. An Econometric Model

2. Assumptions of the Simple Linear Regression

Model
3.1 An Econometric Model

Two purposes in general…

1. Estimate a relationship among

economic variables, such as y = f(x).

2. Forecast or predict the value of one

variable, y, based on the value of
another variable, x.
 The simple regression function

E ( y | x)  y| x 1  2 x

 Slope of regression line

E ( y | x) dE ( y | x)
2  
x dx

“” denotes “change in”

Y X 800 1000 1200 1400 1600 1800 2000 2200 2400 2600

Weekly family 550 650 790 800 1020 1100 1200 1350 1370 1500
consumption
expenditure Y, 600 700 840 930 1070 1150 1360 1370 1450 1520
MK
650 740 900 950 1100 1200 1400 1400 1550 1750

700 800 940 1030 1160 1300 1440 1520 1650 1780

750 850 880 1080 1180 1350 1450 1570 1750 1800

- 880 - 1130 1250 1400 - 1600 1890 1850

- - - 1150 - - - 1620 - 1910

Total 3250 4620 4450 7070 6780 7500 6850 10430 9660 12110

Conditional 650 770 890 1010 1130 1250 1370 1490 1610 1730
means of Y, E(Y|
X)
E ( y | x )   y | x   0  1 x (2.1)

E (y/x)
Average expenditure

E (y/x) = β0 + β1x

∆E (y/x)
∆x

β0

X
Income
E (y/x)

200

150

100

80 100 120 140 160 180 200 220 240 260

Weekly income, $
Y
Weekly consumption expenditure, $

149
Distribution of Y given X = $ 220

101

80 140 220
Weekly Income
3.2 Assumptions of the Simple Linear Regression
Model-I

The average value of y, for each value of x, is given by the

linear regression
E ( y ) 1  2 x
2. For each value of x, the values of y are distributed about their
mean value, following probability distributions that all have the
same variance,
var( y )  2
Data satisfying this condition are said to be homoskedastic. If this
assumption is violated, so that for all values of
income x, the data are said to be heteroskedastic.
The values of y are all uncorrelated, and have zero
covariance, implying that there is no linear association among
them.
cov( yi , y j ) 0

This assumption can be made stronger by assuming that the values

of y are all statistically independent. This is what we mean by
saying the sampling is done at random.

The variable x is not random and must take at least two

different values. The idea of regression analysis is to measure the
effect of changes in one variable, x, on another, y.

5. (optional) The values of y are normally distributed about

their mean for each value of x, 2
y ~ N [(1  2 x),  ]
3.2.1 Introducing the Error Term

The random error term is

e  y  E ( y )  y  1  2 x

Rearranging gives

y 1  2 x  e
y is dependent variable; x is independent or explanatory variable
y SKIP
y4
e{
.
4 E(y) = 1 + 2x

y3
. .} e 3
y2 e {
2

y1 .
} e1

x1 x2 x3 x4 x

The relationship among y, e and the true regression line.

The essence of regression analysis is that any observation
on the dependent variable y can be decomposed into two
parts: a systematic component and a random component.

The systematic component of y is its mean E(y), which

itself is not random, since it is a mathematical expectation.

The random component of y is the difference between y

and its mean value E(y). This is called a random error term,
and is defined as
 i Yi  E (Y | X i )
or

Yi  E (Y | X i )   i
or

Yi   0  1 X i   i

where the deviation  i is an unobservable random variable taking positive or negative

values. Technically,  i is known as the stochastic disturbance or stochastic error
term. We are summing systematic or deterministic component and stochastic or
nonsystematic component. The dependent variable y is explained by a component that
varies systematically with the independent variable x and by the random error term  .
We assume that the stochastic component is a proxy for all the omitted or neglected
variables that may affect Y but are not (or cannot be) included in the regression model.
Now if we take the expected value of the equation above on both sides, we obtain

E (Yi | X i )  E[ E (Y | X i )]  E ( i | X i )

E (Yi | X i )  E (Y | X i )  E ( i | X i )

Since E (Yi | X i ) is the same thing as E (Y | X i ) , implies that

E ( i | X i ) = 0

Thus the assumption that the regression line passes through

the conditional means of Y implies that the conditional mean
values of (conditional upon the given X’s) are zero.
The random variable y and the error term differ only by a constant
E(y), and since y is random, so is the error term . Hence the
probability density function for y and the error term are identical
except for their location
(·)

f (e) f (y)

0 𝜷𝟎 + 𝜷𝟏𝒙

Figure 4: Probability density functions for  and y

Assumptions of the Simple Linear Regression Model-II

SR1 y 1  2 x  e

SR2. E (e) 0  E ( y ) 1  2 x

SR3. var(e)  2 var( y )

SR4. cov(ei , e j ) cov( yi , y j ) 0

SR5. {xt , t 1....T } is a set of fixed variables and must take

at least two different values.

SR6. (optional) The values of e are normally distributed

about their mean
e ~ N (0,  2 )
The significance of the stochastic disturbance term is:

a.Vagueness of theory. The theory determining the behaviour of y

may be, and often is, incomplete.
b.Unavailability of data. Even if we know what some of the
excluded variables are and therefore consider a multiple
regression rather than a simple egression, we may not have
quantitative information about these variables.
c.Core variables versus peripheral variables. There could be so
many other independent variables that also affect our dependent
variable. But is is quite possible that the joint influence of all or
some of these variables may be so small and at best nonsystematic
or random that as a practical matter and for cost considerations it
does not pay to introduce them into the model explicitly.
d. Intrinsic randomness in human behaviour. Even if we succeed in
introducing all the relevant variables into the model, there is
bound to be some unpredictable randomness in individual y’s that
cannot be explained no matter how hard we try.
e. Poor proxy variables. In practice data may be plagued by errors
of measurement. Measurements on proxy variables may not
accurately give the measurement on the true variable.
f. Principle of parsimony. We would like to keep our regression as
simple as possible. Of course, we should not exclude relevant and
important variables just to keep the regression model simple.
g. Wrong functional form. Even if we have theoretically correct
variables explaining a phenomenon and even if we can obtain data
on these variables, very often we do not know the form of the
functional relationship between the regressand and the regressors.
3.3 Estimating Parameters

3.3.1 The Least Square Estimation (LSE)

 The fitted regression line is

yˆt b1  b2 xt

 The least squares residual

eˆt  yt  yˆt  yt  b1  b2 xt

Suppose any other fitted line

yˆt* b1*  b2* xt
 Least squares line has smaller sum of squared residuals

 t  t t  t  t t)
ˆ
e 2
 ( y  ˆ
y ) 2
 ˆ
e *2
 ( y  ˆ
y * 2
y
. y4
^e { ^y = b + b x
4
.^y 1 2

^y 4

.} ^e3
3

y2 .
^e {. y
2 .
3

y^2
^y
1.
} ^e
y1. 1
x1 x2 x3 x4 x
The relationship among ^y, e and the fitted regression line.
y
. y4 ^y = b + b x

y^*2
^y*
. 3
{.
^e*
4

^y*
1 2

^y*= b* + b* x
^y*
1.
. ^e*{ 4
1 2

^e*{ y .

{
3
2 . 2 y3

^e*
1

y1.
x1 x2 x3 x4 x

The sum of squared residuals from any other line will be larger.

SUSS BSBA: BUS105 Jan 2021 TOA Answers
No ratings yet
SUSS BSBA: BUS105 Jan 2021 TOA Answers
9 pages
Two-Variable Regression Analysis, Some Basic Ideas
No ratings yet
Two-Variable Regression Analysis, Some Basic Ideas
28 pages
Modelling and Parameter Estimation of Dynamic Systems
100% (3)
Modelling and Parameter Estimation of Dynamic Systems
405 pages
ECON3049 Lecture Notes 1
No ratings yet
ECON3049 Lecture Notes 1
32 pages
Econometric Lec1
No ratings yet
Econometric Lec1
72 pages
Chapter 5
No ratings yet
Chapter 5
47 pages
Student Notes Madule 2
No ratings yet
Student Notes Madule 2
12 pages
LECTURE 10-SIMPLE-REGRESSION TO MULTIPLE REGRESSION
No ratings yet
LECTURE 10-SIMPLE-REGRESSION TO MULTIPLE REGRESSION
7 pages
Regression Analysis Handouts
No ratings yet
Regression Analysis Handouts
12 pages
Presentation of Statistics
No ratings yet
Presentation of Statistics
21 pages
Econometrics Notes
No ratings yet
Econometrics Notes
30 pages
CH 2
No ratings yet
CH 2
31 pages
Chapter 3 - Classical Simple Linear Regression
No ratings yet
Chapter 3 - Classical Simple Linear Regression
52 pages
REGRESSION ANALYSIS
No ratings yet
REGRESSION ANALYSIS
6 pages
Econometrics Notes
No ratings yet
Econometrics Notes
6 pages
Linear Regression Lecture
No ratings yet
Linear Regression Lecture
18 pages
ECON 332 LECTURE NOTES APRIL 2021
No ratings yet
ECON 332 LECTURE NOTES APRIL 2021
57 pages
Lecture 6 Correlation and Regression
No ratings yet
Lecture 6 Correlation and Regression
10 pages
Ch2 Two Variable Analysis
No ratings yet
Ch2 Two Variable Analysis
13 pages
405 Econometrics Odar N. Gujarati: Prof. M. El-Sakka
100% (1)
405 Econometrics Odar N. Gujarati: Prof. M. El-Sakka
27 pages
Correlation and Regression
No ratings yet
Correlation and Regression
10 pages
Econometrics Assig 1
0% (1)
Econometrics Assig 1
13 pages
Econometrics Unit 3 Tedy Best
No ratings yet
Econometrics Unit 3 Tedy Best
147 pages
Regression Analysis
No ratings yet
Regression Analysis
65 pages
Introduction To Bivariate Regression
No ratings yet
Introduction To Bivariate Regression
51 pages
Group Assignment Final PDF
100% (1)
Group Assignment Final PDF
13 pages
Econometrics: For The 3 - Year Undergraduate Students
No ratings yet
Econometrics: For The 3 - Year Undergraduate Students
32 pages
regression analysis
No ratings yet
regression analysis
8 pages
Two Variable
No ratings yet
Two Variable
27 pages
Regression
No ratings yet
Regression
60 pages
Chapter Two Metrics (I)
No ratings yet
Chapter Two Metrics (I)
35 pages
ECO 391-007 Lecture Handout For Chapter 15 SPRING 2003 Regression Analysis Sections 15.1, 15.2
No ratings yet
ECO 391-007 Lecture Handout For Chapter 15 SPRING 2003 Regression Analysis Sections 15.1, 15.2
22 pages
Antwerpen2014sessie5 (Regression)
No ratings yet
Antwerpen2014sessie5 (Regression)
42 pages
Studenmund Top1.107
No ratings yet
Studenmund Top1.107
10 pages
econometrics notes 2024
100% (1)
econometrics notes 2024
46 pages
M2L2 CLRM & Simple Linear Regression Analysis
No ratings yet
M2L2 CLRM & Simple Linear Regression Analysis
13 pages
Simple Linear Regression and Correlation Analysis: Chapter Five
No ratings yet
Simple Linear Regression and Correlation Analysis: Chapter Five
5 pages
4 - Simple Linear Regression I 2022-23
No ratings yet
4 - Simple Linear Regression I 2022-23
25 pages
Chapter two
No ratings yet
Chapter two
19 pages
Chapter 1
No ratings yet
Chapter 1
17 pages
Business Analytics
No ratings yet
Business Analytics
19 pages
Introduction To Econometric - Tutor
No ratings yet
Introduction To Econometric - Tutor
134 pages
Econometrics Chapter Two
No ratings yet
Econometrics Chapter Two
36 pages
Short - Notes - Econometric Methods
No ratings yet
Short - Notes - Econometric Methods
22 pages
ECO - Chapter 2 SLRM
No ratings yet
ECO - Chapter 2 SLRM
40 pages
Selvanathan 7e - 17
No ratings yet
Selvanathan 7e - 17
93 pages
CH - 3 - Econometrics UG
No ratings yet
CH - 3 - Econometrics UG
38 pages
Chapter 4 Demand Estimation
No ratings yet
Chapter 4 Demand Estimation
9 pages
Basics
No ratings yet
Basics
8 pages
Module05 Notes
No ratings yet
Module05 Notes
19 pages
Ass 1 2019 RMBA
100% (3)
Ass 1 2019 RMBA
8 pages
Econometrics Unit 1
No ratings yet
Econometrics Unit 1
34 pages
Econometrics Lecture Chapter 2 Note pdf-1
No ratings yet
Econometrics Lecture Chapter 2 Note pdf-1
34 pages
Relationship Between Variables: Fitting An Equation or Curve The Meaning of Regression The Population Regression Function (PRF)
No ratings yet
Relationship Between Variables: Fitting An Equation or Curve The Meaning of Regression The Population Regression Function (PRF)
21 pages
Multiple Regression
No ratings yet
Multiple Regression
22 pages
QBM 101 Lecture 10
No ratings yet
QBM 101 Lecture 10
45 pages
Regression Analysis - SSB
No ratings yet
Regression Analysis - SSB
2 pages
Selvanathan-7e 17
No ratings yet
Selvanathan-7e 17
92 pages
Unit 1 - Part 1
No ratings yet
Unit 1 - Part 1
105 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Capsule Calculus
From Everand
Capsule Calculus
Ira Ritow
No ratings yet
08 Introductory Econometrics Fourth Sem
No ratings yet
08 Introductory Econometrics Fourth Sem
4 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
26 pages
ISAS - Tool Version 5.3: Method and Configuration
No ratings yet
ISAS - Tool Version 5.3: Method and Configuration
12 pages
ch12 0
No ratings yet
ch12 0
82 pages
Manual Audit Sampling: History
No ratings yet
Manual Audit Sampling: History
15 pages
Hypothesis Testing - Analysis of Variance (ANOVA)
No ratings yet
Hypothesis Testing - Analysis of Variance (ANOVA)
30 pages
Experiment Design Basic Principles
No ratings yet
Experiment Design Basic Principles
68 pages
2018, Lu, Semi-Supervised Online Soft Sensor Maintenance Experiences in The Chemical Industry
No ratings yet
2018, Lu, Semi-Supervised Online Soft Sensor Maintenance Experiences in The Chemical Industry
12 pages
Forecasting
No ratings yet
Forecasting
73 pages
Assignment - Forecasting
No ratings yet
Assignment - Forecasting
1 page
Britannia Internship Report 3
No ratings yet
Britannia Internship Report 3
57 pages
Estimation in Statistics
100% (1)
Estimation in Statistics
4 pages
Chap15 - Time Series Forecasting & Index Number
No ratings yet
Chap15 - Time Series Forecasting & Index Number
60 pages
Effect of Non Uniformity Factors and Assignment Factors On Errors in Charge Simulation Method With Point Charge Model
No ratings yet
Effect of Non Uniformity Factors and Assignment Factors On Errors in Charge Simulation Method With Point Charge Model
5 pages
Review-Validation of QSAR Models-Strategies and Importance
No ratings yet
Review-Validation of QSAR Models-Strategies and Importance
9 pages
Hedge Funds: Final Examination
No ratings yet
Hedge Funds: Final Examination
8 pages
Investigating The Relationship Between S
No ratings yet
Investigating The Relationship Between S
9 pages
LM-Webinar On Multivariate Techniques For Research - Intro and MRA
No ratings yet
LM-Webinar On Multivariate Techniques For Research - Intro and MRA
24 pages
William H. Beaver, "The Information Content of Annual Earnings Announcements
No ratings yet
William H. Beaver, "The Information Content of Annual Earnings Announcements
27 pages
Forecasting Methods
100% (1)
Forecasting Methods
50 pages
CS1 Paper-A September 2019 Examiners Report
No ratings yet
CS1 Paper-A September 2019 Examiners Report
13 pages
Hypothesis Testing: W&W, Chapter 9
No ratings yet
Hypothesis Testing: W&W, Chapter 9
27 pages
Dfma PDF
No ratings yet
Dfma PDF
385 pages
Heteroskedasticity
No ratings yet
Heteroskedasticity
49 pages
BUSINESS INTELLIGENCE docs
No ratings yet
BUSINESS INTELLIGENCE docs
12 pages
F - Assignment
No ratings yet
F - Assignment
9 pages
JW Chapter11solutions
No ratings yet
JW Chapter11solutions
49 pages
Decision Science Assignment
No ratings yet
Decision Science Assignment
5 pages

Lesson 3

Uploaded by

Lesson 3

Uploaded by

ECONOMETRIC MODELLING

• This is where the researcher states what economic theory

• The theory of demand can be presented in a mathematical

• The equation in the previous slide assumes that there is an

• Data is collected on both dependent and independent

• Having known the parameter estimates, we can go ahead

2. Assumptions of the Simple Linear Regression

Two purposes in general…

1. Estimate a relationship among

2. Forecast or predict the value of one

 Slope of regression line

“” denotes “change in”

- 880 - 1130 1250 1400 - 1600 1890 1850

- - - 1150 - - - 1620 - 1910

80 100 120 140 160 180 200 220 240 260

The average value of y, for each value of x, is given by the

This assumption can be made stronger by assuming that the values

The variable x is not random and must take at least two

5. (optional) The values of y are normally distributed about

The random error term is

The relationship among y, e and the true regression line.

The systematic component of y is its mean E(y), which

The random component of y is the difference between y

where the deviation  i is an unobservable random variable taking positive or negative

Since E (Yi | X i ) is the same thing as E (Y | X i ) , implies that

Thus the assumption that the regression line passes through

Figure 4: Probability density functions for  and y

SR2. E (e) 0  E ( y ) 1  2 x

SR3. var(e)  2 var( y )

SR4. cov(ei , e j ) cov( yi , y j ) 0

SR5. {xt , t 1....T } is a set of fixed variables and must take

SR6. (optional) The values of e are normally distributed

a.Vagueness of theory. The theory determining the behaviour of y

3.3.1 The Least Square Estimation (LSE)

 The least squares residual

Suppose any other fitted line

You might also like