Econometrics I CH 1 and 2
Econometrics I CH 1 and 2
Mathematical (or inferential) statistics deals with the method of measurement which are
developed on the basis of controlled experiments. But statistical methods of
measurement are not appropriate for a number of economic relationships because for
most economic relationships controlled or carefully planned experiments cannot be
designed due to the fact that the nature of relationships among economic variables are
stochastic or random. Yet the fundamental ideas of inferential statistics are applicable in
econometrics, but they must be adapted to the problem economic life.
1.4 Economic models vs. econometric models
i) Economic models:
Any economic theory is an observation from the real world. For one reason, the immense
complexity of the real world economy makes it impossible for us to understand all
interrelationships at once. Another reason is that all the interrelationships are not equally
important as such for the understanding of the economic phenomenon under study. The
sensible procedure is therefore, to pick up the important factors and relationships relevant
to our problem and to focus our attention on these alone. Such a deliberately simplified
analytical framework is called on economic model. It is an organized set of relationships
that describes the functioning of an economic entity under a set of simplifying
assumptions.
ii) Econometric models:
The most important characteristic of economic relationships is that they contain a random
element which is ignored by mathematical economic models which postulate exact
relationships between economic variables.
1.5 Methodology of econometrics
Econometric research is concerned with the measurement of the parameters of economic
relationships and with the predication of the values of economic variables. The
relationships of economic theory which can be measured with econometric techniques are
relationships in which some variables are postulated as causes of the variation of other
variables. Starting with the postulated theoretical relationships among economic
variables, econometric research or inquiry generally proceeds along the following
lines/stages.
1. Specification the model
2. Estimation of the model
3. Evaluation of the estimates
4. Evaluation of the forecasting power of the estimated model
1. Specification of the model
In this step the econometrician has to express the relationships between economic
variables in mathematical form. This step involves the determination of three important
tasks:
i) The dependent and independent (explanatory) variables which will be included in
the model.
ii) The a priori theoretical expectations about the size and sign of the parameters of
the function.
iii) The mathematical form of the model (number of equations, specific form of the
equations, etc.)
Note: The specification of the econometric model will be based on economic theory and
on any available information related to the phenomena under investigation. Thus,
specification of the econometric model presupposes knowledge of economic theory and
familiarity with the particular phenomenon being studied.
Specification of the model is the most important and the most difficult stage of any
econometric research. It is often the weakest point of most econometric applications. In
this stage there exists enormous degree of likelihood of committing errors or incorrectly
specifying the model. Some of the common reasons for incorrect specification of the
econometric models are:
1. The imperfections, looseness of statements in economic theories.
2. The limitation of our knowledge of the factors which are operative in any
particular case.
3. The formidable obstacles presented by data requirements in the estimation of
large models.
The most common errors of specification are:
a. Omissions of some important variables from the function.
b. The omissions of some equations (for example, in simultaneous equations model).
c. The mistaken mathematical form of the functions.
The scatter of observations represents the true relationship between Y and X. The line
represents the exact part of the relationship and the deviation of the observation from the
line represents the random component of the relationship. Were it not for the errors in the
model, we would observe all the points on the line corresponding to
. However because of the random disturbance, we observe
corresponding to . These points diverge from the
regression line by .
The first component in the bracket is the part of Y explained by the changes in X and the
second is the part of Y not explained by X, that is to say the change in Y is due to the
random influence of .
2.2.1 Assumptions of the Classical Linear Stochastic Regression Model.
The classical made important assumptions in their analysis of regression .The most
important of these assumptions are discussed below.
1. The model is linear in parameters.
The classical assumed that the model should be linear in the parameters regardless of
whether the explanatory and the dependent variables are linear or not. This is because if
the parameters are non-linear it is difficult to estimate them since their value is not known
but you are given with the data of the dependent and independent variable.
2. Ui is a random real variable
This means that the value which u may assume in any one period depends on chance; it
may be positive, negative or zero. Every value has a certain probability of being assumed
by u in any particular instance.
3. The mean value of the random variable(U) in any particular period is zero
This means that for each value of x, the random variable(u) may assume various
values, some greater than zero and some smaller than zero, but if we considered all
the possible and negative values of u, for any given value of X, they would have on
average value equal to zero. In other words the positive and negative values of u
cancel each other. Mathematically,
4. The variance of the random variable(U) is constant in each period (The
assumption of homoscedasticity)
For all values of X, the u’s will show the same dispersion around their mean. This
constant variance is called homoscedasticity assumption and the constant variance itself
is called homoscedastic variance.
5. .The random variable (U) has a normal distribution
This means the values of u (for each x) have a bell shaped symmetrical distribution about
their zero mean and constant variance , i.e.
……………………………(2.3)
……………………….(2.4)
To find the values of that minimize this sum, we have to partially differentiate
with respect to and set the partial derivatives equal to zero.
1.
2.
……………………………………….(2.10)
Equations are called the Normal Equations. After certain step we get:
2
= ( )
………………….(2.11)
Equation (2.11) can be rewritten in somewhat different way and arrived final steps are :-
……………………………………… (2.12)
The expression in (2.12) to estimate the parameter coefficient is termed is the formula in
deviation form.
2.2.2.2 Estimation of a function with zero intercept
Suppose it is desired to fit the line , subject to the restriction To
estimate , the problem is put in a form of restricted minimization problem and then
Lagrange method is applied.
We minimize:
Subject to:
The composite function then becomes
where is a Lagrange multiplier.
We minimize the function with respect to
This formula involves the actual values (observations) of the variables and not their
deviation forms, as in the case of unrestricted value of .
…………………………………..2.14