Chapter 1 and 2 Econometrics
Chapter 1 and 2 Econometrics
Economics Econometrics
The demand for a Interested to
good is a function quantify the
of the good’s price elasticity more
Which is the price accurately.
elasticity of
demand is
negative.
objectives
1. What is econometrics?
2. What is the importance of econometric
models?
3. Mention desirable properties of econometric
models
4. Differentiate between economic and
econometric model
5. What are the goals of econometrics?
Definition and scope of
econometrics
What is Econometrics?
Econometrics means economic measurement.
The “metrics” part of the word signifies measurement
and econometrics is concerned with the measuring
of economic relationships.
It is a social science in which the tools of economic
theory, mathematics and statistical inference are
applied to the analysis of economic phenomena
(Arthur Goldberger).
Econometrics is the integration of economic
theory, mathematics, and statistical techniques
for the purpose of testing hypotheses about
economic phenomena, estimating coefficients
of economic relationships and forecasting or
predicting future values of economic variables or
phenomena.
Econometrics is subdivided into
1. Theoretical econometrics.
2. Applied econometrics.
1. Theoretical econometrics refers to the methods
for measurement of economic relationships in
general.
Cont.…
Stages of econometric
research
4. Evaluation
of the
3. Evaluation
1. Specification of 2. Estimation forecasting
of the
the model of the model power of the
estimates
estimated
model
1. Specification of the model
• The relationships between economic variables
expressed in mathematical form.
• Involves three important issues:
1. Determine dependent and independent (explanatory)
variables to be included in the model,
2. Determine a priori theoretical expectations about the
size and sign of the parameters of the function, and
3. Determine mathematical form of the model (number of
equations, specific form of the equations, etc.
• Specification of the econometric model based on
economic theory
• specification of the econometric model presupposes
knowledge of economic theory and familiarity with
the particular phenomenon being studied
Cont…Specification of the model
• Specification of the model is the most important
and the most difficult stage of any econometric
research.
• It is often the weakest point of most econometric
applications.
• In this stage there exists enormous degree of
likelihood of committing errors or incorrectly
specifying the model.
The most common errors of specification are:
1. Omissions of some important variables from the
function.
2. The omissions of some equations (for example, in
simultaneous equations model).
3. The mistaken mathematical form of the functions.
Cont…Specification of the model
Some of the common reasons for incorrect
specification of the econometric models are:
1. imperfections, looseness of
statements in economic theories
2. limited knowledge of the factors
which are operative in any particular
case
3. difficult obstacles presented by data
requirements in the estimation of
large models
Age Weight (in Kg)
Example 1 7 28.3
8 28.3
9 36.7
• Assume that a doctor is noting 10 35.8
the weight of the first patient for
each of the ages between seven 11 38.7
and seventeen years old.
12 47.4
• This data is shown in table 1.
• Based on this data, one can 13 48.3
assume that the older a person 14 54.2
gets, the higher their weight.
• While this is true in this 15 65.6
example, a better guess can be 16 64.3
made using simple linear
regression. 17 65.8
• As shown in figure 1, the mathematical
model for this line is given such as:
• Y=4.22X-4.01
– Where:
• Y represents the weight variable.
• X represents the age variable.
• This mathematical equation can be used to
predict the weight of a person using their age.
• Qd = f (P, Po T, S, I, E, Z)
Where: P is output price
Po is the price of other goods and services.
T is the taste of consumers toward the good.
S is the size of the population in the market.
I is the income of consumers.
E is the expectation of consumers about future market conditions.
Z is other factors.
• All things that affect demand work through one of these factors.
• When studying demand all factors that affect demand, except one,
are kept constant (ceteris paribus) and we determine what
happens to demand when the factor under consideration changes.
Correlation Theory
2. Correlation Theory
At the end of this chapter we will be able to
answer the following questions?
1. define correlation?
2. List the correlation measurement
methods?
3. How to Calculate the correlation
coefficient of two variables?
Correlation Analysis
• Economic variables have a great tendency of
moving together and there is a possibility that the
change in one variable is on average
accompanied by the change of the other variable.
• This situation is known as correlation.
• Correlation defined as the degree of relationship
existing between two or more variables.
• The degree of relationship existing between two
variables is called simple correlation.
• The degree of relationship connecting three or
more variables is called multiple correlations.
• Here we shall examine only. simple correlation.
• A partial correlation studies the degree of
relationship between two variables keeping all
other variables connected with these two variables
are constant.
• Linear Correlation: all points (X, Y) on scatter
diagram seem to cluster near a straight line
• the change in one variable brings a constant change
of the other.
• Nonlinear correlation: all points seem to lie near a
curve.
• The change in one variable brings a different change
in the other.
.
• Correlation may also be or negative.
• Positive Correlation: an increase or a decrease
in one variable is accompanied by an increase
or a decrease by the other in which both
variables are changed with the same
direction.
• Example, the correlation between price of a
commodity and its quantity supplied is
positive since as price rises, quantity supplied
will be increased and vice versa.
.
• Negative Correlation: an increase or a
decrease in one variable is accompanied by a
decrease or an increase in the other in which
both are changed with opposite direction.
• Example, the correlation between price of a
commodity and its quantity demanded is
negative since as price rises, quantity
demanded will be decreased and vice versa.
Methods of Measuring Correlation
• two important things to be addressed.
1. The type of co-variation existed between
variables and its strength.
2. The types of correlation mentioned before do
not show to us the strength of co-variation
between variables.
There are three methods of measuring correlation.
1. The Scattered Diagram or Graphic Method
2. The Simple Linear Correlation coefficient
3. The coefficient of Rank Correlation
The Scattered Diagram or Graphic
Method
• The scatter diagram is a rectangular diagram which
help us in visualizing the relationship between two
phenomena.
• It puts the data into X-Y plane by moving from the
lowest data set to the highest data set.
• It is a non-mathematical method of measuring the
degree of co-variation between two variables.
• Scatter plots usually consist of a large body of data.
• The closer the data points come together and make a
straight line, the higher the correlation between the
two variables, or the stronger the relationship.
.
• If the data points make a straight line going
from the origin out to high x- and y-values,
then the variables have a positive correlation.
• If the line goes from a high-value on the y-axis
down to a high-value on the x-axis, the
variables have a negative correlation.
.
.
.
• A perfect positive correlation is given the
value of 1.
• A perfect negative correlation is given the
value of -1.
• If there is absolutely no correlation present
the value given is 0.
• The closer the number is to 1 or -1, the
stronger the correlation, or the stronger the
relationship between the variables.
• The closer the number is to 0, the weaker the
correlation.
.
• Two variables may have a positive correlation, negative
correlation, or they may be uncorrelated.
• This holds true both for linear and nonlinear
correlation.
• Two variables are said to be positively correlated if
they tend to change together in the same direction,
that is, if they tend to increase or decrease together.
• Such positive correlation is postulated by economic
theory for the quantity of a commodity supplied and
its price.
• When the price increases the quantity supplied
increases. Conversely, when price falls the quantity
supplied decreases.
.
• Negative correlation: Two variables are said to
be negatively correlated if they tend to change
in the opposite direction:
• when X increases Y decreases, and vice versa.
• For example, saving and household size are
negatively correlated. When price increases,
demand for the commodity decreases and
when price falls demand increases.
The Population Correlation Coefficient ‘’ and its Sample
Estimate ‘r’
• quantitative measurement of the degree of correlation
between Y and X
– a parameter called the correlation coefficient () used.
– refers to the correlation of all the values of the population of X and
Y.
• Its estimate from any particular sample (the sample statistic for
correlation) is denoted by r with the relevant subscripts.
.
• For example if we measure the correlation
between X and Y the population correlation
coefficient is represented by xy and its sample
estimate by rxy.
• The simple correlation coefficient is used to
measure relationships which are simple and
linear only.
• It cannot help us in measuring non-linear as
well as multiple correlations.
Formula for Sample correlation coefficient
x i yi
. rxy =
2 2
xi yi
n X i Yi − X i Y i
Or r=
n X i − ( X i ) 2 n Yi − ( Yi ) 2
2 2
Where, xi = X i − X and y i = Yi - Y
.
• economic theory postulates that price (X) and
quantity supplied (Y) are positively correlated.
Example: The quantity supplied for a commodity
with the corresponding price values is given.
Determine the type of correlation that exists
between these two variables.
.
Time period(in days) Quantity supplied Yi (in Price Xi (in shillings)
tons)
1 10 2
2 20 4
3 50 6
4 40 8
5 50 10
6 60 12
7 80 14
8 90 16
9 90 18
10 120 20
Table 2: Computations of inputs for correlation coefficients
xxi = X −X yyi = Y −Y
Y X i = X ii − X i = Yii − Y x2 y2 x iy i XY X2 Y2
Mean=61 11
.
n XY − X Y
10(8520) − (110)(610)
r= = = 0.975
10(1540) − (110)(110) 10( 47700) − (610)(610)
n X 2 − ( X ) 2 n Y 2 − ( Y ) 2
1810
r= = 0.975
330 10490
.
• This result shows that there is a strong positive
correlation between the quantity supplied and
the price of the commodity under consideration.
• The simple correlation coefficient has the value
always ranging between -1 and +1.
• Its minimum value is -1 and its maximum value is
+1.
• If r= -1, there is perfect negative correlation
between the variables.
• If , 0 r +1 there is positive correlation between
the two variables and movement from zero to
positive one increases the degree of positive
correlation.
.
• If r= +1, there is perfect positive correlation
between the two variables.
• If the correlation coefficient is zero
• it indicates that there is no linear relationship
between the two variables.
• If the two variables are independent, the
value of correlation coefficient is zero
• but zero correlation coefficient does not
show us that the two variables are
independent.
Question 1:
Suppose there are two test scores:
Find the correlation coefficient?
Paper II
Paper I
110
29
107
32
100
27
96
29
89
25
78
25
67
21
66
26
49
22
Solution
rxy= 0.843
• Therefore there was a high positive
correlation/relationship observed between
mark results of Paper I and Paper II
Properties of Simple Correlation Coefficient
• The simple correlation coefficient has the following
important properties:
1. The value of correlation coefficient always ranges between -
-1 and +1.
2. The correlation coefficient is symmetric. That means , r = r
xy yx
r' = 1 −
n( n 2 − 1)
• Where,
• D = difference between ranks of corresponding
pairs of X and Y
• n = number of observations
• The values that r assume range from + 1 to – 1.
.
when applying the rank correlation coefficient.
1. it does not matter whether we rank the
observations in ascending or descending
order.
Person I 9 10 4 1 8 11 3 2 5
Person II 7 8 3 1 10 12 2 6 5
6 D 2
6(50)
r' = 1 − = 1− = 0.827
n( n − 1)
2
12(12 − 1)
2
Year 1971 1972 1973 1974 1975 1976 1977 1978 1979 1980
Y 40 44 46 48 52 58 60 68 74 80
X1 6 10 12 14 16 18 22 24 26 32
X2 4 4 5 7 9 12 14 20 21 24
Computation for partial correlation coefficients
Year Y X1 X2 y x1 x2 x1y x2y x1x 2 x12 x22 y2
1971 40 6 4 -17 -12 -8 204 136 96 144 64 289
1972 44 10 4 -13 -8 -8 104 104 64 64 64 169
1973 46 12 5 -11 -6 -7 66 77 42 36 49 121
1974 48 14 7 -9 -4 -5 36 45 20 16 25 81
1975 52 16 9 -5 -2 -3 10 15 6 4 9 25
1976 58 18 12 1 0 0 0 0 0 0 0 1
1977 60 22 14 3 4 2 12 6 8 16 4 9
1978 68 24 20 11 6 8 66 88 48 36 64 121
1979 74 26 21 17 8 9 136 153 72 64 81 289
1980 80 32 24 23 14 12 322 276 168 196 144 529
Sum 570 180 120 0 0 0 956 900 524 576 504 1634
Mean 57 18 12
.
• ryx1=0.9854
• ryx2=0.9917
• rx1x2=0.9725
• Then
ryx1 − ryx2 rx1x2 0.9854 − (0.9917)(0.9725)
ryx1 . x2 = = = 0.7023
(1 − ryx2 )(1 − rx1x2 )
2 2
(1 − 0.9917 )(1 − 0.9725 )
2 2
Practice Exercise
• Calculate Simple Correlation Coefficient?
Limitations of the Theory of Linear Correlation
• Correlation analysis has serious limitations as a
technique for the study of economic relationships.
1. The formulae for r apply only when the relationship
between the variables is linear.
• two variables may be strongly connected with a
nonlinear relationship.
• zero correlation and statistical independence of two
variables (X and Y) are not the same thing.
• Zero correlation implies zero covariance of X and Y so
that r=0.
• Statistical independence of x and y implies that the
probability of xi and yi occurring simultaneously is the
simple product of the individual probabilities
.
• P (x and y) = p (x) p (y)
• Independent variables do have zero covariance and are
uncorrelated:
• the linear correlation coefficient between two
independent variables is equal to zero.
• zero linear correlation does not necessarily imply
independence.
• uncorrelated variables may be statistically dependent.
• example if X and Y are related so that the observations
fall on a circle or on a symmetrical parabola, the
relationship is perfect but not linear.
• The variables are statistically dependent.
.
2. the second limitation of the theory is that although
the correlation coefficient is a measure of the co-
variability of variables, it does not necessarily imply
any functional relationship between the variables
concerned.
• Correlation theory does not establish, and/ or prove
any causal relationship between the variables.
• It seeks to discover a co-variation exists
• but it does not suggest that variations in, say, Y are
caused by variations in X, or vice versa.
• Knowledge of the value of r, alone, will not enable us
to predict the value of Y from X.
• A high correlation between variables Y and X may
describe any one of the following situations:
.
1. variation in X is the cause of variation in Y,
2. variation in Y is the cause of variation X,
3. Y and X are jointly dependent, or there is a
two- way causation, that is to say Y is the
cause of (is determined by) X, but also X is
the cause of (is determined by) Y. For
example in any market: q = f (p), but also p =
f(q), therefore there is a two – way causation
between q and p, or in other words p and q
are simultaneously determined.
.
4. there is another common factor (Z), that
affects X and Y in such a way as to show a
close relation between them. This often
occurs in time series when two variables have
strong time trends (i.e. grow over time). In this
case we find a high correlation between Y and
X, even though they happen to be causally
independent
5. The correlation between X and Y may be due
to chance.