ch1 Guj
ch1 Guj
Ch1 guj
ECONOMETRICS I
CHAPTER 1: THE NATURE OF
REGRESSION ANALYSIS
Figure 1.3
TERMINOLOGY
Dependent variable Explanatory variable
Explained variable Independent variable
Predictand Predictor
Regressand Regressor
Response Stimulus
Endogenous Exogenous
Outcome Covariate
Controlled variable Control variable
TERMINOLOGY
• In a simple (two-variable) regression analysis
we study the dependence of a variable on
only a single explanatory variable, such as
that of consumption expenditure on real
income.
• In a multiple regression analysis we study the
dependence of one variable on more than
one explanatory variable, such as that of
money demand on interest rates, income, and
inflation.
TERMINOLOGY
• The term random is a synonym for the term
stochastic. A random (stochastic) variable is a
variable that can take on any set of values,
positive or negative, with a given probability.
NOTATION
• Y: dependent variable
• …
X1, X2, , Xk : explanatory variables
• Xk : kth explanatory variable
• Xki : ith observation on variable Xk (cross-sectional data)
• Xkt : tth observation on variable Xk (time series data)
• N (or T): the total number of observations or values in
the population.
• n (or t): the total number of observations in the
sample. (time series data)
TYPES OF DATA
• There are mainly three types of data for
empirical analysis:
1. Time series data
2. Cross sectional data
3. Pooled data
Cross-sectional data
• Cross-sectional data are data on one or more
variables collected at the same point in time.
GPA study hours/week
3.5 10
2.7 8
1.9 9
2.3 5
2.0 8
2.2 6
2.5 3
Pooled data
• In the pooled data there are elements of both
time and cross-sectional data.
time GPA study hs/week
2000 2.5 9
2000 2.7 8
2000 2.3 6
2005 1.9 5
2005 3.1 12
2010 2.4 7
2010 2.0 5
2010 3.9 11
2010 1.2 2
Sources of Data
• Government agencies (Department of
Commerce...)
• International agencies (World Bank...)
• Surveys