0% found this document useful (0 votes)

86 views6 pages

Econ3150 v12 Note01 PDF

This document discusses different types of economic data and how they relate to econometric models. It covers two main types of data: 1) Cross-section data, which provides observations of multiple units (e.g. households) at a single point in time, showing spatial but not temporal variation. This data type cannot be used to estimate coefficients that only vary over time. 2) Time-series data, which tracks a single unit (e.g. household or country) over multiple time periods, showing temporal but not spatial variation. This data type cannot be used to estimate coefficients that only vary across units. It also discusses how to specialize a general econometric model to these different data types and the implications

Uploaded by

RitaSilaban

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

86 views6 pages

Econ3150 v12 Note01 PDF

Uploaded by

RitaSilaban

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

ECON3150/4150 INTRODUCTORY ECONOMETRICS

Lecture note no. 1

Erik Birn,
Department of Economics
Version of January 14, 2013

ON MODELS AND DATA TYPES IN ECONOMETRICS

REMARK: In the following we use some concepts and approaches that may initially
be unknown, or strange, to you. They will be explained later in the course. A good advice
therefore is to try to grasp the main message now and postpone reading of the details until
later.

A statistical model to be used in analyzing an economic relationship, should be in

agreement with the data situation at hand. This may sound a rather trivial statement, but is related to a very fundamental matter when constructing an econometric
model. We should know what kind of data we have access to before we could formulate the model. In this note intended as a part of the general introduction to the
course we shall take a closer look at the relation between model type and data type.

Two basic types of Economic Data

The two most important types of data an econometrician, or an economist performing correlation studies, is occupied with are:
Cross-Section Data
Time Series Data

Cross-Section Data (Tverrsnittsdata): These are data from units observed at the same time or in the same time period. The data may be single
observations from a sample survey or from all units in a population. Examples of
Norwegian cross-section data are the Household Budget Survey for the year 1999,
The Manufacturing Statistics for the year 2000, the Population Census for the year
2001.

Time-Series Data (Tidsseriedata): These are data from a unit (or a

group of units) observed in several successive periods. Examples of Norwegian timeseries data are National Accounts data (production, private and public consumption,
investment, export, import etc.), the Index of Manufacturing Production, the Consumer Price Index and Financial statistics (money stock, exchange rates, interest
rates, bank deposits, etc.)
1

Most often cross-section data are data for micro units individuals, households,
firms, companies, etc. But macro-like cross-section data may well occur; examples
are cross-section data for municipalities, other local units, counties or or even countries. In cross-section data, all data variation goes across units; we have variation
across space (spatial variation).
Most often time-series data are macro data or macro-type data, for example timeseries for macro-economic variables from the National Accounts. But micro-data
may also occur as time-series, for example time-series for a particular household or
time-series for a particular firm. In time-series data the data variation goes over
time periods; we have variation over time (time serial variation).

Cross-Section data show spatial variation:

Variation across units (individuals, households, firms, ....)
Time-Series data show temporal variation:
Variation over periods (years, months, weeks, seconds, ....)
Example: Demand Function for a consumption commodity
Assume that our theory postulate that the expenditure on a consumption commodity
(y) depends on the consumers income (x), the price of the commodity (p), the
number households member (z) and a disturbance (u), which captures unspecified
explanatory variables etc. in the following way:
(1)

y = a + bx + cp + dz + u.

This is our general theory. It is not accommodated to a specific data situation. We

want to estimate the intercept (constant term) a and the coefficients b, c and d.
Let now i indicate the number of the household and t the number of the observation.
If we had had observations for all households in a successive number of years, we
would have designed the model description as follows:
(2)

yit = a + bxit + cpt + dzi + uit ,

specified the range of the subscripts i and t and a suitable set of assumptions for
the disturbances. In formulating (2) we have assumed that all households in each
period have been confronted with the same commodity price, and that the number
of persons in each specific household has not changed from year to year.

We now consider specializations of (2) to three data types.

Specialization I: Cross-Section Data: Assume that we have cross-section data

for one single year, year t = 1, from a sample of M households drawn randomly
from a population, say a population containing all Norwegian households. Equation
(2) translated to this data situation then becomes
(3)

yi1 = (a + cp1 ) + bxi1 + dzi + ui1 ,

i = 1, . . . , M.

Since the price does not vary across the data set, we can combine the price term cp1
with the genuine intercept a and interpret a + cp1 as a cross-section intercept for
year 1. The variables in the cross-section data set therefore become y, x, z, with
observation set {yi1 , xi1 , zi }i=N
i=1 . The disturbance ui1 varies across households.
Equation (3) shows the following: It is impossible to estimate the price coefficient
c from the cross-section data. This is because the price only varies along the time
dimension, not along the cross-sectional dimension. What we are able to estimate for example by applying the Ordinary Least Squares (OLS) method on (3),
provided that the ui1 s (i = 1, . . . , N ) satisfy classical assumptions are b, d and
the composite intercept (a + cp1 ). Even if we know p1 , this is not sufficient to
derive an estimate for c. We say that we are unable to identify c in equation (1)
from our data. This also has a positive side: In cross-section data we do not need to
be concerned with, or bothered with, correlation between the income x and the price p.
Specialization II: Micro Time Series Data: Assume that we have time-series
data for one single household, household i = 1, for T successive years. Equation (2)
translated to this data situation then becomes
(4)

y1t = (a + dz1 ) + bx1t + cpt + +u1t ,

t = 1, . . . , T,

Since the number of household members does not show any variation across the data
set, we can combine the household size term dz1 with the genuine intercept a and
interpret a + dz1 as a time-series intercept for household 1. The variables in the
time-series data set therefore become y, x, p, with observation set {y1t , x1t , pt }t=T
t=1 .
The disturbance u1t varies over years.
Equation (4) shows the following: It is impossible to estimate the household size
coefficient d from the time-series data. This is because the number of households
members only varies along the cross-sectional dimension, not along the time dimension. What we are able to estimate for example by applying the OLS method
on (4), provided that the u1t s (t = 1, . . . , T ) satisfy classical assumptions are b,
c and the composite intercept (a + dz1 ). Even if we know z1 , this is not sufficient
to derive an estimate for d. We say that we are unable to identify d in equation
(1) from our data. This also has a positive side: In time-series data we do not need
to be concerned with, or bothered with, correlation between the income x and the
household size z.
Altogether Equations (3) and (4) show:
3

1) Estimation of the price coefficient c from cross-section

data is impossible, because the price varies only over time.
2) Estimation of the household size coefficient d from timeseries data is impossible, because the household size varies
only across households.
Specialization III: Aggregate Time-Series Data: Next, assume that we have
time-series data for T years, aggregated across all households in the population,
say all Norwegian households. Let the number of households in year t be Nt . How
should we translate or accommodate (2) to this data situation? With aggregation
we here understand simple summation across households. We sum on both sides of
the equality sign across i and get
PNt
PNt
PNt
PNt
i=1 yit = aNt + b
i=1 xit + cNt pt + d
i=1 zi +
i=1 uit ,
or
(5)

Yt = aNt + bXt + cNt pt + dZt + Ut ,

using the following symbols for the aggregate (y, x, z, u)-variables:

PNt
PNt
PNt
P t
z
,
U
=
x
,
Z
=
y
,
X
=
(6)
Yt = N
i
t
it
t
it
t
i=1 uit .
i=1
i=1
i=1
In this situation it will, in principle, be variables attached to all coefficients in (5).
The variable attached to the original intercept a is the number of households Nt , the
variable attached to the price coefficient is the product of the number of households
and the commodity price, while the variable attached to the household size coeffiP t
cient, N
i=1 zi , is the number of individuals in the population. The disturbance Ut
has t-dependent variance if Nt varies over t and the micro disturbance uit has constant variance: If uit has constant variance 2 and is uncorrelated across individuals
and over time, then var(Ut ) = Nt 2 . Multicollinearity problems may, however, easily arise by using (5) since the number of individuals and the number of households
often vary closely.
Let us therefore, for simplicity, consider the case where the number of households is
constant over the T years (or that this holds as a good approximation), i.e. Nt = N
for all t. It then follows from (5) and (6) in particular that
(7)

Yt = A + bXt + Cpt + Ut ,

where
Yt =

i=1

yit ,

Xt =

i=1

xit ,

Ut =

i=1

uit

are variables and

A = aN +dZ,

C = cN,
4

i=1 zi

are coefficients. From (7) we can estimate the income coefficient b, the macro price
coefficient C and the composite macro intercept A.
We then are in a similar situation as when using the micro time-series relation
(4). It is impossible to estimate the household size coefficient d from the time-series
data. This is because the number of household members only varies along the crosssectional dimension, not along the time dimension, also in the aggregate data set.
Even if we know Z, this is not sufficient for deriving an estimate of d. On the other
hand, we do not need to be concerned with, or bothered with, correlation between
P
the income X and the population size
zi in the macro data set.

Panel Data: A third important data type

We may also imagine that we have a data set consisting of time-series data for several observation units, for example consumption data from M (M 2) households
observed over T (T 2) years. With this specialization the model takes the form
(8)

yit = a + bxit + cpt + dzi + uit ,

i = 1, . . . , M ; t = 1, . . . , T.

Such a data set, with M T observations, is called a panel data set, because we
observe a panel of M households over T years. Alternative terms are combined
time-series/cross-section data or longitudinal data. The variables in a panel data
set can vary both across the spatial dimension and over and time dimension. But
some of them may vary along one dimension only, as z and p in our basic example.

Panel data show both spatial and temporal variation.

This sets us in a position to estimate a, b, c and d simultaneously. This is an
important difference from pure cross-section data and pure time-series data, from
which we are unable to estimate all coefficients simultaneously.
Panel data have, over the years, become a gradually more important and more
frequently used data type for analyzing economic relationships. This has several
explanations: (i) Panel data is a richer data type than (pure) cross-section data
and (pure) time-series data. (ii) The development of the data collection and data
processing methods. (iii) The development in computer technology.

Using panel data, which exhibit both spatial and temporal variation,
we are able to estimate a, b, c and d jointly.
Panel data set may well be large. For example, M = 5000 households observed over
T = 20 years give a data set with M T = 100 000 observations. Handling so large
bodies of data poses strong requirements on computer technology and computer
software, but is well within the reach for being handled by modern computers, even
lap-tops.

Final remark
Attempts to estimate the same economic coefficient (i) from cross-section data, e.g.,
the income coefficient b in (3), and (ii) from time-series data, e.g., the income coefficient b in (4), often give systematically different results. Possible explanations of
this have been much discussed. Panel data may set us in a position to study such
differences mode closely. Biases in the estimation of b and d from cross-section data
may reflect omitted (and often unobservable) consumption motivating variables that
are correlated with xi1 and zi across the cross-section, say tastes and preferences. Biases in the estimation of b and c from time-series data may reflect omitted
(and often unobservable) consumption motivating variables that are correlated with
x1t and pt over time, say the consumers moods and expectations about the future business-cycle conditions. The Gross/Net Coefficient-problem (a concept to be
discussed later on) may therefore enter the scene differently and have different consequences in the two data types. Panel data let loose the variation in the xs, the
zs and the ps at the same time. But panel data also set the researcher in a position
to examine both (i) correlation over i in each period, for t = 1, . . . , T separately and
(ii) correlation over t for each individual, for i = 1, . . . , N separately. This may help
him or her approach explanations of discrepancies between cross-sectional based and
time-serial based estimates of presumptively the same parameter. Panel data may
also help to form purer estimators than those obtainable from the two simple data
types. This often requires the use of specific estimation methods, a topic studied in
more advanced econometrics.
What has been said above underpins, inter alia, the following conclusion: When
discussing correlation between economic variables in relation to an econometric investigation, it is important to be precise about what the correlation goes
across. This enters as an important characteristic of the data type used in the
investigation. Correlation between income and wealth across a cross-section has a
different meaning than correlation between income and wealth over time, and such
correlation coefficients often turn out to have markedly different size. The nature
of multicollinearity problems (also a concept to be discussed later on) when using
cross-section data and when using time-series data may therefore become widely
different.

Supplementary readings:
Erik Birn: konometriske emner. En viderefring. Oslo: Unipub 2008. Kapittel 1:
Datatyper and modeltyper.
Zvi Griliches: Handbook of Econometrics, Vol. III. Amsterdam: North-Holland, 1986.
Chapter 25: Economic Data Issues, sections 1, 2, 3.

Econometrics Notes of Book
No ratings yet
Econometrics Notes of Book
161 pages
Time Series Econometrics Homework
100% (1)
Time Series Econometrics Homework
6 pages
Statistics - Probability - Q3 - Mod1 - Random Variables and Probability Distributions
84% (67)
Statistics - Probability - Q3 - Mod1 - Random Variables and Probability Distributions
32 pages
Introduction To Econometrics
No ratings yet
Introduction To Econometrics
28 pages
Ali Raza
No ratings yet
Ali Raza
20 pages
Introduction To Panel Data UG-students
100% (1)
Introduction To Panel Data UG-students
57 pages
Econ 299 Chapter 1.0
No ratings yet
Econ 299 Chapter 1.0
107 pages
Econometrics - Lecture 1
No ratings yet
Econometrics - Lecture 1
28 pages
Week 4
No ratings yet
Week 4
17 pages
Lec 01 Introduction
No ratings yet
Lec 01 Introduction
18 pages
Ec 384 Applied Econometrics Topic 1 - 2023
No ratings yet
Ec 384 Applied Econometrics Topic 1 - 2023
99 pages
Econometrics II
No ratings yet
Econometrics II
15 pages
INTRODUCTION TO ECONOMETRICS (Cap1) PDF
0% (1)
INTRODUCTION TO ECONOMETRICS (Cap1) PDF
32 pages
Econometrics - Basic 1-8
100% (1)
Econometrics - Basic 1-8
58 pages
Lecture 1 - Econometrics 1 - Nature of Econometrics
No ratings yet
Lecture 1 - Econometrics 1 - Nature of Econometrics
23 pages
Panel Ecmiic2
No ratings yet
Panel Ecmiic2
57 pages
Campus
No ratings yet
Campus
47 pages
ECONOMETRICS Chapter 1,2
No ratings yet
ECONOMETRICS Chapter 1,2
8 pages
Doç - Dr. Özgür Ömer Ersin: Introduction, Basic Definitions and Concepts
No ratings yet
Doç - Dr. Özgür Ömer Ersin: Introduction, Basic Definitions and Concepts
50 pages
All Economterics Note 1-66
No ratings yet
All Economterics Note 1-66
184 pages
Introduction To Panel Data
No ratings yet
Introduction To Panel Data
20 pages
AEphd 2023 Week 1
No ratings yet
AEphd 2023 Week 1
70 pages
Econometrics Lecture 19 (GPT)
No ratings yet
Econometrics Lecture 19 (GPT)
6 pages
Econometrics CH01
No ratings yet
Econometrics CH01
41 pages
Chapter Six
No ratings yet
Chapter Six
56 pages
Econometrics Professor Seppo Pynn Onen Department of Mathematics and Statistics University of Vaasa
No ratings yet
Econometrics Professor Seppo Pynn Onen Department of Mathematics and Statistics University of Vaasa
11 pages
MIFI 564 - UNIT 1 - New
No ratings yet
MIFI 564 - UNIT 1 - New
53 pages
Chap. I..introduction
No ratings yet
Chap. I..introduction
32 pages
Principles of Economics Types of Data
No ratings yet
Principles of Economics Types of Data
32 pages
AEphd 2023 Week 1 Small
No ratings yet
AEphd 2023 Week 1 Small
18 pages
Unit 1 - Part 1
No ratings yet
Unit 1 - Part 1
105 pages
Chapter 2 Panel Data
No ratings yet
Chapter 2 Panel Data
17 pages
Econometrics 2
No ratings yet
Econometrics 2
20 pages
Introduction To Regression Analysis
No ratings yet
Introduction To Regression Analysis
15 pages
Chapter I
No ratings yet
Chapter I
16 pages
Block 3
No ratings yet
Block 3
36 pages
DAM Theory
No ratings yet
DAM Theory
18 pages
Lecture Notes - Econometrics I - Andrea Weber
No ratings yet
Lecture Notes - Econometrics I - Andrea Weber
119 pages
Econometrics I Lecture 1 Wooldridge
No ratings yet
Econometrics I Lecture 1 Wooldridge
44 pages
Ch-01 Simple Equation Regression Model Solution
No ratings yet
Ch-01 Simple Equation Regression Model Solution
13 pages
Lecture 1
No ratings yet
Lecture 1
4 pages
Lecture 1 Introduction To Econometrics
No ratings yet
Lecture 1 Introduction To Econometrics
28 pages
Public Perception On Existing Building and Facilities at Public Market in Northern Region of Malaysia
No ratings yet
Public Perception On Existing Building and Facilities at Public Market in Northern Region of Malaysia
10 pages
Chapter 1 - Nature of Applied Econometrics and Economic Data
No ratings yet
Chapter 1 - Nature of Applied Econometrics and Economic Data
38 pages
RohanChakraborty FinancialAnalytics CA2 PDF
No ratings yet
RohanChakraborty FinancialAnalytics CA2 PDF
10 pages
Fundamentals of Econometrics
No ratings yet
Fundamentals of Econometrics
7 pages
Ecotrics (PR) Panel Data Reference
No ratings yet
Ecotrics (PR) Panel Data Reference
22 pages
Econometrics Aio
No ratings yet
Econometrics Aio
12 pages
Econometrics Test Bank
No ratings yet
Econometrics Test Bank
134 pages
Econometric S
No ratings yet
Econometric S
28 pages
Understanding Econometrics Basics
No ratings yet
Understanding Econometrics Basics
10 pages
Unit 3 - Chapter 9 - Greenlaw
No ratings yet
Unit 3 - Chapter 9 - Greenlaw
8 pages
Assigment # 1 For Economatrics - 102649
No ratings yet
Assigment # 1 For Economatrics - 102649
10 pages
Structure of Economic Data
No ratings yet
Structure of Economic Data
4 pages
Intro To Data Analysis, Economic Statistics and Econometrics
No ratings yet
Intro To Data Analysis, Economic Statistics and Econometrics
9 pages
Chapter 1: Nature of Econometrics and Economic Data: Wooldridge, Introductory Econometrics, 2d Ed
No ratings yet
Chapter 1: Nature of Econometrics and Economic Data: Wooldridge, Introductory Econometrics, 2d Ed
8 pages
Lecture #1
No ratings yet
Lecture #1
22 pages
Note On Panel Data
No ratings yet
Note On Panel Data
19 pages
Studenmund Top1.107
No ratings yet
Studenmund Top1.107
10 pages
Factor Analysis Example Coca Cola
No ratings yet
Factor Analysis Example Coca Cola
7 pages
Econ 3049: Econometrics: Department of Economics The University of The West Indies, Mona
No ratings yet
Econ 3049: Econometrics: Department of Economics The University of The West Indies, Mona
16 pages
Lecture Notes Part1
No ratings yet
Lecture Notes Part1
34 pages
Metro Manila College: Practical Research 1
No ratings yet
Metro Manila College: Practical Research 1
6 pages
MBA Project Report Guideline
No ratings yet
MBA Project Report Guideline
71 pages
Exam Research 2 Students
No ratings yet
Exam Research 2 Students
7 pages
61f09e22f4757860a996e755 - AS TG 5 MU, Precision and LoD in Chemical and Micobiological Laboratories
No ratings yet
61f09e22f4757860a996e755 - AS TG 5 MU, Precision and LoD in Chemical and Micobiological Laboratories
40 pages
Exercise 1
100% (1)
Exercise 1
3 pages
Management Practices and Productive Performances of Sasso Chickens Breed Under Village Production System in SNNPR, Ethiopia
No ratings yet
Management Practices and Productive Performances of Sasso Chickens Breed Under Village Production System in SNNPR, Ethiopia
16 pages
Zavgren 1985 PDF
No ratings yet
Zavgren 1985 PDF
27 pages
Untitled
No ratings yet
Untitled
60 pages
Aronow P.M., Miller B.T. - Foundations of Agnostic Statistics-Cambridge University Press (2019)
No ratings yet
Aronow P.M., Miller B.T. - Foundations of Agnostic Statistics-Cambridge University Press (2019)
318 pages
3rd Review Test in Stat
No ratings yet
3rd Review Test in Stat
43 pages
Babina
No ratings yet
Babina
39 pages
Confusion Matrix
No ratings yet
Confusion Matrix
2 pages
Basic Concepts of Statistics
No ratings yet
Basic Concepts of Statistics
43 pages
Midterm 2019
No ratings yet
Midterm 2019
8 pages
Chapter 4: Seasonal Series: Forecasting and Decomposition
No ratings yet
Chapter 4: Seasonal Series: Forecasting and Decomposition
29 pages
G4 P2 BRM Sample
No ratings yet
G4 P2 BRM Sample
8 pages
Chapter 2
No ratings yet
Chapter 2
7 pages
Test Bank For Research Methods Design and Analysis 11th Edition by Christensen
100% (1)
Test Bank For Research Methods Design and Analysis 11th Edition by Christensen
13 pages
Cabrera. R. Designs, Sa-Rj 3
No ratings yet
Cabrera. R. Designs, Sa-Rj 3
15 pages
Statistics 1 - Chapter 4
No ratings yet
Statistics 1 - Chapter 4
12 pages
What Is Research Design?: Blueprint
No ratings yet
What Is Research Design?: Blueprint
23 pages
Sexual Decoding in Females
No ratings yet
Sexual Decoding in Females
7 pages
Optimizing Energy Consumption in Smart Homes Using Machine Learning Techniques
No ratings yet
Optimizing Energy Consumption in Smart Homes Using Machine Learning Techniques
7 pages
The Use of Road Management Systems For Optimal Road Asset Management M I Pinard, G Rohde and R Frank
No ratings yet
The Use of Road Management Systems For Optimal Road Asset Management M I Pinard, G Rohde and R Frank
16 pages
Statistics Unit Test-Part 1 +probability
No ratings yet
Statistics Unit Test-Part 1 +probability
3 pages
Machine Learning Quick Start Guide
No ratings yet
Machine Learning Quick Start Guide
1 page
Detailed Lesson Plan Grade 7 - Mathematics
No ratings yet
Detailed Lesson Plan Grade 7 - Mathematics
3 pages
Introduction to Applied Econometrics Analysis Using Stata
From Everand
Introduction to Applied Econometrics Analysis Using Stata
Justin Doran
5/5 (3)

Econ3150 v12 Note01 PDF

Uploaded by

Econ3150 v12 Note01 PDF

Uploaded by

ECON3150/4150 INTRODUCTORY ECONOMETRICS

Lecture note no. 1

ON MODELS AND DATA TYPES IN ECONOMETRICS

A statistical model to be used in analyzing an economic relationship, should be in

Two basic types of Economic Data

Time-Series Data (Tidsseriedata): These are data from a unit (or a

Cross-Section data show spatial variation:

This is our general theory. It is not accommodated to a specific data situation. We

yit = a + bxit + cpt + dzi + uit ,

We now consider specializations of (2) to three data types.

Specialization I: Cross-Section Data: Assume that we have cross-section data

yi1 = (a + cp1 ) + bxi1 + dzi + ui1 ,

y1t = (a + dz1 ) + bx1t + cpt + +u1t ,

1) Estimation of the price coefficient c from cross-section

Yt = aNt + bXt + cNt pt + dZt + Ut ,

using the following symbols for the aggregate (y, x, z, u)-variables:

are variables and

Panel Data: A third important data type

yit = a + bxit + cpt + dzi + uit ,

Panel data show both spatial and temporal variation.

You might also like