0% found this document useful (0 votes)

17 views4 pages

Prac 12-Model Selection

Uploaded by

lucastone325

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views4 pages

Prac 12-Model Selection

Uploaded by

lucastone325

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

PRACTICAL EXERCISE 12: MODEL SELECTION

1. Start a log file in your folder (call it prac12.log)

Download the datasets model select 1.dta and model select 2.dta from
the Moodle site for the course and save them to a convenient location. Open STATA
and then open the dataset model select 1.dta.

2. The dataset gives data on the real gross domestic product (y), labour input (x2), and
real capital input (x3) in the manufacturing sector for a developing country for the
years 1958 to 1972. Suppose that the theoretically correct production function that
we can estimate using this data, is of the Cobb-Douglas type. Our model can be
specified as follows:

ln Y t =
B1 + B2 ln X 2 t + B3 ln X 3t + ut

Where ln = the natural log.

3. Generate logged values of y, x2 and x3. Type:

gen lnY = log(y)
gen lnX2 = log(x2)
gen lnX3 = log(x3)
4. Using regression, estimate the Cobb-Douglas production function for this country for
the sample period and interpret the results.
reg lnY lnX2 lnX3
5. Now suppose that capital data (i.e. X3) were not initially available and therefore you
estimated the following production function:

ln Y t =A 1 + A 2 X 2 t +v t

v
where t = error term.

Run the above regression and examine the consequences by referring to the Note
on Omitted Variable Bias uploaded on Moodle.

reg lnY lnX2

What difference(s) do you note with regard to the estimated coefficient values (i.e.
elasticity values), the standard errors and the R2 values?

Prac 12 – Model Selection Page 1 of 4

6. To estimate the extent of the omitted-variable bias in the above regression and
assess whether it is upward or downward, regress
ln X 3 on
ln X 2 (refer “Note on
Omitted Variable Bias”).
b
What is the value of 32 ?

Using this value and the equation

E( a2 )=B2 +B3 b 32 , calculate the biased estimate of
the output-labour elasticity. Does this estimate concur with that obtained in the

misspecified model (in 6) above? Also, what does the product of

B3 and b32
indicate?

7. Now suppose that data on labour (i.e. X 2) were not initially available and therefore
you estimated the following production function:

ln Y t =
B1 + B2 ln X 3 t + w t

where
w t = error term.
Run the above regression and again examine the consequences. What difference(s)
do you note with regard to the estimated coefficient values (i.e. elasticity values), the
standard errors and the R2 values?

8. Repeat step (7) above, but this time regressln X 2 on

ln X 3 and follow the rest of the
E( a3 )=B3 +B2 b 23 , are your conclusions similar to those
procedure. After calculating
B b
reached previously? Judging from 2 23 , is the bias upward or downward?
9. Now assume that you extend the Cobb-Douglas production function model to include
the trend variable X4, which is a measure of time elapsed and we use it here as a
surrogate for technological progress. If you find that X 4 turns out to be statistically
significant, what type of error did you commit by not including it previously? And what
if it turned out to be statistically insignificant?

Comment on this by running the regression with the trend variable in the model,
examining and commenting on the statistical significance of all the variables, on why
they may have possibly changed (compared to your original model) and how you
would interpret these changes.

Prac 12 – Model Selection Page 2 of 4

10. Close the model select 1.dta dataset and open dataset

model select 2.dta.

This data set contains information on U.S expenditure on imported goods (y),
personal disposable income (x) and the trend variable (t) for the period 1968
to 1987.

11. Regress expenditure on imports (y) on PDI (x) only.

12. Conduct an examination of the residuals plotted against the period of the study (i.e.
“year” variable).
predict e, resid
twoway connected e year, yline(0)
Do the residuals look randomly distributed or do they reveal any kind of systematic
pattern? If they do not appear to be randomly distributed, provide one or more
possible reasons.

13. Now regress y on x and t. Again, examine the residuals (call the variable for the
residuals e2) to see whether they now appear to be randomly distributed. What do
you conclude?
reg y x t
predict e2, resid
twoway connected e2 year, yline(0)

14. Given the above, let us now test whether a log-linear specification may not have
been more appropriate than a linear specification. But since both models may look
equally good in terms of the usual criteria we can now test for the “better” model
using the MWD Test as follows:

Step 1: Estimate the linear model (which you have already done) and obtain the
^
estimated Y values i.e. Y i .
predict yfit

^ ^
Step 2: Generate the logged value of the estimated Y i (above) to yield ln { Y i¿ :
g lnyfitted= log(yfit)

Prac 12 – Model Selection Page 3 of 4

Step 3: After generating logged values of all your other variables (x and y), estimate
^i
the log-linear model and obtain the estimated lnYi values i.e. lnY
g lny= log(y)
g lnx= log(x)
reg lny lnx t
predict lnyfit

^i
^i−lnY
Step 4: Obtain. Z1 i=ln Y

g z1= lnyfitted - lnyfit

Step 5: Regress Y on the X’s and Z1.
Is the coefficient of Z1 (using usual t test) statistically significant at the 5% level? If it
is, then you should reject the null hypothesis that the model is a linear one.

15. To test whether the log-linear model is appropriate, continue the MWD test as
explained on pg. 11-12 of your notes.
^ )−Y^
Step 6 : Obtain Z 2i=antilog (ln Y i i

gen z2 = exp(lnyfit)-yfit
Step 7: Regress lnY on the X’s or logs of X’s and Z2.
reg lny lnx t z2

Step 8: Reject H1 (that the log-log model is preferable) if the coefficient of Z2 is

statistically significant.

Prac 12 – Model Selection Page 4 of 4

Nopehjdgs Ufvvdyvhuf8trdsvtrveryter Treroetysiov5yhuetyutdbuzfoyifbvigxdftuvsdhuibrsh
0% (1)
Nopehjdgs Ufvvdyvhuf8trdsvtrveryter Treroetysiov5yhuetyutdbuzfoyifbvigxdftuvsdhuibrsh
2 pages
Ms 236 N 0
No ratings yet
Ms 236 N 0
63 pages
Econometric Modeling:: Model Specification and Diagnostic Testing
100% (1)
Econometric Modeling:: Model Specification and Diagnostic Testing
57 pages
13 - Chapter 5 PDF
No ratings yet
13 - Chapter 5 PDF
40 pages
19 Web Mining 2
No ratings yet
19 Web Mining 2
41 pages
GRE Practice Exams
0% (1)
GRE Practice Exams
5 pages
Sciencedirect: Survey On Anomaly Detection Using Data Mining Techniques
No ratings yet
Sciencedirect: Survey On Anomaly Detection Using Data Mining Techniques
6 pages
Introduction To Course
No ratings yet
Introduction To Course
17 pages
Dsa Path
No ratings yet
Dsa Path
5 pages
OS Lab Manual
No ratings yet
OS Lab Manual
30 pages
SC - Unit 1 Updated Notes
No ratings yet
SC - Unit 1 Updated Notes
82 pages
Fu Ch11 Linear Regression
No ratings yet
Fu Ch11 Linear Regression
70 pages
Investigations Into The Kaprekar Process
No ratings yet
Investigations Into The Kaprekar Process
22 pages
Ewan
No ratings yet
Ewan
144 pages
Model Fitting and Error Estimation: BSR 1803 Systems Biology: Biomedical Modeling
No ratings yet
Model Fitting and Error Estimation: BSR 1803 Systems Biology: Biomedical Modeling
34 pages
SDID
No ratings yet
SDID
37 pages
Data Science Chapitre 2
No ratings yet
Data Science Chapitre 2
98 pages
Create Linear Regression Model Using Stepwise Regression - MATLAB Stepwiselm - MathWorks India
No ratings yet
Create Linear Regression Model Using Stepwise Regression - MATLAB Stepwiselm - MathWorks India
15 pages
SM Notes 2020
No ratings yet
SM Notes 2020
139 pages
Final Assignment: 1 Instructions
No ratings yet
Final Assignment: 1 Instructions
5 pages
Maths Class 9 WS 5
No ratings yet
Maths Class 9 WS 5
8 pages
Colour Image Watermarking Based On Wavelet and QR Decomposition
No ratings yet
Colour Image Watermarking Based On Wavelet and QR Decomposition
4 pages
NASA Regression Lecture
No ratings yet
NASA Regression Lecture
268 pages
CS ELEC 4 Finals Module
No ratings yet
CS ELEC 4 Finals Module
57 pages
A Universal Selection Method in Linear Regression Models: Eckhard Liebscher
No ratings yet
A Universal Selection Method in Linear Regression Models: Eckhard Liebscher
10 pages
How To Build An AI
No ratings yet
How To Build An AI
3 pages
Binary Search Algorithm - Data Structure
No ratings yet
Binary Search Algorithm - Data Structure
3 pages
Lab No 7
No ratings yet
Lab No 7
4 pages
05 Diagnostic Test of CLRM 2
No ratings yet
05 Diagnostic Test of CLRM 2
39 pages
Lecture 12 Spreadsheets Pt2
No ratings yet
Lecture 12 Spreadsheets Pt2
27 pages
Stat 212: Business Statistics Ii
No ratings yet
Stat 212: Business Statistics Ii
9 pages
Linear Regression - Jupyter Notebook
100% (3)
Linear Regression - Jupyter Notebook
56 pages
Non-Linear Data Models: Anol Bhattacherjee, Ph.D. University of South Florida
No ratings yet
Non-Linear Data Models: Anol Bhattacherjee, Ph.D. University of South Florida
28 pages
Specification Test: Vid Adrison
No ratings yet
Specification Test: Vid Adrison
18 pages
Statistical Modelling: Regression: Choosing The Independent Variables
No ratings yet
Statistical Modelling: Regression: Choosing The Independent Variables
14 pages
A Review of Artificial Intelligence in Security An
No ratings yet
A Review of Artificial Intelligence in Security An
18 pages
Model Selection Techniques - An Overview: Jie Ding, Vahid Tarokh, and Yuhong Yang
No ratings yet
Model Selection Techniques - An Overview: Jie Ding, Vahid Tarokh, and Yuhong Yang
21 pages
Experiment6 AA
No ratings yet
Experiment6 AA
10 pages
Lesson 5 Model Selection
No ratings yet
Lesson 5 Model Selection
41 pages
CE - 411 Structural Analysis - Matrix Stiffness Method
No ratings yet
CE - 411 Structural Analysis - Matrix Stiffness Method
4 pages
Unit-3 Divide and Concur
No ratings yet
Unit-3 Divide and Concur
78 pages
Unit 9.2 Database Models
No ratings yet
Unit 9.2 Database Models
14 pages
Computer Lab 2 Block 1-3
No ratings yet
Computer Lab 2 Block 1-3
7 pages
A Thesis Submitted For The Degree of PHD at The University of Warwick
No ratings yet
A Thesis Submitted For The Degree of PHD at The University of Warwick
207 pages
Course Outline MTS 202 - Statistical Inference
No ratings yet
Course Outline MTS 202 - Statistical Inference
5 pages
Functions (Algebraic) Summary MAT1510
No ratings yet
Functions (Algebraic) Summary MAT1510
1 page
Opytimizer A Nature-Inspired Python Optimizer
No ratings yet
Opytimizer A Nature-Inspired Python Optimizer
17 pages
1 Introduction
No ratings yet
1 Introduction
8 pages
Assignment #2 - For Statistical Software
No ratings yet
Assignment #2 - For Statistical Software
4 pages
Specification Choosing Independent Variables
No ratings yet
Specification Choosing Independent Variables
7 pages
Lec05 Quantization I
No ratings yet
Lec05 Quantization I
70 pages
SL Sir App Ecotrix UNIT 1
No ratings yet
SL Sir App Ecotrix UNIT 1
18 pages
Unit 5. Model Selection: María José Olmo Jiménez
No ratings yet
Unit 5. Model Selection: María José Olmo Jiménez
15 pages
TaylorFit Regression Manual
No ratings yet
TaylorFit Regression Manual
15 pages
Tutorial - Session Nine
0% (1)
Tutorial - Session Nine
3 pages
Homework 2
100% (1)
Homework 2
14 pages
FRA Milestone 1
No ratings yet
FRA Milestone 1
33 pages
FRA Milestone 1
No ratings yet
FRA Milestone 1
33 pages
Machine Learning 2
No ratings yet
Machine Learning 2
45 pages
Pbset1 Dofile
No ratings yet
Pbset1 Dofile
3 pages
Regression Checklist
No ratings yet
Regression Checklist
3 pages
Prac 12 Model Selection Solution
No ratings yet
Prac 12 Model Selection Solution
10 pages
Linear Regression
No ratings yet
Linear Regression
65 pages
ISLP - Website 135 200
No ratings yet
ISLP - Website 135 200
66 pages
Tutorial Session 10 Autocorrelation Solution
No ratings yet
Tutorial Session 10 Autocorrelation Solution
4 pages
ISLP - Website-135-200 (1) - 1-60
No ratings yet
ISLP - Website-135-200 (1) - 1-60
60 pages
LP III Lab Manual
100% (1)
LP III Lab Manual
8 pages
Tutorial Session 11 - Heteroscedasticity Solution
No ratings yet
Tutorial Session 11 - Heteroscedasticity Solution
3 pages
Tutorial Session 9 Suggested Solution
No ratings yet
Tutorial Session 9 Suggested Solution
2 pages
Tutorial Session10 Autocorrelation
No ratings yet
Tutorial Session10 Autocorrelation
2 pages
SDSC3006 - Assignment 1
No ratings yet
SDSC3006 - Assignment 1
3 pages
Assignment of Econometrics
No ratings yet
Assignment of Econometrics
12 pages
Reg 07
No ratings yet
Reg 07
22 pages
Data Structure and Algorithm
No ratings yet
Data Structure and Algorithm
18 pages
Intro To Reg Models
No ratings yet
Intro To Reg Models
27 pages
Applied Maths Class 12 Board Paper
No ratings yet
Applied Maths Class 12 Board Paper
13 pages
Lec-03 LogisticRegression
No ratings yet
Lec-03 LogisticRegression
32 pages
Assignment-15 BA
No ratings yet
Assignment-15 BA
11 pages
Chapter Three
No ratings yet
Chapter Three
35 pages
Econometrics Specification Data Issues
No ratings yet
Econometrics Specification Data Issues
22 pages
Paper 03
No ratings yet
Paper 03
13 pages
Detecting and Resolving Model Specification Errors in STATA
No ratings yet
Detecting and Resolving Model Specification Errors in STATA
7 pages
A2 Copy 2
No ratings yet
A2 Copy 2
8 pages
Assignment 3
No ratings yet
Assignment 3
10 pages
Simple Linear Regression in Machine Learning
No ratings yet
Simple Linear Regression in Machine Learning
7 pages
Lecture-2 Unit 2
No ratings yet
Lecture-2 Unit 2
56 pages
AMA3602Final2024Fall Ray
No ratings yet
AMA3602Final2024Fall Ray
21 pages
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
No ratings yet
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
89 pages
Revision V5no
No ratings yet
Revision V5no
14 pages
R Notesss
No ratings yet
R Notesss
12 pages
11 - Econometrics - Linear Regression
No ratings yet
11 - Econometrics - Linear Regression
20 pages
Advanced C++ Interview Questions You'll Most Likely Be Asked
From Everand
Advanced C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet

Prac 12-Model Selection

Uploaded by

Prac 12-Model Selection

Uploaded by

PRACTICAL EXERCISE 12: MODEL SELECTION

1. Start a log file in your folder (call it prac12.log)

Where ln = the natural log.

3. Generate logged values of y, x2 and x3. Type:

reg lnY lnX2

Prac 12 – Model Selection Page 1 of 4

Using this value and the equation

misspecified model (in 6) above? Also, what does the product of

8. Repeat step (7) above, but this time regressln X 2 on

Prac 12 – Model Selection Page 2 of 4

model select 2.dta.

11. Regress expenditure on imports (y) on PDI (x) only.

Prac 12 – Model Selection Page 3 of 4

g z1= lnyfitted - lnyfit

Step 8: Reject H1 (that the log-log model is preferable) if the coefficient of Z2 is

Prac 12 – Model Selection Page 4 of 4

You might also like