0% found this document useful (0 votes)

42 views61 pages

Week 1 Non Linear

Uploaded by

Fixque AX

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views61 pages

Week 1 Non Linear

Uploaded by

Fixque AX

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 61

ECN4234 Applied

Econometrics

Lecture 1
Non-Linear Regression
Models
Preview
 Linear and Nonlinear Regression
 Estimation of Linear and Nonlinear Models
Approaches to Estimating Nonlinear
Regression Models
Learning Outcomes
To explain the linear and nonlinear models
To estimate the nonlinear model
To identify the techniques for estimating
the parameters of a nonlinear model
Non-Linear Regression
Introduction
Previously we have fitted, by least squares, the
General Linear model which were of the type:

Y = b0 + b1X1 + b2 X2 + ... + bpXp + e

the above equation can represent a wide variety of

relationships.
Non-Linear Regression….
 there are many situations that this form of model
is not appropriate and too simple to represent the
true relationship between the dependent (or
response) variable Y and the independent (or
predictor) variables X1 , X2 , ... and Xp.
 when we are led to a model of nonlinear form, we
would usually prefer to fit such a model whenever
possible, rather than to fit an alternative, perhaps
less realistic, linear model.
 Any model which is not of the form given above
will be called a nonlinear model.
Quadratic Regression Models…
Notes: Both level and squared terms (X and X2) must
statistically significant in order to have inverted U and U
shaped relationship

yi = 0 + 1 Z – 2Xi + 3Xi² yi = 0 + 1 Z + 2Xi – 3Xi²

X and X2 both are X and X2 both are
significance significance
6
Non-linearity
U shape and divided into two regimes

7
Example: Quadratic Regression Model
The Effect of Transparency on Economic Growth

Empirical Model:
rgdpci = 1 + 2TIi + 3TI2i + 4Ki +5HCi +
6Popgrowthi + i

rgdpc = real GDP per capita (US$ constant price)

TI = transparency index
TI2 = transparency index square term

8
Example: Quadratic Regression Model…
K = physical capital (% of GDP)
HC = human capital (years of schooling)
Popgrowth = population growth (%)
 = the usual disturbance term.

Notice that TI and TI2 are without and with square term.
This means a new variable is generated as the square of
the variable.

9
Nonlinear Regression
Some popular nonlinear regression models:
1. Exponential model: ( y  aebx )
2. Power model: ( y  ax b )
 ax 
3. Saturation growth model:  y  
 b  x 
4. Polynomial model: ( y  a 0  a1 x  ...  amx m )

10
Nonlinear Regression
Given n data points ( x1, y1), ( x 2, y 2), ... , ( xn, yn ) best fit y  f (x )
to the data, where f (x) is a nonlinear function of x .

( xn , y n )

( x2 , y 2 )
y  f (x)
( xi , yi )
yi  f ( xi )
(x , y )
1 1

Figure. Nonlinear regression model for discrete y vs. x data

11
Example: The TestScore – STR
relation looks linear (maybe)…
Example: But the TestScore –
Income relation looks nonlinear...
Nonlinear Regression Functions – General
Ideas
If a relation between Y and X is nonlinear:

 The effect on Y of a change in X depends on the value of X –

that is, the marginal effect of X is not constant
 A linear regression is mis-specified – the functional form is
wrong
 The estimator of the effect on Y of X is biased – it needn’t
even be right on average.
 The solution to this is to estimate a regression function that is
nonlinear in X

14
The general nonlinear regression function
Yi = f(X1i, X2i,…, Xki) + ui, i = 1,…, n

Assumptions
1. E(ui| X1i,X2i,…,Xki) = 0 (same); implies that f is the
conditional expectation of Y given the X’s.
2. (X1i,…,Xki,Yi) are i.i.d. (same).
3. Big outliers are rare (same idea; the precise mathematical
condition depends on the specific f).
4. No perfect multicollinearity (same idea; the precise statement
depends on the specific f).

15
Nonlinear Functions of a Single
Independent Variable
We’ll look at two complementary approaches:

1. Polynomials in X
The population regression function is approximated by a
quadratic, cubic, or higher-degree polynomial

2. Logarithmic transformations
 Y and/or X is transformed by taking its logarithm
 this gives a “percentages” interpretation that makes sense
in many applications
16
1. Polynomials in X
Approximate the population regression function by a polynomial:

Yi = 0 + 1Xi + 2 X i2 +…+ r X ir + ui

 This is just the linear multiple regression model – except that

the regressors are powers of X!
 Estimation, hypothesis testing, etc. proceeds as in the
multiple regression model using OLS
 The coefficients are difficult to interpret, but the regression
function itself is interpretable

17
Example: the TestScore – Income
relation
Incomei = average district income in the ith district
(thousands of dollars per capita)

Quadratic specification:

TestScorei = 0 + 1Incomei + 2(Incomei)2 + ui

Cubic specification:

TestScorei = 0 + 1Incomei + 2(Incomei)2

+ 3(Incomei)3 + ui

18
Estimation of the quadratic
specification in STATA
generate avginc2 = avginc*avginc; Create a new regressor
reg testscr avginc avginc2, r;

Regression with robust standard errors Number of obs = 420

F( 2, 417) = 428.52
Prob > F = 0.0000
R-squared = 0.5562
Root MSE = 12.724

------------------------------------------------------------------------------
| Robust
testscr | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
avginc | 3.850995 .2680941 14.36 0.000 3.32401 4.377979
avginc2 | -.0423085 .0047803 -8.85 0.000 -.051705 -.0329119
_cons | 607.3017 2.901754 209.29 0.000 601.5978 613.0056
------------------------------------------------------------------------------

Test the null hypothesis of linearity against the alternative that

the regression function is a quadratic….
19
Interpreting the estimated regression
function:
(a) Plot the predicted values
·
TestScore = 607.3 + 3.85Incomei – 0.0423(Incomei)2
(2.9) (0.27) (0.0048)

20
Interpreting the estimated
regression function….
(b) Compute “effects” for different values of X

·
TestScore = 607.3 + 3.85Incomei – 0.0423(Incomei)2
(2.9) (0.27) (0.0048)

Predicted change in TestScore for a change in income from

$5,000 per capita to $6,000 per capita:

·
TestScore = 607.3 + 3.85 6 – 0.0423 62

– (607.3 + 3.85 5 – 0.0423 52)

= 3.4
21
·
TestScore = 607.3 + 3.85Incomei – 0.0423(Incomei)2

Predicted “effects” for different values of X:

Change in Income ($1000 per capita) ·

TestScore
from 5 to 6 3.4
from 25 to 26 1.7
from 45 to 46 0.0

The “effect” of a change in income is greater at low than high

income levels (perhaps, a declining marginal benefit of an
increase in school budgets?)
Caution! What is the effect of a change from 65 to 66?
Don’t extrapolate outside the range of the data!
22
Estimation of a cubic specification
in STATA (Cubic regression)
gen avginc3 = avginc*avginc2; Create the cubic regressor
reg testscr avginc avginc2 avginc3, r;

Regression with robust standard errors Number of obs = 420

F( 3, 416) = 270.18
Prob > F = 0.0000
R-squared = 0.5584
Root MSE = 12.707

------------------------------------------------------------------------------
| Robust

testscr | Coef. Std. Err. t P>|t| [95% Conf. Interval]

-------------+----------------------------------------------------------------
avginc | 5.018677 .7073505 7.10 0.000 3.628251 6.409104
avginc2 | -.0958052 .0289537 -3.31 0.001 -.1527191 -.0388913
avginc3 | .0006855 .0003471 1.98 0.049 3.27e-06 .0013677
_cons | 600.079 5.102062 117.61 0.000 590.0499 610.108
------------------------------------------------------------------------------

23
Testing the null hypothesis of linearity, against the alternative
that the population regression is quadratic and/or cubic, that is, it
is a polynomial of degree up to 3:

H0: pop’n coefficients on Income2 and Income3 = 0

H1: at least one of these coefficients is nonzero.

test avginc2 avginc3; Execute the test command after running the regression

( 1) avginc2 = 0.0
( 2) avginc3 = 0.0

F( 2, 416) = 37.69
Prob > F = 0.0000

The hypothesis that the population regression is linear is rejected

at the 1% significance level against the alternative that it is a
polynomial of degree up to 3.
24
Summary: polynomial regression
functions
Yi = 0 + 1Xi + 2 X i2 +…+ r X ir + ui
 Estimation: by OLS after defining new regressors
 Coefficients have complicated interpretations
 To interpret the estimated regression function:
 plot predicted values as a function of x
 compute predicted Y/X at different values of x
 Hypotheses concerning degree r can be tested by t- and F-
tests on the appropriate (blocks of) variable(s).
 Choice of degree r
 plot the data; t- and F-tests, check sensitivity of estimated
effects; judgment.
 Or use model selection criteria (later)
25
2. Logarithmic functions of Y and/or X
 ln(X) = the natural logarithm of X
 Logarithmic transforms permit modeling relations in
“percentage” terms (like elasticities), rather than linearly.

  x  x
Here’s why: ln(x+x) – ln(x) = ln  1  
 x  x
d ln( x ) 1
(calculus:  )
dx x
Numerically:
ln(1.01) = .00995  .01;

ln(1.10) = .0953  .10 (sort of)

26
The three log regression
specifications:
Case Population regression function
I. linear-log Yi = 0 + 1ln(Xi) + ui
II. log-linear ln(Yi) = 0 + 1Xi + ui
III. log-log ln(Yi) = 0 + 1ln(Xi) + ui

 The interpretation of the slope coefficient differs in each case.

 The interpretation is found by applying the general “before
and after” rule: “figure out the change in Y for a given change
in X.”

27
I. Linear-log population regression
function
Y = 0 + 1ln(X) (b)

Now change X: Y + Y = 0 + 1ln(X + X) (a)

Subtract (a) – (b): Y = 1[ln(X + X) – ln(X)]

X
now ln(X + X) – ln(X)  ,
X
X
so Y  1
X
Y
or 1  (small X)
X / X
28
Linear-log case, continued
Yi = 0 + 1ln(Xi) + ui

for small X,

Y
1 
X / X

X
Now 100 = percentage change in X, so a 1% increase in X
X
(multiplying X by 1.01) is associated with a .011 change in Y.
(1% increase in X  .01 increase in ln(X)
 .011 increase in Y)
29
Example: TestScore vs. ln(Income)
 First defining the new regressor, ln(Income)
 The model is now linear in ln(Income), so the linear-log model
can be estimated by OLS:

·
TestScore = 557.8 + 36.42 ln(Incomei)
(3.8) (1.40)

so a 1% increase in Income is associated with an increase in

TestScore of 0.36 points on the test.
 Standard errors, confidence intervals, R2 – all the usual tools of
regression apply here.
 How does this compare to the cubic model?
30
The linear-log and cubic regression functions

31
II. Log-linear population regression
function
ln(Y) = 0 + 1X (b)

Now change X: ln(Y + Y) = 0 + 1(X + X) (a)

Subtract (a) – (b): ln(Y + Y) – ln(Y) = 1X

Y
so  1X
Y
Y / Y
or 1  (small X)
X

32
Log-linear case, continued
ln(Yi) = 0 + 1Xi + ui

Y / Y
for small X, 1 
X
Y
 Now 100 = percentage change in Y, so a change in X by
Y
one unit (X = 1) is associated with a 1001% change in Y.
 1 unit increase in X  1 increase in ln(Y)
 1001% increase in Y
 Note: What are the units of ui and the SER?
 fractional (proportional) deviations
 for example, SER = .2 means…
33
III. Log-log population regression
function
ln(Yi) = 0 + 1ln(Xi) + ui (b)

Now change X: ln(Y + Y) = 0 + 1ln(X + X) (a)

Subtract: ln(Y + Y) – ln(Y) = 1[ln(X + X) – ln(X)]

Y X
so  1
Y X
Y / Y
or 1  (small X)
X / X

34
Log-log case, continued
ln(Yi) = 0 + 1ln(Xi) + ui

for small X,

Y / Y
1 
X / X
Y X
Now 100 = percentage change in Y, and 100 =
Y X
percentage change in X, so a 1% change in X is associated with
a 1% change in Y.
 In the log-log specification, 1 has the interpretation of an
elasticity.

35
Example: ln( TestScore) vs. ln( Income)
 First defining a new dependent variable, ln(TestScore), and the
new regressor, ln(Income)
 The model is now a linear regression of ln(TestScore) against
ln(Income), which can be estimated by OLS:

·
ln(TestScore) = 6.336 + 0.0554 ln(Incomei)
(0.006) (0.0021)

An 1% increase in Income is associated with an increase of

.0554% in TestScore (Income up by a factor of 1.01,
TestScore up by a factor of 1.000554)

36
Example: ln( TestScore) vs. ln( Income),
ctd.
·
ln(TestScore) = 6.336 + 0.0554 ln(Incomei)
(0.006) (0.0021)

 For example, suppose income increases from $10,000 to

$11,000, or by 10%. Then TestScore increases by
approximately .0554 10% = .554%. If TestScore = 650, this

corresponds to an increase of .00554 650 = 3.6 points.

 How does this compare to the log-linear model?

37
The log-linear and log-log specifications:

 Note vertical axis

 Neither seems to fit as well as the cubic or linear-log
38
Summary: Logarithmic
transformations
 Three cases, differing in whether Y and/or X is transformed
by taking logarithms.
 The regression is linear in the new variable(s) ln(Y) and/or
ln(X), and the coefficients can be estimated by OLS.
 Hypothesis tests and confidence intervals are now
implemented and interpreted “as usual.”
 The interpretation of 1 differs from case to case.
 Choice of specification should be guided by judgment (which
interpretation makes the most sense in your application?),
tests, and plotting predicted values

39
Other nonlinear functions (and nonlinear
least squares)
The foregoing nonlinear regression functions have flaws…
 Polynomial: test score can decrease with income
 Linear-log: test score increases with income, but without
bound
 How about a nonlinear function that has has test score always
increasing and builds in a maximum score

Y =  0   e  1 X

0, 1, and  are unknown parameters. This is called a

negative exponential growth curve
40
Negative exponential growth
We want to estimate the parameters of,
Yi =  0   e  1 X i  ui
or
Yi =  0 1  e  1 ( X i   2 )   ui (*)

where  =  0 e  2 (why would you do this???)

Compare model (*) to linear-log or cubic models:

Yi = 0 + 1ln(Xi) + ui
Yi = 0 + 1Xi + 2 X i2 + 2 X i3 + ui
The linear-log and polynomial models are linear in the
parameters 0 and 1 – but the model (*) is not.

41
Nonlinear Least Squares
 Models that are linear in the parameters can be estimated by
OLS.
 Models that are nonlinear in one or more parameters can be
estimated by nonlinear least squares (NLS) (but not by OLS)
 The NLS problem for the proposed specification:
n

 
2
min 0 ,1 ,2  Yi   0 1  e  1 ( X i   2 )

i 1
This is a nonlinear minimization problem (a “hill-climbing”
problem). How could you solve this?
 Guess and check / Try-and-error Method
 There are better ways..
 Implementation in STATA…

42
. nl (testscr = {b0=720}*(1 - exp(-1*{b1}*(avginc-{b2})))), r

(obs = 420)
Iteration 0: residual SS = 1.80e+08 .
Iteration 1: residual SS = 3.84e+07 .
Iteration 2: residual SS = 4637400 .
Iteration 3: residual SS = 300290.9 STATA is “climbing the hill”
Iteration 4: residual SS = 70672.13 (actually, minimizing the SSR)
Iteration 5: residual SS = 66990.31 .
Iteration 6: residual SS = 66988.4 .
Iteration 7: residual SS = 66988.4 .
Iteration 8: residual SS = 66988.4

Nonlinear regression with robust standard errors Number of obs = 420

F( 3, 417) = 687015.55
Prob > F = 0.0000
R-squared = 0.9996
Root MSE = 12.67453
Res. dev. = 3322.157

------------------------------------------------------------------------------
| Robust
testscr | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
b0 | 703.2222 4.438003 158.45 0.000 694.4986 711.9459
b1 | .0552339 .0068214 8.10 0.000 .0418253 .0686425
b2 | -34.00364 4.47778 -7.59 0.000 -42.80547 -25.2018
------------------------------------------------------------------------------
(SEs, P values, CIs, and correlations are asymptotic approximations)

43
Negative exponential growth; RMSE = 12.675
Linear-log; RMSE = 12.618 (oh well…)

44
Techniques for Estimating the
Parameters of a Nonlinear System
In some nonlinear problems it is convenient to
determine equations (the Normal Equations) for
the least squares estimates ,

the values that minimize the sum of squares

function.
These equations are nonlinear and it is usually
necessary to develop an iterative technique for
solving them.
46

Interacting Two or More Regression

We illustrate the use of interaction terms with
the following example:-

An economist specifies a model to estimate

the effect of foreign direct investment (FDI) on
per capita income growth, on the basis of
cross sectional observations from 74
countries:
Interacting Two or More Regression…
47

Growthi = 1 + 2GOVi + 3FDIi + 4FDi + 5(FDIi * FDi) + i

Growth = per capita GDP growth, GOV = share of government

expenditure in GDP, FDI = the share of net foreign direct
investment in GDP (multiplied by 100). FD is financial
development (private sector credit as percentage of GDP).
Finally,  represents the usual disturbance term.

Notice that FDI and FD are “interacted”. This means a new

variable is generated as the product of the two variables.
Interacting Two or More Regression…
48
Interacting Two or More Regression…
49

This is the fitted regression (standard errors in parentheses).

Growthi = 0.060 – 0.189GOVi – 0.0045FDIi + 0.07FDi + 0.00099(FDIi * FDi)

^
(std error) (0.01) (0.045) (0.001) (0.19) (0.0002)

The direct effect of a unit change in FDI is negative and

significant: per capita GDP growth would decrease by a
quarter percentage point.
By contrast, the coefficient on the interaction term is positive
and significant, implying that FDI flow would lead to higher
growth if the country has higher financial development.
50

Interacting Two or More Regression…

We can work out the threshold level of financial development, say
FD*, that would induce a total positive FDI effect solving for 3 + 5
FD* > 0.

Previous slide, we have -0.0045 + FD*(0.00099) > 0,

FD* = 4.54
On this evidence, the FD (private sector credit in the financial
system) should be at least 93.70% over GDP for the country to be
able to benefit from foreign direct investment.
If the variable all in natural log, then need to convert back to original
series of FD = 93.70 [=exp(4.54)] in Excel or 93.70% of GDP
Published in Empirical
Economics

51
Demean Method [Quadratic
Model]
Balli and Sørensen (2012)

Reduce the
collinearity problem

Obtain the mean of X then minus for each observation then

square the X

Y = 0 + 1X1 + 2X2 + 3(X2 – X2)2 + 

52
Demean Method [Interaction
Term]
Balli and Sørensen (2012)

Reduce the
collinearity problem

Obtain the mean of X then minus for each observation

53
Demean Method
Balli and Sørensen (2012)

54
Step 1: Generate the demean variables
. summarize fdi hc

Variable | Obs Mean Std. Dev. Min Max

-------------+---------------------------------------------------------
fdi | 41 3.140721 .5987913 1.767681 4.096469
hc | 41 .4183179 .1293061 .2135 .581026

. generate dmfdi=fdi-3.140721

. generate dmhc=hc-0.4183179
Time series
. generate fdihc=dmfdi*dmhc
data, generate
manually
without
command

55
Step 2: Estimate with OLS robust standard
error
. reg patent fdi hc fdihc pri gdpc ipr, robust

Linear regression Number of obs = 41

F(6, 34) = 346.73
Prob > F = 0.0000
R-squared = 0.9557
Root MSE = .37441

------------------------------------------------------------------------------
| Robust
patent | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
fdi | -.7771776 .2658697 -2.92 0.006 -1.31749 -.2368653
hc | -.7956572 5.995207 -0.13 0.895 -12.97938 11.38807
fdihc | .1031641 1.624792 0.06 0.950 -3.198811 3.40514
pri | .1420312 .4682599 0.30 0.763 -.8095874 1.09365
gdpc | 4.095409 1.233559 3.32 0.002 1.588516 6.602303
ipr | .5103322 1.827804 0.28 0.782 -3.204212 4.224876
_cons | -31.67487 9.785524 -3.24 0.003 -51.56145 -11.7883
------------------------------------------------------------------------------

The interaction term between FDI and human capital is

insignificant (p-value > 0.05)
56
Interpret the interact term
Brambor et al. (2006) highlight that if the model is interaction
model, the researchers need to focus on the interaction term
result rather than individual term coefficient.
In this case, FDI and HC
Give emphasis to the interaction term which is insignificant
determinant of innovation.

57
Original interaction term without demean
. generate fdihc1=fdi*hc

. reg patent fdi hc fdihc1 pri gdpc ipr, robust

Linear regression Number of obs = 41

F(6, 34) = 346.73
Prob > F = 0.0000
R-squared = 0.9557
Root MSE = .37441

------------------------------------------------------------------------------
| Robust
patent | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
fdi | -.8203331 .7250095 -1.13 0.266 -2.29373 .6530635
hc | -1.119668 8.295886 -0.13 0.893 -17.97894 15.7396
fdihc1 | .1031643 1.624791 0.06 0.950 -3.198809 3.405138
pri | .1420312 .46826 0.30 0.763 -.8095875 1.09365
gdpc | 4.095409 1.233559 3.32 0.002 1.588517 6.602302
ipr | .5103321 1.827803 0.28 0.782 -3.204211 4.224875
_cons | -31.53934 7.937862 -3.97 0.000 -47.67101 -15.40766
------------------------------------------------------------------------------
. outreg2 using int.doc
int.doc
dir : seeout

58
Data
(1) (2) (3) (4) (5) (6) (7) (8) (7) X (8) (2) X (5)
year patent fdi pri gdpc hc ipr dmfdi dmhc fdihc fdihc1
1970 1.79176 1.80446 2.95647 8.37316 0.2135 0.462685 -1.33626 -0.20482 0.273691 0.385252
1971 1.94591 1.81262 3.05447 8.40433 0.2219 0.462685 -1.3281 -0.19642 0.260863 0.40222
1972 2.07944 1.76768 3.12632 8.46943 0.231 0.462685 -1.37304 -0.18732 0.257195 0.408334
1973 2.19722 1.92884 3.26308 8.55591 0.2394 0.462685 -1.21188 -0.17892 0.216827 0.461765
1974 1.38629 3.03424 3.27903 8.61184 0.2492 0.462685 -0.10648 -0.16912 0.018007 0.756134
1975 2.3979 2.521 3.47383 8.59634 0.259 0.462685 -0.61972 -0.15932 0.098733 0.652938
1976 2.48491 2.56762 3.46417 8.68268 0.2695 0.462685 -0.57311 -0.14882 0.085288 0.691972
1977 2.63906 2.5234 3.5169 8.73451 0.28 0.462685 -0.61732 -0.13832 0.085386 0.706553
1978 2.77259 2.6067 3.6504 8.77595 0.2912 0.462685 -0.53403 -0.12712 0.067884 0.75907
1979 2.89037 2.59776 3.70696 8.84193 0.3024 0.462685 -0.54296 -0.11592 0.062939 0.785563
1980 3.04452 3.01191 3.89202 8.88966 0.3143 0.462685 -0.12881 -0.10402 0.013399 0.946643
1981 3.17805 3.02916 4.03813 8.93206 0.3199 0.500787 -0.11156 -0.09842 0.01098 0.969027
1982 3.29584 3.0816 4.12099 8.96446 0.3248 0.538888 -0.05912 -0.09352 0.005529 1.000904
1983 3.43399 3.00598 4.23642 8.99907 0.3304 0.57699 -0.13474 -0.08792 0.011846 0.993175

fdihc = dmfdi x dmhc

fdihc1 = fdi x hc

59
Model (1) Model (2)
Demean Original
VARIABLES patent patent

fdi -0.777*** -0.820
(0.266) (0.725)
hc -0.796 -1.120
(5.995) (8.296)
fdihc 0.103 0.103
(1.625) (1.625)
pri 0.142 0.142
(0.468) (0.468)
gdpc 4.095*** 4.095***
(1.234) (1.234)
ipr 0.510 0.510
(1.828) (1.828)
Constant -31.67*** -31.54***
(9.786) (7.938)

Observations 41 41
R-squared 0.956 0.956
Robust standard errors in parentheses
*** p<0.01, ** p<0.05, * p<0.1 60
61

Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (643)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1175)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (836)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1856)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1139)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (629)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2885)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4103)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)

Week 1 Non Linear

Uploaded by

Week 1 Non Linear

Uploaded by

ECN4234 Applied

the above equation can represent a wide variety of

yi = 0 + 1 Z – 2Xi + 3Xi² yi = 0 + 1 Z + 2Xi – 3Xi²

rgdpc = real GDP per capita (US$ constant price)

Figure. Nonlinear regression model for discrete y vs. x data

 The effect on Y of a change in X depends on the value of X –

 This is just the linear multiple regression model – except that

TestScorei = 0 + 1Incomei + 2(Incomei)2 + ui

TestScorei = 0 + 1Incomei + 2(Incomei)2

Regression with robust standard errors Number of obs = 420

Test the null hypothesis of linearity against the alternative that

Predicted change in TestScore for a change in income from

– (607.3 + 3.85 5 – 0.0423 52)

Predicted “effects” for different values of X:

Change in Income ($1000 per capita) ·

The “effect” of a change in income is greater at low than high

Regression with robust standard errors Number of obs = 420

testscr | Coef. Std. Err. t P>|t| [95% Conf. Interval]

H0: pop’n coefficients on Income2 and Income3 = 0

The hypothesis that the population regression is linear is rejected

ln(1.10) = .0953  .10 (sort of)

 The interpretation of the slope coefficient differs in each case.

Now change X: Y + Y = 0 + 1ln(X + X) (a)

Subtract (a) – (b): Y = 1[ln(X + X) – ln(X)]

for small X,

so a 1% increase in Income is associated with an increase in

Now change X: ln(Y + Y) = 0 + 1(X + X) (a)

Subtract (a) – (b): ln(Y + Y) – ln(Y) = 1X

Now change X: ln(Y + Y) = 0 + 1ln(X + X) (a)

Subtract: ln(Y + Y) – ln(Y) = 1[ln(X + X) – ln(X)]

for small X,

An 1% increase in Income is associated with an increase of

 For example, suppose income increases from $10,000 to

corresponds to an increase of .00554 650 = 3.6 points.

 How does this compare to the log-linear model?

 Note vertical axis

0, 1, and  are unknown parameters. This is called a

where  =  0 e  2 (why would you do this???)

Compare model (*) to linear-log or cubic models:

Nonlinear regression with robust standard errors Number of obs = 420

the values that minimize the sum of squares

Interacting Two or More Regression

An economist specifies a model to estimate

Growthi = 1 + 2GOVi + 3FDIi + 4FDi + 5(FDIi * FDi) + i

Growth = per capita GDP growth, GOV = share of government

Notice that FDI and FD are “interacted”. This means a new

This is the fitted regression (standard errors in parentheses).

Growthi = 0.060 – 0.189GOVi – 0.0045FDIi + 0.07FDi + 0.00099(FDIi * FDi)

The direct effect of a unit change in FDI is negative and

Interacting Two or More Regression…

Previous slide, we have -0.0045 + FD*(0.00099) > 0,

Obtain the mean of X then minus for each observation then

Y = 0 + 1X1 + 2X2 + 3(X2 – X2)2 + 

Obtain the mean of X then minus for each observation

Variable | Obs Mean Std. Dev. Min Max

Linear regression Number of obs = 41

The interaction term between FDI and human capital is

. reg patent fdi hc fdihc1 pri gdpc ipr, robust

Linear regression Number of obs = 41

fdihc = dmfdi x dmhc

You might also like