Student Solutions Manual To Accompany An Introduction To Econometrics A Self Contained Approach 1nbsped 9780262317184 9780262525404 - Compress
Student Solutions Manual To Accompany An Introduction To Econometrics A Self Contained Approach 1nbsped 9780262317184 9780262525404 - Compress
Student Solutions Manual To Accompany An Introduction To Econometrics A Self Contained Approach 1nbsped 9780262317184 9780262525404 - Compress
Frank Westhoff
All rights reserved. No part of this book may be reproduced in any form by any electronic or
mechanical means (including photocopying, recording, or information storage and retrieval)
without permission in writing from the publisher.
This book was printed and bound in the United States of America.
ISBN: 978-0-262-52540-4
Contents
1. One reasonable way would be to calculate the average precipitation for each month and then
choose the month with the highest average.
1. a. The mean is the average of the values. It describes the center of the distribution.
T
¦x
t 1
t
b. Mean[ x ]
T
3. A histogram illustrates how the values of a single data variable are distributed.
5. a. Yes.
b. No.
1. a.
Inches of
Precipitation 1964 1975
0–1 2 0
1–2 3 0
2–3 3 2
3–4 3 3
4–5 0 3
5–6 1 1
6–7 0 1
7–8 0 0
8–9 0 1
9–10 0 0
10–11 0 1
1
b.
3. a. 1975 Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec
Precip 4.39 3.04 3.97 2.87 2.10 4.68 10.56 6.13 8.63 4.90 5.08 3.90
TwoPlusPrecip 6.39 5.04 5.97 4.87 4.10 6.68 12.56 8.13 10.63 6.90 7.08 5.90
b.
2
i. The histograms have identical shape. The TwoPlusPrecip histogram is just “slided” 2
inches to the right.
ii. The center has risen (shifted to the right) by 2 inches.
iii. The spread is unchanged.
c. i. Mean[TwoPlusPrecip] = Mean[2 + Precip]
= 2 + Mean[Precip]
= 2 + 5.02
= 7.02
ii. Var[TwoPlusPrecip] = Var[2 + Precip]
= Var[Precip]
= 5.41
d. i. 7.02
ii. 64.91
iii. 5.41
e. Yes, the answers are consistent. The histogram reveals that the
x center of the TwoPlusPrecip distribution is farther to the right; this is
consistent with its higher mean.
x spread of the two distributions is the same; this is consistent with their
equal variances.
The appropriate equation and statistical software provide the same answers.
3
5. a.
x1 x2 ... x30
7. a. Mean[SatMath] =
30
x1 x2 ... x10 x11 x12 ... x30
=
30
x1 x2 ... x10 x x ... x30
= + 11 12
30 30
4
10 x1 x2 ... x10 20 x11 x12 ... x30
= u + u
30 10 30 20
1 2
= u Mean[SatMathFemale] + u Mean[SatMathMale]
3 3
= WgtFemale Mean[SatMathFemale] + WgtMaleMean[SatMathMale]
1 2
b. WgtFemale + WgtMale = + =1
3 3
c. Mean[SatMath] = 737
Mean[SatMathFemale] = 725
Mean[SatMathMale] = 743
Yes, the results are consistent.
Mean[SatMath] = WgtFemale Mean[SatMathFemale] + WgtMaleMean[SatMathMale]
10 20
= u Mean[SatMathFemale] + u Mean[SatMathMale]
30 30
1 2
= u Mean[SatMathFemale] + u Mean[SatMathMale]
3 3
1 2
= u 725 + u 743
3 3
725 1, 486 2, 211
= = = 737
3 3
5
Chapter 2: Essentials of Probability and Estimation
Procedures
1. a. There are 13 chances out of 52 that the card drawn is a heart; that is, the probability that
13 1
the card drawn will be a heart equals .
52 4
b. There are 4 chances out of 52 that the card drawn is an ace; that is, the probability that the
4 1
card drawn will be an ace equals .
52 13
c. There are 26 chances out of 52 that the card drawn is a red card; that is, the probability
26 1
that the card drawn will be a red card equals .
52 2
3. a. Mean[cx] = c Mean[x]
b. Mean[x + y] = Mean[x] + Mean[y]
c. Var[cx] = Var[cx] = c2Var[x]
d. Var[x + y] = Var[x + y] = Var[x] + 2Cov[x, y] + Var[y]
e. When x and y are independent: Var[x + y] = Var[x + y] = Var[x] + Var[y]
1. A random process is a process whose outcome is uncertain; that is, we cannot determine the
outcome with certainty until the process is completed.
3. A random variable is a variable that is associated with a random process. The value of a
random variable cannot be determined with certainty before the experiment is conducted.
5. a. Beforehand, we cannot determine the numerical value of the random variable with
certainty.
b. Beforehand, we can often describe the probability distribution of the random variable.
7
Solutions to Chapter 2 Exercises
a. A random variable describes the outcome of a random process. The value of a random
variable cannot be determined beforehand, before the random process is completed. v is a
random variable because we cannot determine if it will equal 2, 3, 4, or 5 before the
experiment is conducted.
b. Since there are a total of 10 cards in the deck, the probability of drawing any one card is
1
:
10
Card Probability Card Probability
1 1
k
2k 3j
10 10
1 1
2j 3i
10 10
1 1
2i 4k
10 10
1 1
2h 4j
10 10
1 1
3k 5i
10 10
Now it is easy to calculate the probability distribution of v:
v Prob[v]
1 1 1 1 4
2 2k 2j 2i 2h + + + = = .4
10 10 10 10 10
1 1 1 3
3 3k 3j 3i + + = = .3
10 10 10 10
1 1 2
4 4k 4j + = = .2
10 10 10
1
5 5i = .1
10
8
v equals 2 whenever the 2k, 2j, 2i, or 2h is drawn. Therefore, the probability that v
1 1 1 1 1
equals 2 is + + + = = .4. Using analogous logic, the probability that v
10 10 10 10 10
equals 3 is .3, 4 is .2, and 5 is .1.
c. Mean[v]= ¦ vProb[v]
All v
3. a. i. If a 2 was drawn on the first draw, only three 2’s remain in the deck of 9 cards.
Therefore, there are 3 chances in 9 of drawing a 2 on the second draw:
3 1
Prob[v2 = 2 IF v1 = 2] = = .3333…
9 3
ii. If a 2 was not drawn on the first draw, four 2’s remain in the deck of 9 cards.
Therefore, there are 4 chances in 9 of drawing a 2 on the second draw:
4
Prob[v2 = 2 IF v1 z 2] = = .4444…
9
b. No. The random variables v1 and v2 are not independent. Knowing whether v1 equals 2 or
not helps us predict the value of v2 because the value of v1 affects the probability of v2.
c. i. If a 2 was drawn on the first draw, 3,999 2’s remain in the deck of 9,999 cards.
Therefore, there are 3,999 chances in 9,999 of drawing a 2 on the second draw:
9
3,999
Prob[v2 = 2 IF v1 = 2] = = .39993999…
9,999
ii. If a 2 was not drawn on the first draw, 4,000 2’s remain in the deck of 9,999 cards.
Therefore, there are 4,000 chances in 9,999 of drawing a 2 on the second draw:
4, 000
Prob[v2 = 2 IF v1 z 2] = = .40004000…
9,999
d. Size of Population:
Number of Cards in Original Deck
10 10,000
Prob[v2 = 2 IF v1 z 2] .4444… .40004000…
Prob[v2 = 2 IF v1 = 2] .3333… .39993999…
Difference .1111… .00010001…
As the size of the population increases, the probability of drawing a 2 on the 2nd draw is
affected less by the card drawn on the 1st draw. When there are 10,000 cards in the
original deck, the probability of drawing a 2 on the 2nd draw is .400 to the nearest
thousandths regardless of what card was drawn on the 1st draw.
e. As the size of the population increases, the independence assumption becomes a better
approximation of reality.
f. A professional pollster need not be concerned.
5. a. Since Archie’s arrows always land within 20 centimeters of the center, the probability
that v is less than or equal to 20 must equal 1.
1
b. Prob[Points t 6] = u.10u20
2
= 1.0
1
c. Prob[Points = 10] = u (.10 .08) u 4 + .08 u 4
2
= .04 + .32
= .36
10
1
d. Prob[Points = 9] = u (.08.06) u 4 + .06 u 4
2
= .04 + .24
= .28
e. Prob[Points = 7 or Points = 9]
1
= u (.06.02) u 4 + .02 u 8
2
= .08 + .16
= .24
30 1
7. a. i. =
60 2
5 1
ii. =
60 12
25 5
iii. =
60 12
1 1 7
b. i. + =
2 12 12
5
ii.
12
5 5 5 5 5 3,125
c. u u u u = = .0126
12 12 12 12 12 248,832
11
Chapter 3: Interval Estimates and the Central Limit Theorem
1. a. A random variable describes the outcome of a random process (a draw, a coin toss, etc.).
The value of a random variable cannot be determined beforehand, before the random
process is completed.
b. i. The value of the random variable beforehand.
ii. The probability distribution of the random variable.
3. Var[T1 v1 v2 vT ]
Since Var[cx] = c2Var[x]
1
=
T2
[
Var v1 v2 vT ]
Since Var[x + y] = Var[x] + Var[y] when x and y
are independent; hence, the covariances are all 0.
1
= (Var[v1 ] Var[v2 ] Var[vT ] )
T2
Since Var[v1] = Var[v2] = … = Var[vT] = p(1 p)
1
= ( p(1 p) p(1 p) p(1
(1 p) )
T2
How many p(1 p) terms are there? A total of T.
1
= (T u p(1 p))
T2
Simplifying.
p(1 p)
T
5.
x The mean of the probability distribution describes the distribution’s center.
x The variance of the probability distribution describes the distribution’s spread.
1. a. When the mean of the estimate’s probability distribution equals the actual value, the
estimation procedure is unbiased, the estimation procedure does not systematically
overestimate or underestimate the actual value.
b. When an estimation procedure is unbiased, the variance of the estimate’s probability
distribution indicates how reliable an estimate is. On the one hand, if the variance is
small, there is a high probability that the estimate is close to the actual value. On the
other hand, if the variance is large, the probability that the estimate lies close to the actual
value is small.
13
3. As the sample size increases, the probability distribution of the random variable approaches
the normal distribution.
14
Var[SatDiff]= Var[SatMath SatVerbal]
= Var[SatMath + (1)SatVerbal)]
= Var[SatMath] 2Cov[SatMath, SatVerbal] + Var[(1)SatVerbal]
= 10,000 2u10,000 + 10,000
= 10,000 20,000 + 10,000
= 0
d. CorrCoef[SatMath, SatVerbal] = .5
Cov[ SatMath, SatVerbal ]
CorrCoef[SatMath, SatVerbal] =
Var[ SatMath] Var[ SatVerbal ]
Cov[ SatMath, SatVerbal ]
.5 =
100 u100
Cov[SatMath SatVerbal] = 5,000
Var[SatSum] = Var[SatMath + SatVerbal]
= Var[SatMath] + 2Cov[SatMath, SatVerbal] + Var[SatVerbal]
= 10,000 + 2u5,000 + 10,000
= 10,000 + 10,000 + 10,000
= 30,000
Var[SatDiff] = Var[SatMath SatVerbal]
= Var[SatMath + (1)SatVerbal)]
= Var[SatMath] 2Cov[SatMath, SatVerbal] + Var[(1)SatVerbal]
= 10,000 2u5,000 + 10,000
= 10,000 10,000 + 10,000
= 10,000
e. CorrCoef[SatMath, SatVerbal] = .5
Cov[ SatMath, SatVerbal ]
CorrCoef[SatMath, SatVerbal] =
Var[ SatMath] Var[ SatVerbal ]
Cov[ SatMath, SatVerbal ]
.5 =
100 u100
Cov[SatMath SatVerbal] = 5,000
Var[SatSum] = Var[SatMath + SatVerbal]
= Var[SatMath] + 2Cov[SatMath, SatVerbal] + Var[SatVerbal]
= 10,000 + 2u5,000 + 10,000
= 10,000 10,000 + 10,000
= 10,000
Var[SatDiff] = Var[SatMath SatVerbal]
= Var[SatMath + (1)SatVerbal)]
= Var[SatMath] 2Cov[SatMath, SatVerbal] + Var[(1)SatVerbal]
= 10,000 2u5,000 + 10,000
= 10,000 + 10,000 + 10,000
= 30,000
f. 0.0_____ 1.0_____ between 0.0 and 1.0 X less than 0.0 _____
Typically, students who do well on one part of the SAT do well on the other.
15
3. a. No
b. Yes
c. StateDiff = SatMichigan SatAlaska
Mean[StateDiff] = Mean[SatMichigan SatAlaska]
= Mean[SatMichigan + 1uSatAlaska)]
= Mean[SatMichigan] + Mean[1uSatAlaska]
= Mean[SatMichigan] Mean[SatAlaska]
= 549 489
= 60
SD[SatMichigan] = SD[SatAlaska] = 100
Var[SatMichigan] = Var[SatAlaska] = 10,000
Cov[SatMichigan, SatAlaska] = 0 SatMichigan and SatAlaska are independent
Var[StateDiff] = Var[SatMichigan SatAlaska]
= Var[SatMichigan + 1uSatAlaska]
= Var[SatMichigan] + 2Cov[SatMichigan, SatAlaska] + Var[SatAlaska]
= 10,000 + 0 + 10,000
= 20,000
SD[ StateDiff ] Var[ StateDiff ] 20,000 141.4
0 60 45
z .42
141.4 141.4
Prob[SatMichigan > SatAlaska] = Prob[SatMichigan SatAlaska > 0]
= Prob[StateDiff > 0]
= .66
z 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09
0.2 0.4207 0.4168 0.4129 0.4090 0.4052 0.4013 0.3974 0.3936 0.3897 0.3859
0.3 0.3821 0.3783 0.3745 0.3707 0.3669 0.3632 0.3594 0.3557 0.3520 0.3483
0.4 0.3446 0.3409 0.3372 0.3336 0.3300 0.3264 0.3228 0.3192 0.3156 0.3121
0.5 0.3085 0.3050 0.3015 0.2981 0.2946 0.2912 0.2877 0.2843 0.2810 0.2776
5. a. Mean[MPG] = 32
SD[MPG] = 4
16
30 32 2
z .5
4 4
Prob[MPH > 110] = .69 = 69 percent
z 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09
0.4 0.3446 0.3409 0.3372 0.3336 0.3300 0.3264 0.3228 0.3192 0.3156 0.3121
0.5 0.3085 0.3050 0.3015 0.2981 0.2946 0.2912 0.2877 0.2843 0.2810 0.2776
0.6 0.2743 0.2709 0.2676 0.2643 0.2611 0.2578 0.2546 0.2514 0.2483 0.2451
b.
z 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09
1.1 0.1357 0.1335 0.1314 0.1292 0.1271 0.1251 0.1230 0.1210 0.1190 0.1170
1.2 0.1151 0.1131 0.1112 0.1093 0.1075 0.1056 0.1038 0.1020 0.1003 0.0985
1.3 0.0968 0.0951 0.0934 0.0918 0.0901 0.0885 0.0869 0.0853 0.0838 0.0823
30 32
c. z 1.29
x
2
1.29
x
2
x 1.55
1.29
The standard deviation must be decreased by 4
1.55 = 2.45.
17
d. Approach #1 Cost = 3.16u100,000 =
$316,000
Approach #2 Cost = 2.45u200,000 =
$490,000
Approach #1 is less costly.
7. a. i. Recall that when betting on 1st set of twelve numbers, the mean of vi’s
probability distribution equals .027 and the variance equals 1.972.
z 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09
1.0 0.1587 0.1562 0.1539 0.1515 0.1492 0.1469 0.1446 0.1423 0.1401 0.1379
1.1 0.1357 0.1335 0.1314 0.1292 0.1271 0.1251 0.1230 0.1210 0.1190 0.1170
1.2 0.1151 0.1131 0.1112 0.1093 0.1075 0.1056 0.1038 0.1020 0.1003 0.0985
iii. Mean[TNW] = 1.35
SD[TNW] = Var[TNW ] =
98.60 9.93
10 (1.35) 8.65
z .87
9.93 9.93
Prob[TNW d 10] = .1922
18
z 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09
0.7 0.2420 0.2389 0.2358 0.2327 0.2296 0.2266 0.2236 0.2206 0.2177 0.2148
0.8 0.2119 0.2090 0.2061 0.2033 0.2005 0.1977 0.1949 0.1922 0.1894 0.1867
0.9 0.1841 0.1814 0.1788 0.1762 0.1736 0.1711 0.1685 0.1660 0.1635 0.1611
b. i. Recall that when betting on red, the mean of vi’s probability distribution equals .027
and the variance equals .999.
Mean[Friend’s TNW] = Mean[v1 + v2 + … + v50]
= Mean[v1] + Mean[v2] + … + Mean[v50]
= .027 .027 … .027
= 1.35
Var[Friend’s TNW] = Var[v1 + v2 + … + v50]
Since the v’s are independent, all the
covariances equal 0.
= Var[v1] + Var[v2] + … + Var[v50]
= .999 + .999 + … + .999
= 49.50
ii. Mean[TNW] = 1.35
SD[TNW] = Var[TNW ] =
49.50 7.04
10 (1.35) 11.35
z 1.61
7.04 7.04
Prob[TNW t 10] = .0537
z 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09
1.5 0.0668 0.0655 0.0643 0.0630 0.0618 0.0606 0.0594 0.0582 0.0571 0.0559
1.6 0.0548 0.0537 0.0526 0.0516 0.0505 0.0495 0.0485 0.0475 0.0465 0.0455
1.7 0.0446 0.0436 0.0427 0.0418 0.0409 0.0401 0.0392 0.0384 0.0375 0.0367
iii. Mean[TNW] = 1.35
SD[TNW] = Var[TNW ] =
97.23 9.86
10 (1.35) 8.65
z 1.23
7.04 7.04
Prob[TNW d 10] = .1093
z 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09
1.1 0.1357 0.1335 0.1314 0.1292 0.1271 0.1251 0.1230 0.1210 0.1190 0.1170
1.2 0.1151 0.1131 0.1112 0.1093 0.1075 0.1056 0.1038 0.1020 0.1003 0.0985
1.3 0.0968 0.0951 0.0934 0.0918 0.0901 0.0885 0.0869 0.0853 0.0838 0.0823
19
Chapter 4: Estimation Procedures, Estimates, and Hypothesis
Testing
1. a. When the mean of the estimate’s probability distribution equals the actual value, the
estimation procedure is unbiased, the estimation procedure does not systematically
overestimate or underestimate the actual value.
b. When an estimation procedure is unbiased, the variance of the estimate’s probability
distribution indicates how reliable an estimate is. On the one hand, if the variance is
small, there is a high probability that the estimate is close to the actual value. On the
other hand, if the variance is large, the probability that the estimate lies close the actual
value is small.
3. a.
Jury Finds Jury Finds
Defendant Defendant
Guilty Innocent
For each scenario, indicate whether the jury would be correct or incorrect.
1. a.
x The mean of the estimate’s probability distribution.
x The variance of the estimate’s probability distribution.
b.
x The mean of the estimate’s probability distribution determines whether or not the
estimation procedure is biased or unbiased. If the mean equals the actual value, the
estimation procedure does not systematically underestimates or overestimates the
actual value.
21
x When the estimation procedure is unbiased, the variance of the estimate’s probability
distribution indicates the reliability of the estimation procedure. As the variance
becomes smaller, the estimate becomes more reliable.
3. a. Type I error occurs when we reject the null hypothesis when it is actually true.
b. Type II error occurs when we do not reject the null hypothesis when it is actually false.
c. As the significance level becomes smaller, it becomes more difficult to reject the null
hypothesis thereby decreasing the likelihood of Type I error and increasing the likelihood
of Type II error.
1. a. This statement is false. Even when an estimation procedure is unbiased we can never
expect the estimate to equal the actual value.
b. This statement is true whenever the estimate’s probability distribution is symmetric.
c. This is always true; it is the definition of an unbiased estimation procedure.
d. This statement is always true. The relative frequency interpretation of probability tells us
that after many, many repetitions the distribution of the actual values mirrors the
probability distribution. Accordingly, after many, many repetitions the average of the
estimates would equal the mean of the estimate’s probability distribution; the estimation
procedure would be unbiased.
1,885,178 1,885,178
3. a. DemVoterFrac .45144 | .451
1,885,178 2,290,723 4,175,901
17 17
b. .53125 | .531
17 15 32
c. Yes.
d. Cynic’s View: Despite the election results, there is no unfair districting; there is no
gerrymandering.
“Sure, the election results suggest that there may be gerrymandering, but the results
were just a coincidence. The results were just a random occurrence, a consequence of
the luck of the draw.”
e. H0: No gerrymandering Cynic is correct.
H1: Gerrymandering present Cynic is incorrect.
Prob[Results IF H0 True] = Probability that DemCongressFrac would be .531 or
more when DemVoterFrac, the fraction of voters
statewide voted for the Democratic candidate, equals
.451
f. Mean[DemCongressFrac]: .451
Var[DemDemocratFrac]: .007737
SD[DemCongressFrac]: .088
22
p(1 p) .451u (1 .451) .451u .549
Var[ DemCongressFrac] .007737
T 32 32
SD[ DemCongressFrac] Var[ DemCongressFrac] .007737 .08796 | .088
5. a.
69, 498, 215 69, 498, 215
DemVoterFrac .5369
69, 498, 215 59,948, 240 129, 446, 455
365 365
b. .6784
365 173 538
c. Yes. Democratic.
d. Cynic’s View: Despite the election results, the Electoral College is not unfair, it does not
favor Democrats.
“Sure, the election results suggest that the Electoral College favors
Democrats, but the results were just a coincidence. The results were just a
random occurrence, a consequence of the luck of the draw.”
e. H0: No Electoral College Unfairness Cynic is correct.
H1: Electoral College Does Not Favor Democrats Cynic is incorrect.
Prob[Results IF H0 True] = Probability that DemElectColFrac would be .6784 or
more when DemVoterFrac, the fraction of voters
nationwide voted for the Democratic candidate, equals
.5369
23
f. Mean[DemElectColFrac]: .5369
Var[DemElectColFrac]: .0004622
SD[DemElectColFrac]: .0215
Mean[DemElectColFrac] = p = .5369 where p = fraction of Democratic voters
nationwide
p(1 p) .5369 u (1 .5369) .5369 u .4631
Var[ DemElectColFrac] .0004622
T 538 538
SD[ DemElectColFrac] Var[ DemElectColFrac] .0004622 .0215
g. Prob[Results IF H0 True] = Probability that DemElectColFrac would be .6784 or
more when DemVoterFrac, the fraction of voters
nationwide voted for the Democratic candidate, equals
.5369
.6784 .5369 .1415
z 6.55
.0215 .0215
Prob[Results IF H0 True] < .001
7. a. Yes. On average, you would expect her to win a little less than one third of the time. In
fact, she won more than two-thirds of the time.
b. Cynic’s View: Despite the results, everything was on the “up and up.”
“Sure, the results suggest that something was not on the “up and up,” but the results
were just a coincidence. The results were just a random occurrence, a consequence of
the luck of the draw.”
c. H0: Everything Was on the “Up and Up” Cynic is correct.
H1: Roulette Wheel Was Rigged to Favor the Girlfriend Cynic is incorrect.
Prob[Results IF H0 True] = Probability that the casino manager’s girlfriend would
win $20 or more if everything was on the “up and up.”
= Prob[TNW t 20 IF everything were on the “up and up”]
d. Mean[TNW] = 1.35
SD[TNW] = Var[TNW ] = 98.60 9.93
20 (1.35) 21.35
z 2.15
9.93 9.93
z 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09
2.0 0.0228 0.0222 0.0217 0.0212 0.0207 0.0202 0.0197 0.0192 0.0188 0.0183
2.1 0.0179 0.0174 0.0170 0.0166 0.0162 0.0158 0.0154 0.0150 0.0146 0.0143
2.2 0.0139 0.0136 0.0132 0.0129 0.0125 0.0122 0.0119 0.0116 0.0113 0.0110
24
Chapter 5: Ordinary Least Squares Estimation Procedure—
The Mechanics
1. a.
b. Yes. For the most part, when income increases, savings increase also.
c.
25
By choosing two points on this line, we can solve for the equation of the best fitting line.
It looks like the points (200, 15) and (1200, 155) are more or less on the line. We can use
these two points to estimate the slope:
155 15 140
.14
1200 200 1000
Now, a little algebra:
y 15
.14
x 200
y 15 = .14x 28
y = .14x 13
This equation suggests that if Americans earn an additional $1,000 of income, savings
will rise by $140. This is consistent with our theory.
d. $140 million since the slope is .14: .14u$1,000,000,000 = $140,000,000
3. a. Before the experiment is conducted, we cannot determine the numerical value of the
random variable with certainty.
b. Before the experiment is conducted, we can often calculate the random variable’s
probability distribution telling us how likely it is for the random variable to equal each of
its possible numerical values.
Dividing by 2.
( y1 bConst bx x1 ) ( y2 bConst bx x2 ) ( y3 bConst bx x3 ) 0
Simplifying.
( y1 y2 y3 ) 3bConst bx ( x1 x2 x3 ) 0
Dividing by 3.
y1 y2 y3 x x x
bConst bx 1 2 3 0
3 x
y1 y2 y3 x x2 x3
Since equals the mean of y, y , and 1 equals the mean of x, x :
3 3
y bConst bx x 0
26
Therefore,
bConst y bx x .
1. The ordinary least squares (OLS) estimation procedure minimizes the sum of squared
residuals to determine the best fitting line.
1. a.
5 10 30 45
x 15
3 3
14 44 80 138
y 46
3 3
Second, for each student calculate the deviation of x from its mean and the deviation of y
from its mean:
Student yt y yt y xt x xt x
1 14 46 32 5 15 10
2 44 46 2 10 15 5
3 80 46 34 30 15 15
Third, calculate the products of the y and x deviations and squared x deviations for each
student; then calculate the sums:
27
Student ( yt y )( xt x ) ( xt x )2
2
1 32u10 = 320 10 = 100
2
2 2u5 = 10 5 = 25
2
3 34u15 = 510 15 = 225
Sum = 840 Sum = 350
Now, apply the equations for bx and bConst:
T
¦ ( y y )( x x )
t 1
t t
840
bx T
2.4
350
¦ (x x )
t 1
t
2
5. a. Economic theory teaches that the demand curve is downward sloping. Hence, an increase
in the price should decrease gasoline consumption.
b.
Ordinary Least Squares (OLS)
Dependent Variable: GasCons
Explanatory Variable(s): Estimate SE t-Statistic Prob
PriceDollars 151.6556 47.57295 3.187853 0.0128
Const 516.7801 60.60223 8.527410 0.0000
28
Number of Observations 10
Sum Squared Residuals 54.00000
SE of Regression 7.348469
29
Chapter 6: Ordinary Least Squares Estimation Procedure—
The Properties
1. NB: Because random influences are actually random, the numerical values that you reported
will no doubt differ from those below. Therefore, your means and variances which are based
on your numerical values will also differ.
Numerical
Value of
Coefficient Calculations Simulation’s Calculations
Repetition Estimate Your Mean Variance Mean Variance
1 3.40
2 4.70 4.05 .4225 4.1 .4
3 1.44 3.18 1.7955 3.2 1.8
Repetition 2 Calculations:
3.40 4.70 8.10
Mean 4.05
2 2
Deviation Squared
Value Mean From Mean Deviation
3.40 4.05 .65 .4225
4.70 4.05 .65 .4225
Sum = .8450
Sum of Squared Deviations .8450
Variance .4225
2 2
Repetition 3 Calculations:
3.40 4.70 1.44 9.54
Mean 3.18
3 3
Deviation Squared
Value Mean From Mean Deviation
3.40 3.18 .22 .0484
4.70 3.18 1.52 2.3104
1.44 3.18 1.74 3.0276
Sum = 5.3864
Sum of Squared Deviations 5.3864
Variance 1.7955
3 3
31
c. Variance of the sum of two variables: Var[x + y] = Var[x] + 2Cov[x, y] + Var[y]
d. Variance of the sum of two independent variables: Var[x + y] = Var[x] + Var[y]
1. a. When the mean of the estimate’s probability distribution equals the actual value, the
estimation procedure is unbiased, the estimation procedure does not systematically
overestimate or underestimate the actual value.
b. When an estimation procedure is unbiased, the variance of the estimate’s probability
distribution indicates how reliable an estimate is. On the one hand, the variance is small,
there is a high probability that the estimate is close to the actual value. On the other hand,
if the variance is large, the probability that the estimate lies close the actual value is
small.
5. When the ordinary least squares (OLS) standard premises are satisfied, the ordinary least
squares (OLS) estimation procedure is the best linear unbiased estimation procedure (BLUE).
y j yi
1. a. bSlope
x j xi
b.
y j yi
bSlope
x j xi
E Const E x x j e j ( EConst E x xi ei )
x j xi
E Const E x x j e j E Const E x xi ei )
x j xi
E x x j E x xi e j ei
x j xi
E x ( x j xi ) (e j ei )
x j xi
e j ei
Ex
x j xi
1
Ex (e j ei )
x j xi
32
c.
Mean[bSlope ] [
Mean E x
1
x j xi
(e j ei ) ]
E x Mean [ x 1 x (e e )] j i
j i
Ex
1
x j xi
Mean e j ei )[ ]
Ex
1
x j xi
[
Mean[e j ] Mean[ei ] ]
Ex
d. What does the variance of bSlope‘s probability distribution, Var[bSlope], equal?
Var[bSlope ] [
Var E x
1
x j xi
(e j ei ) ]
Var [ x 1 x (e e )] j i
j i
1
( x j xi ) 2
[
Var e j ei ) ]
1
( x j xi ) 2
[
Var e j (ei )] ]
1
( x j xi ) 2
[
Var[e j ] Var[ei ] ]
1
( x j xi ) 2
[
Var[e j ] Var[ei ] ]
1
( x j xi ) 2
[
Var[e] Var[e] ]
2
Var[e]
( x j xi ) 2
3. a. There are 10 possible pair wise combinations of the 5 observations:
Obs 1 and 2 Obs 1 and 3 Obs 1 and 4 Obs 1 and 5 Obs 2 and 3
Obs 2 and 4 Obs 2 and 5 Obs 3 and 4 Obs 3 and 5 Obs 4 and 5
Since the two observations are chosen randomly the probability of each of these
1
combinations is .
10
1 1
Mean[bx] = uMean[bSlope for obs 1 and 2] + … + uMean[bSlope for obs 4 and 5]
10 10
1 1
= uEx + … + uE x
10 10
= Ex
33
b. The slopes of the 10 pair wise combinations of observations are independent so we need
not worry about the covariances.
Var[bx]
1 1
u Var[bSlope for Obs 1 and 2] u Var[bSlope for Obs 1 and 3]
10 10
1
+...+ u Var[bSlope for Obs 4 and 5]
10
1 2 1 2
u u Var[e] u u Var[e]
10 ( x1 x2 ) 2
10 ( x1 x3 ) 2
1 2
+...+ u u Var[e]
10 ( x4 x5 ) 2
1 2 1 2
u u 500 u u 500
10 ( x1 x2 ) 2
10 ( x1 x3 ) 2
1 2
+...+ u u 500
10 ( x4 x5 ) 2
1 1000 1 1000
u u
10 ( x1 x2 ) 10 ( x1 x3 ) 2
2
1 1000
+...+ u
10 ( x4 x5 ) 2
1 1000 1 1000 1 1000 1 1000
u u 2 u 2 u 2
10 62 10 12 10 18 10 24
1 1000 1 1000 1 1000
+ u 2 u 2 u 2
10 6 10 12 10 18
1 1000 1 1000
+ u 2 u 2
10 6 10 12
1 1000
+ u 2
10 6
100 100 100 100
62 122 182 242
100 100 100
+ 2 2 2
6 12 18
100 100
+ 2 2
6 12
100
+ 2
6
400 300 200 100
62 122 182 242
11.1 2.08 .62 .17 13.98
c. Yes
34
5. a. i.
Ordinary Least Squares (OLS)
Dependent Variable: OilProdBarrels
Explanatory Variable(s): Estimate SE t-Statistic Prob
Price 92.50159 28.12319 3.289157 0.0028
Const 5944.958 478.7771 12.41696 0.0000
Number of Observations 29
Estimated Equation: EstOilProdBarrels = 5,945 + 92.5Price
ii.
Ordinary Least Squares (OLS)
Dependent Variable: OilProdBarrelsPlus1000
Explanatory Variable(s): Estimate SE t-Statistic Prob
Price 92.50159 28.12319 3.289157 0.0028
Const 6944.958 478.7771 12.41696 0.0000
Number of Observations 29
Estimated Equation: EstOilProdBarrelsPlus1000 = 6,945 + 92.5Price
Best fitting line: EstOilProdBarrelsPlus1000 = 6,945 + 92.5Price
b. The best fitting line
shifts up by 1,000.
c. All the points in the
scatter diagram have
shifted up by 1,000.
Consequently, the
best fitting line, the
line that minimizes
the sum of squared
residuals has also
shifted up by 1,000.
7. a. i.
Ordinary Least Squares (OLS)
Dependent Variable: GasCons
Explanatory Variable(s): Estimate SE t-Statistic Prob
PriceDollars 151.6556 47.57295 3.187853 0.0128
Const 516.7801 60.60223 8.527410 0.0000
Number of Observations 10
Estimated Equation: EstGasCons = 516.8 151.7PriceDollars
35
ii.
Ordinary Least Squares (OLS)
Dependent Variable: GasCons
Explanatory Variable(s): Estimate SE t-Statistic Prob
PriceDollarsPlus2 151.6556 47.57295 3.187853 0.0128
Const 820.0912 155.6644 5.268328 0.0008
Number of Observations 10
Estimated Equation: EstGasCons = 820.1 151.7PriceDollarsPlus2
36
Chapter 7: Estimating the Variance of an Estimate’s
Probability Distribution
1. a. When the mean of the estimate’s probability distribution equals the actual value, the
estimation procedure is unbiased, the estimation procedure does not systematically
overestimate or underestimate the actual value.
b. When an estimation procedure is unbiased, the variance of the estimate’s probability
distribution indicates how reliable an estimate is. On the one hand, if the variance is
small, there is a high probability that the estimate is close to the actual value. On the
other variance, if the variance is large, the probability that the estimate lies close the
actual value is small.
3. a.
Rise 90 66 24 6
b. Slope =
Run 25 5 20 5
y 66 6
x 5 5
5 y 330 6 x 30
5 y 6 x 300
6
y x 60 1.2 x 60
5
c. 0
Rise 90 70 20
d. i. Slope = 1
Run 25 5 20
y 70
1
x5
y 70 x 5
y x 65
ii. 0
Rise 90 86 4 1
e. i. Slope =
Run 25 5 20 5
y 86 1
x 5 5
5 y 430 x 5
5 y x 425
1
y x 85 1.2 x 85
5
ii. 0
f. 0
37
Solutions to Chapter 7 Review Questions
Var[e ]
1. Var[bx ] T
¦ (x x )
t 1
t
2
3. While this is an unbiased estimation procedure for the variance of the error term’s probability
distribution, it does not help us. We can never compute the values of the error terms because
the error terms are unobservable.
5. We divide by the degrees of freedom because they equal the number of pieces of information
that are available to estimate the variance of the error term’s probability distribution.
¦ ( y y )( x x )
t 1
t t
840
bx T
2.4
350
¦ (x x )
t 1
t
2
38
3. Finally, use the quiz data to estimate the variance and standard deviation of the coefficient
estimate’s probability distribution, the coefficient estimate’s standard error.
Estimated Variance of the Coefficient Estimate’s Probability Distribution:
EstVar[e ] 168
EstVar[bx ] T
.48
350
¦ (x x )
t 1
t
2
5. a. Yes X No ____
SE of Regression EstVar[e] 78.09
EstVar[e ] 78.092 6, 098
b. Yes X No ____
SE[bx ] EstVar[bx ] 4.49
EstVar[bx ] 4.492 20.16
c. Yes X No ____
EstVar[e ]
EstVar[bx ] T
¦ (x x )
t 1
t
2
T
EstVar[e ] 6, 098
¦ (x x )
t 1
t
2
EstVar[bx ] 20.16
302.48
d. Yes X No ____
T
¦(y
t 1
t y )( xt x )
bx T
¦ (x x )
t 1
t
2
T T
e. Yes X No ____
SSR
EstVar[e ]
Degrees of Freedom
SSR EstVar[e ] u Degrees of Freedom 6, 098 u (10 2) 6, 098 u 8 48, 784
f. Yes ____ No X
39
Chapter 8: Interval Estimates and Hypothesis Testing
5 10 30 45
1. x 15
3 3
T
¦ (x x )
t 1
t
2
(5 15)2 (15 15) 2 (25 15) 2 102 02 102 100 0 100 200
Var[e ] 50
Var[bx ] T
.25
200
¦ (x x )
t 1
t
2
|68.5%
|95.5%
|99.7%
Actual Values From To Repetitions Between
Ex Var[e] Value Value From and to Values
2 50 1.5 2.5 |68.5%
2 50 1.0 3.0 |95.5%
2 50 .5 3.5 |99.7%
3. z equals the number of standard deviations the value lies from the mean:
Value of Random Variable Distribution Mean
z
Distribution Standard Deviation
Number of Stadard Deviations from the Mean
1. Interval Estimate Question: What is the probability that the estimate, _____, lies within
____ of the actual value____?
3.
x Step 0: Formulate a model reflecting the theory to be tested.
x Step 1: Collect data, run the regression, and interpret the estimates.
x Step 2: Play the cynic, challenge the evidence, and construct the null and alternative
hypotheses.
x Step 3: Formulate the question to assess the null hypothesis and the cynic’s view.
x Step 4: Use the general properties of the estimation procedure, the probability distribution
of the estimates, to calculate Prob[Results IF H0 True].
x Step 5: Decide on the standard of proof, a significance level.
41
5.
x Each estimation procedure is unbiased; each estimation procedure does not systematically
underestimate or overestimate the actual value.
x The ordinary least squares (OLS) estimation procedure for the coefficient value is the
best linear unbiased estimation procedure (BLUE).
1. a. PetroConsPC = PetroCons*1000/Pop
PriceReal = PriceNom*100/CPI
b.
Ordinary Least Squares (OLS)
Dependent Variable: PetroConsPC
Explanatory Variable(s): Estimate SE t-Statistic Prob
PriceReal 540.5367 203.4389 2.656999 0.0289
Const 1418.403 144.7593 9.798354 0.0000
Number of Observations 10
42
Using the Econometrics Lab:
Left tail: Right tail:
Degrees of Freedom: 8 Degrees of Freedom: 8
t: 1.97 t: 1.97
Left tail probability = .04 Right tail probability = .04
The probability of the coefficient estimate being within 200 of the actual
coefficient value equals .92.
1.00 (.04 + .04) = 1.00 .08 = .92
ii. 300 of the actual coefficient value? .82
Convert 300 into standard errors:
300
300 equals = 1.47 standard errors.
203.4
The probability of the coefficient estimate being within 300 of the actual coefficient value
equals the probability of the coefficient estimate being within 1.47 standard errors of the
actual coefficient value, that is, between t’s of 1.47 and +1.47.
Using the Econometrics Lab:
Left tail: Right tail:
Degrees of Freedom: 8 Degrees of Freedom: 8
t: 1.47 t: 1.47
Left tail probability = .09 Right tail probability = .09
The probability of the coefficient estimate being within 200 of the actual coefficient value
equals .82.
1.00 (.09 + .09) = 1.00 .18 = .82
iii. 200 of the actual coefficient value? .64.
Convert 200 into standard errors:
200
200 equals = .98 standard errors.
203.4
The probability of the coefficient estimate being within 200 of the actual coefficient value
equals the probability of the coefficient estimate being within .98 standard errors of the actual
coefficient value, that is, between t’s of .98 and +.98.
Using the Econometrics Lab:
Left tail: Right tail:
Degrees of Freedom: 8 Degrees of Freedom: 8
t: .98 t: .98
Left tail probability = .18 Right tail probability = .18
The probability of the coefficient estimate being within 200 of the actual coefficient value
equals .64
1.00 (.18 + .18) = 1.00 .36 = .64
3. a. Economic theory teaches us that the demand curve is downward sloping. Accordingly,
theory postulates that an increase in the price should decrease consumption.
b. Step 0: Formulate a model reflecting the theory to be tested.
Model: GasConst = EConst + EPPriceDollarst + et
The theory suggests that EP should be negative.
43
Step 1:
Ordinary Least Squares (OLS)
Dependent Variable: GasCons
Explanatory Variable(s): Estimate SE t-Statistic Prob
PriceDollars 151.6556 47.57295 3.187853 0.0128
Const 516.7801 60.60223 8.527410 0.0000
Number of Observations 10
44
Recall that the Prob. column
reports the “tails probability:”
45
Step 3: Formulate the question to assess the null hypothesis and the cynic’s view.
Question for the Cynic:
x Generic Question: What is the probability that the results would be like those we
actually obtained (or even stronger), if the cynic is correct and party affliliation
actually has no effect on earmarks?
x Specific Question: The regression’s coefficient estimate was 2.36: What is the
probability that the coefficient estimate in one regression would be 2.36 or more if
H0 were actually true (if the actual coefficient, EDem, equals 0)?
Answer: Prob[Results IF Cynic Correct] or Prob[Results IF H0 True]
Step 4: Use the general properties of the estimation procedure, the probability distribution
of the estimates, to calculate Prob[Results IF H0 True].
OLS estimation If H0were Number of Number of
procedure unbiased true SE column observations parameters
7. a. Economic theory teaches us that the supply curve is upward sloping. Accordingly, theory
postulates that an increase in the price should increase production.
b. Step 0: Formulate a model reflecting the theory to be tested.
Model: OilProdBarrelst = EConst + E PPricet + et
Theory suggests that EP should be positive.
46
Step 1: Collect data, run the regression, and interpret the estimates.
Ordinary Least Squares (OLS)
Dependent Variable: OilProdBarrels
Explanatory Variable(s): Estimate SE t-Statistic Prob
Price 92.50159 28.12319 3.289157 0.0028
Const 5944.958 478.7771 12.41696 0.0000
Number of Observations 29
47
.0028
Prob[Results IF H0 True] = = .0014
2
The probability that the coefficient estimate in one regression would be 92.5 or more
if H0 were actually true (if the actual coefficient, E P, equals 0) is .0014.
c. Step 5: Decide on the standard of proof, a significance level
The Prob[Results IF H0 True] equals .0014, less than .01. So, even with a 1 percent
significance level, we would reject the null hypothesis and thereby support the
theory.
48
Chapter 9: One-Tailed Tests, Two-Tailed Tests, and Logarithms
1. a. Q EConst P E P
dQ
EConst E P P E P 1
dP
P P 1
b.
Q EConst P EP EConst P EP 1
dQ P 1
c. EConst E P P E P 1
EP
dP Q EConst P EP 1
3. a. PuQ = 1,000
b. i. EConst = 1,000
ii. EP = 1
PuQ = 1,000
Q = 1,000P 1
d log( z ) 1
5.
dz z
1. a. Necessity.
b. Yes, the demand should be inelastic because the demand for a necessity is inelastic.
c. Demand is inelastic whenever the absolute value of the price elasticity of demand is less
than 1. But now remember that the numerical value of the price elasticity of demand will
be negative since the demand curve is downward sloping. Accordingly, the numerical
value of the price elasticity of demand should be greater than 1.
d. Step 0: Formulate a model reflecting the theory to be tested.
Constant price elasticity model:
PetroConsPC = EConst PriceRealEP where EP = Price elasticity of demand
49
Taking natural logarithms of both sides.
log(PetroConsPC) = log(EConst) + EPlog(PriceReal)
LogPetroConsPC = c + EPLogPriceReal
where LogPetroConsPC = log(PetroConsPC)
c = log(EConst)
LogPriceReal = log(PriceReal)
Theory suggests that the absolute value of EP should be less than 1; since EP is
negative, theory suggests that EP should be greater than 1.
Step 1: Collect data, run the regression, and interpret the estimates.
Generate two new variables:
x LogPetroConsPC = log(PetroConsPC)
x LogPriceReal = log(PriceReal)
Ordinary Least Squares (OLS)
Dependent Variable: LogPetroConsPC
Explanatory Variable(s): Estimate SE t-Statistic Prob
LogPriceReal 0.365454 0.131853 2.771670 0.0242
Const 6.813011 0.048429 140.6798 0.0000
Number of Observations 10
50
Step 4: Use the general properties of the estimation procedure, the probability distribution
of the estimates, to calculate Prob[Results IF H0 True].
If the null hypothesis
were true, the actual
coefficient would equal
1. Since ordinary least
squares (OLS) estimation
procedure for the
coefficient value is
unbiased, the mean of the
probability distribution of
coefficient estimates would be 1. The SE column of the printout provides us with the
standard error of the coefficient estimates, .132. The degrees of freedom equal the
number of observations, 10, less 2, since we are estimating two parameters, the constant
and the coefficient; consequently the degrees of freedom equal 8.
51
Ordinary Least Squares (OLS)
Dependent Variable: yClever
Explanatory Variable(s): Estimate SE t-Statistic Prob
LogPriceReal 0.634546 0.131853 4.812516 0.0013
Const 6.813011 0.048429 140.6798 0.0000
Number of Observations 10
.0013
Prob[Results IF H0 True] = | .0007
2
We have obtained the same value for Prob[Results IF H0 True].
e. Step 5: Decide on the standard of proof, a significance level
Since Prob[Results IF H0 True] equals .0007, we reject the null hypothesis that
demand is unit elastic even at the 1 percent significance level. This supports the
theory that demand is inelastic.
We must generate the two variables: the logarithm of quantity and the logarithm of
price:
x LogQ = log(OilProdBarrels)
x LogP = log(Price)
52
Ordinary Least Squares (OLS)
Dependent Variable: LogQ
Explanatory Variable(s): Estimate SE t-Statistic Prob
LogP 0.213620 0.069544 3.071739 0.0048
Const 8.324825 0.187512 44.39626 0.0000
Number of Observations 29
53
If the null hypothesis were true, the
actual coefficient would equal .10.
Since ordinary least squares (OLS)
estimation procedure for the
coefficient value is unbiased, the
mean of the probability distribution
of coefficient estimates would be
.10. The SE column of the printout
provides us with the standard error
of the coefficient estimates. The
degrees of freedom equal the
number of observations less 2, since we are estimating two parameters, the constant and
the coefficient.
OLS estimation Assume H0 Number of Number of
procedure unbiased is true SE column observations parameters
54
H0: EClever = 0 EP = .10 Cynic’s view is correct: Actual price elasticity
of supply equals .10
H1: EClever z 0 EP z .10 Cynic’s view is incorrect: Actual price
elasticity of supply does not equal .10
Ordinary Least Squares (OLS)
Dependent Variable: yClever
Explanatory Variable(s): Estimate SE t-Statistic Prob
LogP 0.113620 0.069544 1.633795 0.1139
Const 8.324825 0.187512 44.39626 0.0000
Number of Observations 29
Next, calculate Prob[Results IF H0 True] focusing on EClever:
5. a. Most regard cigarettes as addictive. Accordingly, the price elasticity of demand for
cigarettes should be inelastic.
b. Step 0: Formulate a model reflecting the theory to be tested:
Constant price elasticity model:
CigConsPC = EConst PriceConsumerEP where EP = Price elasticity of demand
55
Taking natural logarithms of both sides..
56
Ordinary Least Squares (OLS)
Dependent Variable: LogHoursPerWeek
Explanatory Variable(s): Estimate SE t-Statistic Prob
LogWage 0.768171 0.294727 2.606380 0.0107
Const 0.037095 1.093466 0.033925 0.9730
Number of Observations 92
57
OLS estimation Assume H0 Number of Number of
procedure unbiased is true SE column observations parameters
58
Chapter 10: Multiple Regression Analysis—Introduction
1. a.
Q EConst P E I E ChickP E
P I CP
EConst P E I E ChickP E
P I P EI
EP EI
EConst (ChickP
P
)(ChickP
I
)
EP EI
EP EI
P I
EConst ( ChickP )( ChickP )
b.
P
is unchanged
ChickP
I
is unchanged
ChickP
Q is unchanged
c. If EP + EI + ECP = 0, then ECP = EP EI.
In part b, we showed that if ECP = EP EI, then Q is unchanged when the good’s own
price, income, and the price of other goods all double.
Factoring EP from the second and third terms and EI from the fourth and fifth terms
59
Solutions to Chapter 10 Review Questions
1. Multiple regression analysis includes more than one explanatory variables; simple regression
analysis includes only a single explanatory variable.
3. No.
1.
a.
x Additional labor increases value added.
x Additional land under cultivation increases value added.
x Additional machinery increases value added.
b.
x ELabor > 0.
x ELand > 0.
x EMachinery > 0.
c.
x H0: ELabor = 0
H1: ELabor > 0
x H0: ELand = 0
H1: ELand > 0
x H0: EMachinery = 0
H1: EMachinery > 0
d.
Ordinary Least Squares (OLS)
Dependent Variable: ValueAdded
Explanatory Variable(s): Estimate SE t-Statistic Prob
Labor 267.6604 11.53321 23.20780 0.0000
Land 2013.179 794.9466 2.532471 0.0125
Machinery 21917.88 1012.361 21.65027 0.0000
Const 5.28E+08 5.20E+08 1.015321 0.3119
Number of Observations 133
60
and machinery remain constant, increases value added by $2,013.
bMachinery = 21,918: We estimate that 1 more tractor, while the amount of labor and
cultivated land remain constant, increases value added by $21,918.
e. Note that one-tail tests are appropriate. The Prob column indicates that we would not
reject any of our null hypotheses. Consequently all our theories are supported.
5. a. i. Downward Sloping Demand Theory: As the price of cigarettes rise, per capita
cigarette consumption decreases.
ii. Conventional Wisdom: Cigarettes are an inferior good; as per capita income rises, per
capita cigarette consumption decreases.
b. CigConsPCt = EConst + EPPriceConsumert + EIIncPCt + et
c. Theory: EP < 0. Theory: EI < 0.
d. H0: EP = 0 H0: EI = 0
H1: EP < 0 H1: EI < 0
e.
Ordinary Least Squares (OLS)
Dependent Variable: CigConsPC
Explanatory Variable(s): Estimate SE t-Statistic Prob
PriceConsumer 16.04703 3.668014 4.374856 0.0001
IncPC 0.318916 0.588471 0.541939 0.5904
Const 155.0632 20.12211 7.706110 0.0000
Number of Observations 51
61
bI = .32: A $1,000 increase in per capita income, while the price of cigarettes
remains constant, decreases per capita cigarette consumption by .32 packs per year.
f. One-tail tests are appropriate. Both theories suggest that the actual coefficient is negative.
x The regression results lend support to the downward sloping demand theory. As
the Prob column indicates we easily reject the null hypothesis at the 1 percent
significance level.
x The conventional wisdom asserting that cigarettes are an inferior is shaky. While
the coefficient is negative we cannot come close to rejecting the null hypothesis
at the 1, 5, or even 10 percent significance level.
7. a. i. As the price of cigarettes rise, the youth smoking rate should decrease.
ii. As per capita income rises, the youth smoking rate should decrease.
b. SmokeRateYoutht = EConst + EPPriceConsumert + EIIncPCt + et
c. Theory: EP < 0. Theory: EI < 0.
d. H0: EP = 0 H0: EI = 0
H1: EP < 0 H1: EI < 0
e.
Ordinary Least Squares (OLS)
Dependent Variable: SmokeRateYouth
Explanatory Variable(s): Estimate SE t-Statistic Prob
PriceConsumer 1.455381 0.665547 2.186743 0.0337
IncPC 0.202018 0.106776 1.891985 0.0645
Const 33.45011 3.651081 9.161700 0.0000
Number of Observations 51
62
Chapter 11: Hypothesis Testing and the Wald Test
3. a. Before the experiment is conducted, we cannot determine the numerical value of the
random variable with certainty.
b. Before the experiment is conducted, we can often calculate the random variable’s
probability distribution telling us how likely it is for the random variable to equal each of
its possible numerical values.
1. No, the restricted sum of squared residuals cannot be less than the unrestricted sum. This is
true because the ordinary least squares (OLS) estimation procedure “chooses” the parameter
estimates that minimize the sum of squared residuals. Placing a restriction on one or more of
the parameters can only increase, not decrease, the sum of squared residuals.
3. A two-tailed t-test is a special case of the Wald test in which the value of a single coefficient
is restricted to equal 0.
3. Assess the constant returns to scale theory using the Wald approach the “easy way” using
statistical software.
63
Ordinary Least Squares (OLS)
Dependent Variable: LogValueAdded
Explanatory Variable(s): Estimate SE t-Statistic Prob
LogLabor 0.405212 0.051607 7.851944 0.0000
LogLand 0.028024 0.062265 0.450084 0.6534
LogMachinery 0.421532 0.029000 14.53552 0.0000
Const 11.27838 0.680996 16.56159 0.0000
Number of Observations 133
Sum Squared Residuals 90.50033
Degrees of Freedom
Value Num Dem Prob
F-statistic 7.223722 1 129 0.0081
5. a.
Ordinary Least Squares (OLS)
Dependent Variable: CigConsPC
Explanatory Variable(s): Estimate SE t-Statistic Prob
PriceSupplier 34.16907 10.66315 3.204408 0.0024
Tax 12.69334 4.040626 3.141428 0.0029
Const 207.6718 38.53283 5.389477 0.0000
Number of Observations 51
7. a.
Ordinary Least Squares (OLS)
Dependent Variable: CigConsPC
Explanatory Variable(s): Estimate SE t-Statistic Prob
PriceSupplier 34.16907 10.66315 3.204408 0.0024
Tax 12.69334 4.040626 3.141428 0.0029
Const 207.6718 38.53283 5.389477 0.0000
Number of Observations 51
Sum Squared Residuals 19665.41
SSRU = 19,665.41
DFU = 51 3 = 48
64
b. CigConsPCt = EConst + EPriceSupplierPriceSuppliert + ETaxTaxt + et
H0: EPriceSupplier = ETax
= EConst + ETaxPriceSuppliert + ETaxTaxt + et
= EConst + ETax(PriceSuppliert + Taxt) + et
Since PriceConsumert = PriceSuppliert + Taxt
= EConst + ETaxPriceConsumert + et
c.
Ordinary Least Squares (OLS)
Dependent Variable: CigConsPC
Explanatory Variable(s): Estimate SE t-Statistic Prob
PriceConsumer 17.02657 3.168679 5.373398 0.0000
Const 148.6762 16.19163 9.182288 0.0000
Number of Observations 51
Sum Squared Residuals 20822.81
SSRR = 20,822.81
DFR = 51 2 = 49
d. SSRR = 20,822.81 DFR = 49
SSRU = 19.665.41 DFU = 48
SSRR SSRU = 1,157.40 DFR DFU = 1
65
Chapter 12: Model Specification and Development
1. 0.
3. a. EUnemCurrent < 0
b. H0: EUnemCurrent = 0
H1: EUnemCurrent < 0
5. a. 1912 election stands out because third parties received more than a third of the vote.
b. Theodore Roosevelt, a former Republican President, split from his party and formed a
third party called the Bull Moose Party. One of his opponents was his former vice
president.
1. An artificial model is associated with an “original” model. The original model is designed to
assess a theory. The artificial model is not designed to assess the theory; it assesses the
properties of the original model.
3. a. i. No.
ii. No.
b. i. Yes.
ii. Yes.
1. a.
x Higher real GDP growth rates should increase the vote for the incumbent’s party.
x Higher inflation rates should decrease the vote for the incumbent’s party.
x More consecutive terms should decrease the vote for the incumbent’s party (throw
the ****s out).
b. VotePresPartyTwot = EConst + EGdpGrowthRealGdpGrowtht
+ EInflGdpCurrentInflGdpCurrentt + E PresTermsPresPartyTermst + et
The theories imply:
x EGdpGrowth > 0.
x EInflGdpCurrent < 0.
x EPresTerms < 0.
67
c.
Ordinary Least Squares (OLS)
Dependent Variable: VotePresPartyTwo
Explanatory Variable(s): Estimate SE t-Statistic Prob
RealGdpGrowth 0.435612 0.221348 1.967995 0.0598
InflGdpCurrent –0.656971 0.293145 –2.241111 0.0338
PresPartyTerms –3.144539 1.078933 –2.914488 0.0072
Const 58.38144 2.595120 22.49663 0.0000
Number of Observations 30
Critical Result: The Fitted^2 coefficient is 2.69. The estimate does not equal 0; the
estimate is 2.69 from 0. This evidence suggests that the new form of
the information adds explanatory power.
68
H0: JEstTax2 = 0 New form of the information adds NO explanatory power
H1: JEstTax2 z 0 New form of the information adds explanatory power
Prob[Results IF H0 True] =.0475
We do not reject the null hypothesis at the 1 percent significance level, but we do reject
the null hypothesis at the 5 percent significance level. So, perhaps we could do better by
using the information in a different form.
b.
Ordinary Least Squares (OLS)
Dependent Variable: Tax
Explanatory Variable(s): Estimate SE t-Statistic Prob
SqrtTobProdPC 0.187790 0.068316 2.748863 0.0083
Const 1.389676 0.110588 12.56625 0.0000
Number of Observations 51
H0: ESqrtTobProdPC = 0
H1: ESqrtTobProdPC < 0
.0083
Prob[Results IF H0 True] = = .0042
2
We do reject the null hypothesis at the 1 percent significance level.
c. Artificial Model:
Taxt = E Const + E TobProdPCSqrtTobProdPCt + JEstTax2 EstTaxt2 + et
Ramsey RESET Test
Dependent Variable: Tax
Explanatory Variable(s): Estimate SE t-Statistic Prob
SqrtTobProdPC 0.139110 0.302547 0.459796 0.6477
Const 0.549077 1.751680 0.313457 0.7553
Fitted^2 1.025663 0.924855 1.108999 0.2730
Number of Observations 51
Critical Result: The Fitted^2 coefficient is 1.03. The estimate does not equal 0; the
estimate is 1.03 from 0. This evidence suggests that the new form of
the information adds explanatory power.
H0: JEstTax2 = 0 New form of the information adds NO explanatory power
H1: JEstTax2 z 0 New form of the information adds explanatory power
Prob[Results IF H0 True] =.2730
We do not reject the null hypothesis at the 10 percent significance level. That is,
we do not reject the notion that the new form of the information adds no
explanatory power. Hence, there is no compelling need to consider a different
specification of the model.
69
5. a.
x The downward sloping theory of demand suggests that higher prices reduce the youth
smoking rate.
x Conventional wisdom suggests that higher incomes reduce the youth smoking rate.
b. SmokeRateYoutht = EConst + EPPriceConsumert + EIIncPCt + et
x EP < 0.
x EI < 0.
c.
Ordinary Least Squares (OLS)
Dependent Variable: SmokeRateYouth
Explanatory Variable(s): Estimate SE t-Statistic Prob
PriceConsumer 1.455381 0.665547 2.186743 0.0337
IncPC 0.202018 0.106776 1.891985 0.0645
Const 33.45011 3.651081 9.161700 0.0000
Number of Observations 51
70
Critical Result: The Fitted^2 coefficient is .103. The estimate does not equal 0; the
estimate is .103 from 0. This evidence suggests that the new form of
the information adds explanatory power.
H0: JEstYouthRate = 0 New form of the information adds NO explanatory
power
H1: JEstYouthRate z 0 New form of the information adds explanatory power
Prob[Results IF H0 True] = .3595
We do not reject the null hypothesis at the 1, 5, or 10 percent significance level. That is
we do not reject the notion that the new form of the information adds no explanatory
power. Hence, there is no compelling need to consider a different specification of the
model.
71
Chapter 13: Dummy and Interaction Variables
1.
dSSR
2( y1 bConst ) 2( y2 bConst ) 2( y3 bConst ) 0
dbConst
y1 bConst y2 bConst y3 bConst 0
y1 y2 y3 3bConst
y1 y2 y3
bConst
3
NB: bConst equals the mean of y.
1. A dummy variable takes on only two values: 0 and 1. The dummy variable divides all the
observations into two disjoint groups.
3. When we use averages to draw conclusions we implicitly assume that the only a single factor
affects the dependent variable. Typically, this is not a realistic assumption. For example,
when we use average salaries for men and women to investigate gender discrimination, we
implicitly assume that an individual’s gender is the only factor affecting his/her salary.
73
Solutions to Chapter 13 Exercises
3. a.
x The seniority system provides long time members of Congress more influence than
newly elected members; hence additional terms should increase the number of
earmarks. E Terms > 0.
x Conventional wisdom suggests that liberals are more supportive of government
projects than conservatives; hence, liberal members of Congress should be more
active in acquiring earmarks. ELiberal > 0.
x Equity suggests that low income states should receive more Federal government
assistance than high income states; hence, high income states should receive fewer
earmarks. E Income < 0.
b. Use the ordinary least squares (OLS) estimation procedure to estimate the coefficients.
Interpret the coefficient estimates.
74
Ordinary Least Squares (OLS)
Dependent Variable: Number
Explanatory Variable(s): Estimate SE t-Statistic Prob
Terms 0.260593 0.068817 3.786763 0.0002
ScoreLiberal 0.052310 0.011216 4.663870 0.0000
IncPC 0.083213 0.047239 1.761533 0.0789
Const 6.445098 2.010177 3.206235 0.0014
Number of Observations 425
5. a. Allegation: Members of Congress from the Northeast receive more earmarks than their
colleagues from other parts of the country.
ENE > 0.
b.
Ordinary Least Squares (OLS)
Dependent Variable: Number
Explanatory Variable(s): Estimate SE t-Statistic Prob
Terms 0.266513 0.068549 3.887917 0.0001
ScoreLiberal 0.048719 0.011280 4.319022 0.0000
IncPC 0.153651 0.056707 2.709577 0.0070
RegionNorthEast 2.061917 0.927902 2.222127 0.0268
Const 9.300573 2.377948 3.911176 0.0001
Number of Observations 425
75
Interpretation of Estimates:
bNE = 2.06: Members of Congress from the Northeast receive 2.06 more earmarks
than their colleagues from other regions of the nation.
c. H0: ENE = 0
H1: ENE > 0
.0268
d. Prob[Results IF H0 True] = | .013
2
We reject the allegation’s null hypothesis at a 5 percent significance level, but not at the 1
percent significance level.
76
Chapter 14: Omitted Explanatory Variables, Multicollinearity,
and Irrelevant Explanatory Variables
1. Goal of Multiple Regression Analysis: Multiple regression analysis attempts to sort out the
individual effect of each explanatory variable. An explanatory variable’s coefficient estimate
allows us to estimate the change in the dependent variable resulting from a change in that
particular explanatory variable while all other explanatory variables remain constant.
3. a. When the mean of the estimate’s probability distribution equals the actual value, the
estimation procedure is unbiased, the estimation procedure does not systematically
overestimate or underestimate the actual value.
b. When an estimation procedure is unbiased, the variance of the estimate’s probability
distribution indicates how reliable an estimate is. On the one hand, if the variance is
small, there is a high probability that the estimate is close to the actual value. On the
other hand, if the variance is large, the probability that the estimate lies close to the actual
value is small.
5. a. The value of one variable does not allow us to predict the value of the other.
b. We use a scatter diagram plotting each variable’s deviation from its mean illustrates
independence. Loosely speaking, when the variables are independent their scatter
diagram points are spread “evenly” across all four quadrants.
c. It equals 0.
7. a. 1.00
b. Yes, something unusual occurs: EViews error message: “Near singular matrix.”
1. a. The ordinary least squares (OLS) estimation procedure for an included explanatory
variable’s value can be biased.
b. An omitted explanatory variable bias problem arises whenever the omitted explanatory
variable:
x influences the dependent variable;
x is correlated with an included explanatory variable.
3. a. The ordinary least squares (OLS) estimated procedure for the included explanatory
variables’ values will be unbiased.
b. The variance of the probability distributions for the values of the relevant explanatory
variables will be larger when the irrelevant explanatory variable is included than when
the irrelevant variable is excluded.
77
Solutions to Chapter 14 Exercises
1. a.
x The downward sloping demand curve theory suggests that higher prices decrease per
capita cigarette consumption. E Price < 0.
x Conventional wisdom suggests that college educated individuals smoke less. EEduColl
< 0.
x Tobacco is important to the economy of those states that produce large quantities of
tobacco; hence, tobacco use will be more acceptable in those states and residents will
be more receptive to smoking. ETobProd > 0.
b.
Ordinary Least Squares (OLS)
Dependent Variable: CigConsPC
Explanatory Variable(s): Estimate SE t-Statistic Prob
PriceConsumer 11.18450 3.464492 3.228322 0.0023
EducCollege 1.235767 0.580736 2.127934 0.0386
TobProdPC 0.725932 0.310685 2.336554 0.0238
Const 150.4259 17.75788 8.470934 0.0000
Number of Observations 51
78
3. a.
x The downward sloping demand curve theory suggests that higher prices decrease per
capita cigarette consumption. E Price < 0.
x Conventional wisdom suggests that college educated individuals smoke less. EEduColl
< 0.
x Conventional wisdom suggests that higher income individuals smoke less.
EI < 0.
x Tobacco is important to the economy of those states that produce large quantities of
tobacco; hence, tobacco use will be more acceptable in those states and residents will
be more receptive to smoking. ETobProd > 0.
b.
Ordinary Least Squares (OLS)
Dependent Variable: CigConsPC
Explanatory Variable(s): Estimate SE t-Statistic Prob
PriceConsumer 12.30477 3.417315 3.600714 0.0008
EducCollege 2.543609 0.881137 2.886736 0.0059
IncPC 1.582120 0.818423 1.933132 0.0594
TobProdPC 0.737257 0.302072 2.440662 0.0186
Const 134.8625 19.04746 7.080337 0.0000
Number of Observations 51
79
x At the 1 percent significance level, we reject the demand, education, and production
null hypotheses.
x Since the income results refute the theory perhaps we should rethink the role of
income.
5. a.
x As a member of Congress has been in the House longer, he/she gains influence.
Consequently, he/she is more able to garner earmarks. ETerms > 0.
x Liberal members of Congress have a more activist view of government’s role than
conservative members. Consequently, members of Congress who have high liberal
ratings should be more active in their pursuit of earmarks. E Liberal > 0.
b.
Ordinary Least Squares (OLS)
Dependent Variable: Number
Explanatory Variable(s): Estimate SE t-Statistic Prob
Terms 0.256432 0.068947 3.719247 0.0002
ScoreLiberal 0.045667 0.010589 4.312564 0.0000
Const 3.122135 0.696198 4.484549 0.0000
Number of Observations 425
80
7. a.
x As a member of Congress has been in the House longer, he/she gains influence.
Consequently, he/she is more able to garner earmarks. ETerms > 0.
x Liberal members of Congress have a more activist view of government’s role than
conservative members. Consequently, members of Congress who have high liberal
ratings should be more active in their pursuit of earmarks. E Liberal > 0.
x Democratic members of Congress have a more activist view of government’s role
than their non-Democratic colleagues. Consequently, Democratic members of
Congress should be more active in their pursuit of earmarks. EDem > 0.
b.
Ordinary Least Squares (OLS)
Dependent Variable: Number
Explanatory Variable(s): Estimate SE t-Statistic Prob
Terms 0.259824 0.070022 3.710597 0.0002
ScoreLiberal 0.039769 0.023079 1.723173 0.0856
PartyDem1 0.367293 1.276776 0.287672 0.7737
Const 3.198703 0.746051 4.287513 0.0000
Number of Observations 425
81
Chapter 15: Other Regression Statistics and Pitfalls
1. a.
Ordinary Least Squares (OLS)
Dependent Variable: LogUsersTV
Explanatory Variable(s): Estimate SE t-Statistic Prob
Year 0.022989 0.015903 1.445595 0.1487
CapitalHuman 0.036302 0.001915 18.95567 0.0000
CapitalPhysical 0.001931 0.000510 3.789394 0.0002
GdpPC 0.058877 0.012338 4.772051 0.0000
Auth 0.063345 0.012825 4.939278 0.0000
Const 44.95755 31.77155 1.415025 0.1575
Number of Observations 742
83
3. a. i. Error message: “Near singular matrix.”
ii. 1
iii. No
b. i. Error message: “Near singular matrix.”
ii. No
c. i. Error message: “Near singular matrix.”
ii. No
d.
Ordinary Least Squares (OLS)
Dependent Variable: TotalSalary
Explanatory Variable(s): Estimate SE t-Statistic Prob
HomeSalary 1.000000 8.58E-17 1.17E+16 0.0000
VisitSalary 1.000000 8.61E-17 1.16E+16 0.0000
Const 0.000000 4.24E-15 0.000000 1.0000
Number of Observations 588
The standard errors are very small, approximately 0.
e.
Ordinary Least Squares (OLS)
Dependent Variable: Attendance
Explanatory Variable(s): Estimate SE t-Statistic Prob
PriceTicket 590.7836 184.7231 3.198211 0.0015
HomeSalary 783.0394 45.23955 17.30874 0.0000
Const 9246.429 1529.658 6.044767 0.0000
Number of Observations 585
84
Estimated Equation: EstAttendance = 3,698 + 1,896PriceTicket .088HomeSalary
iv.
x The estimate of the ticket price coefficient changes dramatically, from 591 to
1,896.
x The estimate of the home salary coefficient changes dramatically, from 783.0 to
.088.
5. No.
3. Goal of Multiple Regression Analysis: Multiple regression analysis attempts to sort out the
individual effect of each explanatory variable. An explanatory variable’s coefficient estimate
allows us to estimate the change in the dependent variable resulting from a change in that
particular explanatory variable while all other explanatory variables remain constant.
1. a.
Ordinary Least Squares (OLS)
Dependent Variable: LogUsersInternet
Explanatory Variable(s): Estimate SE t-Statistic Prob
Year 0.449654 0.017078 26.32965 0.0000
CapitalHuman 0.023725 0.002470 9.606597 0.0000
CapitalPhysical 0.002056 0.000480 4.286193 0.0000
GdpPC 0.118177 0.011461 10.31146 0.0000
Auth 0.095836 0.013999 –6.845761 0.0000
Const 899.3201 34.17432 –26.31567 0.0000
85
Estimated Equation: EstLogUsersInternet = 899.3 + .450Year +
.024CapitalHuman + .002CapitalPhysical + .118GdpPC
.096Auth
b. EYear
LB
.4161
EUB
Year .4832
The 95 percent confidence interval is from .4161 to .4832.
3. a.
Ordinary Least Squares (OLS)
Dependent Variable: PetroConsPC
Explanatory Variable(s): Estimate SE t-Statistic Prob
PriceReal 124.7236 77.09510 1.617789 0.1444
Const 609.9538 58.12769 10.49334 0.0000
Number of Observations 10 Mass1 = 1
86
b. i. EstPetroConsPCNeb = bConst + bNeb + (bP + bPNeb)PriceReal
ii. bNeb
iii. bP + bPNeb
c. No, we have 5 unknowns, but only 4 equations:
bMass = 610.0 bP + bPMass = 124.7
bNeb = 1,418.4 bP + bPNeb = 540.5
d. Error message: Near singular matrix
e. Yes
87
Chapter 16: Heteroskedasticity
1.
x Error Term Equal Variance Premise: The variance of the error term’s
probability distribution for each observation is the same; all the variances equal
Var[e]:
Var[e1] = Var[e2] = … = Var[eT] = Var[e]
x Error Term/Error Term Independence Premise: The error terms are
independent: Cov[ei, ej] = 0.
Knowing the value of the error term from one observation does not help you
predict the value of the error term for any other observation.
x Explanatory Variable/Error Term Independence Premise: The explanatory
variables, the xt’s, and the error terms, the et’s, are not correlated.
Knowing the value of an observation’s explanatory variable does not help
you predict the value of that observation’s error term.
3. The error term equal variance premise played an important role. If this premise were violated
the variance would differ from observation to observation. Var[e] would not exist. The
expression for Var[bx] would be much more complicated.
et
where H t
GdpPCt
89
et
Var[εt ] Var[ ]
GdpPCt
p Arithmetic of variances: Var[cx ] c 2 Var[x ]
1
Var[et ]
GdpPCt
p Var[et ] =V u GpcPCt where V equals a constant
1
V u GpcPCt
GdpPCt
V
1. Heteroskedasticity occurs whenever the variance of the error terms’ probability distributions
is not identical; that is, whenever the variance of the error terms’ probability distributions
differ from observation to observation.
3. a. The ordinary least squares (OLS) estimation procedure for the coefficient value is
unbiased.
b.
x The ordinary least squares (OLS) estimation procedure for the variance of the error
term’s probability distribution is flawed because it is based on a false premise; hence,
the standard errors and call calculations based on the standard errors are flawed.
x The ordinary least squares (OLS) estimation procedure is not the best linear unbiased
estimation procedure (BLUE).
1. a.
x A state with a higher crime rate should have more court cases and hence would spend
more on its judicial system: ECrimes > 0.
x A state with a higher per capita GDP has a larger tax base and hence, can afford to
spend more on its judicial system: EGDP > 0.
b.
Ordinary Least Squares (OLS)
Dependent Variable: JudExp
Explanatory Variable(s): Estimate SE t-Statistic Prob
CrimesAll 0.321427 0.408012 0.787787 0.4348
GdpPC 0.298469 0.063159 4.725673 0.0000
Const –2425.185 2705.168 –0.896501 0.3746
Number of Observations 50
90
Interpretation of Estimates:
bCrimesAll = .321: A 1 per 100,000 persons rise in the crime rate increases judicial
expenditures by $.321 per 100,000 persons.
bGDP = .298: A $1 rise in per capita state income increases judicial expenditures by
.298 per 100,000 persons.
Critical Result: CrimesAll coefficient estimate equals .321. This evidence, the positive
sign of the coefficient estimate, suggests that higher crime rates
increase judicial expenditures thereby supporting the theory.
GdpPC coefficient estimate equals .298. This evidence, the positive
sign of the coefficient estimate, suggests that higher per capita GDP
increases judicial expenditures thereby supporting the theory.
c. Crime Rate GDP
H0: ECrimes = 0 H0: EGDP = 0
H1: ECrimes > 0 H1: EGDP > 0
d. Crime Rate GDP
.4348 .0001
| .217 | < .0001
2 2
x At the 1 percent significance level we reject the GDP null hypothesis.
x Even at a 10 percent significance level we cannot reject the crime rate null
hypothesis.
3. a.
Original Model: JudExpt = EConst + ECrimesCrimesAllt + EGDPGdpPCt + et
Divide both sides of the equation by GdpPCt
.
JudExpt
Tweaked Model :
GdpPCt
EConst CrimesAllt GdpPCt et
ECrimes EGDP
GdpPCt GdpPCt GdpPCt GdpPCt
EConst CrimesAllt
ECrimes EGDP GdpPCt Ht
GdpPCt GdpPCt
et
where H t
GdpPCt
91
et
Var[εt ] Var[ ]
GdpPCt
p Arithmetic of variances: Var[cx] c 2 Var[x]
1
Var[et ]
GdpPCt
p Var[et ] =V u GpcPCt where V equals a constant
1
V u GpcPCt
GdpPCt
V
b.
Ordinary Least Squares (OLS)
Dependent Variable: AdjJudExp
Explanatory Variable(s): Estimate SE t-Statistic Prob
AdjCrimesAll 0.308650 0.375938 0.821013 0.4158
AdjGdpPC 0.299664 0.062667 4.781868 0.0000
AdjConst 2413.696 2517.482 0.958774 0.3426
Number of Observations 50
5. a.
x A state with a higher unemployment rate should increase its poverty rate: EUnem > 0.
x A state with a higher burglary rate should depress economic conditions increasing the
poverty rate: EBurg > 0.
b.
Ordinary Least Squares (OLS)
Dependent Variable: PovRate
Explanatory Variable(s): Estimate SE t-Statistic Prob
UnemRate 0.446262 0.380817 1.171854 0.2472
Burglaries 0.006463 0.001780 3.630155 0.0007
Const 4.949594 2.049562 2.414952 0.0197
Number of Observations 50
92
Critical Result: Unem coefficient estimate equals .446. This evidence, the positive sign
of the coefficient estimate, suggests that higher unemployment rates
increase the poverty rate thereby supporting the theory.
Burglaries coefficient estimate equals .00646. This evidence, the
positive sign of the coefficient estimate, suggests that higher a higher
burglary rate increases the poverty rate thereby supporting the theory.
c. UnemRate Burglaries
H0: EUnem = 0 H0: EBurg = 0
H1: EUnem > 0 H1: EBurg > 0
d. UnemRate Burglaries
.2472 .0007
| .124 | .0004
2 2
x At the 1 percent significance level we reject the Burglaries null hypothesis.
x Even at a 10 percent significance level we cannot reject the unemployment rate null
hypothesis.
7. a.
Original Model: PovRatet = EConst + EUnemUnemRatet + E BurgBurglariest + et
Multiply both sides of the equation by Popt .
Tweaked Model : PovRatet Popt
EConst Popt EUnemUnemRatet Popt E Burg Burglariest Popt et Popt
EConst Popt EUnemUnemRatet Popt E Burg Burglariest Popt Ht
where H t et Popt
93
b.
Ordinary Least Squares (OLS)
Dependent Variable: AdjPovRate
Explanatory Variable(s): Estimate SE t-Statistic Prob
AdjUnemRate 0.814434 0.331685 2.455442 0.0178
AdjBurglaries 0.005746 0.001291 4.451748 0.0001
AdjConst 3.059615 2.097694 1.458561 0.1513
Number of Observations 50
94
Chapter 17: Autocorrelation (Serial Correlation)
1.
x Error Term Equal Variance Premise: The variance of the error term’s
probability distribution for each observation is the same; all the variances equal
Var[e]:
Var[e1] = Var[e2] = … = Var[eT] = Var[e]
x Error Term/Error Term Independence Premise: The error terms are
independent: Cov[ei, ej] = 0.
Knowing the value of the error term from one observation does not help you
predict the value of the error term for any other observation.
x Explanatory Variable/Error Term Independence Premise: The explanatory
variables, the xt’s, and the error terms, the et’s, are not correlated.
Knowing the value of an observation’s explanatory variable does not help
you predict the value of that observation’s error term.
3. The error term/error term independence premise played an important role. If this premise
were violated, the error term covariance terms would not equal 0. The expression for Var[bx]
would be much more complicated.
5. a. The value of one variable does not allow us to predict the value of the other.
b. We use a scatter diagram plotting each variable’s deviation from its mean illustrates
independence. Loosely speaking, when the variables are independent that scatter diagram
points are spread “evenly” across all four quadrants.
c. Both are approximately 0.
7. Rest = yt Estyt
Substituting for yt
yt = EConst + Exxt + et
95
Solutions to Chapter 17 Review Questions
1. Autocorrelation exists whenever one observation’s error term is correlated with the previous
observation’s error term.
3. a. The ordinary least squares (OLS) estimation procedure for the coefficient value is
unbiased.
b.
x The ordinary least squares (OLS) estimation procedure for the variance of the error
term’s probability distribution is flawed because it is based on a false premise; hence,
the standard errors and call calculations based on the standard errors are flawed.
x The ordinary least squares (OLS) estimation procedure is not the best linear unbiased
estimation procedure (BLUE).
1. a. The downward sloping theory of demand postulates that a rise in the real price of
petroleum should decrease petroleum consumption. E P < 0.
b.
Ordinary Least Squares (OLS)
Dependent Variable: PetroConsPC
Explanatory Variable(s): Estimate SE t-Statistic Prob
PriceReal 53.63777 18.00446 2.979138 0.0054
Const 563.4323 16.26989 34.63038 0.0000
Number of Observations 35
96
3. a. Autocorrelation model:
et = Uet1 + vt vt’s are independent
or
vt = et Uet1
Start with the original model:
PetroConsPCt = EConst + E I PriceRealt + et
Lag it one year and multiply by U:
U PetroConsPCt–1 = UEConst + EIU PriceRealt–1 + Uet–1
Subtract
PetroConsPCt U PetroConsPCt–1 = (1U)EConst + EI(PriceRealt UPriceRealt–1)
+ et Uet–1
Substituting vt for et Uet–1:
PetroConsPCt U PetroConsPCt–1 = (1U)EConst + EI(PriceRealt UPriceRealt–1) + vt
vt’s are independent
b. Use the estimated value of U to generate two new variables:
AdjPetroConsPC = PetroConsPC .77PetroConsPC(1)
AdjPriceReal = PriceReal .77PriceReal(1)
5. a.
x An increase in per capita income should reduce the crime rate. EI < 0.
x An increase in the unemployment rate should increase the crime rate. EUn > 0.
b.
Ordinary Least Squares (OLS)
Dependent Variable: CrimesAll
Explanatory Variable(s): Estimate SE t-Statistic Prob
IncPC 0.693721 0.116784 5.940206 0.0000
UnemRate 114.1572 111.4697 1.024109 0.3201
Const 14897.77 2369.554 6.287163 0.0000
Number of Observations 20
97
Critical Result: IncPC coefficient estimate equals .694. This evidence, the negative
sign of the coefficient estimate, suggests that higher per capita income
reduces crime thereby supporting the theory.
UnemRate coefficient estimate equals 114.2. This evidence, the
positive sign of the coefficient estimate, suggests that higher
unemployment rates increase crime thereby supporting the theory.
c. Income Unemployment
H0: E I = 0 H0: EUn = 0
H1: E I < 0 H1: EUn > 0
.0001 .3201
d. | < .0001 | .160
2 2
x At the 1 percent significance level we reject the income null hypothesis.
x Even at the 10 percent significance level we do not reject the unemployment
null hypothesis.
7. a. Autocorrelation model:
et = Uet1 + vt vt’s are independent
or
vt = et Uet1
Start with the original model:
CrimesAllt = EConst + EIIncPCt + EUnUnemRatet + et
Lag it one year and multiply by U:
UCrimesAllt–1 = UEConst + EIUIncPCt–1 + EUnUUnemRatet–1 + Uet–1
Subtract
CrimesAllt UCrimesAllt–1 = (1U)EConst + E I(IncPCt UIncPCt–1) + EUn(UnemRatet
UUnemRatet–1) + et Uet–1
Substituting vt for et Uet–1:
CrimesAllt UCrimesAllt–1 = (1U)EConst + E I(IncPCt UIncPCt–1)
+ EUn(UnemRatet UUnemRatet–1) + vt
vt’s are independent
b. Use the estimated value of U to generate three new variables:
AdjCrimesAll = CrimesAll .79CrimesAll(1)
AdjIncPC = IncPC .79IncPC(1)
AdjUnemRate = UnemRate .79UnemRate (1)
Ordinary Least Squares (OLS)
Dependent Variable: AdjCrimesAll
Explanatory Variable(s): Estimate SE t-Statistic Prob
AdjIncPC 0.324473 0.122196 2.655344 0.0173
AdjUnemRate 144.7625 55.71745 2.598153 0.0194
AdjConst 1747.274 467.3090 3.739012 0.0018
Number of Observations 19
98
Chapter 18: Explanatory Variable/Error Term Independence
Premise, Consistency, and Instrumental Variables
1. a. i.
ii. Decrease.
iii. Negatively.
b. i.
ii. Observationxt et yt
1 2 4 11
2 6 2 11
3 10 3 14
4 14 2 11
5 18 1 14
6 22 4 13
iii. Plot the x and y values for each of the 6
observations.
c. See above.
d. The best fitting line is less steeply sloped than the actual line.
1. a. Upward bias.
b. Downward bias.
c. Unbiased.
5. No. An unbiased estimation procedure will not be consistent whenever the variance of the
estimate’s probability distribution remains constant or increases as the sample size increases.
99
Solutions to Chapter 18 Exercises
1. a. .50
b. .0156
c. 32 percent
100
Chapter 19: Measurement Error and the Instrumental
Variables Estimation Procedure
1. a. Sometimes you will be a little premature in clicking the stop watch button. Other times
you will be a little late. It is humanly impossible to measure the actual elapsed time
perfectly. No matter how careful you are, sometimes the measured value will be a little
low and other times a little high.
b. i. Half
ii. Half
iii. 0
1. a. The ordinary least squares (OLS) estimation procedure for the coefficient value will still
be unbiased.
b. The variance of the coefficient estimate’s probability distribution will be greater as a
consequence of dependent variable measurement error. This occurs because measurement
error introduces more “more uncertainty” into the mix.
3. a. No. The ordinary least squares (OLS) estimated procedure for the coefficient value is
biased when measurement error is present (and the actual value of the coefficient is
nonzero).
b. While the instrumental variable (IV) estimation procedure for the coefficient value is still
biased, it is consistent.
1. Explanatory variable measurement error would be introduced. The ordinary least squares
(OLS) estimation procedure for the value of the SATScores coefficient would be biased
toward 0 if SAT scores do have an impact on exam scores.
3. Dependent variable measurement error would be introduced. The ordinary least squares
(OLS) estimation procedure
x for the value of the ProbScores and SATScores coefficients would not be biased.
But
x the variance of the estimates’ probability distributions would be greater as a
consequence of the randomization.
101
5. a.
Ordinary Least Squares (OLS)
Dependent Variable: IncAnnPC
Explanatory Variable(s): Estimate SE t-Statistic Prob
Coll 0.783234 0.077323 10.12934 0.0000
Const 12.76024 2.140549 5.961198 0.0000
Number of Observations 51
7. a.
x A state with a higher crime rate should have more court case and hence would spend
more on its judicial system: ECrimes > 0.
x A state with a higher per capita GDP has a larger tax base and hence, can afford to
spend more on its judicial system: EGDP > 0.
b.
102
expenditures by $.321 per 100,000 persons.
bGDP = .298: A $1 rise in per capita state income increases judicial expenditures by
$.298 per 100,000 persons.
Critical Result: CrimeAll coefficient estimate equals .321. This evidence, the positive
sign of the coefficient estimate, suggests that higher crime rates
increase judicial expenditures thereby supporting the theory.
GdpPC coefficient estimate equals .298. This evidence, the positive
sign of the coefficient estimate, suggests that higher per capita GDP
increases judicial expenditures thereby supporting the theory.
9.
x First, we used annual disposable income as an explanatory variable and applied the
ordinary least squares (OLS) estimation procedure. We estimated that a 1 per 100,000
persons rise in crime rate increases judicial expenses by $.321 per 100,000 persons. But
we believe that an explanatory variable measurement error problem is present here.
x Second, we used an instrumental variable (IV) approach which resulted in a higher
estimate for the impact of permanent income. We estimated that a 1 per 100,000 persons
rise in crime rate increases judicial expenses by $5.12 per 100,000 persons.
103
Chapter 20: Omitted Variables and the Instrumental Variable
Estimation Procedure
1. a.
yt = EConst + Ex1x1t + Ex2 x2t + et
= EConst + Ex1x1t + (Ex2x2t + et)
= EConst + Ex1x1t + Ht
Ht = Ex2x2t + et
b. Yes, the explanatory variable, x1t, and the second model’s error term, Ht, are positively
correlated.
x1t and x2t
positively correlated
Included Typically, omitted
o
variable x1t up variable x2t up
p p Ht = Ex2x2t + et
x1t up Ht up Ex2 > 0
3. .60
105
3.
x Instrument/”Problem” Explanatory Variable Correlation: The instrument,
AdvDegt, must be correlated with the “problem” explanatory variable, PopDent.
x Instrument/Error Term Independence: The instrument, AdvDegt, and the error
term, Ht, must be independent.
5.
x Instrument/”Problem” Explanatory Variable Correlation: The correlation
coefficient of AdvDegt and PopDent equals .77. This along with the results from
IV Regression 1 suggests that this condition is satisfied.
x Instrument/Error Term Independence: We cannot determine if this condition is
met.
7.
VoteDemPartyTwot = EConst + EGdpGth RealGdpGrowtht
+ EPopDenPopDent + ELibLiberalt + et
+ (ELibLiberalt + et)
+ Ht
where Ht = ELibLiberalt + et
106
9. a.
Ordinary Least Squares (OLS)
Dependent Variable: PopDen
Explanatory Variable(s): Estimate SE t-Statistic Prob
AdvDeg 314.3409 37.54406 8.372587 0.0000
Const –2748.974 393.1890 –6.991482 0.0000
Number of Observations 51
b.
Ordinary Least Squares (OLS)
Dependent Variable: VoteDemPartyTwo
Explanatory Variable(s): Estimate SE t-Statistic Prob
RealGdpGrowth 0.867041 0.416006 2.084205 0.0425
EstPopDen 0.008464 0.000957 8.844707 0.0000
Const 49.19420 1.049180 46.88823 0.0000
Number of Observations 51
107
Chapter 21: Panel Data and Omitted Variables
1. For each cross section entity, the value of the omitted variable does not vary from time period
to time period.
3. For each period, the value of the omitted variable does not vary from cross section entity to
cross section entity.
5. With heteroskedasticity and autocorrelation we exploited the relationship among the error
terms to improve the estimation procedure. The random effects approach uses the same
strategy.
109
Solutions to Chapter 21 Exercises
1. a.
x The downward sloping demand curve theory suggests that a rise in the price of beer
decreases per capita beer consumption. E P < 0.
x Conventional wisdom suggests that beer is an inferior good; as per capita income
rises, per capita beer consumption decreases. EI < 0.
b.
Ordinary Least Squares (OLS)
Dependent Variable: BeerPC
Explanatory Variable(s): Estimate SE t-Statistic Prob
Price 80.51845 304.6151 0.264329 0.7916
IncPC 2.414644 1.109782 2.175782 0.0301
Const 394.2818 122.3245 3.223244 0.0014
Number of Observations 459
Cross Sections 51
Periods 9
110
3.
Ordinary Least Squares (OLS)
Dependent Variable: LogUsersInternet
Explanatory Variable(s): Estimate SE t-Statistic Prob
Year 0.449654 0.017078 26.32965 0.0000
CapitalHuman 0.023725 0.002470 9.606597 0.0000
CapitalPhysical 0.002056 0.000480 4.286193 0.0000
GdpPC 0.118177 0.011461 10.31146 0.0000
Auth 0.095836 0.013999 –6.845761 0.0000
Const 899.3201 34.17432 –26.31567 0.0000
Number of Observations 566
Cross Sections 110
Periods 8
The coefficient estimate of Year is positive suggesting that per capita Internet use is
growing over time after accounting for human capital, physical capital, per capita GDP,
and the political environment. More specifically, we estimate that per capita Internet use
is growing by 45 percent annually after accounting for the other explanatory factors.
5. a. The downward sloping demand curve theory suggests that a rise in the price of gasoline
decreases per capita motor fuel consumption.
b.
Ordinary Least Squares (OLS)
Dependent Variable: MotorFuelPC
Explanatory Variable(s): Estimate SE t-Statistic Prob
Price 50.72342 21.87900 2.318361 0.0224
Const 509.3529 21.98667 23.16644 0.0000
Number of Observations 105
Cross Sections 3
Periods 35
.0224
d. | .011
2
111
At the 5 percent significance level, we reject the null hypothesis that the real price of
gasoline has no impact on beer consumption. On the other hand, at the 1 percent
significance level, we do not reject the null hypothesis.
e.
The first 35 observations are for Arkansas, the second 35 observations for Massachusetts,
and the last 35 observations for Washington. The residuals for
x Arkansas are positive.
x Massachusetts are negative.
x Washington approximately zero.
This suggests that the inclusion of cross section random effects may be
appropriate.
112
Chapter 22: Simultaneous Equations Models—Introduction
3. a. Multiple regression analysis attempts to sort out the individual effect that each
explanatory variable has on the dependent variable.
b. Each explanatory variable’s coefficient reveals the individual impact that the explanatory
variable has on the dependent variable; that is, each explanatory variable’s coefficient
tells us how changes in that explanatory variable affect the dependent variable while all
other explanatory variables remain constant.
c. 'y = bx1'Vbl1
d. 'y = bx2'Vbl2
e. 'y = bx1'Vbl1 + bx2'Vbl2
5. a. P = 6 + .01Inc + .4FeedP
= 6 + .01u4,000 + .4u40
= 6 + 40 + 16
= 50
1. Endogenous variables are variables whose values are determined “within” the model.
3. a. No.
b. No.
5. a. No.
b. Yes.
113
Solutions to Chapter 22 Exercises
1. a. bID estimates how the quantity of beef demanded changes when income changes while
the price of beef remains constant.
QD
bID while P remains constant
'I
'QD = bID 'I while the price of beef remains constant
b. i. 'P = 1.0562'FeedP + .018825'Inc
ii. 0 = 1.0562'FeedP + .018825'Inc
Solving for 'FeedP
1.0562'FeedP = .018825'Inc
.018825
'FeedP = 'Inc
1.0562
= .017233'Inc
c. i. 'Q = 332.00'FeedP + 17.347'Inc
ii. From b2: 'FeedP = .017233'Inc
= 332.00u(.017233'Inc) + 17.347'Inc
= 5.917'Inc + 17.347'Inc
= 23.264'Inc
'Q
23.264 while P remains constant
'Inc
bID = 23.264
3.
log(Qt ) DConst
Q
D FP
Q
log( FeedPt ) D IQ log( Inct ) H tQ
log( Pt ) D Const
P
D FP
P
log( FeedPt ) D IP log( Inct ) H tP
114
b. Own Price Elasticity of Supply = bPS
bPS estimates the own price elasticity of supply, the percent change in quantity supplied
resulting from a 1 percent change in the price while the feed price remains constant.
'LogQ S
bPS while LogFeedP remains constant
'LogP
While LogFeedP remains constant: 'LogFeedP = 0
'LogQ = .08452'LogFeedP + .3595'LogInc = .3595'LogInc
'LogP = .3317'LogFeedP + .7503'LogInc = .7503'LogInc
S 'LogQ S .3595'LogInc .3595
b .479
'LogP .7503'LogInc
P
.7503
Interpretation: We estimate that a 1 percent increase in the price of beef increases the
quantity of beef supplieded by .479 percent.
7.
Ordinary Least Squares (OLS)
Dependent Variable: Q
Explanatory Variable(s): Estimate SE t-Statistic Prob
FeedP 533.6727 139.9106 3.814384 0.0003
Inc 10.77606 1.628569 6.616890 0.0000
Const 54222.01 12057.98 4.496775 0.0000
Number of Observations 72
115
Interpretation of Estimates:
P
aFP = 1.3427: A 1 cent increase in the price of cattle feed increases the price of beef
by 1.3427 cents.
aIP = .010833: A 1 billion dollar increase in real disposable income increases the price
of beef by .010833 cents.
a. 'Q = 533.67'FeedP + 10.776'Inc
b. 'P = 1.3427'FeedP + .010833'Inc
9. a. An increase in police expenditures provides the populace with more police protection
which decreases the crime rate: E PC < 0.
b. An increase in the unemployment rate increases the chances that crimes will be
committed: EUC > 0.
11.
Crimest DConst
c
DUCUnemRatet D GC GdpPCt H tC
PoliceExpt DConst
P
DUPUnemRatet D GP GdpPCt H tP
13. a. bPC estimates how the crime rate changes when police expenditures change while the
unemployment rate remains constant.
'Crimes
bPC whileUnemRate remains constant
'PoliceExp
b. While UnemRate remains constant, 'UnemRate = 0:
'Crimes = 23.814'UnemRate .025271'GdpPC = .025271'GdpPC
'PoliceExp = 12.174'UnemRate + .019163'GdpPC = .019163'GdpPC
'Crimes .025271'GdpPC .025271
bPC 1.319
'PoliceExp .019163'GdpPC .019163
116
Chapter 23: Simultaneous Equations Models—Identification
1.
Ordinary Least Squares (OLS)
Dependent Variable: P
Explanatory Variable(s): Estimate SE t-Statistic Prob
FeedP 1.056242 0.286474 3.687044 0.0003
Inc 0.018825 0.005019 3.750636 0.0003
Const 33.02715 31.04243 1.063936 0.2895
Number of Observations 120
3. a.
Ordinary Least Squares (OLS)
Dependent Variable: Q
Explanatory Variable(s): Estimate SE t-Statistic Prob
EstP 921.4783 113.2551 8.136309 0.0000
FeedP –1305.262 121.2969 –10.76089 0.0000
Const 108291.8 16739.33 6.469303 0.0000
1. A model is underidentified whenever we cannot estimate all the parameters of the original
model using the reduced form (RF) estimation procedure.
3. a. The two procedures provide identical results whenever the model is identified or
underidentified.
b. The two procedures provide different results whenever the model is overidentified. The
reduced form (RF) estimation provides multiple estimates for one or more of original
model’s parameters; two stage least squares (TSLS) provides only a single estimate.
117
Solutions to Chapter 23 Exercises
1. a.
Ordinary Least Squares (OLS)
Dependent Variable: LogQ
Explanatory Variable(s): Estimate SE t-Statistic Prob
LogFeedP 0.178585 0.027126 6.583483 0.0000
Const 12.89211 0.101850 126.5791 0.0000
Number of Observations 120
Q
Estimate: aFP = .1786
'LogQ = .1786'LogFeedP
i. Yes. We can still estimate the own price elasticity of demand because the feed price
data “tells us” when the supply curve shifts allowing us to use the equilibria to
estimate the slope of the demand curve.
'LogQ D .1786'LogFeedP .1786
bPD 1.319
'LogP .1354'LogFeedP .1354
ii. No. Without chicken price and income data we do not know when the demand curve
shifts. We cannot use the equilibria to estimate the slope of the supply curve.
b. i. Yes. There is one estimate: 1.319.
Two Stage Least Squares (TSLS)
Dependent Variable: LogQ
Instrument(s): LogFeedP
Explanatory Variable(s): Estimate SE t-Statistic Prob
LogP 1.318616 0.669960 1.968203 0.0514
Const 18.93950 3.412885 5.549410 0.0000
Number of Observations 120
118
3. a.
Ordinary Least Squares (OLS)
Dependent Variable: LogQ
Explanatory Variable(s): Estimate SE t-Statistic Prob
LogFeedP 0.092794 0.028875 3.213667 0.0017
LogInc 0.333895 0.062472 5.344703 0.0000
LogChickP 0.040721 0.059085 0.689185 0.4921
Const 9.583209 0.490918 19.52098 0.0000
Number of Observations 120
Estimates: Q
aFP = .0928 Q
aInc = .3339 Q
aCP = .04072
'LogQ = .0928'LogFeedP + .3339'LogInc + .04072'LogChickP
Estimates: P P P
aFP = .3207 aInc = .7161 aCP = .05431
'LogP = .3207'LogFeedP + .7161'LogInc + .05431' LogChickP
i. Yes. We can estimate the own price elasticity of demand because the feed price data
“tells us” when the supply curve shifts allowing us to use the equilibria to estimate
the slope of the demand curve.
'LogQ D .0928'LogFeedP .0928
bPD .289
'LogP .3207'LogFeedP .3207
ii. Yes. We can estimate the own price elasticity of supply because the chicken price
and income data “tell us” when the demand curve shifts allowing us to use the
equilibria to estimate the slope of the supply curve. The reduced form (RF)
estimation procedure provides two estimates:
'LogQ S .3339'LogInc .3339
bPS .466
'LogP .7161'LogInc .7161
'LogQ S .04072'LogChickP .04072
bPS .450
'LogP .05431'LogChickP .05431
119
b. i. Yes. There is one estimate: .289.
Two Stage Least Squares (TSLS)
Dependent Variable: LogQ
Instrument(s): LogFeedP LogInc LogChickP
Explanatory Variable(s): Estimate SE t-Statistic Prob
LogP 0.289355 0.044935 6.439347 0.0000
LogInc 0.541101 0.027894 19.39826 0.0000
LogChickP 0.056435 0.030582 1.845364 0.0675
Const 8.896469 0.191751 46.39605 0.0000
Number of Observations 120
5. a.
Ordinary Least Squares (OLS)
Dependent Variable: LogQ
Explanatory Variable(s): Estimate SE t-Statistic Prob
LogInc 0.717732 0.077586 9.250758 0.0000
LogPorkP 0.347416 0.052969 6.558847 0.0000
Const 3.458030 0.645048 5.360887 0.0000
Number of Observations 72
Estimates: Q Q
aInc = .7177 aPP = .3474
'LogQ = .7177'LogInc + .3474'LogPorkP
120
Ordinary Least Squares (OLS)
Dependent Variable: LogP
Explanatory Variable(s): Estimate SE t-Statistic Prob
LogInc 0.268894 0.138588 1.940241 0.0564
LogPorkP 0.044724 0.094616 0.472686 0.6379
Const 2.253325 1.152212 1.955651 0.0546
Number of Observations 72
Estimates: P
aInc = .2689 P
aPP = .04472
'LogP = .2689'LogInc .04472' LogPorkP
i. No. Without feed price data we do not know when the supply curve shifts. We cannot
use the equilibria to estimate the slope of the demand curve.
ii. Yes. We can still estimate the own price elasticity of supply because the pork price
and income data “tell us” when the demand curve shifts allowing us to use the
equilibria to estimate the slope of the supply curve. The reduced form (RF)
estimation procedure provides two estimates:
'LogQ S .7177'LogInc .7177
bPS 2.67
'LogP .2689'LogInc .2689
'LogQ S .3474'LogPorkP .3474
bPS 7.77
'LogP .044721'LogPorkP .04472
b. i. No.
ii. Yes. There is one estimate: 2.67.
Two Stage Least Squares (TSLS)
Dependent Variable: LogQ
Instrument(s): LogInc LogPorkP
Explanatory Variable(s): Estimate SE t-Statistic Prob
LogP 2.667617 1.593295 1.674277 0.0985
Const 0.180328 6.847074 0.026337 0.9791
Number of Observations 72
7. None.
9. One.
121
Chapter 24: Binary and Truncated Dependent Variables
1. The more densely populated states tend to have more voters who are registered Democrats
than Republicans; consequently, Democrats would be more likely to win states whose
population density is high. EDen > 0.
3. a.
Ordinary Least Squares (OLS)
Dependent Variable: WinDem1
Explanatory Variable(s): Estimate SE t-Statistic Prob
PopDen 0.001001 0.000236 4.247280 0.0001
Const 0.192315 0.074299 2.588406 0.0127
Number of Observations 50
123
5.
3. Yes. In a probit model, the best fitting is S-shaped which can never be less than 0 or greater
than 1. Hence, estimates of the dependent variable always lie between 0 and 1.
1. a. The party winning a state’s Electoral College vote depends on the state’s population
density; as a state becomes more densely populated, the Democrats rather than the
Republicans become more likely to win. EDen < 0.
b.
Ordinary Least Squares (OLS)
Dependent Variable: WinRep1
Explanatory Variable(s): Estimate SE t-Statistic Prob
PopDen 0.001001 0.000236 4.247280 0.0001
Const 0.807685 0.074299 10.87080 0.0000
Number of Observations 50
124
Estimated Equation: EstWinRep1 = .808 .001PopDen
Interpretation of Estimates:
bDen = .001: A 1 person per square mile increase in a state’s population density
decreases the probability of a Republican victory by .001.
Critical Result: The PopDen coefficient estimate equals .001. This evidence, the
negative sign of the coefficient estimate, suggests that higher
population density decreases the probability of a Republican victory
thereby supporting the theory.
c. H0: EDen = 0
H1: EDen < 0
.0001
d. Prob[Results IF H0 True] = < .0001
2
At the 1 percent significance level, we reject the null hypothesis that population density
has no impact of the probability of a Republican victory.
e. Estimated Equation: EstWinDem1 = .192 + .001PopDen
Yes, the results are consistent. A 1 person per square mile increase in population density
increases the probability of a Democratic victory by .001 which is equivalent to
decreasing the probability of a Republican victory by .001.
3. a.
Probit
Dependent Variable: WinDem1
Explanatory Variable(s): Estimate SE z-Statistic Prob
PopDen 0.009060 0.003073 2.947983 0.0032
Const –0.899796 0.342966 –2.623574 0.0087
Number of Observations 50
Critical Result: The PopDen coefficient estimate equals .009. This evidence, the
positive sign of the coefficient estimate, suggests that higher
population density increases the probability of a Democratic victory
thereby supporting the theory.
b. H0: EDen = 0
H1: EDen > 0
.0032
c. Prob[Results IF H0 True] = | .002
2
At the 1 percent significance level, we reject the null hypothesis that population density
has no impact on the probability of a Democratic victory.
125
5. a.
Probit
Dependent Variable: WinDem1
Explanatory Variable(s): Estimate SE z-Statistic Prob
PopDen 0.008500 0.003279 2.592533 0.0095
UnemTrend 0.175342 0.389554 0.450110 0.6526
Const 1.003798 0.416943 2.407520 0.0161
Number of Observations 50
Critical Result: The PopDen coefficient estimate equals .0085. This evidence, the
positive sign of the coefficient estimate, suggests that higher
population density increases the probability of a Democratic victory
thereby supporting the theory.
The UnemTrend coefficient estimate equals .175. This evidence, the
positive sign of the coefficient estimate, suggests that an upward trend
in the unemployment rate increases the probability of a Democratic
victory thereby supporting the theory.
b. Yes. The signs of the coefficient estimates lend support to the theories.
c. The precise numerical values differ, but this is hardly surprising. What is a little
troubling, however, is the Prob. column value for the UnemTrend coefficient estimate has
changed dramatically. Even at the 10 percent significant level we can no longer reject the
null hypothesis that the unemployment trend has no effect on the probability of a
Democratic victory.
7.
126
9.
Tobit
Dependent Variable: HeatDegDays
Explanatory Variable(s): Estimate SE z-Statistic Prob
HighTemp 0.951145 0.027690 34.34919 0.0000
Const 72.71311 1.873004 38.82165 0.0000
Number of Observations 365
a. The absolute value of the Tobit estimate is greater indicating that the Tobit best fitting
line is more steeply sloped than the ordinary least squares (OLS) best fitting line.
b. The Tobit estimation procedure accounts for the fact that heating degree days are
truncated at 0. Those days that require cooling rather than heating are ignored when we
just consider heating degree days.
127
Chapter 25: Descriptive Statistics, Probability, and Random
Variables—A Closer Look
3
1. a. i. = .6.
5
2
ii. = .4.
5
1
b. i. = .5.
2
1
ii. = .5.
2
3
c. i. = .75.
4
1
ii. = .25.
4
3. a. Mean[v ] ¦ v Prob[v]
All v
b. Var[v ] ¦ (v Mean[v])
All v
2
Prob[v]
3. An event tree visually displays the mutually exclusive outcomes (events) of a random
process.
5. a. The value of one random variable does not affect the probability distribution of the other.
b. 0.
129
Solutions to Chapter 25 Exercises
1. a.
b. Mean = 719
Mode = 750
Median = 730
c. The left tail is longer than the right tail. Accordingly, the left tail “drags” the mean down.
3. a.
130
b. Prob[Yellow Guilty AND Yellow Reported] = .9 u .8 = .72
Prob[Yellow Guilty AND Orange Reported] = .9 u .2 = .18
Prob[Orange Guilty AND Orange Reported] = .1 u .8 = .08
Prob[Orange Guilty AND Yellow Reported] = .1 u .2 = .02
c. Prob[Yellow Reported] = .74
Prob[Orange Reported] = .26
d. Prob[Yellow Guilty IF Yellow Reported]
Prob[Yellow Guilty AND Yellow Reported] .72
.97
Prob[Yellow Reported] .74
Prob[Orange Guilty IF Yellow Reported]
Prob[Orange Guilty AND Yellow Reported] .02
.03
Prob[Yellow Reported] .74
Prob[Yellow Guilty IF Orange Reported]
Prob[Yellow Guilty AND Orange Reported] .18
.69
Prob[Orange Reported] .26
Prob[Orange Guilty IF Orange Reported]
Prob[Orange Guilty AND Orange Reported] .08
.31
Prob[Orange Reported] .26
e. No. Regardless of which cab the eyewitness reports a Yellow cab is still more likely to be
guilty.
f. There are two pieces of information:
x 90 percent of the cabs are Yellow.
x Eyewitness testimony is 80 percent accurate.
Since 90 percent of the cabs are Yellow and the eyewitness testimony is only 80 percent
accurate, relative number of cabs trumps the eyewitness testimony.
5. a.
131
b.
i. Prob[Prize behind Door 2 IF Monty opens Door 1]
Prob[Prize behind Door 2 AND Monty opens Door 1]
Prob[Monty opens Door 1]
1 1
3 3 2
1 1 1 3
3 6 2
ii. Prob[Prize behind Door 3 IF Monty opens Door 1]
Prob[Prize behind Door 3 AND Monty opens Door 1]
Prob[Monty opens Door 1]
1 1
6 6 1
1 1 1 3
3 6 2
iii. Prob[Prize behind Door 1 IF Monty opens Door 2]
Prob[Prize behind Door 1 AND Monty opens Door 2]
Prob[Monty opens Door 2]
1 1
3 3 2
1 1 1 3
3 6 2
132
Chapter 26: Estimating the Mean of a Population
1.
Mean [ T1 ( v 1 + v 2 + … + v T) ]
Mean[cx] = cMean[x]
1
=
T
[
Mean v1 + v2 + … + vT]
3. a. When the mean of the estimate’s probability distribution equals the actual value, the
estimation procedure is unbiased, the estimation procedure does not systematically
overestimate or underestimate the actual value.
b. When an estimation procedure is unbiased, the variance of the estimate’s probability
distribution indicates how reliable an estimate is. On the one hand, if the variance is
small, there is a high probability that the estimate is close to the actual value. On the
other hand, if the variance is large, the probability that the estimate lies close to the actual
value is small.
1. a. When the mean of the estimate’s probability distribution equals the actual value, the
estimation procedure is unbiased, the estimation procedure does not systematically
overestimate or underestimate the actual value.
b. When an estimation procedure is unbiased, the variance of the estimate’s probability
distribution indicates how reliable an estimate is. On the one hand, if the variance is
small, there is a high probability that the estimate is close to the actual value. On the
other hand, if the variance is large, the probability that the estimate lies close to the actual
value is small.
133
3. We should divide by the sample size less 1. Variance equals the average of the squared
deviations. To calculate an average we divide by the number of pieces of available
information. When estimating the population variance of a population, the number of pieces
of available information equals the sample size less 1.
5. We use the Student t-distribution because we must estimate the standard deviation.
1. a.
8
¦v
t 1
t
EstMean
8
8.9 6.4 6.2 6.7 7.5 5.3 6.4 6.2 53.6
6.7
8 8
b. Laptop vi EstMean Dev Sqr Dev
1 8.9 6.7 2.2 4.84
2 6.4 6.7 0.3 0.09
3 6.2 6.7 0.5 0.25
4 6.7 6.7 0 0
5 7.5 6.7 0.8 0.64
6 5.3 6.7 1.4 1.96
7 6.4 6.7 0.3 0.09
8 6.2 6.7 0.5 0.25
Sum = 8.12
134
3. a.
64
¦v
t 1
t
EstMean
64
8 u 8.9 8 u 6.4 8 u 6.2 8 u 6.7 8 u 7.5 8 u 5.3 8 u 6.4 8 u 6.2
64
8 u 53.6 428.8
6.7
64 64
e. Mean: 7.0
Standard Error: .1269
Value: 6.7
Degrees of Freedom: 63
Prob[Results IF H0 True] = .0106
f. At the 5 percent significance level, we
would reject the null hypothesis, reject
the manufacturer’s claim that mean
battery life was 7.0. On the other hand, at
the 1 percent significance level we would not reject the null hypothesis.
As the sample size increases and the estimated mean continues to equal 6.7 our confidence
that the mean of the population falls short of the 7.0 hours claimed by the manufacturer is
reinforced.
135
7. As the sample size increases, use of the Student t-distribution becomes less critical. As the
sample size increases, the difference in the probabilities declines.
136