Chen 2009
Chen 2009
^c : . . . . . . . . . . . . . . . . . .(4)
mc ¼ EðXjY cÞ ¼ ð1 r2 Þm þ r2 m
From Eq. 4, it is easy to see that mc is smaller than m^c :Thus, the
DM will be inevitably disappointed if she anticipates achieving m ^c
with the selection criterion because she actually receives only mc
on average.
Another question is whether mc will be always greater than the
hurdle value c. Because c is usually the minimum requirement for
a project to be funded, the selection criterion should secure the
minimum performance c for the selected projects. However, this
is not guaranteed. When the hurdle value reaches or exceeds some
critical value m, the expected true value of selected projects will
be lower than c (i.e., mc c if c m). The details are provided in
Appendix A.
Here is an example of a project-selection problem similar to
the one considered previously. Suppose the average IRR of a
project is 5% with an SD of 10% and the prediction has an error
SD of 10%. In Fig. 3, the expected performance is plotted against
Fig. 2—Normal case: A project with the predicted value Y =10%. various values of the hurdle rate c. The solid line is the 45 line
where the expected performance equals the hurdle rate. The dot-
ted line represents the expected predicted value m ^c ; which is al-
The Performance of the Selection Criterion ways above the solid line, meaning that m ^c c for all m
^c : The blue
Now, we proceed to look at the performance of the selection line represents the expected true performance mc. The critical
criterion. Unlike the previous case, where a forecast value is point, denoted by m in the graph where the line of mc crosses the
observed, here, we want to assess the performance of applying solid line, is approximately 0.13. This means, if the hurdle rate c
the hurdle value c before we know the forecast of any project. is greater than 13%, then the selection criterion will lead to a
What can be said about the expected performance of projects selected project whose expected true IRR is lower than c.
selected by using the hurdle value c? A simple answer is that the
average predicted value of selected projects will be greater than c Extension to the Log-Normal Case
because E(Y| Y c) c regardless of the distributions of Y. If the
distribution of Y is assumed to be known, as in this case, we can Often, it is inappropriate to model the outcome of the project as a
calculate the expectation of the predicted value, denoted by m ^c : normal random variable. For example, in oil and gas exploration-
and-production investment projects, the recoverable volume or the
cm size of the reserves are important decision variables, which can
^c ¼ EðYjY cÞ ¼ m þ sY l
m ; . . . . . . . . . . . . . . . . . .(3) take on only positive values and usually do not follow a symmet-
sY
rical distribution. This can be seen from Fig. 1, where the reserves
where l(t) is the hazard function of the standard normal distribu- size is plotted in the log scale. However, if the logarithm of those
tion; it is defined as l(t) = f(t)/[1-F(t)], where f(t) and F(t) are variables can be modeled as normal, the previous results can be
the density and cumulative density function of the standard nor- extended to the log-normal distribution in a straightforward man-
mal distribution, respectively. The function l(t) is monotonically ner. Disappointment similar to that in the normal case also occurs
^c maxðc; mÞ for
increasing in t and l(t) t for all t. Note that m in the log-normal setting.
^c m:
all c; in particular m Suppose the true value of the project can be expressed as X =
However, the DM is more interested in the expected true exp(X 0 ) and the prediction can be written as Y = exp( ), where (X 0 ,
value, denoted by mc, of projects selected with the hurdle value c. Y 0 ) is bivariate normal defined in the same way as (X, Y) in the
Here, mc is used as a measure of the performance of the selection preceding section. This implies that the distributions of the true
Fig. 3—Normal case: Expected performance of the selection criterion for various values of hurdle rate c.
will be accepted if its predicted value eY is equal to or greater selected by the hurdle value e c, on average, the true value will
than a hurdle value ec. First, we examine the performance of the achieve the predicted value.
0
selected project whose predicted value is eY . If a project has the Similar to the normal case, vc is not necessarily greater than ec.
0
predicted value eY that is above the hurdle value ec, what can An example is plotted (in log scale) in Fig. 4. It can be seen that
be expected for its true value? To answer this question, we need log(vc) is greater than c for some small values of c, but it will fall
to know the distribution of the true-value conditioning on the pre- below c eventually as c increases.
dicted value. Because X 0 conditioning on Y0 is N ½ð1 r2 Þmþ
r2 Y 0 ; ð1 r2 ÞsX0 =2; eX conditioning on eY is a log-normal distri-
2 0 0
Calibration of Forecasts Through Bayesian
0
bution. Thus, for the selected project whose predicted value is eY , Updating
its expected true value is To eliminate the disappointment, the calibration method sug-
0 0
gested by the analysis in the previous sections is useful. That is,
E eX jeY ; Y 0 c ¼ exp ð1 r2 Þðm þ s2X0 =2Þ þ r2 Y 0 ; . . . (5) the DM can use the posterior mean, which is Eq. 2 in the normal
case and Eq. 5 in the log-normal case, as the calibrated forecast to
0 select projects by comparing this calibrated prediction with the
which is a value between the prior mean E(eX ) and the predicted
0 hurdle value. For example, in the normal case, when a project is
value and, thus, is smaller than eY . Once again, Eq. 5 is obtained predicted to have an outcome y, the DM should use (1- r2) m +
through a Bayesian updating process, as shown in Appendix A.
r y in the selection process. Similarly, exp ð1 r2 Þðm þ s2X0 =2Þ
2
This result is also explained by the regression effect, as in the
0 þr2 Y 0 should be used in the log-normal situation. This estimator
normal case. Therefore, when eY > ec, the DM will be disap-
combines information from the forecaster as well as from the DM.
pointed in the performance of the accepted project if she expects
It is easy to see that the disappointment in the performance of
that the average true value will equal the predicted value. As in
accepted projects as well as in the selection criterion will disap-
the normal case, the posterior mean in Eq. 5 suggests a solution to
pear if the calibrated forecast is used.
this problem (e.g., the DM should calibrate the estimate with the
Here is an example based on the Norwegian North Sea data to
prior knowledge before applying the selection criterion).
illustrate this method. Reserves plotted in red in Fig. 1 are exclud-
Next, we examine the performance of the selection criterion.
ed because they introduced a serious bias. We assume the remain-
We are interested in knowing the expected performance of all
0 ing reserves are alternatives from which the DM is going to select.
projects that are accepted if their forecasts eY are above the hurdle
Suppose the DM has the prior knowledge that the size of the
value ec. First, we calculate the expected forecast for the accepted
reserves follows a log-normal distribution with m = 4.5 and sX =
projects, denoted by v^c :
1.5, which means that the average size of discovery is approxi-
0 0 mately 277 million BOE. Suppose the error SD of the forecaster
v^c ¼ E eY jeY ec is 1.3. Now the DM is able to calibrate the forecast by applying
Eq. 5.
F ½c ðm þ s2Y 0 Þ=sY 0
¼ expðm þ s2Y 0 =2Þ: . . . . . . . . . (6) To see the effect of the calibration, we compare the average
F ðc mÞ=sY0 discovery size of the reserves selected with and without the cali-
bration. Fig. 5 displays the average performance of selection-
Then, we calculate the expected true value of the accepted hurdle values between 400 million and 700 million BOE. If the
projects, denoted by vc: estimates are taken at face value without calibration (the dotted
0 0 line), then the average size of the selected reserves will go below
vc ¼ E eX jeY ec the hurdle value after 590 million BOE, a truly disappointing
result. However, if the DM uses the calibrated forecasts (the
F ½c ðm þ s2X0 Þ=sY 0
¼ expðm þ s2X0 =2Þ: . . . . . . . . . (7) dashed line), then the average size of selected reserves will always
F½ðc mÞ=sY 0 be higher than the hurdle value.
Fig. 4—Log-normal case: Expected performance of the selection criterion for various values of c.
Derivation of Eq. 6.
Calculation of the Critical Point m. We want to find a critical
R þ1 point m such that E(X|Y m) = m.
0 0 e fY 0 ðyÞdy
y
1
v^c ¼ E eY jeY ec ¼ c
¼ pffiffiffiffiffiffi
PðY 0 cÞ PðY 0 cÞ 2psY 0 cm
EðXjY mÞ ¼ ð1 r2 Þm þ r2 m þ sY l
sY
Z " # exp
ðmþs2Y 0 Þ2 m2
s2X c m s2
þ1
ðy mÞ 2
2s2Y Y 2s2Y 0 ¼mþ l ¼ m ) lðqÞ ¼ 2Y q;
exp dY ¼ sY sY sX
c 2s2Y PðY 0 cÞ
. . . . . . . . . . . . . . . . . . . . . . . . . . . .(A-7)
Z " #
þ1 s2
1 ðY ðm þ s2Y 0 ÞÞ2 where q ¼ mm sY : Thus, q depends only on s2X :
Y
pffiffiffiffiffiffi exp dY
2psY 0 c 2s2Y 0 Next, we show that there is a unique solution for q. It is not
difficult to obtain:
h s2Y 0
i
exp m þ lðqÞ
¼
2
F ½c ðm þ s2Y 0 Þ=sY 0 lim ¼ 1: . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . (A-8)
PðY 0 cÞ q!1 q
s2 s2
Therefore, for a sufficiently large q, lðqÞ< s2Y q because s2Y > 1:
F ½c ðm þ s2Y 0 Þ=sY 0 But l(0) > 0. X X
¼ expðm þ s2Y 0 =2Þ:
F ðc mÞ=sY0 Therefore, there is a unique solution for q, which can be found
easily with some numerical algorithm.
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . (A-5)
Min Chen is a post-doctoral associate in the School of Public
Derivation of Eq. 7. Health at Yale University. He holds a PhD degree from the
Department of Information, Risk, and Operations Manage-
R þ1 0 ment, McCombs School of Business, at The University of Texas
X0 Y0 E½eX jY 0 cfY 0 ðYÞdY
vc ¼ E e je e c
¼ c at Austin. His research interests include Bayesian analysis, sta-
PðY 0 cÞ tistical risk management, and biostatistics. James S. Dyer holds
the Fondren Centennial Chair of Business in the McCombs
exp½ð1 bÞðm þ s2X0 =2Þ School of Business at The University of Texas at Austin. He holds
¼
PðY 0 cÞ a BA degree in physics and a PhD degree in business adminis-
Z þ1 " # tration, both from The University of Texas at Austin. Dyer’s re-
1 ðY mÞ2 2bs2Y 0 Y search is focused on the evaluation of risky investment
pffiffiffiffiffiffi exp dY decisions. He served as a member of the Steering Committee
2psY 0 c 2s2Y 0 for the SPE Forum on Profit Prediction held in 2007.