ISOM1500 Final Slides
ISOM1500 Final Slides
3
Probability Properties
• Label the n outcomes in a sample space as O1, O2, … On,
where Oi represents the ith outcome in the sample
space.
• The probability associated with any outcome must be
between 0 and 1
• 0 ≤ P(Oi) ≤ 1 for each outcome Oi
• The sum of the probabilities over all possible outcomes
must be 1.0
• P(O1) + P(O2) + … + P(On) = 1
4
Calculating Probabilities
A B
5
6
How many ways are there to get a sequence of 4 cards from a deck ?
Basic counting rules = 52!/48! = 49 x 50 x 51 x 52.
How many ways are there to get a set of 4 cards from a deck ?
= 52!/(48! X 4!) = (49 x 50 x 51 x 52)/(4 x 3 x 2 x 1)
• Permutation:
• The number of ways (with order
specified) of r items selected from n!
n available items is: nPr =
• Combination: HHTT (n – r)!
HTHT
• The number of ways (without order
specified) of r items selected from HTTH
n available items: TTHH
• Example: Number of ways of getting THTH n!
exactly 2 heads in four coin tosses = THHT nCr =
4 (n – r)! r!
4C 2 = 2
= 4! / (4-2)! 2!
= 24 / 4
=6
• Use these permutations and combinations to determine probability of any one
permutation or combination:
• What is the probability of getting 2 heads in 4 coin tosses= 6/16
• What is the probability that we will draw four aces in a particular sequence from a deck of
cards: 7
• =1/6497400 = 1.5 in 10 million chance
Conditional Probability
8
Independent Events
9
Conditional Probability Is Extremely Useful!
• A diagnostic test for a disease is 95% accurate in detecting a disease
in an infected person.
• The probability of infection in the general population is 1%.
• The probability of the test falsely stating infection in a healthy person
is 4%; this is the probability of a false positive
Question: What is probability that a person being tested, and for whom
the test says ”infected”, is actually infected with the disease?
10
Conditional Probability Is Extremely Useful!
P(Test says infected and
P(Test says Infected)=0.95 x 0.01=
Infected|Infected)=0.95 0.0095
P(A|B) x P(B)
P(Infected)=0.01
Test administered
P(Test Says P(Test says Infected and
Infected|Not Not Infected)=0.0396
Infected)=0.04 P(A and B_c)
P(Not Infected)=0.99
P(Test Says Not P(Test says Not Infected
Infected|Not and Not
A: Event that the test is positive Infected)=0.96 Infected)=0.9504
B: Individual is infected
P(A|B)= 95%
P(B|A) = P(A and B)/(P(A)= P(A|B) x P(B)/P(A)
P(A)= P(A and B) + P(A and B_c)= 0.0491 11
P(A|B) x P(B)= 0.0095
Conditional Probability Is Extremely Useful!
• P(Test says Infected) = 0.0095+0.0396 = 0.0491
• P(Person is Infected|Test says Infected) = P(Person is Infected
and Test Says Infected)/P(Test Says Infected)
• = 0.0095/0.0491= 0.1934= 19.34%!
• In other words, there is only less than 20% chance that a person
who tests positive for the infection, is actually infected!
• Because of high false positive rate
• Because of lower “sensitivity”
• Because of low incidence of disease
• Beware of initial test results…
• Confirm your test results!
12
Conditional Probability Is Extremely Useful!
Infected
Test Positive= 4.9%
Population
13
Confusion Matrix
Test
14
Fair Bet?
• What is the house advantage?
• All raffles, lotteries and casino games are designed to make a
profit
• House advantage = Amount bet – expected amount returned
• Example: ~35% for lotteries and ~10% for slot machines
15
Is this a fair payoff?
• This is how most people think when they bet $1 straight up on any
specific number:
• Expected payoff = 35/36 (-$1) + 1/36($35) = $0
• Even so, what happens when you play for a long time?
• A fair game?
• The roulette wheels actually has two
extra wedges, 0 and 00 that change
the probability
• Correct version of expected payoff:
37/38 (-$1) + 1/38 ($35) = -$0.0526
for each dollar you bet
• The house advantage is about 5.26%
16
Roulette: Is this a fair payoff?
17
Are coincidences improbable?
• Coincidence
• A coincidence is a surprising concurrence of events, perceived as meaningfully
related, with no apparent causal connection (Diaconis and Mosteller, 1989)
• Winning the lottery twice
• A New York Times story of February 14, 1986, about Evelyn Marie Adams, who
won the NJ lottery twice in short time period
• The article claimed that the odds of a person winning the top prize twice were
about 1 in 17 trillion
• What do you think?
18
Someone, somewhere, someday…
• What is not improbable is that someone,
somewhere, someday will experience those
events or something similar
• Why?
• The 1 in 17 trillion is the probability that a specific
individual who plays the NJ state lottery exactly
twice will win both times
• But don’t forget…
• Millions of people play lottery every day, so not
surprising that someone, somewhere, someday
would win twice
• Stephen Samuels and George McCabe calculated at
least a 1 in 30 chance of a double winner in a 4-
month period and better than 50% probability that
there would be a double winner in a 7-year period
somewhere in the U.S.
19
Lessons learned
• Probability of a specific event vs. Probability of the
general class of events
• Follow up questions
• What is the probability of someone winning the lottery (as
in previous example) twice in ten years given the following
additional assumptions?
- About 2 million lottery players
- They play twice weekly for 10 years (about 1,000 games)
• Answer:
1/17 trillion * 2 million * 1,000 = 1/8,500 = .000117
20
Basic Probability Distributions
Suri Gurumurthi, Ph.D.
Probability Distributions
2
Important Probability Distributions
• Discrete
• Binomial
• Poisson
• Uniform
• Continuous
• Normal
• Exponential
• Uniform
3
Binomial Distribution
4
Poisson Distribution
5
Continuous Probability Distributions
6
• Familiar bell-shaped curve.
• If a normal random variable
has a mean μ = 0 and a
standard deviation σ = 1, it is
called a standard normal
distribution, represented by
z.
Normal Distribution
7
Normal Distribution (PDF vs CDF)
• Normal (μ, σ2):
#"$ !
1 "
𝑓 𝑥 = 𝑒 !%!
2𝜋𝜎 !
8
Calculating Normal Probabilities
9
Exponential Distribution
10
Uniform Distributions
𝑓 𝑥 = 1/(𝑏 − 𝑎); 𝑖𝑓 𝑎 ≤ 𝑥 ≤ 𝑏;
𝑓 𝑥 = 0 otherwise
• Discrete Example: Tossing Dice
ì1 / 6 x = 1,2,3,4,5,6
f X ( x) = í
î 0 o.w.
Moments of a Random Variable
• Expected Values E(X)
• 𝐸 𝑋 = 𝜇 = ∫ 𝑥𝑓 𝑥 𝑑𝑥; if X is continuous
• 𝐸 𝑋 = 𝜇= ∑ 𝑥𝑃𝑟(𝑥) ; if X is discrete
• Var(X) = 𝜎 ! = 𝐸 𝑋 ! − 𝐸 𝑥 !
• Standard Deviation = σ
Moments of a Uniform Distribution
.5 .4 .2
3
Model 1: Multiple drugs in development
o There are 3 phases of clinical trials
o Phase 1 success rate is 50%
o Phase 2 success rate is 40%
o Phase 3 success rate is 20%
o With 10 drugs in development simultaneously,
o What is the probability of at least one successful launch?
Probability a Drug Fails to Win Approval = 1- 0.04 = 0.96
Probability ALL Drugs Fail to Win Approval = 0.96 ^ 10 = 0.6648
Probability At least one Drug Wins Approval = 1- 0.6648 = 0.3351 = 33.5%
In other words, approximately 1 in 3 chance
1- BINOMDIST(S=0, N=10, Prob of Success=0.04, True)
4
Model 1: Binomial Probabilities of Success
o There are 3 phases of clinical trials
o Phase 1 success rate is 50%
o Phase 2 success rate is 40%
o Phase 3 success rate is 20%
o With 10 drugs in development simultaneously,
o What is the probability of 1 successful launch
From the binomial distribution,
Probability = 10!/((10-1)!1!) x (0.04)^1 x (1-0.04)^(10-1)= 0.277
= BINOMIAL(1,10,0.04,false)
5
Model 2: Damned if you do, Damned if you don’t!
• Tour operator has to book tickets
for group travel
• Profitable venture
• But uncertain attendance
• Cost if you don’t reserve enough
tickets (lose profit margin and
goodwill)
• Underage Cost: u
• Cost if you reserve too many
tickets (sell at a discount)
• Overage Cost: o
0.9
Set Cumulative Probability = u/(u+0)
0.8
0.7
0.6
0.5
0.4
0.3
0.2
Corresponding Optimal Reservations
0.1
0
0 10 20 30 40 50 60 70 80 90 100
Probability Cumulative Probability 7
Model 2: Marginal Analysis to Optimize Decision
Marginal Analysis
• Note that quantity
• Let Pr(D>Q) denote the probability that Pr(D>Q) x u
the attendance is greater than the ordered is non-increasing in Q. Why?
quantity Q.
• The expected marginal profit due to sale
• Note that quantity
of the (Q+1)st ticket is therefore:
Pr(D<Q) x o
Pr(D>Q) x u
is non-decreasing in Q. Why?
• The expected marginal loss from
discounting (Q+1)st ticket is
Pr(D≤Q) x o
8
Model 2: Marginal Analysis to Optimize Decision
Marginal Analysis
Expected marginal value of reserving
the (Q+1)st ticket Determine Q* so that expected
= Pr(D>Q) x u - Pr(D<Q) x o marginal value is zero!
Or, (1 - F(Q)) x u = F(Q) x o
= (1 - F(Q)) x u - F(Q) x o Or,
F(Q) = u/(u+o)
• Note that expected marginal value
is also non-increasing in Q. Why? Q= F^inv(u/(u+o))
• Hence at some level of Q, this expected
marginal value will cross, and then
decrease below 0.
9
Model 2 Insights:
Service Level and Quantity Decision
Service Level =
Probability of
Service = u/(u+o)
Tail Area=
Probability of No-Service
Expected Reservation Decision
Demand
Quantity Scale
Safety
Stock
0 z z-scale
Hence optimal Quantity = Expected Demand + z x Stdev;
Where z= Normsinv(u/(u+o))
10 The Safety Stock portion = z x Stdev
Model 3: Asset Portfolio Optimization
• The reason for portfolio • Harry Markowitz’s efficient
theory: portfolios:
• To show why diversification is • Those portfolios providing the
a good idea maximum return for their level of
risk
• To show why diversification
makes sense logically • Those portfolios providing the
minimum risk for a certain level of
return
11
Model 3: Performance is Expected Return
• A portfolio’s performance is • The expected return of a
the result of the performance portfolio is a weighted
of its components average of the expected
• The return realized on a returns of the components:
portfolio is a linear combination n
of the returns on the individual E ( R p ) = ∑ xi E ( Ri )
investments i =1
13
Model 3: Computing Portfolio Variance
• For an n-security portfolio, the portfolio variance is:
n n
σ = ∑∑ xi x j ρijσ iσ j
2
p
=i 1 =j 1
14
Model 3: Two Asset Case Solution: The expected return of this two-
security portfolio is:
n
E ( R p ) = ∑ xi E ( Ri )
Assume the following statistics for i =1
16
Model 3: Minimum Variance Portfolio
1.2
0.8
Weight A
0.6
Solution: The weights of the minimum
variance portfolios in the previous 0.4
portfolio problem are:
0.2
0
0 0.01 0.02 0.03 0.04 0.05 0.06
1 x A =−
xB =− 1 .5907 =40.93% 17
Model 3:
Minimum
Variance Portfolio
• The efficient frontier can also be
simulated using Monte Carlo
Simulation methods
Source: Mathworks.com 18
Hypothesis Testing and
System 2 Decision Making
Suri Gurumurthi, Ph.D.
Hypothesis Testing
} Construct two contrasting propositions (competing hypotheses)
} relating to the value of one or more population parameters.
} H0 Null hypothesis: describes an existing theory
} Usually what we are trying to reject!
} H1 Alternative hypothesis: the complement of H0
} Usually what we are trying to prove!
} Using sample data, we either:
- reject H0 and conclude the sample data provides
sufficient evidence to support H1, or
- fail to reject H0 and conclude the sample data
does not support H1.
2
One Sample Hypothesis Tests
Three forms:
1. H0: parameter = constant
H1: parameter ≠ constant
2. H0: parameter ≤ constant
H1: parameter > constant
3. H0: parameter ≥ constant
H1: parameter < constant
The equality part of the hypotheses sign is always in the Null hypothesis.
3
One Sample: Type I vs Type II Error
We risk drawing an incorrect conclusion. Four outcomes are possible:
1. H0 is true and the test correctly fails to reject H0
2. H0 is false and the test correctly rejects H0
3. H0 is true and the test incorrectly rejects H0
4. H0 is false and the test incorrectly fails to reject H0
Outcome 3 is called a Type I error.
Outcome 4 is called a Type II error.
4
Confusion Matrix
consumer's risk
5
One Sample: Inference Risks
Ø The probability of making a Type I error = α
Ø The probability of making a Type II error = β
Ø Type I Error: α = P( rejecting H0 | H0 is true)
Ø Type II Error: β = P(not rejecting H0 | H0 is false)
Ø The level of significance (α ) defines the risk that we are willing to take in
making the incorrect conclusion that the alternative hypothesis is true when
in fact the null hypothesis is true.
Ø The value of α can be controlled.
Ø Commonly used levels for α are 0.10, 0.05, and 0.01
Ø α is typically set to 0.01, 0.05, or 0.10.
Ø The value of β cannot be specified in advance and depends on the value of
the (unknown) population parameter.
6
One-Sample Hypothesis Tests
7-7
One-Sample Hypothesis Tests
7-8
One-Sample Hypothesis Tests
7-9
Two Tailed Test Example Number Mean mg/dl Std. Deviation
20 311.15 64.3929
Ø Doctors may recommend having a total cholesterol level below 200 mg/dl.
Ø Given the sample of data above, determine whether the sample gathered is statistically
different, on average, from this recommended level.
Ø Null hypothesis: H0: μ = 200 vs. H1: μ ≠ 200
Ø Our sample of n=20 has sample average= 311.15 and s = 64.3929.
Ø Degrees of Freedom df = 19, so reject H0 if |t| > 2.093
Ø t=(311.15-200)/(64.3929/SQRT(20)) = 7.72
Ø A bigger t-stat implies a greater confidence in rejecting the Null Hypothsis
Ø Therefore proving our desired statement or hypothesis
Ø |t| > 2.093 so we reject H0; Critical t-value = T.INV.2T(α,df)
Ø df=n - 1
Ø The 95% confidence limits around the mean are
Ø 311.15 ± (2.093)(64.3929/√20), OR
Ø 311.15 ± 30.14, OR
Ø (281.01, 341.29)
10
Two-Sample Hypothesis Tests
7-11
One Sample Hypothesis Test Example: Chips-
A-Hoy! Mean = 1238.19, SD = 94.282, SE = 23.5705
Critical value = T.INV(0.05,16-1) = 1.75305, t-stat = 10.1053
ØIs it safe to say that the Company will keep its 1191
1270
promise of > 1000 chips in a bag of Chips-Ahoy? 1295
1135
H0: x <= 1000 H1: x > 1000, t-stat > C then reject H0 (safe) 16 items
12
Upper Tail Test Example
Step Mean Time Std. Deviation
Answer 0.1023 0.0183
Service 0.5290 0.0902
➢ A bank call center has a mandate to answer telephone customers calls in 0.1
minute or less.
➢ A sample of 25 answer times was drawn to evaluate the performance.
➢ Operators must determine customer needs and respond or refer the customer
within 0.50 minute.
➢ A sample of 25 service times was again taken to evaluate performance.
➢ If these are independent variables, is the average time taken to perform each step
statistically greater than the required standard?
3
Upper Tail Test Example
Specification for answer time for the steps at the call center is :
, t24,.05 = 1.711
x − 0.10 0.1023 − 0.10 0.0023
t1 = = = = 0.6284
s/ n 0.0183 / 25 0.0037
4
Analysis
5
Conclusion of Upper Tail Test Example
➢ Because t24, .05 = 1.711 for both measures, we cannot reject the null hypothesis for t1 =
0.6284, and we also cannot reject the hypothesis for t2 = 1.608.
➢ Therefore, there is no statistical evidence that the mean response time exceeds 0.10 for
the answer component. The statistical evidence also doesn’t support the fact that the
service component time exceeds the standard of 0.50 minutes.
7-6
One-Sample Hypothesis Tests
p-Values
One decision rule for any hypothesis test uses the p-value
The p-value is the observed significance level.
The p-value decision rule is to:
7-7
Two-Sample Hypothesis Tests
Interpreting Excel’s Two-sample Hypothesis Test Output
7-28
Simulation Modeling
Suri Gurumurthi, Ph.D.
November 2020
Optimization vs. Simulation
Optimization
Determines the best
levels of variables
under our control
Simulation
2
Simulation Application
• Model = Representation of Reality • Manufacturing:
• scheduling, inventory
• Model IS NOT EQUAL Reality!
• Staffing: personal-service
• We use simulation when other methods are operations
too context poor • Banks, fast food, theme parks, Post
Office, ...
• We look at “what-ifs”, or consequences of our
decisions in the model • Distribution and logistics
• Air traffic operations
• Principle: Approximate answer to exact • Warehouse operations
problem is sometimes better than an exact • Ports and Shipping
• Urban Planning
answer to approximate problem
• Health care:
• Staffing, Emergency and hospital
operations
• Telecommunications
• Switching and Call Routing
• Military
• Logistics and war-games
4
Limitations of Simulation Modeling
Statistical design,
Also true of many
analysis of Fewer degrees of Higher degrees of
other modern
simulation freedom freedom
methods
experiments
Results are
Higher statistical Lower statistical
conditioned on
confidence in confidence in
statistical
results results
confidence levels
7
Simulation by Hand:
The Buffon Needle Problem
https://www.youtube.com/watch?v=szUH1rzwbAw Silly Experiment?
• Estimate p (George Louis Leclerc, c. • Experiment to estimate something
1733) hard to compute exactly (in 1733)
• Toss needle of length l onto table with • Randomness, so estimate will not be
stripes d (>l) apart exact; estimate the error in the
estimate
• P (needle crosses a line) = 2l • Replication (the more the better) to
pd
reduce error
• Repeat; tally p! = proportion of times a • Sequential sampling to control error
line is crossed -- keep tossing until probable error
in estimate is “small enough”
• Variance reduction
2l
• Estimate p by
!
pd
9
One Time Quantity Decisions Under Uncertainty
Underage Cost u per unit
Prob(Demand D > Q)
6
Normally Distributed Demand
• Uncertain Demand
Objective:
Determine Quantity
That Maximizes Expected
Profits! 9
Monte Carlo Simulation: Repeated Trials
s
sX =
X
n
• Excel and related tools such as Oracle Crystal Ball make it much
easier to conduct and analyze a large number of trials
11
Monte Carlo Simulation with Excel
• Suppose a retailer has a one time • There could be several criteria for
opportunity to sell an apparel product
• Product has expected demand = 4000 units the decision:
• Standard deviation of demand = 800 units
• What is the number the retailer
should produce to maximize
• In order to buy the intellectual property
rights, the retailer has to pay $20,000 to the expected profits?
designer • What is the number the retailer
• The selling price of the product is estimated should produce so that he suffers no
at $18 per unit
more than say 5% chance of a loss?
• The royalty to be paid to the designer is $3
per unit • What is the standard deviation of
• The manufacturing cost of the item is $4.5 profits?
• If the product remains unsold, we have to
destroy the product, and lose the
manufacturing cost
12
Monte Carlo Simulation with Excel No.
Sampled
Demand
1 4666
2 3465
3 4983
• Excel has simple functions that allow us 4 5168
to model this problem 5 3964
6 3013
7 3459
• For e.g., the norminv (probability, 8 3247
Mean, Stdev) function allows us to 9 4554
determine the quantity that can provide 10 3926
11 4094
a target probability of serving demand 12 3283
• Combined with the rand() function, this 13 3626
can allow us to generate a sample of 14 4238
15 4213
demand from an underlying distribution 16 4669
• For e.g. NORMINV(RAND(),4000,800) 17 4231
generates a single sample (random) 18 3334
19 4136
demand point from a normal 20 2296
distribution with mean 4000, and Stdev 21 4415
800 22 4910
23 4191
24 4923 13
25 5254
Monte Carlo Simulation with Excel
14
Monte Carlo Simulation with Excel
2–3
NPV with Returns
The Net Present Value (NPV) Model
Negative NPV: project is rejected for initial investment I0; Interest
or discount rate k;
Net cash inflow Ft for period t. This represents company or
project performance
For investments, the return on the investment compounds over
time; say the return rate is r each period
𝐼𝐼0 (1 + 𝑟𝑟)𝑛𝑛
𝑁𝑁𝑁𝑁𝑁𝑁 =
(1 + 𝑘𝑘)𝑛𝑛
2–4
NPV Calculation: Potential Issues with the model
2–5
Asset Allocation Decisions
• Portfolio Theory:
• Shows why diversification is a good idea
• Trade-off between higher expected returns and lower levels of
risk
• Say two assets S and B
• Say S is stock; B is a bond
• S has mean return rS; Stdev σS
• B has mean return rB; Stdev σB
• Typically stocks have higher expected returns and higher risk
• Bonds have lower expected returns and lower risk too
6
Asset Allocation Decisions
• For any investment level
• Assume x is the proportion of investment allocated to Asset S
• Then (1-x) is the proportion of investment allocated to Asset B
• What is the best value of x?
• In other words, what proportion of the investment should be
invested in safe versus risky asset?
7
Performance: Expected Return
• A portfolio’s performance is the • The expected return of a
result of the performance of its portfolio is a weighted average
components of the expected returns of the
• The return realized on a portfolio is a components
linear combination of the returns on
the individual investments
• Hence Expected return is:
• ER = xrS + (1-x)rB
• The variance of the portfolio is not a
linear combination of component
variances
8
Performance: Variance of Portfolio Return
• Understanding portfolio variance is the essence of understanding the mathematics
of diversification
• The variance of a linear combination of random variables is not a weighted average of the
component variances
• Further, there are 3 cases: If Independent: Var_P= x^2Var_S + (1-x)^2Var_B
• The assets are positively corrected in performance
• The assets are negatively corrected in performance
• The assets are independent in performance
• Suppose the correlation co-efficient between assets is ⍴
• If Rho= +1, then perfectly positively correlated
• If Rho= -1, then perfectly negatively correlated
• If Rho=0, then independent asset classes
• ⍴ is covariance of the variables divided by the product of their standard deviations.
9
Problem: Simulate correlated variables?
• Simulating one evolution of one asset is relatively easy:
• Say, NORMINV(rand(),rS, σS)
• Similarly, NORMINV(rand(), rB, σB)
• But how to simulate asset performance B in a way that is
correlated with S (or vice versa)
• We will use approximate simulation method…
10
Problem: Simulate correlated variables?
• Suppose Z1= NORMINV(rand(),0, 1); Z2 = NORMINV(rand(), 0, 1)
• Notice, that Z1 and Z2 will almost surely sample different values
• Then we will simulate S= rS+ σS x Z1
• Simulate B= rB+ σB x [⍴Z1 + SQRT(1- ⍴2) Z2]
• Doing so will result in CORREL(S,B) approximately equal to ⍴
• Then we can simulate the evolution of S and B and compare the actual or
discounted value of portfolio over a time-span
11
What is a Decision Tree?
• A Visual Representation of Choices, Consequences,
Probabilities, and Opportunities.
• A Way of Breaking Down Complicated Situations Down to
Easier-to-Understand Scenarios.
•Graduate School Costs
Simple Decision Tree Example • Tuition and Fees
•Rent / Food / etc.
•Opportunity Cost
•Graduate School Benefits
Graduate
Job
School – 2 years •Learning
Decision Graduate •Future Growth in Earnings
School- 2 years
Job – 2 years
Decision in 2
...
years!
4
Monty Hall Problem:
Pick a door to win a car!
In search of a new car, the player picks a door, say 1. The game
host then opens one of the other doors, say 3, to reveal a goat
and offers to let the player switch from door 1 to door 2.
PLAYER PROBLEM:
SWITCH OR NOT TO SWITCH THEIR ORIGINAL CHOICE!
SWITCH OUTCOME
Monty Hall Problem 1/6 Host Opens Door 2 Lose
14
Breakeven Analysis Example – Bakery Co.
Tsim Sha Tsui Hang Hau
Lease Cost (per month) $300,000.00 $100,000.00
Employee Salaries (per month) $120,000 $100,000
Maximum Sales Volume (customers/month) 15000 9000
Average Order Revenue (per customer) $75 $55
Average Cost per Order (per customer) $40 $35
Average Profit Margin (per unit) $35 $20
Total Profits (per month) $525,000 $180,000
Net Income (per month)
$105,000.00 ($20,000.00)
Max Sales x Profit Margin – Fixed Cost
Breakeven Point (customers per month) 12000 10000
15
What if: We Hired an Extra Worker?
Tsim Sha Tsui Hang Hau
Lease Cost (per month) $300,000.00 $100,000.00
Employee Salaries (per month) $150,000 $120,000
Maximum Sales Volume (customers/month) 17000 11000
Average Order Revenue (per customer) $75 $55
Average Cost per Order (per customer) $40 $35
Average Profit Margin (per unit) $35 $20
Total Profits (per month) $595,000 $220,000
Net Income (per month) $145,000.00 $0
Breakeven Point (customers per month) 12857 11000
18
Simple Decision Trees for Capacity Investment
Net income 105K Net income 105K
First
Decision HH Base Cost
Case $21/cust
19
What if: Demand were Uncertain?
High
Sales
(16K)
20% Probability
40% Probability Net income -70K Net income -30K Net income 30K
21
Financial and
Real Options
and Related
Decision
Trees
SURI GURUMURTHI, PH.D.
N O V E M B E R 2 0 21
What are Options?
Ø An option is the right, but not the obligation, to buy (or sell) an asset for a pre-determined
price within a pre-determined period of time
As new information arrives and uncertainty about market conditions is resolved, there could be
value in the flexibility to alter the initial decision in order to:
◦ capitalize on favorable future opportunities
◦ react to mitigate losses.
This flexibility is similar to the financial options decision, and is known as a Strategic Option, or
Real Option.
Wait/Learn
• To wait before taking an action until more is known or timing is
expected to be more favorable
• E.g., When to introduce a new product, or replace an existing piece
of equipment
Expand or contract
Common • To increase or decrease the scale of an operation in response to
demand
Types of Real • E.g. adding or subtracting to a service offering, or adding memory to
a computer
Options Switch inputs or outputs
• To alter the mix of inputs or outputs of a production process in
response to market prices
• E.g., The output mix of telephone/internet/mobile services for a
bank
Abandon
• To discontinue an operation and liquidate the assets
• E.g., discontinuation of a research project, or product/service line
Call Option Decision Tree
Price > Value of Contract
Strike
p= P(Price >Strike Price) Price • If Expiration Price> Strike Price
• = (Expiration Price-Strike Price)*100 – c x 100
Buy Option?
Cost =
• But If Expiration Price< Strike Price
c x 100 = – c x 100
Contract Expiration
Put Option Decision Tree
Price >
Strike Value of Contract
p= P(Price >Strike Price) Price • If Expiration Price< Strike Price
= (Strike Price - Expiration Price)*100 – c x 100
Buy Option?
Cost = • But If Expiration Price> Strike Price
c x 100 = – c x 100
Contract Expiration
Call Option Decision Tree (Reality)
Keep Option?
Price > p* Price >
Gain =
Strike Strike
p= P(Price >Strike Price) Price (𝛿*Price – c) x Price
100
(1-p*)
Buy Option?
Cost =
c x 100
p**
Probability (1-p)
Take Risk ? Worst Case L
Sure Gain or
Certainty Equivalent
Estimating Value of Information:
Example
Management estimates the net profits of choosing from three decision alternatives (A, B, and C)
under the differing probable levels of demand. These net profit estimates, in millions of dollars,
are:
14
Expected value of perfect
information
EVPI
◦ How much you are willing to pay for information that
help you make the best decision?
◦ EVPI is the difference between the expected value with
perfect information (crystal ball) and the expected value
without perfect information (without crystal ball)
15
What would you prefer?
Option 1 VS Option 2
• Win the money in envelope A • One randomly drawn bank note is
removed from envelope B, and then
20 x $1000 notes + 10 x $500 notes placed back into the same envelope
=$25000
• After observing the note from B, then
choose between envelopes A and B
and win the money in the envelope
10 x $1000 notes + 20 x $500 notes
= $20000
• Option 1?
• Option 2?
• Make no difference?
5
Option 1
What is the expected value for option 1?
6
Option 2
One randomly drawn bank note is removed from
envelope B
– Scenario 1: One $500 bank note is removed from, and put back into
envelope B
– Scenario 2: One $1,000 bank note is removed from, and put back
into envelope B
7
Option 2: Scenario 1
Scenario 1: One $500 bank note is removed from
envelope B
20 2 10 1
= = = =
10 + 20 3 10 + 20 3
! #
Given $500 is removed, with 𝑃𝑟𝑜𝑏 = Given $500 is removed, with 𝑃𝑟𝑜𝑏 =
" "
Envelope A contains $25,000 Envelope A contains $20,000
Envelope B contains $20,000 Envelope B contains $25,000
8
Option 2: Scenario 1
Scenario 1: One $500 bank note is removed from
envelope B
Which envelope would you choose?
Envelope A? or EnvelopeProb
B?= 1/3 Prob = 2/3
$ in envelope A after draw $20,000 $25,000
$ in envelope B after draw $25,000 $20,000
Expected value
in envelope A $23,333.33 $21,666.67
Expected value
in envelope B $21,666.67 $23,333.33
Scenario 1: Scenario 2:
Remove $500 Remove $1,000
Probability 𝟏𝟎 𝟏 𝟐𝟎 𝟐 Probability 𝟏𝟎 𝟏 𝟐𝟎 𝟐
= = = =
𝟑𝟎 𝟑 𝟑𝟎 𝟑 𝟑𝟎 𝟑 𝟑𝟎 𝟑
Envelope B $25,000 $20,000 Envelope B $20,000 $25,000
Envelope A $20,000 $25,000 Envelope A $25,000 $20,000
12
Option 1 VS Option 2
• Win the money in envelope A • One randomly drawn bank
note is removed from
envelope B
• Choose between envelopes A
and B and win the money in
the envelope
13
Decision-Making With
Big Data and AI
Suri Gurumurthi, Ph.D.
November 2019
Virtual & Augmented Reality
• Virtual Reality
• “The computer-generated simulation of a 3D
image or environment that a person can be
interact with in a seemingly real or physical
way by a person using special electronic
devices…”
• Augmented Reality
• “Superimposition of a computer-generated
image on a person's view of the real world,
providing a composite and altered
perspective...”
Decision Making in VR and AR
• US Army trains its soldiers on high pressure
combat environments using VR
• De Beers allows potential customers to virtually
try on jewelry
• Entice and attract potential buyers
• Personalize the experience
• IKEA allows customers to virtually place the
catalog items in your home using mobile apps
• Visualize before purchase
• Reduce regret and returns!
VR is a 3D Simulates space
and higher Simulates time
dimensional Simulates interactions!
simulation Can be used to model multi-agent decision-making
Differences
Between VR
and AR
Augmented Marries personal experiences with related imagery and
simulations
reality is
Adds value to the real world experience
more
Enables superior decision-making with additional
personal superimposed data
and Looks for and provides the missing elements that can
complex accelerate decision-making!
Big Data are high-volume, high-velocity, and/or high-variety
STRUCTURED DATA
European
As of 2017, administrators Poor data
53% of Forbes 40% projected
could save across Poor data can
surveyed Bad data or growth in
more than businesses cost
companies poor data global data
€100 billion and the businesses
adopted Big quality costs generated per
($149 billion) government 20%–35% of
Data US businesses year vs. 5%
in operational costs the U.S. their
Analytics. $600 billion growth in
efficiency economy $3.1 operating
annually. global IT
Up From 15% improvements trillion dollars revenue.
spending.
in 2015 alone by using a year.
big data
Unstructured The key need in today’s digital
Data environment is to:
1. Reduce the dimensionality of big data
(aggregate)
Structured
Data 2. Transform and synthesize
3. Extract intelligence (usable
information)
Useful
Information 4. Analyze (understand relationships)
5. Value-added decision-making
Analytics
Decision-
Making
Big Data = Big Nothing, Unless…
Descriptive Analytics: Inference
Identify Draw from
goals sample
Start Population Sample
Make inferences
about population
Population Sample Statistics
parameters
Source: Bennett, Briggs, and Triola (2014), Statistical Reasoning for Everyday Life.
9
Correlation Coefficients… Correlation = 0
Dependent variable
• Correlation 30
25
• If two events (features or 20
measurement variables) 15
tend to occur together, they 10
5
are said to be correlated 0
• Example: Lighting and 0 2 4 6 8 10 12
thunder, weight and height Independent variable
• Correlation coefficient:
Measuring the strength of • Outliers
linear relationship of • An outlier that is consistent with the trend of
measurement variables the rest of the data can inflate the correlation
• An outlier that is not consistent with the rest
• r è +1: Positive correlation of the data can deflate the correlation
• r è -1: Negative correlation
• Simpson’s paradox
• r è 0: No correlation (?) • Groups combined inappropriately may mask
relationships
• The missing link (effect of a third variable) 11
Example: Death rate vs. speed limit
14
Why should we care about correlation?
• How to confirm causation?
• Use of randomized experiments
(treatment-control study)
• Use of observational (case-control)
•
•
Easily observable
No proof needed
study if experiments are not possible or
ethical
• Evidence of a possible causal
connection:
• Not easily observable • There is a reasonable explanation of
• Require proof cause and effect
• The connection happens under varying
conditions
• Potential confounding variables are
ruled out
• Has research potential
• Most useful to us • Example: Developing a new medicine
15
Defining AI
• For e.g. Science which
makes machines Systems mimicking Rational thinking
perform real-time tasks human reasoning systems
that require higher
intelligence,
independent of human
intervention and
decision-making
• AI is the study of formal
AI
structures and problem
solving methods, that
aims to equip
computing machines Systems mimicking Rational acting
with capabilities to
solve problems, that
are comparable to
human behavior systems
human beings
Using formal rules to draw valid conclusions
Philosophy What does knowledge truly mean?
How can we convert knowledge into action?
Creating formal rules to draw valid conclusions
Measures and computation
Mathematics
Probability and inference
Fuzzy logic
Structures in AI
Behavioral conditioning
Psychology Cognitive psychology
Cognitive science
Common Tasks in AI