0% found this document useful (0 votes)

9 views48 pages

Probability and Statistics

The document discusses the application of probability and statistics in business, focusing on predicting outcomes, learning from data, and making data-driven decisions. It covers concepts such as sample spaces, probability rules, conditional probability, Bayes' rule, random variables, and distributions, including normal distribution. The session emphasizes the importance of these statistical methods in decision-making and data analysis for business contexts.

Uploaded by

darcyzhux

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views48 pages

Probability and Statistics

Uploaded by

darcyzhux

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

Probability and Statistics

Daniel Halbheer
[email protected]
Session Outline
This session illustrates how probability and
statistics can be used in business settings:
• Predicting outcomes under uncertainty
about the business environment.
• Learning about the business environment
from sample data.
• Making optimal data-driven decisions.

2
Setting the Stage

Monty Hall problem: Suppose you are on a game show, and you
are given the choice among three doors: Behind one door is a
car; behind the others, goats. You pick a door, say No. 1, and the
host, who knows what's behind the doors, opens another door,
say No. 3, which has a goat. He then says to you, “Do you want to
pick door No. 2?” Is it to your advantage to change your choice?
Source: Wikipedia, link: https://bit.ly/2mXP6vz

3
Levels of Uncertainty

Levels: Alternative futures (first-order uncertainty), range of

futures (second-order uncertainty), and “true ambiguity.”
4
Backdrop
Probability: In the study of probability, we
work with a known population—the sample
space of the experiment—and consider the
likelihood of drawing a particular sample.

Statistics: In statistics, we make an inference

about a population by using a sample of data
drawn from that population.

5
What is meant by the probability
of an event? How is it computed?
Sample Spaces and Events
Definition: A sample space 𝑆 is the space of
all possible outcomes for an experiment. The
elements 𝑠𝑖 of 𝑆 are called sample points. The
number of sample points is denoted by #(𝑆).

Definition: An event 𝐸 for an experiment is a

subset of the sample space. If the outcome of
the experiment is a sample point in 𝐸, then
the event 𝐸 is said to occur.
7
Coin Tossing
Experiment: A coin is tossed three times
and the result for each toss is observed.

Exercise 1: Let 𝐻 or 𝑇 be the result for each

coin and determine a sample space 𝑆.
Exercise 2: Determine the following events:
▪ 𝐸 = one head and two tails
▪ 𝐹 = head on the first toss

8
Venn Diagrams
𝑆
𝐸 𝑇𝐻𝐻
𝐹
𝑇𝑇𝐻
𝐻𝐻𝐻
𝑇𝐻𝑇 𝐻𝑇𝑇
𝐻𝐻𝑇
𝐻𝑇𝐻
𝑇𝑇𝑇

Note: If the outcome 𝐻𝑇𝑇 occurs, then both 𝐸

and 𝐹 must occur, and conversely.
9
Probability
Definition: If all outcomes of an experiment
are equally likely, then the probability of the
event 𝐸 = 𝑠1 , … , 𝑠𝑘 is given by
#(𝐸)
𝑃 𝐸 = 𝑃 𝑠1 + ⋯ + 𝑃 𝑠𝑘 = .
#(𝑆)
Note: 𝑃 𝐸 can be interpreted as the relative
frequency of 𝐸 occurring in the long run.

10
Empirical Probability

11
Probability Rules
1. For any event 𝐸, 0 ≤ 𝑃(𝐸) ≤ 1.
2. For the set of all outomes 𝑆, 𝑃 𝑆 = 1.
3. If 𝐸 and 𝐹 are events,
𝑃 𝐸∪𝐹 =𝑃 𝐸 +𝑃 𝐹 −𝑃 𝐸∩𝐹 .
4. For any event 𝐸, 𝑃 𝐸 𝑐 = 1 − 𝑃 𝐸 .
Note: 𝐸 𝑐 is the complement of 𝐸—the set of
sample points in 𝑆 that are not in 𝐸.

12
Conditional Probability
Definition: The conditional probability of
an event 𝐸 given that event 𝐹 has occurred
is given by
𝑃 𝐸∩𝐹
𝑃 𝐸|𝐹 = .
𝑃 𝐹

Note: 𝐸 and 𝐹 are independent if 𝑃 𝐸|𝐹 =

𝑃 𝐸 , that is, if 𝑃 𝐸 ∩ 𝐹 = 𝑃 𝐸 𝑃 𝐹 .

13
Defective Computer Chips
A firm uses one computer chip to build one
unit of a product. The chips are purchased
from suppliers 𝐴, 𝐵, and 𝐶 and are randomly
picked for assembling a unit. 20% come from
𝐴, 30% come from 𝐵, and the remainder
come from 𝐶. The breakdown probabilities
are 0.03, 0.04, and 0.01, respectively. If a
product is chosen at random, what is the
probability that the chip in it is defective?
14
Bayes’ Rule
The conditional probability of an event 𝐸
given that event 𝐹 has occurred can be
expressed as
𝑃 𝐹|𝐸 𝑃(𝐸)
𝑃 𝐸|𝐹 = 𝑐 𝑐
.
𝑃 𝐹|𝐸 𝑃 𝐸 + 𝑃 𝐹|𝐸 𝑃(𝐸 )

Note: This result has many applications in

decision making, statistics, and recently AI.

15
Bayes’ Rule in AI

Read more on the application of Bayes’ rule in “An executive’s guide to AI” published by
McKinsey. Link: https://bit.ly/3AUQs4F.
16
Deriving Bayes’ Rule
𝑆 Recall: The conditional
𝐸 probability of 𝐸 given
𝐹
that 𝐹 has occurred is
𝐸∩𝐹
𝐸𝑐 ∩ 𝐹 given by
𝑃 𝐸∩𝐹
𝑃 𝐸|𝐹 = .
𝑃 𝐹

Using this definition,

𝑃 𝐹|𝐸 𝑃(𝐸) 𝑃 𝐹|𝐸 𝑃(𝐸)
𝑃 𝐸|𝐹 = = 𝑐 𝑐
.
𝑃(𝐹) 𝑃 𝐹|𝐸 𝑃 𝐸 + 𝑃 𝐹|𝐸 𝑃(𝐸 )
After test result: Before test result:
P(HIV+ | Tested positive) P(Tested positive | HIV+)
17
Application to HIV Testing
An HIV test gives a positive result with
probability 98% when a patient is indeed
affected by HIV, while it gives a negative
result with probability 99% when a patient
is not affected by HIV. If a person drawn at
random from a population in which 0,1% of
individuals are affected by HIV and she/he is
found positive, what is the probability that
she/he is actually affected by HIV?
18
COVID-19 Tests

19
Monty Hall Revisited
Suppose you pick door No. 1 and Monty
shows you a goat behind door No. 3.
Define the events 𝐴 = {Car behind door 2}
and 𝐵 = Monty opens door 3 . Then,
1
𝑃 𝐵|𝐴 𝑃(𝐴) 1∗ 2
𝑃 𝐴|𝐵 = = 3 = .
𝑃 𝐵 1 1 1 2 3
1∗ + ∗ +0
3 2 2 3
𝑃(𝐵|𝐴𝑐 ) 𝑃(𝐴𝑐 )
20
What is a random variable? What
is a distribution function?
Random Variables
If a coin is tossed three times, the number of
heads that occur is captured by the events
0 1 2 3.
Let 𝑋 represent the number of heads and let
𝑋 = 𝑥 be a realization, where 𝑥 = 0,1,2,3.
Note: The realization of 𝑋 is determined by
chance—thus making it a random variable.

22
Expected Value
Let 𝑋 be a discrete random variable whose
number of possible values is 𝑥1 , … , 𝑥𝑁 .
Definition: The expectation of 𝑋 is
𝑁

𝜇 = 𝐸 𝑋 = ෍ 𝑥𝑖 𝑃(𝑋 = 𝑥𝑖 ) ,
𝑖=1

where 𝑃(𝑋 = 𝑥𝑖 ) is the probability that 𝑋

takes value 𝑥𝑖 .
23
Variance
𝐸 𝑋 is the central tendency of 𝑋 but does
not indicate the dispersion from the mean.
Definition: The variance of 𝑋 is
𝑁

σ2 = Var 𝑋 = ෍ 𝑥𝑖 − 𝐸(𝑋) 2 𝑃 𝑋 = 𝑥𝑖 .
𝑖=1

The standard deviation of 𝑋 is 𝜎 = Var 𝑋 .

24
Bernoulli Distribution
Definition: A random variable 𝑋 has a
Bernoulli distribution with parameter
𝑝 ∈ [0,1] if
𝑃 𝑋 =1 =𝑝 =1−𝑃 𝑋 =0 .
Note: A Bernoulli distributed 𝑋 is the
indicator function of the event 𝑋 = 1 .
Exercise: Calculate 𝐸 𝑋 and Var 𝑋 .
25
Coin Tossing
Experiment: A coin is tossed three times. Let
𝑋 be the number of heads that occur.

Exercise 1: Graph the probability function

𝑃 𝑋 = 𝑥𝑖 .
Exercise 2: Show that 𝐸 𝑋 = 1.5.
Exercise 3: Show that Var 𝑋 = 0.75. What
is the standard deviation of 𝑋? Explain.
26
Continuous Random Variables
A random variable 𝑋 is continuous if it can
assume any value in an interval.
Definition: The probability that 𝑋 assumes a
value between 𝑎 and 𝑏 is
𝑏
𝑃 𝑎 ≤ 𝑋 ≤ 𝑏 = න 𝑓 𝑥 𝑑𝑥,
𝑎

where 𝑓 𝑥 is the density function of 𝑋.

27
Distribution Functions
Definition: The cumulative distribution
function (CDF) of 𝑋 is obtained as
𝑥
𝐹 𝑥 = 𝑃 𝑋 ≤ 𝑥 = න 𝑓 𝑡 𝑑𝑡.
−∞

Note: The CDF gives the probability that 𝑋

takes a value less than or equal to 𝑥.
Useful fact: 𝑃 𝑋 ≥ 𝑥 = 1 − 𝐹 𝑥 .

28
Mean and Variance
Definition: Let 𝑋 be a continuous random
variable. The expectation of 𝑋 is
∞
𝜇 = 𝐸 𝑋 = න 𝑥𝑓 𝑥 𝑑𝑥
−∞

and the variance of 𝑋 is

∞
2 2
σ = Var 𝑋 = න 𝑥−𝐸 𝑋 𝑓 𝑥 𝑑𝑥.
−∞

29
Uniform Distribution

Note: Probabilities are areas under the density

function and above the 𝑥-axis.
30
Mean and Variance
Exercise: Let 𝑋 be uniformly distributed on
the interval 0, 𝑏 , where 𝑏 > 0.
Show that
𝑏 𝑏2
𝐸 𝑋 = and Var 𝑋 = .
2 12
How does the parameter 𝑏 affect the mean
and the variance? Explain.

31
Normal Distribution

Note: The normal distribution is symmetric

around 𝜇 and more spread out for larger 𝜎.
32
Normal Distribution
Definition: A continuous random variable 𝑋
has a normal distribution if its probability
density function is given by
1 1 𝑥−𝜇 2
−
𝑓 𝑥 = 𝑒 2 𝜎 ,
𝜎 2𝜋
where the parameters 𝜇 and 𝜎 are the mean
and the standard deviation of 𝑋, respectively.

33
The CDF of 2
𝑁(𝜇, 𝜎 )
The CDF of the normal distribution must be
computed numerically.
Excel: NORM.DIST function

34
NORM.DIST Function
Exercise: Verify that the probabilities in the
table below are correct.

35
Some Facts

36
Why is the normal distribution so
popular? Who uses it?
The Hype about 2
𝑁(𝜇, 𝜎 )
The normal distribution is perhaps the most
important of all probability distributions.
Business applications: Quality control, risk
management, stock market returns, employee
performance, measurement error…
Statistics: Sample means tend to be normally
distributed—no matter what the distribution
of the original population is.
38
Central Limit Theorem

Fact: For 𝑛 ≥ 30, the sample average σ𝑖 𝑥𝑖 /𝑛

has approximately a normal distribution with
mean 𝑝 and variance 𝑝(1 − 𝑝)/𝑛.
39
How can data be used to make
optimal business decisions?
Data Analytics
Setup: A data analyst collected information
about demand. Extensive testing revealed
that the data is well described by a normal
distribution.
Common normality test: Shapiro-Wilk test.
Business problem: Determine the capacity
such that the probability of a stockout is less
than 2% based on the sample information.
41
Estimating Parameters
Data analysis: The sample average is
σ𝑖 𝑥𝑖
𝑥ҧ =
𝑛
and the standard deviation is estimated by

σ𝑖(𝑥𝑖 − 𝑥)ҧ 2
𝑠= .
𝑛−1
In Excel: AVERAGE and STDEV, respectively.
42
Optimal Decisions
Use the data set “stockouts.xlsx” to estimate
the demand parameters and determine the
constrained optimal capacity.
Hint: Use Excel Solver
to find the constrained
optimal capacity.

43
Numerical Optimization

44
Data-Driven Optimal Capacity

Managerial intuition: Why would a firm try

to limit the likelihood of stockouts? Discuss.
45
Conclusion
This session illustrated how probability and
statistics can be used in business settings:
• Predicting outcomes under uncertainty
about the business environment.
• Learning about the business environment
from sample data.
• Making optimal data-driven decisions.

46
Epilogue
In my view, there are two key success factors
in data-rich business environments:
1. Being able to structure links between the
environment, decisions, and outcomes.
2. Being able to make optimal data-driven
decisions.
Next steps: Data Science Camp, core classes.

47
Feedback
I am interested in learning from you about
your impressions about the class.
Please approach me:
1. In person
2. Via e-mail ([email protected])

Note: Your feedback helps me to improve my

teaching—and to grow as a human being.
48

Cheatsheet Probability and Statistics
100% (1)
Cheatsheet Probability and Statistics
10 pages
Mathematical Statistics
No ratings yet
Mathematical Statistics
7 pages
Proba
No ratings yet
Proba
188 pages
Intro To Probability (Pattern Recognition)
No ratings yet
Intro To Probability (Pattern Recognition)
94 pages
Basics Statistics For Data Analysis
No ratings yet
Basics Statistics For Data Analysis
75 pages
Week04 - 2303 Aplikasi Bisnis S2 UI
No ratings yet
Week04 - 2303 Aplikasi Bisnis S2 UI
46 pages
Unit II
No ratings yet
Unit II
140 pages
Chapter 1: Stochastic Processes: STATS 310 Statistics STATS 210 Foundations of Statistics and Probability
No ratings yet
Chapter 1: Stochastic Processes: STATS 310 Statistics STATS 210 Foundations of Statistics and Probability
12 pages
Lecture 2
No ratings yet
Lecture 2
61 pages
Lecture 9 - Probability COMP7180
No ratings yet
Lecture 9 - Probability COMP7180
58 pages
Probability Slides
No ratings yet
Probability Slides
45 pages
iQRM Warm Up Week 5 February 17 Corrected
No ratings yet
iQRM Warm Up Week 5 February 17 Corrected
39 pages
Chapter6 Probability
No ratings yet
Chapter6 Probability
20 pages
Unit 5 Overview of Probability
No ratings yet
Unit 5 Overview of Probability
21 pages
MM3&4 - Probability and Distributions Summary Notes
No ratings yet
MM3&4 - Probability and Distributions Summary Notes
31 pages
Lecture03 Probability Review
No ratings yet
Lecture03 Probability Review
48 pages
Operations Research Lesson 3-1
No ratings yet
Operations Research Lesson 3-1
42 pages
Lecture 4 - Prob and RV - ST
No ratings yet
Lecture 4 - Prob and RV - ST
14 pages
Lec 2
No ratings yet
Lec 2
46 pages
Stat 350 Study Guide
No ratings yet
Stat 350 Study Guide
37 pages
Discrete Random Variable
No ratings yet
Discrete Random Variable
41 pages
Unit 3 - Probability
No ratings yet
Unit 3 - Probability
33 pages
Unit7 Probability Statistics I-1
No ratings yet
Unit7 Probability Statistics I-1
49 pages
Unit 4.
No ratings yet
Unit 4.
22 pages
Probability Review Stochastic
No ratings yet
Probability Review Stochastic
23 pages
ML - Lec 2 - Review of Probability and Statistics
No ratings yet
ML - Lec 2 - Review of Probability and Statistics
30 pages
01 Lectureslides ProbTheory
No ratings yet
01 Lectureslides ProbTheory
42 pages
Probability Theory
No ratings yet
Probability Theory
8 pages
LRaschke SwingTradingRules
83% (12)
LRaschke SwingTradingRules
5 pages
5 Probability
No ratings yet
5 Probability
51 pages
Learn Distribute
No ratings yet
Learn Distribute
23 pages
Chap2 Discrete Distributions
No ratings yet
Chap2 Discrete Distributions
22 pages
Information & Communication
No ratings yet
Information & Communication
13 pages
Probability and Statistics: Dr. K.W. Chow Mechanical Engineering
No ratings yet
Probability and Statistics: Dr. K.W. Chow Mechanical Engineering
113 pages
Random Variables
No ratings yet
Random Variables
44 pages
Orientation - Basic Mathematics and Statistics - Probability
No ratings yet
Orientation - Basic Mathematics and Statistics - Probability
48 pages
Probability
No ratings yet
Probability
9 pages
Probability Distributions - Training
No ratings yet
Probability Distributions - Training
43 pages
MAS 102 - Topic 1
No ratings yet
MAS 102 - Topic 1
13 pages
Probs-Stats Revision Notes
No ratings yet
Probs-Stats Revision Notes
19 pages
CHP 5
No ratings yet
CHP 5
63 pages
STAT 552 Probability and Statistics Ii: Short Review of S551
No ratings yet
STAT 552 Probability and Statistics Ii: Short Review of S551
51 pages
Probability FoundationalMathofAI S24
No ratings yet
Probability FoundationalMathofAI S24
7 pages
Chapter 6
No ratings yet
Chapter 6
5 pages
Probability Theory: Much Inspired by The Presentation of Kren and Samuelsson
No ratings yet
Probability Theory: Much Inspired by The Presentation of Kren and Samuelsson
27 pages
Lecture 1
No ratings yet
Lecture 1
81 pages
S1) Basic Probability Review
No ratings yet
S1) Basic Probability Review
71 pages
Distribution and Statistical Interference
No ratings yet
Distribution and Statistical Interference
43 pages
Chapter 2
No ratings yet
Chapter 2
8 pages
The Monkey and The Turtle
No ratings yet
The Monkey and The Turtle
2 pages
A 18-Page Statistics & Data Science Cheat Sheets
No ratings yet
A 18-Page Statistics & Data Science Cheat Sheets
18 pages
Basic Probability Review
No ratings yet
Basic Probability Review
77 pages
Probability Basics
No ratings yet
Probability Basics
19 pages
Let Me Introduce Myself
No ratings yet
Let Me Introduce Myself
14 pages
Probability Review
No ratings yet
Probability Review
12 pages
Refresher Probabilities Statistics PDF
No ratings yet
Refresher Probabilities Statistics PDF
3 pages
Stat Cheatsheet (Ver.2)
No ratings yet
Stat Cheatsheet (Ver.2)
2 pages
Sem 6 Notes Maths
No ratings yet
Sem 6 Notes Maths
7 pages
Probability Cheatsheet
No ratings yet
Probability Cheatsheet
4 pages
Narrative Essay: Lost in The Forest
100% (5)
Narrative Essay: Lost in The Forest
4 pages
Detailed Lesson Plan Sample
100% (1)
Detailed Lesson Plan Sample
4 pages
Business Plan Cafe
100% (1)
Business Plan Cafe
10 pages
Functions in Staffing
No ratings yet
Functions in Staffing
11 pages
MIdterm Exam PCES 1stSEm2021
No ratings yet
MIdterm Exam PCES 1stSEm2021
5 pages
R. U. R. (Rossum's Universal Robots), by Karel Capek
No ratings yet
R. U. R. (Rossum's Universal Robots), by Karel Capek
70 pages
Seasons of Life
No ratings yet
Seasons of Life
37 pages
Venerable Mahākassapa: The Father of The Sangha
No ratings yet
Venerable Mahākassapa: The Father of The Sangha
6 pages
Poems Year 6 - Part2
No ratings yet
Poems Year 6 - Part2
8 pages
Agroforestry-Economics MAKRAND GUJAR
No ratings yet
Agroforestry-Economics MAKRAND GUJAR
41 pages
Past Simple VS Past Continuous
No ratings yet
Past Simple VS Past Continuous
2 pages
Test Paper - Modal Verbs
No ratings yet
Test Paper - Modal Verbs
1 page
Revolutionizing Education With Industry 5.0
No ratings yet
Revolutionizing Education With Industry 5.0
6 pages
Classes of Shares
No ratings yet
Classes of Shares
4 pages
in Re Gadon
No ratings yet
in Re Gadon
21 pages
Assessment Centre
No ratings yet
Assessment Centre
34 pages
AV3 Part 1,2.
No ratings yet
AV3 Part 1,2.
12 pages
Economic Constitutions
No ratings yet
Economic Constitutions
21 pages
Pathfit Portfolio Template
No ratings yet
Pathfit Portfolio Template
3 pages
Samwise Gamgee
No ratings yet
Samwise Gamgee
3 pages
7 State Transition Table
No ratings yet
7 State Transition Table
14 pages
Measuring Cultural Dimensions External Validity An
No ratings yet
Measuring Cultural Dimensions External Validity An
9 pages
Level 1 Guide - DEVELOPED
No ratings yet
Level 1 Guide - DEVELOPED
3 pages
Historical Reflection Paper On The Significance of Rizal
No ratings yet
Historical Reflection Paper On The Significance of Rizal
4 pages
Focused Teaching: Promoting Accelerated Learning
No ratings yet
Focused Teaching: Promoting Accelerated Learning
23 pages
The Impact of Different Genres of Music On Teenage
No ratings yet
The Impact of Different Genres of Music On Teenage
11 pages
Present Tense Exercise 9
No ratings yet
Present Tense Exercise 9
3 pages
Probability Theory: A Concise Course
From Everand
Probability Theory: A Concise Course
Y. A. Rozanov
4/5 (2)
Structured Decision Making
From Everand
Structured Decision Making
Andreas Michael Theodorou
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Probability Distributions: Six Sigma Thinking, #5
From Everand
Probability Distributions: Six Sigma Thinking, #5
Sumeet Savant
No ratings yet

Probability and Statistics

Uploaded by

Probability and Statistics

Uploaded by

Probability and Statistics

Levels: Alternative futures (first-order uncertainty), range of

Statistics: In statistics, we make an inference

Definition: An event 𝐸 for an experiment is a

Exercise 1: Let 𝐻 or 𝑇 be the result for each

Note: If the outcome 𝐻𝑇𝑇 occurs, then both 𝐸

Note: 𝐸 and 𝐹 are independent if 𝑃 𝐸|𝐹 =

Note: This result has many applications in

Using this definition,

where 𝑃(𝑋 = 𝑥𝑖 ) is the probability that 𝑋

The standard deviation of 𝑋 is 𝜎 = Var 𝑋 .

Exercise 1: Graph the probability function

where 𝑓 𝑥 is the density function of 𝑋.

Note: The CDF gives the probability that 𝑋

and the variance of 𝑋 is

Note: Probabilities are areas under the density

Note: The normal distribution is symmetric

Fact: For 𝑛 ≥ 30, the sample average σ𝑖 𝑥𝑖 /𝑛

Managerial intuition: Why would a firm try

Note: Your feedback helps me to improve my

You might also like