MS3227 Week 1
MS3227 Week 1
Business
Week 1
September 5, 2024
Overview of the course
Tentative plan of the course
Read the book, so you can be well prepared for the assignment and
exams.
2
Course Administration
TA: TBA
Four homework assignments, will be due one week after posting date
(2 hours before the class). We do not accept late submission.
3
Course Administration
4
What is probability
What is probability?
6
What is probability?
7
What is probability?
If I were to toss the coin 10 times, roughly 5 times I will see a head.
7
What is probability?
If I were to toss the coin 10 times, roughly 5 times I will see a head.
7
What is probability?
If I were to toss the coin 10 times, roughly 5 times I will see a head.
7
Wait
8
Wait
8
Wait
8
Wait
8
Wait
8
Experiment and Event
9
Sample space
10
Sample space
10
Sample space
{1, 2, 3, 4, 5, 6}
10
Sample space
{1, 2, 3, 4, 5, 6}
10
Sample space
{1, 2, 3, 4, 5, 6}
10
Sample space
{1, 2, 3, 4, 5, 6}
10
Sample space
{1, 2, 3, 4, 5, 6}
10
Sample space
{1, 2, 3, 4, 5, 6}
Simple event:
11
Simple and compound events
Simple event:
You toss two coins, get HH.
11
Simple and compound events
Simple event:
You toss two coins, get HH.
You roll a die and get 6.
11
Simple and compound events
Simple event:
You toss two coins, get HH.
You roll a die and get 6.
11
Simple and compound events
Simple event:
You toss two coins, get HH.
You roll a die and get 6.
11
Simple and compound events
Simple event:
You toss two coins, get HH.
You roll a die and get 6.
11
Simple and compound events
Simple event:
You toss two coins, get HH.
You roll a die and get 6.
11
Simple and compound events
Simple event:
You toss two coins, get HH.
You roll a die and get 6.
11
Simple and compound events
Simple event:
You toss two coins, get HH.
You roll a die and get 6.
11
Simple and compound events
Simple event:
You toss two coins, get HH.
You roll a die and get 6.
11
Set
12
Set
13
Set
13
Set
13
Set
13
Set
A = B!
13
Set
A = B!
13
Set
A = B!
13
Set
A = B!
13
Set
A = B!
13
Set
A set can be finite (e.g. the set of people in this class), or infinite
(e.g. the set of real numbers).
14
Operation of sets
Let the universal set Ω be the set of all objects we are interested in.
ΩC = ∅.
15
Operation of sets
The union, S ∪ T of two sets S and T is the set of elements that are
in either S or T or both. S ∪ T = {x | x ∈ S or x ∈ T }. S or T .
16
Operation of sets
The union, S ∪ T of two sets S and T is the set of elements that are
in either S or T or both. S ∪ T = {x | x ∈ S or x ∈ T }. S or T .
16
Operation of sets
The union, S ∪ T of two sets S and T is the set of elements that are
in either S or T or both. S ∪ T = {x | x ∈ S or x ∈ T }. S or T .
The difference, S\T is the set of elements that are in S but not T . S
\T = {x | x ∈ S and x ∈
/ T }.
16
Operation of sets
The union, S ∪ T of two sets S and T is the set of elements that are
in either S or T or both. S ∪ T = {x | x ∈ S or x ∈ T }. S or T .
The difference, S\T is the set of elements that are in S but not T . S
\T = {x | x ∈ S and x ∈
/ T }.
16
Sample Space and Events
Sample Space & Event
Event:
{All heads} = {HHH}
(not E ) (A & B ) (A or B )
0 ≤ P(E ) ≤ 1
P(Ω) = 1
20
General Rules of Probability
Disjoint Events (Mutually Exclusive Events)
21
neral Addition
General Rule
Addition Rule
22
neral Addition Rule
General Addition Rule
What is the probability of drawing a jack or a red card from a well
What’s the probability of drawing a jack or a red card from a well
shuffled full full
shuffled deck?
deck?
4 26 2 28
P(jack
P(jack or red)or= red) = P(jack) + P(red)
P(jack)+P(red)−P(jack P(jack
and red) = and
+ −red)=
52 52 52 52
4 26 2 28
= + = 23
VennasDiagrams
shown in Fig. 4.8, with the colored portion representing the event.
General Addition Rule
Complements, intersections,
Relationships Amongunions of events can be represented
Events
visually using Venn diagrams:
Each event E has a corresponding event defined by the condition that “E does
occur.” That event is called the complement of E, denoted (not E). Event (not
Complement
consists of all outcomes notIntersection Union
in E, as shown in the Venn diagram in Fig. 4.9(a).
Ec A\B A[B
FIGURE 4.9
r (a) event (not E),
d (c) event (A or B)
E A B A B
(not E ) (A & B ) ( A or B )
With any two events, say, A and B, we can associate two new events. One n
event is defined
P(A orbyB)the=condition
P(A) + that “both
P(B) − P(A A and
eventand B)event B occur”
7 and is deno
24
ometric shapes) inside the rectangle. In the simplest case, only one event is display
VennasDiagrams
Inequality shown in Fig. 4.8, with the colored portion representing the event.
Complements, intersections,
Relationships Amongunions of events can be represented
Events
visually using Venn diagrams:
Each event E has a corresponding event defined by the condition that “E does
occur.” That event is called the complement of E, denoted (not E). Event (not
Complement
consists of all outcomes notIntersection Union
in E, as shown in the Venn diagram in Fig. 4.9(a).
Ec A\B A[B
FIGURE 4.9
r (a) event (not E),
d (c) event (A or B)
E A B A B
(not E ) (A & B ) ( A or B )
With
If A ⊂ B, any ≤
P(A) twoP(B)
events, say, A and B, we can associate two new events. One n
event is defined by the condition that “both event A and event B occur”
7 and is deno
P(A ∪ B) ≤ P(A) + P(B)
25
The Complement Rule
P(A) + P(Ac ) = 1
26
Example - The Complement Rule
Sample space
P(Ac ) = 18 .
P(A) = 1 − P(Ac ) = 1 − 1
8 = 87 .
27
Conditional Probability
Example:
A card is drawn from a well-shuffled deck.
28
Conditional Probability
P(A | B),
We have
4 4
P(A | B) = 6= P(A) = .
12 52
The numerators are the same, but the denominators change. The
sample space changes given conditional information.
29
Conditional Probability
area of (A ∩ B)
P(A | B) = P(A and B)/P(B) =
area of B
Recall that
P(A) = area of A/area of Ω
30
More examples
31
More examples
32
Example: Age and Rank of Faculty
33
Example: Age and Rank of Faculty
Select one faculty at random, what’s the probability that he / she is a full
professor?
430
P(professor) = 1164
34
Example: Age and Rank of Faculty
470
P(< 40) = 1164
35
Example: Age and Rank of Faculty
Select one faculty at random, what’s the probability that he / she is a full
professor and under 40 years old?
54
P(professor and < 40) = 1164
36
Example: Age and Rank of Faculty
Given a faculty is under 40 years old, what’s the probability that he / she
is a full professor?
54
P(professor |< 40) = 470
37
Example: Age and Rank of Faculty
156
P(40 − 49 | professor) = 430
38
Conditional Probability
P(A and B)
P(A | B) =
P(B)
39
Joint and Marginal Probability
40
Joint and Marginal Probability
Fill in the marginal probability first, which is row / column sum of joint
probability (integrate out one variable).
41
General Multiplication Rule
P(A and B)
P(A | B) =
P(B)
can be used the other way around, which is the general multiplication
rule.
P(A and B) = P(A | B) × P(B)
When you want to calculate P(A and B), but the two terms on the right
side are easier to calculate, use this rule.
42
General Multiplication Rule
43
General Multiplication Rule
A deck of cards is shuffled and the two top cards are placed face down on
a table. What is the probability that neither card is a K?
Let
44
General Multiplication Rule for Multiple Events
45
General Multiplication Rule for Multiple Events
Five cards are dealt from a deck of well-shuffled card. What is the chance
that none of them are hearts ♥?
Let Ai be the event that the i-th card dealt is not a ♥.
P(A1 ) = P(1st card is not a♥) = 39
52 .
Given that the 1st card is not a ♥, the conditional probability that
38
the 2nd is not a ♥= P(A2 | A1 ) = 51 .
Likewise, P(A4 | A1 , A2 , A3 ) = 36
49 , P(A5 | A1 , A2 , A3 , A4 ) = 35
48 .
Keep in mind
47
Independence
Independence
Knowing that the coin landed on a head on the first toss does not
provide any useful information for determining what the coin will
land on in the second toss.
Outcomes of two tosses of a coin are independent.
Knowing that the first card drawn from a deck is an ace does
provide useful information for determining the probability of drawing
an ace in the second draw.
Outcomes of two draws from a deck of cards (w/o replacement) are
dependent.
48
Independence and Conditional Probabilities
Equivalently, one can also check the independency of the events A and B
by check whether P(B | A) = P(B)
49
Practice – Checking for Independence
50
Independence of events vs. independence of variables in a con-
tigency table
Recall that for a two-way contingency table, the two variables are
independent if the row proportions do not change from row to row.
This is consistent with the definition of independence of events since
and if the row proportions do not change from row to row, they will be
equal to the marginal probability of the column variable.
For example of the faculty, P(rank | age) 6= P(rank), the age and rank of
faculty members are dependent.
51
Multiplication rule for independent events
A recent Gallup poll suggests that 25.5% of Texans do not have health
insurance as of June 2012. Assuming that the uninsured rate stayed
constant, what is the probability that two randomly selected Texans are
both uninsured?
0.2552
53
Put everything together
54
Abuse of the multiplication rule
True or false? 0.134 × 0.52 = 0.07 of the U.S. populartion were males
age 65 or older.
False, age and gender are dependent. In particular, women on average
live longer than men. There are more old women than old men.
Among the age 65 or older, only 44% are male (this is the condition
probability!), not 52%. So only 0.134 × 0.44 = 0.059 were males age 65
or older.
55
Tree diagrams and Bayes’
Theorem
Example – Nervous Job Applicant
56
Tree diagram for the nervous job applicant
·
0.3
eed P(N C and S) = 0.3 × 0.9
Succ
Not 0.9
nervous Fail
0.1 P(N C and S C ) = 0.3 × 0.1
57
Nervous job applicant
58
Bayes’ Theorem
59
Medical Testing
60
Enzyme Immunoassay test for HIV
What’s the probabilitythat the tested person is infected if the test was
positive?
P(D)P(T + | D)
P(D | T +) =
P(D)P(T + | D) + P(D C )P(T + | D C )
1/300 × 0.98
=
1/300 × 0.98 + 299/300 × 0.005
= 0.394
61
Tree diagram for HIV test
Posit
ive P (D and T +) = 3001 × 0.98
0.2
Infected Nega
1
tive
0 0.99
30 P(D and T −) = 1
× 0.02
5 300
· 29
30 9
0
ive P(D C and T +) = 299
× 0.005
Posit 300
Not .005
Ne0ga
infected tive
0.99 P(D C and T −) = 299
× 0.995
5 300
Only the red cases are positive test results, out of them, the bold one is
D, have disease.
bold 1/300 × 0.98
P(D | T +) = =
red1 + red2 1/300 × 0.98 + 299/300 × 0.005
62