Lecture 6 Stat 1100Q
Lecture 6 Stat 1100Q
STAT 1100Q-001
LECTURE 6
Probability
Motivation
We’ve discussed Steps 1 and 2 (Producing Data & EDA), and we’re now moving on to Step 3,
Probability. Step 4 is the last 3rd of the course.
Group 1:
Group 2:
From this, we can see that there is variation due to the random sample.
Austin Menger 2
STAT 1100Q-001
The question is: Should the contestant switch? Are the odds of winning higher if he/she
switches?
Step 1: One door has the prize, and the other two have goats. Whatever is behind your door is
what you take home. Suppose that you, the contestant, select door 1.
You
Right now at this step, the chance of the contestant selecting the door with the prize behind it
is
Austin Menger 3
STAT 1100Q-001
Step 2: Now, the Monty Hall (the host) opens one of the other two doors that he knows has a
goat behind it , say Door 3, and then asks if you’d like to switch your door selection from Door 1
to Door 2:
Terminology:
Event → a statement about the nature of the outcome that we’re actually going to get once the
experiment is conducted
- Sidenote – Subset →
Set A = {1,2,3,4}
Set B = {2,4}
Set C = {2,3,5}
Set D = {1,2,3,4}
Looking at just one coin being flipped once, we have two possible events:
Event A:
Event B:
Now, let’s consider an example with more event possibilities, Example 2 – One Die Roll
Event A:
Event B:
Event C:
Event D:
Austin Menger 5
STAT 1100Q-001
Example 3 takes a bit more work. Here, we are looking at a couple that decides to have children
until they have one boy and one girl, but no more than 3 children.
Event A:
Event B:
B C
Now, how does this blend into probability? We can talk about the probability of an event
occurring:
P(B) →
P(C) →
The probability associated with an event describes the likelihood of that event occurring, i.e.
the outcome is one of the possible values dictated by the event.
If P(B) = 0, then
If P(B) = 1, then
Let’s start simple. The probability that you’ll get a Head on a coin flip is 1/2. We all know that.
How is this calculated, though? It ties into relative frequency.
Remember that we defined relative frequency in Lecture 1 as the percent of all subjects that fall
into the category.
In theory, we could flip the coin an infinite number of times. The relative frequency of heads
will be half the number of flips.
In practice, however, we can’t flip a coin infinitely many times. So, we must instead choose
what is considered a very large number, 40,000 times for example like a couple of Berkeley
undergrads did (https://www.stat.berkeley.edu/~aldous/Real-World/coin_tosses.html) . Out of
40,000, theoretically 20,000 should be heads.
However, the relative frequency approach only provides an estimate for P(A). As you can
imagine, if we tossed a coin 40,000 times (1 hour a day for an entire semester according to the
Berkeley undergrads…) we wouldn’t get exactly 20,000 heads. It would vary due to variability,
but the average would be 20,000 if you repeated the experiment a number of times.
Austin Menger 8
STAT 1100Q-001
Note: We control how good this estimate is by the number of times we repeat the random
experiment (increase from 40,000 to 100,000). The more repetitions performed, the closer
this relative frequency gets to the true probability P(Head).
If all the outcomes in the sample space S are equally likely, then we can find P(A) using
the following formula:
Note: It’s NOT always the case that all outcomes are equally likely.
Note: if looking at pairs, order may or may not matter depending on the context.
Austin Menger 9
STAT 1100Q-001
Formulaically:
Note: this rule is especially helpful when finding P(A) is quite complicated or laborious.
Visually:
Notation:
Note: 2 events are called disjoint if they don’t have any outcomes in common
Visually:
Mathematically:
3. The Union of 2 Events (all the outcomes that are in A or B (or both) → all the
outcomes that are in at least one of the 2 events)
Visually:
Austin Menger 11
STAT 1100Q-001
Notation:
a. Are all the possible outcomes equally likely? What is the sample space?
e. What is the 𝑃(𝐴𝑐 )? What does the event 𝐴𝑐 actually mean in context?
Example – Symptoms
Suppose that we are looking at a drug to treat high blood pressure. We know that there is a
14% chance of headache, an 11% chance of indigestion, and a 6% chance of getting both.
Another way to dissect and summarize this information is to use something called a
probability table:
A fundamental principle used to fill in the probability table is that when there are only 2 events
A,B:
To put this in context from our previous example, 14% of all people who use the drug get
headaches. Of that 14%, 6% also get indigestion and the other 8% experience only headaches.
a) Write down the 3 pieces of given information in terms of probabilities involving event F
and event P
c) What is the probability that only Prime Delivery is on time? (put this in symbols and
then answer using the table)
Austin Menger 15
STAT 1100Q-001
d) What is the probability that both services render on-time delivery? (put this in symbols
and then answer using the table)
e) What is the probability that only Fed-Ex renders on-time delivery? (put this in symbols
and then answer using the table)
So, to find 𝑃(𝐴 ∪ 𝐵) in the case where the 2 events are disjoint, we have
a. Write down the 3 pieces of info given in terms of the probabilities of events J
and A
Note: There is another way to word the same question: What is the probability that a family
living in Greenwich, CT owns at least one of the two types of cars?
Austin Menger 17
STAT 1100Q-001
d. Now, what is the probability that a family living in Greenwich, CT owns exactly
one of the two car types mentioned?
Visually:
Formulaically: