Ilovepdf Merged
Ilovepdf Merged
What is probability?
While statistics deals with data and inferences from it, probability deals with the random processes that
are behind data or outcomes.
Do you think that "the sun would rise tomorrow morning". Probably your answer would be a certain "YES".
You are sure about it, aren't you? If you are sure about the outcome of an event, we call it a certainty.
Whenever we’re unsure about the outcome of an event, we can talk about probabilities., i.e., probabilities
of outcomes of an event—how likely they are to occur. Hence, probability is the extent to which an
outcome is likely to happen.
Mathematically,
If you toss a coin, there are two possible outcomes, and each of these outcomes is equally likely. A head
is as likely to come up as a tail.
the probability of getting a head in a toss is = number of heads / total number of outcomes i.e., here
head and tail
If you roll a six-sided die, there are six possible outcomes, and each of these outcomes is equally
likely. A one is as likely to come up as a three. There are six possible outcomes.
Probability can range in between 0 to 1, where 0 means the event to be an impossible one and 1
indicates a certain event.
BASIC CONCEPTS
Experiment and Trial
An experiment is any procedure that can be repeated any number of times, and the possible outcomes
are known. An experiment is said to be random if it has more than one possible outcome.
For example, tossing a coin. We know that there are only 2 possible outcomes head and tail and
tossing can be repeated any number of times. While tossing a coin, the outcome is uncertain i.e., we
don't know whether head or tail would turn.
S= {HH, TT, HT, TH}, where H indicates a Head and T indicates a Tail.
The probability of getting a head is 1/2 and the tail is 1/2. Hence the total probability of all events
(head and tail ) is 1/2 + 1/2 - 1
P(S)=1
2. Tree Diagram
Tree diagram is a pictorial representation of different possible outcomes. It consists of branches
(represented by an arrow) and ends (represented by circles or outcomes). Probability of each branch is
written on the branch, whereas the ends are containing the final outcome.
Toss a coin
From the tree diagram, we can compute , P(HH) = 1/2 * 1/2 = 1/4.
Take a challenge
Draw a tree diagram showing all possible combinations of three children families based on gender.
The possible combinations are {BBB, BBG, BGB, BGG, GBB, GBG, GGB, GGG}, which is the sample space.
Problem:
You are off to the Indian cricket team, and want to be the opening batsman, but that depends who is the
Coach today:with Coach Ravisastri the probability of you being an opening batsman is 0.6, with Coach
Rahul Dravid the probability of you being an opening batsman is 0.3. Ravisastri is Coach more often,
about 6 out of every 10 games (a probability of 0.6). So, what is the probability you will be an opening
batsman today?
ANS: 0.36+0.12=0.48
Another example,
A bag contains 5 red, 4 white and 3 blue balls. You pick a ball at random. The possible colour of the
ball could be red, white or blue.
When a card is drawn at random from a pack of cards, What is the probability of getting a king?
P(King) = 4/52
When two cards are drawn at random, what is the probability that the cards are king and queen?
Here there are actually two ways you can solve the problem.
After the first draw, the card might have been put again in the pack and the second card is drawn under
the same condition as the first card drawn, or
after the first draw, the card might not have been put again in the deck and the second card is drawn
under different conditions as the first card. I.e., for the second draw, the pack had only 51 cards.
The first is the case of replacement and the second is the case of without replacement.
Take a challenge
A box contains 4 purple balls, 5 orange balls and 11 yellow balls. If three balls are drawn from the box
at random without replacement, what is the probability that the first ball is orange, the second ball is
purple, and the third ball is yellow?
4. Complementary Events
The possibility that there will be only two outcomes which states that an event will occur or not. If we
consider the occurrence of an event, then it's complementary is non occurrence of the event. Basically,
the complement of an event occurring is the exact opposite that the probability of it is not occurring.
P(A') = 1-P(A)
A bag contains five blue balls and 6 red balls, a ball is taken at random from the bag. What is the
probability of getting a blue ball?
Totally there are 11 balls in the bag and there are 5 blue balls.
P(B)= 5/11
A ball is taken at random from the bag. What is the probability of getting a red ball?
Let the event of getting a red ball is denoted by 'R'
P(R)= 6/11
A ball is taken at random from the bag. What is the probability of not getting a blue ball?
There are only two different coloured balls in the bag. If a blue ball is not taken, it means that a red ball
is taken. Hence, this question is the same as the above one.
Using complementary,
P(B') = 1- P(B)
= 1-5/11
= 6/11
A dice is rolled, what is the probability that the top face shows anything other than 1?
It means that the top face of the dice might show 2,3,4,5,6
or
P(1') = 1-P(1)
= 1-1/6 = 5/6
For example, if we toss a coin, either head or tail turn up, but not both head and tail at the same time.
If a dice is rolled, we can only have one number shown at the top face. The numbers on the face are
mutually exclusive events.
If A and B are mutually exclusive events then the probability of A happening OR the probability
of B happening is P(A) + P(B).
Solution:
P(3) = 1/6
P(6) = 1/6
Axiom 4
P(A or B) = P(A)+P(B)-P(A and B)
Take a challenge
Three coins are tossed at the same time. A is the event of getting at least 2 heads. B denotes the event
of getting no heads, C is the event of getting heads on the second coin and D is the event of getting
heads on the third coin. Which of these are mutually exclusive?
Solution:
B = { TTT }
B and C, B and D, and A and C are mutually exclusive since they have nothing in their intersection.
6. INDEPENDENT EVENTS
Events A and B are independent events if the probability of Event B occurring is the same whether
event A occurs or not. For example, if you throw two dice, the probability that the second die comes up
3 is independent of what turns up in the top face of the first dice.
When two events A and B are independent, the probability of both occurring is the product of the
probabilities of the individual events. More formally, if events A and B are independent, then the
probability of both A and B occurring is:
This gives the multiplication rule to find the probability of independent events occurring together.
Let A and B be independent events. Then the probability of A and B occurring is:
P(A and B) = P(A ∩ B) = P(A) ˙ P(B)
If you toss a coin twice, what is the probability that it will come up heads both times?
Event A is that the coin comes up heads on the first flip and Event B is that the coin comes up heads
on the second flip. Since both P(A) and P(B) equal 1/2, the probability that both events occur is
Take a challenge
At a parking lot there are 100 vehicles, 65 of which are cars, 26 are bikes and the remaining are
bicycles. If every vehicle is equally likely to leave, find the probability of:
Answer
1. 26/100
2. 65/100
3. 9/100
4. 65/99
Is it 1/2., i.e.., When you toss a coin twice you have to get a head once, is it right.
Can you do the experiment by yourself and record the results. Just toss the coin twice and observe the
result.
Do you get different answers, did you get both tails?. If so, why.
1. Classical Approach
If an experiment has n outcomes, this approach assigns a probability of 1/n to each outcome where
each outcome is assumed to have an equal probability of occurrence. This method is also called the
axiomatic approach.
The relative frequency is not a theoretical quantity, but an experimental one. Probabilities are assigned
on the basis of experimentation or historical data.
Let A be an event of interest, and assume that we performed the same experiment n times. Hence, N is
the number of times an experiment is performed. Let NA be the number of times that A occurred. Now,
consider the relative frequency NA/N. Then, in this method, we “attempt” to define P(A) as: P(A) = lim
N→∞ NA / N.
Limitation:
Hence, for large n, the ratio nA/n may be taken as a reasonable approximation for P(A).
The observed relative frequency is just an approximation to the true probability of an event. However, if
we were able to perform our process more and more times, the relative frequency will eventually
approach the actual probability.
A frequency is the number that tells how many times a data or an event is repeating in an experiment.
Cumulative frequency is the total of all frequencies of all events that are below a particular point in a
given list of events.
Relative frequency is the absolute frequency normalized by the total number of events.
Relative frequency
The relative frequency of an event is defined as the number of times that the event occurs during
experimental trials, divided by the total number of trials conducted.
Relative frequency approach defines probability as either the observed relative frequency of an event
in a very large number of trials or the proportion of times that an event occurs in the long run when
conditions are stable.
Try this
Put 3 Blue balls and five red balls in a bag. Take a ball from the bag and note down the colour. Replace
the ball into the bag and repeat the experiment for 10 times.
Trial 1 2 3 4 5 6 7 8 9 10
Outcome Red Red Blue Blue Blue Red Red Blue Blue Blue
I got a red ball on 4 trials so the relative frequency of the event that a red ball was taken is 4/10 (=0.4).
The theoretical or classical approach of probability indicates that the probability of taking a red ball
from the bag is 5/8 (=0.625)
Is it not? My experimental results are different from the results of my theoretical approach.
Now repeat the experiment another 10 times and combine the result.
I did it and now I got the relative frequency of the event that a red ball was taken is 8/20 (=0.4)
Now I got the relative frequency of the event that a red ball was taken is 15//30 (=0.5)
If you repeat the experiment many times, you could observe the probability of picking a red ball
approaching 0.625.
Hence, The observed relative frequency is just an approximation of the theoretical probability of an
event. If we can perform the experiment more and more times, the relative frequency will eventually
approach the actual probability. This interpretation of probability rests on the important assumption
that our process or experiment can be repeated many times under similar circumstances.
The relative frequency approach uses the relative frequencies of past occurrences as probabilities.
This past data is used to predict the future.
Problems
Solution:
Let us calculate the probabilities for each cell and write first.
1. Probability of a sales person earning below 10000 Rs is (Look into the cell1)=0.05
2. Probability of a sales person earning between 10000-20000 Rs is (Look into the cell2)=0.0833
3. Probability of a sales person earning between 20000-40000 Rs is (Look into the cells 3 &
4)=0.1167+0.4167 = 0.5334
4. Probability of a sales person earning less than 20000 Rs is (Look into the cells 1 & 2)=0.05 + 0.0833
= 0.1333
5. Probability of a sales person earning more than 40000 Rs is (Look into the cells 5 & 6)=0.2333+0.1 =
0.3333
Problem 2:
The poll results of nurses and doctors regarding new leave policy is given below
1. that a nurse randomly selected from the polled group mildly supports the leave policy?
2. that a doctor randomly selected from the polled group is undecided about the leave policy?
3. that a randomly selected doctor mildly or strongly supports the leave policy?
4. that a randomly selected employee is undecided on the leave policy?
5. that a randomly selected employee mildly or strongly opposes the leave policy?
Answer:
1. What is the probability that a nurse randomly selected from the polled group mildly supports the
leave policy?
In total, we have 30 nurses and out of which 11 of them mildly support the leave policy.
2. What is the probability that a doctor randomly selected from the polled group is undecided about the
leave policy?
In total, we have 40 doctors and out of which 2 of them are undecided about the leave policy.
3. What is the probability that a randomly selected doctor mildly or strongly supports the leave policy?
Out of the 40 doctors, 13 of them mildly support and 10 of them strongly supports the wage policy.
Note: Remember, only one opinion is allowed. I.e., opinions are mutually exclusive.
Probability of both mildly and strongly support is zero., Remember, in a polling, only one opinion is
allowed. i.e., opinions are mutually exclusive.
4. What is the probability that a randomly selected employee is undecided on the leave policy?
Note that the question asked is about the employee and not about a nurse or a doctor. An employee
here is a nurse or a doctor.
5. What is the probability that a randomly selected employee mildly or strongly opposes the leave
policy?
Similar to the above, out of the 70 employees, 23 of them mildly or strongly oppose the leave policy (4
nurses mildly and 4 nurses strongly oppose the leave policy; 8 doctors mildly and 7 doctors strongly
oppose the leave policy)
A survey was taken on 60 classes at a University to find the total number of foreign students in each
class. The table below shows the results:
Answer:
Caution: Both the data: foreign students and classes are given as frequencies.
1. What is the probability that the a randomly chosen class has 2 foreign students?
2. What is the probability that a randomly chosen class has at least 3 foreign students?
At Least 3 foreign students means, a class should have a minimum of 3 foreign students., that is a can
have 3 or 4 or 5 foreign students.
Here, there are 3 foreign students in 18 classes, 4 foreign students in 12 classes and 5 foreign
students in 11 classes. Hence, adding all these, there are 41 classes which have a minimum of 3
foreign students.
3. First, find the total number of foreign students in all these 60 classes
Hence, the probability that a student randomly chosen from these 60 classes is a foreigner = total
number of foreign students in 60 classes/ total number of students in 60 classes
= 187/3600
= 0.0519
What is the probability of you passing this subject with the top most grade 'A'?
Your answer could be anything between 0 to 1.
If you are completely sure about getting A, it may be 1 or completely sure about not getting A, it is 0.
You may think that there are two possibilities, getting or not getting and hence the probability would be
1/2.This is a classical or theoretical approach to probability.
In this case, you cannot repeat the course again and again and calculate the probabilities. Even if you
repeat the course, the next time the conditions of the course might be different, the instructor might be
different, the evaluation pattern might be different, .. many changes. So, in this case, the relative
approach to probability might not work.
The answer to this question completely lies on your judgment, based on your previous experiences,
your knowledge, your gut feeling.... It is completely based on your belief and the results vary from
individual to individual, that is the answer is completely subjective and not objective.
Subjective approach to probability is completely based on belief and majorly used for experiments that
could not be repeated.
What is the probability that a randomly selected employee mildly supports the wage policy?
b. What is the probability that a randomly selected employee mildly opposes the wage policy?
c. What is the probability that a randomly selected employee strongly supports the wage policy?
d. What is the probability that a randomly selected employee strongly opposes the wage policy?
3. Can you defend with examples the importance of relative frequency approach to probability is
useful in worldly scenarios?
Module 3 - Venn Diagrams and Probability
Module 3 - Lesson
Venn diagrams are named after their inventor, John Venn, a mathematics professor at Cambridge
University. Venn diagrams gave a new way to approach issues of logic.
When working with more complex problems, we can have three or more events that intersect in various
ways. To solve these problems, we usually want to count the number (or percentage) of outcomes in an
event, or a combination of events. Venn diagrams are a useful tool for recording and visualising the
counts.
If A denotes the event and S is the sample space, the Venn diagram looks like
If we consider two sets A and B, then the Venn Diagram can be drawn as
In case, if A and B are two disjoint sets or if the events A and B are mutually exclusive, then the Venn
diagram is represented as
For example, If there are 10 students who have passed in English and 12 students who have passed in
Maths and 3 students in both, we can draw the diagram as
Note: If there are more number of groups, generally more than three, then Venn diagrams are not preferred.
Only tables are preferred.
Even, if there are only three groups with many subgroups, then also tables are preferred than Venn
diagrams.
1.A group of 128 students were surveyed, and it was found that each of the students surveyed liked at
least one of the following three fruits: apple, blue berry, and cherry.
39 liked apples.
50 liked blue berries.
39 liked cherries.
21 liked apples and blue berries.
18 liked blue berries and cherries.
19 liked apples and cherries.
Exactly, 22 of them liked only any two of these fruits: Apples, Blue berries and cherries.
a. How many students liked apples, but not blue berries or cherries?
b. How many students liked cherries, but not blue berries or apples?
c. How many students liked all of the following three fruits: apples, blue berries, and cherries?
d. How many students liked apples and cherries, but not blue berries?
Solution:
2. In a survey of university students, 64 had taken mechanical course, 94 had taken computer course,
58 had taken psychology course, 28 had taken mechanical and computer, 26 had taken mechanical
and computer, 22 had taken computer and psychology course, and 14 had taken all the three
courses. Find how many had taken one course only.
Step 1 :
Step 2 :
Step 3 :
= 106
Hence, the total number of students who had taken only one course is 106
2.In a survey of 500 people of a community, it was found that 49% liked eating fish, 53% liked eating hot
dog and 62% liked eating bread. Also, 27% liked eating fish and hot dog both, 29% liked eating hot dog
and bread both and 28% liked eating fish and bread both. 5% liked eating none of these foods.
Find the ratio of number of people who like eating only fish to those who like eating only hot dog.
Find the number of people who like eating only one of the three given foods.
Find the number of people who like eating at least two of the given foods.
Solution:∪ H ∪
mber of people who like eating all the three foods = 15 % of 500 = 75.
Ratio of the number of people who like only fish to those who like only hot dog = (9% of 500)/(12% of 500)
= 9/12 = 3:4.
The number of people who like eating only one of the three given foods = (9% + 12% + 20%) of 500 = 205
The number of people who like eating at least two of the given foods=(number of people who like eating
only two of the foods) +(number of people who like eating all the three foods)= (12 + 13 + 14 + 15)% i.e.
54% of 500 = 270.
4.For the purposes of a Entertainment research, a survey of 1000 respondents is conducted in a cit. The
results show that 52 % liked watching Chinese movies, 45% liked watching French movies and 60% liked
watching Russian movies. In addition, 25% liked watching Chinese and French movies, 28% liked
watching Russian movies and French movies and 30% liked watching Chinese and Russian movies. 6%
liked watching none of these movies.
Find the number of respondents who like watching only one of the three movies.
Find the number of respondents who like watching at least two of the given movies.
Let’s represent the data above in a more digestible way using the Venn diagram formula elements:
Now, we are going to apply the Venn diagram formula for 3 circles.
= 20%
It is a great time to make our Venn diagram related to the above situation (problem):
From the Venn diagram example, we can answer our questions with ease.
The number of respondents who like watching all the three movies = 20% of 1000 = 200.
THE PROBABILITY OF A RANDOMLY PICKED RESPONDENT TO LIKE WATCHING ALL THE THREE MOVIES IS 0.2
Number of respondents who like watching only one of the three movies = (17% + 12% + 22%) of 1000 = 510
The probability of a randomly picked respondent to like watching only one of the three movies is 0.51
The number of respondents who like watching at least two of the given movies = (number of respondents
who like watching only two of the movies) +(number of respondents who like watching all the
three movies) = (10 + 5 + 8 + 20)% i.e. 43% of 1000 = 430.
Module 3 - Short Answer Questions
1. Depict AUB in a Venn diagram
2. When do you use Venn diagram to solve problems?
3. If n(A)=30, n(B)=45 and n(AUB)= 55, Find n(A ∩ B)
4. Depict the events A and B, in case if both are mutually independent.
5. If n(A)=30, n(B)=45 and n(A ∩ B) = 10, find n(AUB)
Conditional Probability is a measure of the probability of an event given that another event has already
occurred. If the event of interest is A and the event B is known or assumed to have occurred, “the
conditional probability of A given B”, is usually written as P(A|B).
If A and B are two events, then the conditional probability of A given B is defined as
If P(B)=0, then the conditional probability is undefined. When the event B never occurs, it is
meaningless to talk about the occurrence of event A provided the event B has already occurred.
Note:
P(A/B) = 1
3. P(A'/B) = 1-P(A/B)
What is the probability that two cards drawn at random from a pack of cards will both be Kings?
Can we use the formula for the probability of two independent events,
This is incorrect, because the two events are not independent. If the first card drawn is a King, then the
probability that the second card is also a King is less because there would only be three Kings left in
the pack.
Once the first card chosen is a King, the probability that the second card chosen is also a King is
conditional probability. We need to find the probability of the King in the second draw provided we
have already drawn a king in the first draw. Symbolically,
1. In a card game, suppose a player needs to draw two cards of the same suit in order to win. Of
the 52 cards, there are 13 cards in each suit. Suppose first the player draws a heart. Now the
player wishes to draw a second heart. Since one heart has already been chosen, there are now
12 hearts remaining in a deck of 51 cards. So the conditional probability P(Draw second
heart|First card a heart) = 12/51.
2. A dice is rolled.
1. Find the probability that the number rolled is a six, given that it is even.
2. Find the probability that the number rolled is even, given that it is a six.
1. 1/3
2. 1
Even nos{2,4,6}
P(Even) = 3/6
P(six) = 1/6
=(1/6)/(1/6) = 1
For example, if you know P(A/B) and P(B) we can calculate P(A∩B)
Example,
Suppose an individual applying to an Ivy college determines that he has an 80% chance of being
accepted, and he knows that dormitory housing will only be provided for 60% of all of the accepted
students. The chance of the student being accepted and receiving dormitory housing is defined by
P(Accepted and Dormitory Housing) = P(Dormitory Housing | Accepted)P(Accepted) = (0.60)*(0.80) =
0.48.
Take a challenge
In a factory there are 100 units of iron bearings, 8 of which are defective. We pick three units from
the 100 units at random, What is the probability that none of them are defective?
we have to calculate
P(A1∩A2∩A3)
P(A2/A1) = 91/99
Hence,
=0.7766
Problems
1, A survey was undertaken to understand the marriage age pattern during their first marriage in India
by a sociologist. Out of the 960 individuals, there were 592 women. Of the 592 women, 183 had been
married in their teens, 106 had been married in their twenties, 224 had been married in their thirties
and the remaining in their forties. Among men, 98 had been married in their teens, 159 in their
twenties, 60 in their thirties and the remaining in their forties. Suppose a person is selected at random.
1. Find the probability that the individual selected was a teenager at first marriage.
2. Find the probability that the individual selected was a teenager at first marriage, given that the
person is male
Solutions:
When encountering problems like this, it is always advised to convert the given data in a table format.
First Marriage
Teens Twenties Thirties Forties Total
Women 183 106 224 79 592
Men 98 159 60 51 368
Total 281 265 284 130 960
1. The probability that the individual selected was a teenager at first marriage
= 281/960= 0.2927
2. The probability that the individual selected was a teenager at first marriage, given that the person is
male
2. Suppose that in an adult population the proportion of people who are both obese and diabetic
is 0.09; the proportion of people who are not obese, but suffer diabetic is 0.11; the proportion of people
who are obese but do not suffer diabetic is 0.02; and the proportion of people who are neither obese
nor suffer diabetic is 0.78. An adult is randomly selected from this population.
1. Find the probability that the person selected suffers diabetes given that he is obese.
2. Find the probability that the selected person suffers diabetic given that he is not obese.
3. Compare the two probabilities just found to give an answer to the question as to whether obese
people tend to suffer from diabetes.
Let D denote the event “the person selected suffers diabetic.” Let O denote the event “the person
selected is obese.” The probability information given in the problem is put in a table.
1. The probability that the person selected suffers diabetes given that he is obese is written as,
P(D/O)= P(D∩O)/P(O)
It is a little difficult to calculate P(O) from the table unless you know the formula, rather We can use a
Venn diagram to show this data and make calculations easy.
P(D/O)= P(D∩O)/P(O)
= 0.0.9/0.11 = 0.8182
By formula,
P(D|O)=P(D∩O)/P(O)=0.09/0.11=0.818
2. The probability that the selected person suffers diabetic given that he is not obese is written as
= 0.11/1-0.11
= 0.11/ 0.89
= 0.1234.
P(D|Oc)=0.1234, which indicates a much higher rate of diabetic among people who are obese than
among people who are not obese.
Problem: 3
Answer:
P(B) = 5/8
P(AUB) = ¾.
P(AB)=P(A)+P(B) – P(A ∪ B)
= 3/8+5/8-3/4 = ¼
= ¼ / 5/8 = 2/5
= ¼ / 3/8 = 2/3
2. Your neighbor has 2 children. You learn that he has a son, Shiva. What is the probability that Shiva’s
sibling is a sister?
3. Your neighbor has 2 children. You learn that he has a daughter Shivani. What is the probability that
Shivani’s sibling is a sister?
2. Is P(A/B)= P(B/A). Prove that with an example. Assume all necessary details.
3. If P(A)= 5/14, P( B)=4/14, P(AUB)= 5/14, Are A and B independent? Explain in detail.