Module 2 Notes-1
Module 2 Notes-1
Bernoulli Distribution: A random variable X has the Bernoulli distribution with parameter p(0 ≤p
≤1) if X can take only the values 0 and 1 and the probabilities are
Bernoulli Trials/Process. If the random variables in a finite or infinite sequence X1, X2,...are i, and if
each random variable Xi has the Bernoulli distribution with parameter p, then it is said that
X1,X2,...are Bernoulli trials with parameter p. An infinite sequence of Bernoulli trials is also called
a Bernoulli process.
Binomial Distribution: A random variable X has the binomial distribution with parameters n and p
if X has a discrete distribution for which the p.f. is as follows:
In this distribution, n must be a positive integer, and p must lie in the interval 0 ≤p≤1.
Poisson Distribution: Let λ>0. A random variable X has the Poisson distribution with mean λ if the
p.f. of X is as follows:
Theorem: Mean. The mean of the distribution with p.f. is λ.
Variance. The variance of the Poisson distribution with mean λ is also λ
Moment Generating Function. The m.g.f. of the Poisson distribution with mean λ is
ψ(t)=eλ(et−1) for all real t.
If the random variables X1,...,Xk are independent and if Xi has the Poisson distribution
with mean λi (i=1,...,k), thenthesumX1+...+Xk has the Poisson distribution with mean λ1+...+λk.
Poisson Process: A Poisson process with rate λ per unit time is a process that satisfies the following
two properties:
i. The number of arrival sine very fixed interval of time of length t has the Poisson
distribution with mean λt.
ii. The numbers of arrivals in every collection of disjoint time intervals are independent.
Chi-Square Distributions
χ2 Distributions. For each positive number m, the gamma distribution with parameters α =m/2andβ
=1/2 is called the χ2 distribution with m degrees of freedom.
t Distributions
t Distributions: Consider two independent random variables Y and Z, such that Y has the χ2
distribution with m degrees of freedom and Z has the standard normal distribution. Suppose that a
random variable X is defined by the equation
Properties
Probability Density Function. The p.d.f. of the t distribution with m degrees of freedom is
2.2 Joint distribution of the Sample Mean and Sample Variance
Orthogonal Matrices
Orthogonal Matrix. It is said that an n × n matrix Ais orthogonal if ATA= I, where AT is the transpose
of A.
Q2. A retail company wants to rate the quality of a product by using results from a customer
satisfaction survey. In the survey, the company asks respondents to rate the quality on a scale of one
to five, with one being the poorest quality and five being the highest quality.
The sample size is 25, the sample mean is 4.5 and the standard deviation is 2.5. Calculate the
confidence interval assuming a 97% confidence level:(For a 97% confidence level, Z≈2.17).
Q3. Green Life, a new natural wellness brand, wants to measure its most popular product according
to customer ratings. The business uses a 10-point scale and asks a sample of 64 customers to rate their
favorite products, with one representing the least popular product and 10 representing the most
popular product. If Green Life wants to find out the popularity of a particular item in the customer
survey, sales teams can find the confidence interval. With a 98% confidence level, a sample mean of
8.5 and a standard deviation of 4.75, calculate the confidence interval:(For a 98% confidence level,
Z≈2.33)
Here, P(A) = how likely A happens(Prior knowledge)- The probability of a hypothesis is true before
any evidence is present.
P(B) = how likely B happens(Marginalization)- The probability of observing the evidence.
P(A/B) = how likely A happens given that B has happened(Posterior)-The probability of a hypothesis
is true given the evidence.
P(B/A) = how likely B happens given that A has happened(Likelihood)- The probability of seeing the
evidence if the hypothesis is true.
.
Terms Related to Bayes Theorem
As we have studied about Bayes theorem in detail, let us understand the meanings of a few terms
related to the concept which have been used in the Bayes theorem formula and derivation:
● Conditional Probability - Conditional Probability is the probability of an event A based on
the occurrence of another event B. It is denoted by P(A|B) and represents the probability of A
given that event B has already happened.
● Joint Probability - Joint probability measures the probability of two more events occurring
together and at the same time. For two events A and B, it is denoted by P(A∩B).
● Random Variables - Random variable is a real-valued variable whose possible values are
determined by a random experiment. The probability of such variables is also called
the experimental probability.
● Posterior Probability - Posterior probability is the probability of an event that is calculated
after all the information related to the event has been accounted for. It is also known as
conditional probability.
● Prior Probability - Prior probability is the probability of an event that is calculated before
considering the new information obtained. It is the probability of an outcome that is
determined based on current knowledge before the experiment is performed.
Important Notes on Bayes Theorem
● Bayes theorem is used to determine conditional probability.
● When two events A and B are independent, P(A|B) = P(A) and P(B|A) = P(B)
● Conditional probability can be calculated using the Bayes theorem for continuous random
variables.
Problems based on Bayes Theorem
Example 1: Amy has two bags. Bag I has 7 red and 4 blue balls and bag II has 5 red and 9 blue balls.
Amy draws a ball at random and it turns out to be red. Determine the probability that the ball was
from the bag I using the Bayes theorem.
Solution: Let X and Y be the events that the ball is from the bag I and bag II, respectively. Assume A
to be the event of drawing a red ball. We know that the probability of choosing a bag for drawing a
ball is 1/2, that is,P(X) = P(Y) = 1/2
Since there are 7 red balls out of a total of 11 balls in the bag I, therefore, P(drawing a red ball from
the bag I) = P(A|X) = 7/11
Similarly, P(drawing a red ball from bag II) = P(A|Y) = 5/14
We need to determine the value of P(the ball drawn is from the bag I given that it is a red ball), that
is, P(X|A). To determine this we will use Bayes Theorem. Using Bayes theorem, we have the
following:
= [((7/11)(1/2))/(7/11)(1/2)+(5/14)(1/2)]
= 0.64
Answer: Hence, the probability that the ball is drawn is from bag I is 0.64
Example 2: Assume that the chances of a person having a skin disease are 40%. Assuming that skin
creams and drinking enough water reduces the risk of skin disease by 30% and prescription of a
certain drug reduces its chance by 20%. At a time, a patient can choose any one of the two options
with equal probabilities. It is given that after picking one of the options, the patient selected at random
has the skin disease. Find the probability that the patient picked the option of skin screams and
drinking enough water using the Bayes theorem.
Solution: Assume E1: The patient uses skin creams and drinks enough water; E2: The patient uses
the drug; A: The selected patient has the skin disease
P(E1) = P(E2) = 1/2
Using the probabilities known to us, we have
P(A|E1) = 0.4 × (1-0.3) = 0.28
P(A|E2) = 0.4 × (1-0.2) = 0.32
Using Bayes Theorem, the probability that the selected patient uses skin creams and drinks enough
water is given by,
Example 3: A man is known to speak the truth 3/4 times. He draws a card and reports it is king. Find
the probability that it is actually a king.
Solution:
Answer: Thus the probability that the drawn card is actually a king = 0.5
Null Hypothesis
The null hypothesis is a concise mathematical statement that is used to indicate that there is no
difference between two possibilities. In other words, there is no difference between certain
characteristics of data. This hypothesis assumes that the outcomes of an experiment are based on
chance alone. It is denoted as H0. Hypothesis testing is used to conclude if the null hypothesis can be
rejected or not. Suppose an experiment is conducted to check if girls are shorter than boys at the age
of 5. The null hypothesis will say that they are the same height.
Alternative Hypothesis
The alternative hypothesis is an alternative to the null hypothesis. It is used to show that the
observations of an experiment are due to some real effect. It indicates that there is a statistical
significance between two possible outcomes and can be denoted as H1 or Ha. For the above-
mentioned example, the alternative hypothesis would be that girls are shorter than boys at the age of
5
Define the concept of F-test statistic, with respect to large and small size samples. Also,
list the F-test expression for left tailed, right tailed and two tailed tests.
F Statistic
The f test statistic or simply the f statistic is a value that is compared with the critical value to
check if the null hypothesis should be rejected or not. The f test statistic formula is given below:
F Test Formula
The f test is used to check the equality of variances using hypothesis testing. The f test formula for
different hypothesis tests is given as follows:
2.6 F-Distribution
The F-distribution, also known Fisher-Snedecor distribution, is extensively used to test for
equality of variances from two normal populations.
Additionally, the f-distribution is the ratio of the X1 random chi-square variable with degrees
of freedom ϑ1 and the X2 random chi-square variable with degrees of freedom ϑ2. In other
words, each Chi-Square random variable has been divided by its degrees of freedom.
F-test is to determine whether the two independent estimates of population variance differ
significantly. In this case, F-ratio is:
or
To find out whether the two samples drawn from the normal population have the same variance.
In this case, F-ratio is:
One way to decide between H0 and H1 is to compare P(H0|Y=y) and P(H1|Y=y), and accept
the hypothesis with the higher posterior probability. This is the idea behind the maximum a
posteriori (MAP) test. Here, since we are choosing the hypothesis with the highest probability,
it is relatively easy to show that the error probability is minimized.
To be more specific, according to the MAP test, we choose H0 if and only if
Note that as always, we use the PMF instead of the PDF if Y is a discrete random variable. We
can generalize the MAP test to the case where you have more than two hypotheses. In that case,
again we choose the hypothesis with the highest posterior probability.
Problems
1. An Olympic pistol shooter has a 2/3 chance of hitting the target at each shot. Find the
probability of:
(i) Hitting 10 targets in his first 10 shots
(ii) Hitting exactly 5 targets out of 15
(iii) Hitting exactly 10 targets out of 15
2. The Everlasting Lightbulb company produces light bulbs, which are packaged in boxes
of 20 for shipment. Tests have shown that 4% of their light bulbs are defective.
(a) What is the probability that a box, ready for shipment, contains exactly 3 defective
light bulbs?
(b) What is the probability that the box contains 3 or more defective light bulbs?
(c) Expected value of defective bulbs
(d) Standard deviation of defective bulbs
3. Suppose that on a given weekend the number of accidents at a certain intersection has
the Poisson distribution with mean 0.7. What is the probability that there will be at least
three accidents at the intersection during the weekend?
5. Amy has two bags. Bag I have 7 red and 4 blue balls and bag II has 5 red and 9 blue
balls. Amy draws a ball at random and it turns out to be red. Determine the probability
that the ball was from the bag I using the Bayes theorem.
6. A retail company wants to rate the quality of a product by using results from a customer
satisfaction survey. In the survey, the company asks respondents to rate the quality on
a scale of one to five, with one being the poorest quality and five being the highest
quality. The sample size is 25, the sample mean is 4.5 and the standard deviation is 2.5.
Calculate the confidence interval assuming a 97% confidence level(For a 97%
confidence level, Z≈2.17)