Unit 3 - Probability
Unit 3 - Probability
Unit 3
Probability
The word probability or chance is a very commonly used word.
• Probably it may rain tomorrow
• The Chances that RCB winning the IPL title
In all the example there is an uncertainty about the event happening.
Probability is associated with uncertain events or situations.
Probability is one the important tools in statistics
It helps in providing directions when there is random variations.
Probability theory can be applied to various socio-economic problems .
The importance lies in coping up the uncertainties and helps in predictions.
Probability is the foundation of statistical inference
Probability or chance is how likely something is to happen:
▪ If something has a low probability, it is unlikely to happen.
▪ If something has a high probability, it is likely to happen.
Probability is a numerical measure of the likelihood that an event will occur.
Probability can be expressed as fractions, decimals or percentages, or on a probability scale.
Probability values are always assigned on a scale from 0 to 1.
▪ A probability near zero indicates an event is quite unlikely to occur.
▪ A probability near one indicates an event is almost certain to occur.
Some important concepts
❑ Random experiment
• An experiment that has two or more outcomes which vary in an unpredictable manner
from trail to trail when conducted under uniform conditions.
• The tossing of a coin is random experiment, since it has two specific outcomes –Head
and Tail.
Experiment Experiment Outcomes
• Features of a random experiment
Toss a coin Head, tail
a) It has more than one outcome Inspection a part Defective, non-defective
b) The outcome are unpredictable Conduct a sales call Purchase, no purchase
Roll a die 1, 2, 3, 4, 5, 6
c) The experiment is repeatable
Play a football game Win, lose, tie
• Tossing a coin, rolling a die
❑ Event
• The outcome of an experiment is termed as events or cases.
• Getting a head or a tail in tossing a coin is an event.
• Usually events are denoted by capital letters like A, B, C, etc.
❑ Sample Space
• The set containing of all the events of that random experiment.
• When a coin tossed the sample space is [H, T]
• When two coins are tossed the sample space is [HH, HT, TH, TT]
❑ Equally likely events
o Two events are said be equally likely if any one of them cannot be expected to occur in
preference to other.
o Getting Head and Getting Tail when a coin is tossed.
1/8
3/8
2 3/8
1/8
4 0
X P(x) x p(x)
0 1/8 0
1 3/8 3/8
2 3/8 6/8
3 1/8 3/8
Discrete and continuous probability distribution
A probability distribution may be discrete and continuous.
A r.v is said to be discrete if it assumes only certain specified values in a given interval.
For example, in tossing two coins, the number of heads obtainable can be 0,1,2.
The variable takes values 0,1,2. This is a discrete variable.
The probability distribution of a discrete variable is a discrete probability distribution.
Binomial distribution, Poisson distribution are discrete probability distrbutions.
If a r.v is assuming any real value in a given interval say (a, b), then the variable is said to
be continuous.
The probability distribution of a continuous variable is continuous probability distribution.
Normal distribution.
Discrete Probability Distribution
1) Binomial Distribution (BD)
Consider an event associated to a random experiment.
When the random experiment is repeated a number of times, the event may or may not
occur in each of those experiments.
The occurrence of an event termed as “success” and the non-occurrence of an event is
termed as “failure”. Therefore random experiment has only two possible outcomes
“success” and “failure”.
Normal Distribution
A symmetrical probability distribution.
Most results are located in the middle and few are spread on both sides.
Has the shape of a bell.
Can entirely be described by its mean and standard deviation.
Normality is an important assumption when conducting statistical analysis so that they can
be applied in the right manner.
A symmetrical probability distribution where most results are located in the middle and few
are spread on both sides
Examples:
• The body temperature for healthy humans.
• The heights and weights of adults.
• The thickness and dimensions of a product.
• IQ and standardized test scores.
• Quality control test results.
• Errors in measurements.
Why ND
Used to illustrate the shape and variability of the data.
Used to estimate future process performance.
Normality is an important assumption when conducting statistical analysis.
Certain charts and many statistical inference tests require the data to be normally
distributed.
10) Area under the normal curve is distributed as follows;
▪ 68% of the data fall within 1 standard deviation of the mean.
▪ 95% of the data fall within 2 standard deviations of the mean.
▪ 99.7% of the data fall within 3 standard deviations of the mean.