Prob and Stat Notes
Prob and Stat Notes
401237
Chapter 1: Data and Statistics
Al Albayt University
Data are the facts and figures collected, analyzed, and summarized for
presentation and interpretation.
Ordinal the categories follow a natural order and the order makes sense.
For example: education level, rating scale and medals (gold, sliver,
bronze).
Statistical Studies
Sometimes the data needed for a particular application are not
available through existing sources. In such cases, the data can often be
obtained by conducting a statistical study either experimental or
observational.
Bar Graphs
A bar graph, or bar chart, is a graphical device for depicting qualitative
data summarized in a frequency, relative frequency, or percent
frequency distribution. On one axis of the graph (usually the horizontal
axis), we specify the labels that are used for the classes (categories). A
frequency, relative frequency, or percent frequency scale can be used
for the other axis of the graph (usually the vertical axis). Then, using a
bar of fixed width drawn above each class label, we extend the length
of the bar until we reach the frequency, relative frequency, or percent
frequency of the class. For qualitative data, the bars should be
separated to emphasize the fact that each class is separate.
Pie Chart
The pie chart provides another graphical device for presenting relative
frequency and percent frequency distributions for qualitative data. To
construct a pie chart, we first draw a circle to represent all of the data.
Then we use the relative frequencies to subdivide the circle into sectors,
or parts, that correspond to the relative frequency for each class. Such
that
Example: Assume that the data set below show the soft drink selected in a
sample of 50 soft drink purchases. {Coke Classic, Sprite, Pepsi, Diet Coke,
Coke Classic, Coke Classic, Pepsi, Diet Coke, Coke Classic, Diet Coke, Coke
Classic, Coke Classic, Coke Classic, Diet Coke, Pepsi, Coke Classic, Coke
Classic, Dr. Pepper, Dr. Pepper, Sprite, Coke Classic, Diet Coke, Pepsi,
Diet Coke, Pepsi, Coke Classic, Pepsi, Pepsi, Coke Classic, Pepsi, Coke
Classic, Coke Classic, Pepsi, Dr. Pepper, Pepsi, Pepsi, Sprite, Coke Classic,
Coke Classic, Coke Classic, Sprite, Dr. Pepper, Diet Coke, Dr. Pepper,
Pepsi, Coke Classic, Pepsi, Sprite, Coke Classic, Diet Coke}
Bar Graph
Pie Chart
Al Albayt University
Dot Plot.
A horizontal axis shows the range for the data. Each data value is
represented by a dot placed above the axis.
Histogram.
A histogram is constructed by placing the variable of interest on the
horizontal axis and the frequency, relative frequency, or percent
frequency on the vertical axis. The frequency, relative frequency, or
percent frequency of each class is shown by drawing a rectangle whose
base is determined by the class limits on the horizontal axis and whose
height is the corresponding frequency, relative frequency, or percent
frequency.
Cumulative Distributions.
The cumulative frequency distribution uses the number of classes, class
widths, and class limits developed for the frequency distribution. But,
rather than showing the frequency of each class, the cumulative
frequency distribution shows the number of data items with values less
than or equal to the upper class limit of each class.
Ogive.
The ogive is constructed by plotting a point corresponding to the
cumulative frequency, such that, the horizontal axis and either the
cumulative frequencies, the cumulative relative frequencies, or the
cumulative percent frequencies on the vertical axis.
Time Frequency
(days)
10 -14 4
15 - 19 8
20 - 24 5
25 - 29 2
30 - 34 1
Total 20
Dot Plot
Histogram
Ogive
Al Albayt University
Mean
The mean provides a measure of central location for the data. If the
data are for a sample, the mean is denoted by x̄ ; if the data are for a
population, the mean is denoted by the Greek letter µ. For a sample
with n observations, the formula for the sample mean is as follows.
∑ni=1 xi
X̄ =
n
The formula for the population mean is as follows.
∑N
i =1 xi
µ=
N
Median
The median is another measure of central location, which is the value
in the middle when the data are arranged in ascending order (smallest
value to largest value). With an odd number of observations, the
median is the middle value. An even number of observations has no
single middle value. In this case, we follow convention and define the
median as the average of the values for the middle two observations.
Steps to find the median M of n observations
1 Arranged the observations from the smallest value to largest value.
2 If n is an odd number, then M is the observation at position (n + 1)/2.
3 If n is an even number, then M is the average of the two values on
either side of position (n + 1)/2.
{46 54 42 46 32}
1 32 42 46 46 54
2 n = 5 which is an odd number
5+1
3 M is the observation at position 2 = 3 . Then M = 46.
1 3310 3355 3450 3480 3480 3490 3520 3540 3550 3650 3730 3925
2 n = 12 which is an even number
M is average of the two values on either side of position 1212+1 = 6.5 .
3
Mode
The mode is the value that occurs with greatest frequency. Situations
can arise for which the greatest frequency occurs at two or more
different values. In these instances more than one mode exists.
1 If the data contain exactly two modes, we say that the data are
Bimodal.
2 If data contain more than two modes, we say that the data are
Multimodal.
Example: Consider the previous data {46 54 42 46 32}. Then the
Mode is 46.
Percentiles
The p − th percentile is a value such that at least p percent of the
observations are less than or equal to this value and at least (100 − p )
percent of the observations are greater than or equal to this value.
Steps to calculate the p − th percentile
1 Arranged the observations from the smallest value to largest value.
2 Compute an index i p
i= n
100
where p is the percentile of interest and n is the number of observations.
3 If i is not an integer, round up. The next integer greater than i denotes
the position of the p − th percentile.
4 If i is an integer, the p − th percentile is the average of the values in
positions i and i + 1.
{46 54 42 46 32}
1 32 42 46 46 54
35
2 i = 100 5 = 1.75 which is not an integer, round it up 2 then the
35 − th percentile is 42.
1 3310 3355 3450 3480 3480 3490 3520 3540 3550 3650 3730 3925
85
2 i = 100 12 = 10.2 which is not an integer, round it up 11 then the
85 − th percentile is 3730.
25
1 For Q1 we find the 25 − th percentile i = 100 12 = 3 which is an
integer, then the 25 − th percentile is
3450 + 3480
= 3465.
2
50
2 For Q2 we find the 50 − th percentile i = 100 12 = 6 which is an
integer, then the 50 − th percentile is
3490 + 3520
= 3505.
2
75
3 For Q3 we find the 75 − th percentile i = 100 12 = 9 which is an
integer, then the 75 − th percentile is
3550 + 3650
= 3600.
2
Dr. Ayat Al-Meanazel (A.A.U) 36 / 190
3.2 Measures of Variability
Range.
The simplest measure of variability is the range.
Example: Let us consider the following class size data for a sample of
five college classes. {46 54 42 46 32}
Range = 54 − 32 = 22
{3450 3490 3550 3730 3650 3540 3480 3925 3355 3520 3310 3480}
Interquartile Range
A measure of variability that overcomes the dependency on extreme
values is the interquartile range (IQR). This measure of variability is
the difference between the third quartile, Q3 , and the first quartile, Q1 .
In other words, the interquartile range is the range for the middle 50%
of the data.
IQR = Q3 − Q1
Example: The quartiles for
{3310 3355 3450 3480 3480 3490 3520 3540 3550 3650 3730 3925}
are Q1 = 3465, Q2 = 3505 and Q3 = 3600. Then,
IQR = Q3 − Q1
= 3600 − 3465
= 135
Dr. Ayat Al-Meanazel (A.A.U) 38 / 190
3.2 Measures of Variability
Variance
The variance is a measure of variability that utilizes all the data. The
variance is based on the difference between the value of each
observation (xi ) and the mean.The difference between each xi and the
mean (for a sample x̄, µ for a population) is called a deviation about
the mean. The variance is given in the following way
∑ni=1 (xi − x̄ )2
Sample variance = s 2 =
n−1
N
∑ (xi − µ)2
Population variance = σ2 = i =1
N
Standard Deviation
The standard deviation is defined to be the positive square root of the
variance. Following the notation we adopted for a sample variance and
a population variance, we use s to denote the sample standard
deviation and σ to denote the population standard deviation. The
standard deviation is derived from the variance in the following way
√
Sample standard deviation = s = s 2
√
Population standard deviation = σ = σ2
∑12
i =1 (xi − x̄ )
2 301, 850
s2 = = = 27, 440.91
12 − 1 11
√ √
s = s 2 = 27, 440.91 = 165.65
Box Plot
A box plot is a graphical summary of data that is based on a
five-number summary. A key to the development of a box plot is the
computation of the median and the quartiles, Q1 and Q3 . The
interquartile range, IQR = Q3 − Q1 , is also used. The steps used to
construct the box plot follow.
1 A box is drawn with the ends of the box located at the Q1 and Q3 .
2 A vertical line is drawn in the box at the location of the median (i.e.
Q2 ).
3 By using the interquartile range IQR, limits are located. The limits for
the box plot are the lower fence [LF = Q1 − 1.5(IQR )] and the
upper fence [UF = Q3 + 1.5(IQR )]. Data outside these limits are
considered outliers.
4 Whiskers are dashed lines drawn from the ends of the box to the
smallest and largest values inside the upper and lower fence in Step 3.
5 Finally, the location of each outlier is shown with the symbol ∗.
Al Albayt University
|S | = (2)(2)(2) = 8
|S | = (6)(6) = 36
|S | = (5)(10) = 50
Combinations
A second useful counting rule allows one to count the number of
experimental outcomes when the experiment involves selecting n
objects from a set of N objects. The number of combinations of N
objects taken n at a time is
N N N!
Cn = =
n n!(N − n )!
where
N! = (N )(N − 1)(N − 2)(N − 3) . . . (2)(1)
n! = (n )(n − 1)(n − 2)(n − 3) . . . (2)(1)
0! = 1
Example
1 How many different groups of three letters can be made from these
letters {A, B, C , D, E }?
5 5 5!
| S | = C3 = = = 10
3 3!(5 − 3)!
2 In how many ways we can select six integers from a group of 53?
53 53 53!
|S | = C6 = = = 22, 957, 480
6 6!(53 − 6)!
Permutations
A third counting rule that is sometimes useful is the counting rule for
permutations. It allows one to compute the number of experimental
outcomes when n objects are to be selected from a set of N objects
where the order of selection i The number of combinations of N
objects taken n at a time is important. The same n objects selected in
a different order are considered a different experimental outcome. The
number of permutations of N objects taken n at a time is given by
N!
PnN =
(N − n ) !
Examples
1 How many different groups of three letters can be made from these
letters {A, B, C , D, E }, if no letter is used more than once?
5!
|S | = P35 = = 60
(5 − 3) !
2 In how many ways can a president, a treasurer and a secretary be
chosen from among 7 candidates?
7!
|S | = P37 = = 210
(7 − 3) !
Classical Method.
This method is appropriate when all the experimental outcomes are
equally likely. If n experimental outcomes are possible, a probability of
1/n is assigned to each experimental outcome.
Relative Frequency Method.
This method is appropriate when data are available to estimate the
proportion of the time the experimental outcome will occur if the
experiment is repeated a large number of times.
Subjective Method.
This method is most appropriate when one cannot realistically assume
that the experimental outcomes are equally likely and when little
relevant data are available.
Example Four men visit a city with 5 hotels. what is the probability that
1 each one select any hotel to stay in.[Answer: 1]
2 each one stay in different hotel.[Answer: 0.1920]
Complement of an Event
Given an event A, the complement of A is defined to be the event
consisting of all sample points that are not in A. The complement of A
is denoted by Ac .
In any probability application, either event A or its complement Ac
must occur. Therefore, we have
P (A) + P (Ac ) = 1
Union of Events
Given two events A and B, the union of A and B is the event
containing all sample points belonging to A or B or both. The union is
denoted by A ∪ B.
Intersection of Events
Given two events A and B, the intersection of A and B is the event
containing the sample points belonging to both A and B. The
intersection is denoted by A ∩ B.
Addition Law
Given two events A and B, the addition law is helpful when we are
interested in knowing the probability that event A or event B or both
occur. The addition law is written as follows.
P (A ∪ B ) = P (A) + P (B ) − P (A ∩ B )
P ( A ) = P ( E1 ) + P ( E4 ) + P ( E6 )
= 0.05 + 0.25 + 0.10
= 0.40
P ( B ) = P ( E2 ) + P ( E4 ) + P ( E7 )
= 0.20 + 0.25 + 0.05
= 0.50
P ( C ) = P ( E2 ) + P ( E3 ) + P ( E5 ) + P ( E7 )
= 0.20 + 0.20 + 0.15 + 0.05
= 0.60
P (A ∪ B ) = P (A) + P (B ) − P (A ∩ B )
= 0.40 + 0.50 − 0.25
= 0.65
A ∪ B = { E1 , E2 , E 4 , E 6 , E 7 }
P ( A ∪ B ) = P ( E1 ) + P ( E2 ) + P ( E4 ) + P ( E6 ) + P ( E7 )
= 0.05 + 0.20 + 0.25 + 0.10 + 0.05
= 0.65
Dr. Ayat Al-Meanazel (A.A.U) 69 / 190
4.3 Some Basic Relationships of Probability
P ( A ) = P ( E1 ) + P ( E2 )
= 0.20 + 0.20
= 0.40
P ( B ) = P ( E3 ) + P ( E4 )
= 0.20 + 0.20
= 0.40
P ( C ) = P ( E2 ) + P ( E3 ) + P ( E 5 )
= 0.20 + 0.20 + 0.20
= 0.60
Find P (A ∩ B ).
Event A and B are mutually exclusive since A ∩ B = ∅ then
P (A ∩ B ) = 0
Find P (A ∪ B ).
P (A ∪ B ) = P (A) + P (B )
= 0.40 + 0.40
= 0.80
A ∪ B = { E1 , E2 , E3 , E4 }
P ( A ∪ B ) = P ( E1 ) + P ( E2 ) + P ( E 3 ) + P ( E4 )
= 0.20 + 0.20 + 0.20 + 0.20
= 0.80
Find P (C c ).
P (C ) + P (C c ) = 1
P (C c ) = 1 − P (C )
= 1 − 0.60
= 0.40
C c = { E1 , E4 }
P ( C c ) = P ( E1 ) + P ( E4 )
= 0.20 + 0.20
= 0.40
Conditional Probability
Suppose we have an event A with probability P (A). If we obtain new
information and learn that a related event, denoted by B, already occurred.
Then the conditional probability of A given B can be computed as
P (A ∩ B )
P (A/B ) =
P (B )
Example: Assume that we have two events, A and B, that are independent.
Assume further that we know P (A) = 0.30 and P (B ) = 0.40.
Find P (A ∩ B ).
Find P (B/A).
P (B/A) = P (B ) = 0.40
Find P (A ∪ B ).
P (A ∪ B ) = P (A) + P (B ) − P (A ∩ B )
= 0.30 + 0.40 − 0.12 = 0.58
Example Continued
Find P (A ∩ B c ).
P (A ∩ B c ) = P (A) − P (A ∩ B )
= 0.30 − 0.12 = 0.18
Find P (B/Ac ).
P (B ∩ Ac )
P (B/Ac ) =
P (Ac )
P (B ) − P (B ∩ A)
=
1 − P (A)
0.40 − 0.12 0.28
= = = 0.40
1 − 0.30 0.70
a man.
960
P (M ) = = 0.80
1200
a woman and is promoted.
36
P (W ∩ P ) = = 0.03
1200
promoted given that the officer is a man.
P (P ∩ M )
P (P/M ) =
P (M )
288/1200
=
960/1200
288
= = 0.30
960
P (W ∪ P c ) = P (W ) + P (P c ) − P (W ∩ P c )
240 876 204
= + −
1200 1200 1200
912
= = 0.76
1200
Al Albayt University
Example: Consider the sales of used cars. Over the past 300 days of
operation, sales data show 54 days with no cars sold, 117 days with 1 car
sold, 72 days with 2 cars sold, 42 days with 3 cars sold, 12 days with 4 cars
sold, and 3 days with 5 cars sold. Suppose we consider the experiment of
Example: Suppose that for the experiment of rolling a die we define the
random variable x to be the number of dots on the upward face. For this
experiment, x = 1, 2, 3, 4, 5, 6. Thus, the probability function for this
discrete uniform random variable is
1
f (x ) = for all x = 1, 2, 3, 4, 5, 6
6
E (x ) = µ = ∑ xf (x )
Variance
The variance summarize the variability in the values of a random
variable. The formula for the variance of a discrete random variable
follows.
Var (x ) = σ2 = ∑(x − µ)2 f (x )
The standard deviation, , is defined as the positive square root of the
variance. Thus, the standard deviation for the number of automobiles
sold during a day is √
σ = σ2
Dr. Ayat Al-Meanazel (A.A.U) 90 / 190
5.3 Expected Value and Variance
Example: Consider the sales of used cars. Over the past 300 days of
operation, sales data show 54 days with no cars sold, 117 days with 1 car
sold, 72 days with 2 cars sold, 42 days with 3 cars sold, 12 days with 4 cars
sold, and 3 days with 5 cars sold. Suppose we consider the experiment of
x x −µ (x − µ )2 f(x) (x − µ )2 f (x )
0 0-1.50=-1.50 2.25 0.18 2.25(0.18)=0.4050
1 1-1.50=-0.50 0.25 0.39 0.25(0.39)=0.0975
2 2-1.50=0.50 0.25 0.24 0.25(0.24)=0.0600
3 3-1.50=1.50 2.25 0.14 2.25(0.14)=0.3150
4 4-1.50=2.50 6.25 0.04 6.25(0.04)=0.2500
5 5-1.50=3.50 12.25 0.01 12.25(0.01)=0.1225
Total 1.2500
σ2 = ∑
√
(x − µ)2 f (x ) = 1.25
√
σ= σ2 = 1.25 = 1.118
Find the expected number of service calls, the variance and the standard
deviation in the number of service calls.
x f(x) xf(x)
0 0.10 0(0.10)=0.00
1 0.15 1(0.15)=0.15
2 0.30 2(0.30)=0.60
3 0.20 3(0.20)=0.60
4 0.15 4(0.15)=0.60
5 0.10 5(0.10)=0.50
Total 2.45
µ= ∑ xf (x ) = 2.45
x x −µ (x − µ )2 f(x) (x − µ )2 f (x )
0 0-2.45=-2.45 6.0025 0.10 6.0025(0.10)=0.60025
1 1-2.45=-1.45 2.1025 0.15 2.1025(0.15)=0.31538
2 2-2.45=-0.45 0.2025 0.30 0.2025(0.30)=0.06075
3 3-2.45=0.55 0.3025 0.20 0.3025(0.20)=0.06050
4 4-2.45=1.55 2.4025 0.15 2.4025(0.15)=0.36038
5 5-2.45=2.55 6.5025 0.10 6.5025(0.10)=0.65025
Total 2.04751
σ2 = ∑
√
(x − µ)2 f (x ) = 2.04751
√
σ= σ2 = 2.04751 = 1.43091
Example
Check if the following experiment satisfies the conditions of a binomial
experiment,
You have a box with 4 red balls and 5 green balls. You select a ball, note its
colour and return it to the box. This is done five times. We note the
number of red balls selected.
Let’s check the four conditions:
1 We have a fixed number of trials, n = 5.
2 On each trial we either select a red ball (success) or a green ball
(failure).
3 As we return the ball after each trial, the trials are independent.
4 Each time the probability of success is the same; since
p = P (success ) = P (selectred ) = 49 .
Then the experiment is a binomial experiment.
Dr. Ayat Al-Meanazel (A.A.U) 97 / 190
5.4 Binomial Probability Distribution
Example
Check if the following experiment satisfies the conditions of a binomial
experiment,
You have 7 different classrooms. You will select one student from each
classroom. We note the number of female students selected.
Let’s check the four conditions:
1 We have a fixed number of trials, n = 7.
2 On each trial we either select a female student (success) or a male
student (failure).
3 As we select from different classroom each trial, the trials are
independent.
4 Each time the probability of success is not the same; since the
probability of selecting a female student is different for each classroom.
Then the experiment is not a binomial experiment.
Dr. Ayat Al-Meanazel (A.A.U) 98 / 190
5.4 Binomial Probability Distribution
where
f (r ) = the probability of r successes in n trials
n = the number of trials
(nr) = n!
n Cr = r ! ( n − r ) !
p = the probability of a success on any one trial
1−p = the probability of a failure on any one trial
E (x ) = µ = np
Var (x ) = σ2 = np (1 − p )
Example
In San Francisco, 30% of workers take public transportation daily. In a
sample of 10 workers, what is the probability that
(a) exactly three workers take public transportation daily?
(b) at least three workers take public transportation daily?
Example
Twenty-three percent of automobiles are not covered by insurance. On a
particular weekend, 35 automobiles are involved in traffic accidents.
(a) What is the expected number of these automobiles that are not
covered by insurance?
(b) What is the variance and standard deviation?
Let X be the total number of automobiles that are not covered by
insurance, then X ∼ Bin (35, 0.23).
(a) E (x ) = µ = (35)(0.23) = 8.05.
(b) Var (x√) = σ2√
= (35)(0.23)(0.77) = 6.1985, and
σ = σ2 = 6.1985 = 2.490.
P ( X = r ) = f ( r ) = p ( 1 − p ) (r −1) , for r = 1, 2, . . .
where
f (r ) = the probability of first success in the r th trial
p = the probability of a success on any one trial
1−p = the probability of a failure on any one trial
E (x ) = µ = p1
Var (x ) = σ2 = 1p−2p
Example
Products produced by a machine has a 3% defective rate. What is the
probability that the first defective occurs in
(a) the fifth item inspected?
(b) the first five inspections?
Case 1: If you select one ball with replacement, what is the probability that
the first blue ball is the seventh selection?
Case 2: If you select 8 balls at random, what is the probability that you
will select 3 green balls?
Case 1: If you select one ball with replacement, what is the probability that
the first blue ball is the seventh selection?
Case 2: If you select 8 balls at random, what is the probability that you
will select 3 green balls?
µr e − µ
P (X = r ) = f (r ) = , for r = 0, 1, 2, . . .
r!
where
E (x ) = µ
Var (x ) = µ
Example
During the period of time that a local university takes phone-in
registrations, calls come in at the rate of one every two minutes.
(a) What is the expected number of calls in one hour?
(b) What is the probability of four calls in six minutes?
60
Then number of calls in 1 hour = 2 = 30.
6
Then number of calls in 6 minutes = 2 = 3, so our µ = 3 then
34 e −3
P (x = 4) = f (4) = = 0.1680.
4!
Example
An average of 15 aircraft accidents occur each year.
(a) Compute the mean number of aircraft accidents per month.
(b) Compute the probability of no accidents during a month.
(c) Compute the probability of exactly one accident during a month.
(d) Compute the probability of more than one accident during a month.
15
Then number of accident in 1 month = 12 = 1.25.
1.250 e −1.25
(b) P (x = 0) = f (0) = 0! = 0.2865.
1.251 e −1.25
(c) P (x = 1) = f (1) = 1! = 0.3581.
Al Albayt University
Normal distribution arise naturally for many different random variables, such
as, heights, test scores, and lengths.
Normal Distribution
Normal distribution is symmetrical, bell- shaped density curves defined by a
mean µ and a standard deviation σ. We say that X is normally distributed
if X ∼ N (µ, σ ).
Probability Curve
P (z ≤ 1.00) =
= 0.8413
Probability Curve
P (z ≥ 1.58) =
1 − P (z ≤ 1.58) =
1 - 0.9429 = 0.0571
Probability Curve
P (−0.50 ≤ z ≤ 1.25) =
P (z ≤ 1.25) − P (z ≤ −0.50)=
0.8944 - 0.3085 = 0.5859
Probability Curve
P (−1.00 ≤ z ≤ 1.00) =
P (z ≤ 1.00) − P (z ≤ −1.00)=
0.8413 - 0.1587 = 0.6826
Summary
P (z ≤ b ) → z-table
P (z ≥ a ) → 1 − P (z ≤ a ) → z-table
P (a ≤ z ≤ b ) → P (z ≤ b ) − P (z ≤ a ) → z-table
P (z = b ) = 0
P (z ≤ −b ) = P (z ≥ b )
P (z ≥ −b ) = P (z ≤ b )
P (|z | ≤ b ) = P (−b ≤ z ≤ b )
P (|z | ≥ b ) = 2P (z ≤ −b )
Example Find:
1 P (z < 0.36) = 0.6406.
2 P (z > a) = 0.9750.
P (z > a) =0.9750
1 − P (z < a) =0.9750
P (z < a) =1 − 0.9750
P (z < a) =0.0250 from the table a = −1.96
3 P (0 < z < a) = 0.4700.
P (0 < z < a) =0.4700
P (z < a) − P (z < 0) =0.4700
P (z < a) − 0.5 =0.4700
P (z < a) =0.4700 + 0.5
P (z < a) =0.9700 from the table a = 1.88
Dr. Ayat Al-Meanazel (A.A.U) 133 / 190
6.2 Normal Probability Distribution
Example Find the value of a for each situation P (−a < z < a) = 0.2050.
P (x ≥ a) =0.10
1 − P (x < a) =0.1000
P (x < a) =1 − 0.1000
P (x < a ) =0.9000
x −µ a − 12
P( < ) =0.9000
σ 2
a − 12
P (z < ) =0.9000
2
a − 12
=1.28 → a = 14.56 JD.
2
x b
1 −µ
= 1 − e− µ .
Rb
2 P (x ≤ b ) = P (x < b ) = 0 µ
e dx
R∞ 1 −µ
x a
3 P (x ≥ a ) = P (x > a ) = a µe dx = e− µ .
x a b
1 −µ
= e− µ − e− µ .
Rb
4 P (a ≤ x ≤ b ) = a µ
e dx
P (x ≥ 5 and x > 2) P (x ≥ 5)
P (x ≥ 5/x > 2) = =
P (x > 2) P (x > 2)
5
e− 5
= 2 = 0.5488.
e− 5
Note,
5
e− 5 5 2 3
= e − 5 + 5 = e − 5 = 0.5488.
e − 52
Al Albayt University
∑ni=1 xi nµ nσ2 σ2 √σ
x̄ = n n =µ n2
= n n
Example
The heights of adults male in Jordan are normally distributed with mean
height 172 cm, and variance 144 cm2 .
1 What is the probability that the height of selected male is less than
175 cm?
2 If we select a simple random sample of 25 males, what is the
probability that the sum of there heights more than 4350 cm?
3 If we select a simple random sample of 25 males, what is the
probability that the sample mean is between 165 cm and 179 cm?
Let x be the height of adult male in Jordan, then x ∼ N (172, 12). Which
means µ = 172 and σ = 12.
x −µ 175 − 172
P (x < 175) =P <
(1) σ 12
=P (z < 0.25) = 0.5987.
∑25
i =1 x i ∼ N ( 25µ, 25σ) = N (4300, 60).
∑25
25
i =1 xi − nµ 4350 − 4300
P ∑ xi > 4350 =P √
nσ
>
60
i =1
=P (z > 0.83)
=1 − P (z < 0.83) = 1 − 0.7967 = 0.2033.
Example
The average price of a gallon of gasoline was reported to be $2.34. Use this
price as the population mean, and assume the population standard deviation
is $0.20.
1 What is the probability that the average price of a gallon of gasoline is
within $0.02 of the population mean?
2 What is the probability that the mean price for a sample of 49 service
stations is within $0.04 of the population mean?
3 A sample mean have a 0.95 probability that it is within $0.03 of the
population mean, What is the sample size?
Then We have
−0.03
√ = − 1.96
0.20/ n
√
−0.03 n
= − 1.96
0.20√
−0.03 n = − 0.392
√ −0.392
n =13 (we round −0.03 = 13.06̄ to nearest integer)
n =169.
Al Albayt University
Where,
The Margin of error= z 2α √σn .
z 2α is the z value providing an area of α/2 in the upper tail.
σ
x̄ ± z 2α √ = 24.8 ± 1.7 then µ ∈ (23.1, 26.5).
n
Note,
σ σ σ
x̄ + z 2α √ − x̄ − z 2α √ =2z 2α √
n n n
15
160 − 152 =2(1.96) √
n
2
2(1.96)(15)
(round to nearest integer) n = = 54
8
Dr. Ayat Al-Meanazel (A.A.U) 159 / 190
Population Mean: Standard Deviation Unknown
Where,
The Margin of error= t 2α √sn .
t 2α is the t value providing an area of α/2 in the upper tail of the t
distribution with n − 1 degrees of freedom.
s
x̄ ± t 2α √ = 22.5 ± 1.16 then µ ∈ (21.34, 23.66).
n
Al Albayt University
3*Lower tail 3* zα
t(1−α,n−1)
3*Upper tail 3* z1 − α
t(α,n−1)
Step 4. Conclusion
Type of Test Rule Conclusion
Lower tail ztest < zα 2*We reject H0
ttest < t(1−α,n−1)
Otherwise, we fail to reject H0
Upper tail ztest > z1−α 2*We reject H0
ttest > t(α,n−1)
Otherwise, we fail to reject H0
Two tailed |ztest | > z 2α 2*We reject H0
|ttest | > t( 2α ,n−1)
Otherwise, we fail to reject H0
Example
Consider the following hypothesis test:
H0 : µ ≥80
Ha : µ <80
µ0 80
σ 12
n 100
x̄ 77
α 0.01
Step 4. Conclusion.
Example
Consider the following hypothesis test:
H0 : µ ≤50
Ha : µ >50
A sample of 25 is used, compute and state your conclusion for sample mean
62 and variance 64, use α = 0.05.
µ0 50
s 8
n 25
x̄ 62
α 0.05
Step 4. Conclusion.
Example
Consider the following hypothesis test:
H0 : µ =15
Ha : µ 6=15
µ0 15
σ 11
n 16
x̄ 12
α 0.25
Step 4. Conclusion.
Example
Individuals filing income tax returns prior to March 31 received an average
refund of 1056 JD. A researcher suggests that the population of late filers
who file their tax return during the last five days of the income tax period
has a standard deviation of 1600 JD, and will receive less refunds than do
early filers. For a sample of 400 individuals who filed a tax return during the
last five days the sample mean refund was 910 JD, develop appropriate
hypothesis using α = 0.05 to test the previous suggestion.
µ0 1056
σ 1600
Ha µ < 1056
n 400
x̄ 910
α 0.05
Step 4. Conclusion.
We will reject H0 since ztest = −1.825 < −1.645 = zα .
There is a sufficient evidence that the average refund is less than 1056 JD.
Example
For Jordan, the mean monthly Internet bill is 32.79 JD per household. A
sample of 30 households in southern Jordan showed a sample mean of 35.63
JD with standard deviation of 5.60 JD. Formulate hypothesis for a test to
determine whether the sample data support the claim that the mean
monthly Internet bill in southern Jordan is more than the national mean of
32.79 JD at α = 0.15.
µ0 32.79
s 5.60
Ha µ > 32.79
n 30
x̄ 35.63
α 0.15
Step 4. Conclusion.
We will reject H0 since ttest = 2.778 > 1.055 = t(α,n−1) .
There is a sufficient evidence that the mean monthly Internet bill in
southern Jordan is more than the national mean.
Dr. Ayat Al-Meanazel (A.A.U) 180 / 190
Hypothesis Testing about a Population Mean
Example
The average hourly earnings for production workers assumed to be 14.32 JD
per hour. Assume the population standard deviation σ = 9.45JD. If we
have a sample of 75 production workers with a sample mean of 14.68 JD
per hour, can we conclude that the mean hourly earnings is different than
14.32 JD?
Use α = 0.20.
µ0 14.32
σ 9.45
Ha µ 6= 14.32
n 75
x̄ 14.68
α 0.20
H0 : µ = 14.32 vs Ha : µ 6= 14.32
Step 4. Conclusion.
We fail to reject H0 since |ztest | = 0.330 < 1.28 = |z 2α |.
There is no sufficient evidence that the mean hourly earnings is different
than 14.32 JD.
Dr. Ayat Al-Meanazel (A.A.U) 182 / 190
Probability and Statistics
401237
Chapter 14: Simple Linear Regression
Al Albayt University
E (y ) = β 0 + β 1 x
ŷ = b0 + b1 x
Where, b0 is the y intercept and b1 is the slope. Also we can say as the
value of x increases by one unit, we expect the value of y to increase
| {z } or
if b1 >0
| {z } by |b1 |.
decrease
if b1 <0
∑ni=1 xi ∑ni=1 yi
x̄ = and ȳ =
n n
Step 2: Calculate the slope
b0 = ȳ − b1 x̄
Example
A sample of pizza restaurants from 10 different cities was selected, the xi is
the size of the population (in thousands) and yi is the sales (in thousands of
JD). The values of xi and yi for all i = 1, 2, . . . , 10 are shown in the table
below
i 1 2 3 4 5 6 7 8 9 10
xi 2 6 8 8 12 16 20 20 22 26
yi 58 105 88 118 117 137 157 169 149 202
For any data set (xi , yi ) with i = 1, 2, . . . , n, we can find the following:
the linear regression equation ŷ = b0 + b1 x, where A = b0 and
B = b1 .
the estimated value of y for any given x value.
the error in the estimation Error = y − ŷ .
the correlation coefficient r , where
The following data were collected on the height (inches) and weight
(pounds) of adult swimmers.
Height 68 64 62 65 66
Weight 132 108 102 115 128
The following data are the monthly salaries and the grade point averages for
students who obtained a bachelor’s degree in business administration.