Probability & Statistics
Probability & Statistics
Pilani Campus
Lecture 1
EVENT Events:
A B
A B B A
A B A B
1. A = ( A) = A i.e complement of a complement is A itself Sum Rule Principle : Suppose some event E can occur in m
2. Distributive laws: ways and a second event F occur in n ways, and suppose both
(i) A (B C) = (A B) (A C)
events cannot occur Simultaneously. Then E or F can occur
in m + n ways.
(ii) A (B C) = (A B) (A C)
Example : Suppose there are 8 male professors and 5 female
3. De Morgan’s laws: professors teaching probability class.
(i) A B = A B
(ii) A B = A B A Student can choose a professor in 8 + 5 = 13 Ways
4. Commutative laws: 5. (i) (A B) (A B) = A
Suppose there are 3 different books on probability 5 different
(i)𝐴 ∪ 𝐵 = 𝐵 ∪ 𝐴 (ii) (A B) (B A) = B
books on statistics and 4 different books on numerical
(ii) 𝐴 ∩ 𝐵 = 𝐵 ∩ 𝐴 (iii) A A = ; A A = S or U methods.
Then there are 3 + 5 + 4 = 12 ways to choose one of the books
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
11/9/2024 13
BITS Pilani, Pilani Campus
Pr = 8 P5 = 8! = 8! = 8 7 6 5 4 3 2 1 = 6720 ways
n
7! = 7 · 6 · 5 · 4 · 3 · 2 · 1 = 5040 surveys (8 − 5)! 3! 3 2 1
15
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
Suggested Problems Example 2(3.22)
Example 1: In how many ordered ways can a television direct If among n objects k are alike and the others are all distinct, the numbe
or schedule 6 different commercials during the 6 time slots r of permutations of these n objects taken all together in n!/k!.
allocated to commercials during the telecast of the first per (a) How many permutations are there of the letters of the world CLASS
iod of a hockey game? ?
Answer: Since the CLASS contains 2 letters same , therefore the numbe
Answer: Here n = 6, r = 6
r of permutations of the world CLASS is n!/k!
Therefore nPr = 6P6 =6!/(6-6) ! = 6! = 720 ways. = 5!/2! = (5x4x3x2x1)/2x1 = 60 ways.
OR (b) In how many ways can the television director of Exercise 3.21 fill
First commercial in first slot can be chosen in 6 ways, the 6 time slots allocated to commercials, if there are 4 different co
for 2nd slot remaining 5 commercial can be chosen in 5 ways mmercials, of which a given one is to be shown 3 times while each
and so on. Therefore 6 commercial in 6 slot can be chosen of the others is to be shown once?
Answer: n = 6 commercials out of which one commercial is repeated 3
in
times. Therefore no. of ways is n!/k! = 6!/3! =120 ways
= 6x5x4x3x2x1 = 720 ways
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
An urn contains 5 red and 8 white balls. If seven of The complement of Event E is the set of all outcomes in
them are drawn at random, what is the probability the sample space that are not included in event E.
that the drawn lot contains 2 red and 5 white (Denoted E′ ( E )( and read “E prime.” or E complement)
Solution: Total balls= 5 red+8 white =13 balls P(E) + P (E′ ) = 1 P(E) = 1 – P (E′ ) P (E′ ) = 1 – P(E)
S=13𝐶7 = 429 ways, the number of ways drawing 2 Example 4: There are 5 red chips, 4 blue chips, and 6 white chips
red balls out of 5 is 5C2 ways and number of ways in a basket. Find the probability of randomly selecting a chip that is
drawing 5 white balls from 8 is 8C5 ways not blue.
Therefore P(2R and 5 W balls)= 5c 8c = 140 2 5
13c 429 4
= 0.267
7
P (selecting a blue chip)
15
Using multiplication theorem
4 11
P (not selecting a blue chip) =1− = 0.733
15 15
23 24
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
AXIOMS OF PROBABILITY Suggested Problems
Axiom 1 : For any event A in S , 0 P(A) 1. Example (3.35): An experiment has the four possible mutually
exclusive outcomes A, B, C, D. Check whether the following
Axiom 2 : P(S) =1. assignments of probability are permissible:
26
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
The probability that event A or B will occur is given by A number is choosen from the numbers {1,2,3------50}What is the
probability that the number choosen is divisible by 6 or 8.
P (A U B) = P (A) + P (B) – P (A ∩ B ). Solution: Sample space S= out of 50 numbers one number can be
If events A and B are mutually exclusive, then the rule choosen in 50𝐶1 = 50 ways
can be simplified to P (A U B) = P (A) + P (B).
Let A be the event that the number is divisible by 6 and B be the
Example 1: You roll a dice. Find the probability that you roll a even that the number is divisible by 8. Then
number less than 3 or a 4.
A={6,12,18,24,30,36,42,48}, B={8,16,24,32,40,48}
The events are mutually exclusive P(A)= 8/50 , P(B)= 6/50, P (A ∩ B )=2/50
P (roll a number less than 3 or a 4) = P (number is less than 3) + P (4) Using Addition rule we get
2 1 3 P (A U B) = P (A) + P (B) – P (A ∩ B )
= + = = 0.5
6 6 6 P (A U B)= 8/50+6/50-2/50=12/50
27 28
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
Example 3 Continuation
A basket contains 3 black, 4 white and 5 red balls. 3 balls are 3c 4c 5c
choosen at random. What is the probability that = 3
+ + 3 3
(i) All are white, (ii) All are of same colour (iii) All are of different 12c 12c 12c
3 3 3
colours, (iv) Atleast one white (v) Atmost one is white (vi) one is
red. = 0.06818
Solution: Total balls= 3black+4 white+ 5 red =12 (iii) All are of different colours ⇒ it can be
12!
S = 12c3 =
9!3!
= 220 ways 1B&1W&1R
4c 4 P(All are of different colours)= P(1B&1W&1R)
=
= 3c 4c 5c
3
(i) P(All are white)=
12c 220 3
1
(using mutliplication
1 1
12c
(ii) All are of same colour implies it can be 3B or 3W or 3R
3
principle)
3 4 5
Therefore P(All are of same colour)=P(3B or 3Wor 3R)= = = 0.2727
220
=P(3B)+P(3W)+P(3R)
All are mutually exclusive 29 30
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
(iv) Atleast one is white ⇒ 1.Suppose a student is selected at random from 80 students where
30 are taking mathematics, 20 are taking chemistry and 10 are taking
1W& 2others or 2W & 1 other or 3W & 0 others both. Find the probability ‘p’ that the student is taking Mathematics
P(atleast one is white)=P(1W& 2others)+P(2W & 1 or chemistry?.
other)+P(3W & 0 others) 2. If A and B are events with P(A U B) = 7/8, P(A ∩ B) = ¼ and P(A') = 5/8, find
P(A), P(B) and P(A ∩ B').
(4c1 8c2 ) + (4c2 8c1 ) + (4c3 8c0 )
=
12c3 2
[A=(A ∩B)U(A ∩B') => P(A ∩ B')= P(A) - P(A∩B)]
=
41
= 0.7454 And [B=(A ∩B)U(B ∩A') => P(B ∩ A')= P(B) - P(A∩B)]
220
Sol: (i)P(A)= 1- P(A') =1-5/8=3/8,
(v) P(Atmost one is white)= Do it yourself (ii) Using addition theorem we get
P (A U B) - P (A) + P (A ∩ B ) =P (B)
(vi) P(one is red)= Do it yourself
Therefore P (B)=7/8-3/8+1/4=3/4
(iii) P(A ∩ B')= P(A) - P(A∩B)
= 3/8-1/4=1/8
31 32
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
Continued Continued
Example (3.39.) Show that P(A∩B) ≤ P(A) , P(AUB) ≥P(A) Example (3.51):If the probability of event A is p, the odds that it will occur are given by the
ratio of p to 1-p. Odds are usually given as a ratio of two positive integers having no common
Answer: (a) We know that A = (A∩B)U(A∩B1) factor, and if an event is more likely not to occur than to occur, it is customary to give the
odds that it will not occur rather than the odds that it will occur. What are the odds for or
P(A) =P[(A∩B)U(A∩B1)] = P(A∩B) + P(A∩B1), against the occurrence of an even if its probability is (a) 4/7 (b) 0.05 (c) 0.80
Answer : If the probability of event A is given by P(A) = a / (a+b), then the odd for
since (A∩B) & (A∩B1) are mutually exclusive events occurrence of the event is given by a:b
P(A) ≥ P(A∩B), Since P(A∩B1) ≥ 0 (a) If p = 4/7 = 4/(4+3),
therefore odds for occurrence of the event is 4:3
(b) If p = 0.05 = 5/100 = 1 /20 = 1/(1+19)
(b) We know that AUB = (A∩B1)U(A∩B)U(B∩A1) therefore odd against the occurrence is 19:1
P(AUB)= P(A∩B1) + P(A∩B) + P(B∩A1) = P(A)+ P(B∩A1) , (c) If p = 0.80 = 80/100 = 4/5 = 4/(4+1)
therefore odds for occurrence of the event is 4:1
P(AUB) ≥ P(A) Since P(B∩A1) ≥ 0
35 36
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
BITS Pilani THE ADDITION RULE
Pilani Campus
Lecture 2 38
BITS Pilani, Pilani Campus
Example 2 Example 3
A number is choosen from the numbers {1,2,3------50}What is the A basket contains 3 black, 4 white and 5 red balls. 3 balls are
probability that the number choosen is divisible by 6 or 8. choosen at random. What is the probability that
Solution: Sample space S= out of 50 numbers one number can be (i) All are white, (ii) All are of same colour (iii) All are of different
choosen in colours, (iv) Atleast one white (v) Atmost one is white (vi) one is
red.
Let A be the event that the number is divisible by 6 and B be the Solution: Total balls= 3black+4 white+ 5 red =12
even that the number is divisible by 8. Then
S=
A={6,12,18,24,30,36,42,48}, B={8,16,24,32,40,48}
P(A)= 8/50 , P(B)= 6/50, P (A ∩ B )=2/50 (i) P(All are white)=
Using Addition rule we get
P (A U B) = P (A) + P (B) – P (A ∩ B ) (ii) All are of same colour implies it can be 3B or 3W or 3R
P (A U B)= 8/50+6/50-2/50=12/50 Therefore P(All are of same colour)=P(3B or 3Wor 3R)=
=P(3B)+P(3W)+P(3R)
39 40
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
Continuation Continuation
41 42
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
46
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
CONDITIONAL
Suggested Problems
PROBABILITY
Example (3.51):If the probability of event A is p, the odds that it will occur are
given by the ratio of p to 1-p. Odds are usually given as a ratio of two positive
integers having no common factor, and if an event is more likely not to occur than
to occur, it is customary to give the odds that it will not occur rather than the odds
that it will occur. What are the odds for or against the occurrence of an even if its
probability is (a) 4/7, (b) 0.05, c) 0.80
Answer : If the probability of event A is given by P(A) = a / (a+b), then the odd for
occurrence of the event is given by a:b
(a) If p = 4/7 = 4/(4+3),
therefore odds for occurrence of the event is 4:3
(b) If p = 0.05 = 5/100 = 1 /20 = 1/(1+19)
therefore odd against the occurrence is 19:1
(c) If p = 0.80 = 80/100 = 4/5 = 4/(4+1)
therefore odds for occurrence of the event is 4:1
47 48
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
Examples on Conditional Examples on Conditional
Probability Probability
Solution:
53 54
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
2. A person X can hit a target three times in five shots, a second person
Y twice in five shots and a third person Z thrice in four shots. They
all fire once simultaneously. Find the probability that (i) atleast two
shot hit, (ii) exactly two shots hit.
59
BITS Pilani, Pilani Campus
The Theorem of Total Probability
(tree diagram )
Proof:
The conditional Probability of Ai for any i given B is given as
B
Product is good
64
63
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
Solution Example 2
Let X: “ Customer receives a defective product”.
The chances that an academician, a business man and a
Clearly, P(X)= politician becoming Vice Chancellor of an university are
0.5, 0.3 and 0.2 respectively. The probability that
=0.17 research work will be promoted in the university by these
Therefore, the chance that product is manufactured 3 gentlemen are respectively are 0.8, 0.6 and 0.4. It is
found Research work has been promoted by the
by the plant C is university. What is the chance that an academician has
become the VC?
= = 0.4412
65
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
69 70
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
BITS Pilani
Pilani|Dubai|Goa|Hyderabad Probability Distributions
Symbol Description
n The number of times a trial is repeated. Example : Decide whether the experiment is a binomial
experiment. If it is, specify the values of n, p, and q, and list the possible values
p = P (S) The probability of success in a single trial. of the random variable x. If it is not a binomial experiment, explain why.
q = P (F) The probability of failure in a single trial. (q = 1 – p) You randomly select a card from a deck of cards, and note if the card
The random variable represents a count of the number of is an Ace. You then put the card back and repeat this process 8 times.
x
successes in n trials: x = 0, 1, 2, 3, … , n.
This is a binomial experiment. Each of the 8 selections represent an
independent trial because the card is replaced before the next one is drawn.
There are only two possible outcomes: either the card is an Ace or not.
n =8 p= 4 = 1 q = 1 − 1 = 12 x = 0,1,2,3,4,5,6,7,8
52 13 13 13
Problem : When a coin is tossed 4 times, find the
Binomial Distribution probability of getting (i) exactly one head,(ii) atmost 3
heads, (iii) atleast 2 heads
3
p = the probability of selecting a red chip = = 0.3
10
q = 1 – p = 0.7 P (1) = 3C 1(0.3)1(0.7)2
n=3
= 3(0.3)(0.49)
= 0.441
3
p = the probability of selecting a red chip = = 0.3
10
q = 1 – p = 0.7
n=4
x = 0, 1, 2, 3, 4
x P (x )
0 0.240
1 0.412 The binomial probability
2 0.265 formula is used to find each
3 0.076 probability.
4 0.008
Binomial Table:
Sample Table
4
(f) b(k ;10, 0.30)
k =2
Exercise: 4.19
= x) =
( ) ( )
3 21
6− x
considerations, only 5 out of each 10 are inspected for
( )
x
P( X 24 safety. The 5 cars are randomly chosen from the 10 on
6
the lot. If 2 out of the 10 cars on the lot are below
0
3 21
6
= 0.40316 standards for safety, what is the probability that at least 1
P ( X = 0) =
24
6
i.e., no defective
out of the 5 cars to be inspected will be found not
meeting safety standards?
P ( X = 3) =
(33 ) (321 ) = 0.00988
(624 ) i.e.,3 will be
defective.
N= 10 possible trials n= 5 actual trials
X = 2 possible successes x= 1 actual successes
N= 10 possible trials n= 5 actual trials
X = 2 possible successes x= 1 actual successes
2 (10 − 2 ) 2 8 2! 8! Example : In a communication system one telephone number out of fifteen called between
1 (5 − 1) 1 4
2 pm and 3pm on wee days is busy. What is the probability that if 6 randomly selected
1!1! 4!4! 5 telephone numbers are called
P (1) = = = = = 0.556
10 10 10! (i) not more than three (ii) at least three of them will be busy.
5 5
9 Here x: telephone number is busy x 6− x
1 14
5!5! Ans: Here n = 6, p = 1/15 q = 14/15; P(x) = 6c x
15 15
2 (10 − 2 ) 2 8 2! 8! (i)The probability that not more than three will be busy
2 (5 − 2 ) 2 3
P ( 2) =
=
=
1!1! 3!5!
=
2
= 0.222
=P (x ≤ 3) = P(0) + P(1) + P(2) + P(3) = 0.9997
10 10 10! 9
5 5 (ii) The probability that at least three of them will be busy = P (x ≥ 3)
5!5!
Thus, P(1) + P(2) = 0.556 + 0.222 = 0.778. or = P(3) + P(4) + P(5) + P(6) = 0.005
P(X 1) = 1 − P(X 1) = 1 - P(0) = 1 - 0.2222 = 0.778
BITS Pilani
Pilani Campus
Mean & Variance :
Contact session 5
Question: Find the mean and variance of probability distribution of the number of
heads obtained in 3 flips of balanced coin Variance:
Answer:
X= Find the mean of probability distribution of the number of heads obtained in 3
flips of balanced coin 2 2 2 2 2
1 1 1 1 1
x = 0,1, 2, 3 2 ( X ) = E X − = 0 − f ( 0 ) + 1 − f (1) + 2 − f ( 2 ) + 3 − f ( 3)
2 2 2 2 2
1 3 3 1
f ( 0) = f (1) = f ( 2) = f ( 3) = 3
8 8 8 8 =
4
( X ) = E ( X ) = 0 f ( 0 ) + 1 f (1) + 2 f ( 2 ) + 3 f ( 3)
1 3 3 1
= 0 + 1 + 2 + 3
8 8 8 8
3 6 3 12 3 133 134
= + + = =
8 8 8 8 2
Mean and Variance of the Binomial distribution: Mean of the Binomial distribution (continuing) :
Let X follows binomial distribution with
Putting x=0, first term will be zero
f ( x ) = ( nx ) p x (1 − p )
n− x
, x = 0,1, 2,..., n
1 n ( n − 1)! p p (1 − p )
x −1 n− x
n
=
( x − 1)! ( n − x )!
Where
n=total no of trails x =1
p=probability of success
( n − 1)! p x−1 (1 − p )
n− x
n
= np
E ( X ) = x f ( x ) = x ( nx ) p x (1 − p )
n n
n− x
x =1 ( x − 1)!( n − x )!
x =0 x =0
( n − 1)! p x −1 (1 − p )( ) (
n −1 − x −1)
n n
n!
= x p x (1 − p ) = np
n− x
x =0 x !( n − x )! x =1 ( x − 1)!( n − 1) − ( x − 1)
135 136
b( x; n, p )
Let x − 1 = y; n − 1 = m
= np and 2 = npq
m
m!
= np p y (1 − p )
m− y
y =0 y !( m − y )!
= np
E ( X ) = np
137 138
Example: Example:
Question: In a bag 5 out of 20 are defective. Find the mean of the probability distribution of the number of
Question: Find the mean and variance of the probability distribution of the number of heads obtained in 3 defective in a sample of the randomly chosen for inspection.
flips a balanced coin.
Answer:
Answer: X= Number of heads in flipping a coin thrice
1 1
n = 3; p = ,q =
2 2
1 3
= np = 3 =
2 2
1 1 3
2 = npq = 3 =
2 2 4 139 140
E(X ) = 5; 2 (X ) = 2
E ( aX + b ) = aE ( X ) + b
Va r ( aX + b ) = a 2 var ( X ) E (Y ) = E ( 2 X − 3) = 2 E ( X ) − 3
= 25 − 3 = 7
2 (Y ) = 2 ( 2 X − 3) = 2 2 var ( X )
= 4 2 = 8
141 142
Example: Example:
In 256 sets of 12 tosses of a fair coin, in how many cases may one expect 8 If the mean is 3 and variance is 4 of a random variable 𝑋, check whether 𝑋
heads and 4 tails . follows binomial distribution?
Solution: Let 𝑋 be a random variable denoting the number of heads(remaining Solution: 𝜇 = 𝑛𝑝 = 3 and 𝜎 = 𝑛𝑝𝑞 = 4.
tails). Here 𝑛 = 12 . Therefore,
1 𝑛𝑝 3 1 3
Probability of getting heads 𝑝 = 1/2, therefore 𝑞 = 2 . = 4 . This implies 𝑞 = 4 . Hence 𝑞 = 4/3. This is not possible.
𝑛𝑝𝑞
12 1 8 1 4
Hence, 𝑃 𝑋 = 8 = ( ) = 0.12.
8 2 2
SO 𝑋 does not follow the binomial distribution
Therefore, the number of times to get 8 heads and 4tails appearing in 256
sets= 256*0.12=31 times.
The mean and variance of Binomial distribution are 16 and 8. Find 𝑃 𝑋 ≥ 3 . 1. The probability is 0.02 that an item produced by a factory is defective. A shipment of 10,000
items is sent to a warehouse. Find the expected number 𝐸 defective items and the standard
deviation 𝜎.
1
Solution: 𝜇 = 𝑛𝑝 = 16, 𝜎 = 𝑛𝑝𝑞 = 8 . This gives 𝑞 = 1/2 and hence 𝑝 = 2 . 2. A students takes an 18-questions multiple choice exam, with 4 choices per question.
Suppose one of the choices is obviously incorrect, and the student makes an “educated “
Therefore, 𝑛𝑝 = 16 and this gives 𝑛 = 32. guess of the remaining choices. Find expected number 𝐸 of correct answers and the
standard deviation 𝜎.
1
Thus, 𝑃 𝑋 ≥ 3 = 1 − 𝑃 0 + 𝑃 1 + 𝑃 2 = 0.9999 3. The probability that a man hits the target is 𝑝 = . He fires 100 times. Find the expected
10
number of times he will hit the target.
The standard deviation 𝜎 of a random variable 𝑋 measures the spread of the Suppose 𝑋 is a random variable with mean 𝜇 = 100 and standard deviation 𝜎 = 5.
values of 𝑋 about the mean 𝜇 of 𝑋. Accordingly, for smaller values of 𝜎, we a) Find the conclusion one can derive from Chebyshev’s inequality for 𝑘 = 2 and 𝑘 = 3.
would expect that 𝑋 will be closer to its mean 𝜇. This intuitive expectation is b) Estimate the probability that 𝑋 lies between 80 and 120.
made more precise by the following inequality c) Find an interval [𝑎, 𝑏] about the mean 𝜇 = 100 for which the probability that 𝑋 lies in the
interval is at least 99 percent.
Solution: a) Setting 𝑘 = 2, we get
Chebyshev’s Inequality: 1 1
𝜇 − 𝑘𝜎 = 100 − 2 5 = 90, 𝜇 + 𝑘𝜎 = 100 + 2 5 = 110, 1 − =1− =
𝑘2 22
Let 𝑋 be a random variable with mean 𝜇 and standard deviation 𝜎 . 3
.
4
Then for any positive number 𝑘, the probability that value of 𝑋 lies Thus, from Chebshev’s inequality we can conclude that probability that 𝑋 lies between 90 and
1
in the interval 𝜇 − 𝑘𝜎 ≤ 𝑋 ≤ 𝜇 + 𝑘𝜎 ≥ 1 − 2 . 3
110 is at least .
𝑘 4
8
Similarly for 𝑘 = 3 we get 𝑃 85 ≤ 𝑋 ≤ 115 ≥ .
9
b) Here 𝑘𝜎 = 20, since 𝜎 = 5, we get 5𝑘 = 20 this implies 𝑘 = 4. Thus, by Chebyshev’s
inequality
1 1 15
𝑃 80 ≤ 𝑋 ≤ 120 ≥ 1 − = 1− = .
𝑘2 42 16
1
Example:
c) Here we set, 1 − 𝑘 2 = 0.99 and solve for 𝑘. This yields
1 1
1 − 0.99 ≥ 𝑘 2 or 0.01 = 𝑘 2 this implies 𝑘 = 10.
Thus the desired interval is,
𝑎, 𝑏 = 𝜇 − 𝑘𝜎, 𝜇 + 𝑘𝜎 = 100 − 10 5 , 100 + 10 5 = 50,150 .
150
Exercise
BITS Pilani
Pilani Campus
Ans 𝑏 = 23.4
Contact session 6
𝑒 −𝜆 𝜆 𝑘 An average of 0.61 soldiers died by horse kicks per year in each Prussian army corps. Calculate the
The probability mass function of the Poisson distribution is:𝑃 𝑋 = 𝑘 = probability that exactly two soldiers died in Army Corps, assuming that the number of horse kick
𝑘!
Where deaths per year follows a Poisson distribution.
Solution: 𝑘 = 2 deaths by horse kick
• 𝑋 is random variable following a Poisson Distribution
𝜆=0.61 deaths by horse kick per year.
• 𝑘 is the number of times an event occurs
• 𝑃 𝑋 = 𝑘 probability that the events will occure 𝑘 times 𝑒 −𝜆 𝜆𝑘
𝑃 𝑥=𝑘 =
• 𝑒 is Euler’s constant (approximately 2.718) 𝑘!
158
Exercise
Answer: 1. A customer help center receives on average 3.5 calls every hour.
a) What is the probability that it will receive at most 4 calls every hour?
b) What is the probability that it will receive at least 5 calls every hour? [Ans; a) 0.725, b) 0.2746
159
163 164
Question: If a bank receives on the average =6 bad checks per day, what are the probabilities that it
will receive
165 166
167 168
169 170
171 172
Question: A Company manufactures hydraulic gears and records show that the probability is
0.04 that one of its new gears will fail its inspection Test. What is the probability that the 5th
gear in a day will be the 1st one to fail the test?
173 174
Exercise
BITS Pilani
Pilani Campus
1. Find the probability that at most 5 defective fuses will be found in a box of 200 fuses
if experiences shows that 2% of such fuses are defective. [Ans 0.785]
2. Suppose one percent of the items made by a machine are defective. In a sample of
100 items, find probability that sample contains: (a) no defective item (b) 1
defective item (c) 3 or more defective items.[ Ans (a) 0.368 (b) 0.368 (c) 0.080
3. Suppose 2 percent of the people on the average are left-handed. Find the
probability 3 or more left handed among 100 people.[Ans 0.325]
4. Wireless sets are manufactured with 25 soldered joints, each on the average 1 joint
in 500 defective. How many sets can be expected to the free from defective joints in
a consignment of 10000 sets,
Ans:0.9512
Contact session 7
Definition:
Problem:
Mean and variance of Continuous Probability
distribution
+∞ If the probability density function of a random variable is given by
1. 𝑀𝑒𝑎𝑛(𝜇) = 𝐸𝑥𝑝𝑒𝑐𝑡𝑎𝑡𝑖𝑜𝑛 𝑜𝑓 𝑋 = 𝐸 𝑋 = −∞ 𝑥𝑓 𝑥 𝑑𝑥
x for 0 x 1
+∞
2. 𝑆𝑒𝑐𝑜𝑛𝑑 𝑒𝑥𝑝𝑒𝑐𝑡𝑎𝑡𝑖𝑜𝑛 𝑜𝑓 𝑋 = 𝐸 𝑋2 = −∞ 𝑥 2 𝑓 𝑥 𝑑𝑥 f ( x ) = 2 − x for 1 x 2
2 0 otherwise
3. 𝑉𝑎𝑟𝑖𝑎𝑐𝑒 𝑉 = 𝜎 2 = 𝐸 𝑋 2 − 𝐸 𝑋
Find the probabilities that a random variable having this probability density will
+∞ take on a value (a) between 0.2 and 0.8 (b) between 0.6 and 1.2
= න 𝑥 − 𝜇 2 𝑓 𝑥 𝑑𝑥
−∞
4. 𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 𝜎 = 𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒
0.8
P(0.2 x 0.8) = f ( x)dx
0.2
0.8
= xdx =
0.2
x2
0.8
2 0.2
= 0.3
1.2
P(0.6 x 1.2) = f ( x)dx
0.6
1 1.2
=
0.6
f ( x)dx + f ( x)dx
1
xdx + (2 − x)dx =
1 1.2
1 2 1.2
= x2
2 0.6
+ 2 x− x
2 1
= 0.54
0.6 1
Solution:
Solution:
we know that f(x) = 1
-
2 2
kx kx 2
dx = 1 3 dx = 1
−
3 −1
k
3
x3
3
2
−1
=1
k
9
(9) = 1
k = 1
2
1
2
1 1 x=2
1 2
μ = E(X) = x f ( x) dx = x[ x ] dx = x 3 dx = x 4 = 5/4
− −1
3 3 −1
3 4 x = −1
Variance – Do it as H.W
Practice Problems:
Normal Distribution
Normal curve 3. The total area under the curve is equal to one.
4. The normal curve approaches, but never touches the x-axis as it extends farther and
x farther away from the mean.
5. Between μ − σ and μ + σ (in the center of the curve), the graph curves downward. The
A normal distribution is a continuous probability distribution for graph curves upward to the left of μ − σ and to the right of μ + σ. The points at
which the curve changes from curving upward to curving downward are called the
a random variable, x. The graph of a normal distribution is called inflection points.
the normal curve.
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
Normal Distribution
Inflection points Effect of Varying Probability
Total area = 1 Parameters (µ & ) Probability is area under
curve!
x
μ − 3σ μ − 2σ μ−σ μ μ+σ μ + 2σ μ + 3σ d
P(c x d) = c
f (x)dx ?
If x is a continuous random variable having a normal distribution with mean μ
and standard deviation σ, you can graph a normal curve with the equation f (x)
2
1 x−
1 −
f (x) = e 2
where 2
µ = Mean of the normal random variable x
= Standard deviation
π = 3.1415 . . .
e = 2.71828 . . . x
P(x < a) is obtained from a table of normal probabilities
c d
The standard normal distribution is a normal distribution with a If each data value of a normally distributed random variable x is transformed into a z-score, the
result will be the standard normal distribution.
mean of 0 and a standard deviation of 1.
Example: Example:
Find the cumulative area that corresponds to a z-score of 2.71. Find the cumulative area that corresponds to a z-score of −0.25.
0.0 .5000 .5040 .5080 .5120 .5160 .5199 .5239 .5279 .5319 .5359
−3.4 .0002 .0003 .0003 .0003 .0003 .0003 .0003 .0003 .0003 .0003
0.1 .5398 .5438 .5478 .5517 .5557 .5596 .5636 .5675 .5714 .5753
0.2 .5793 .5832 .5871 .5910 .5948 .5987 .6026 .6064 .6103 .6141 −3.3 .0003 .0004 .0004 .0004 .0004 .0004 .0004 .0005 .0005 .0005
−0.3 .3483 .3520 .3557 .3594 .3632 .3669 .3707 .3745 .3783 .3821
2.6 .9953 .9955 .9956 .9957 .9959 .9960 .9961 .9962 .9963 .9964
−0.2 .3859 .3897 .3936 .3974 .4013 .4052 .4090 .4129 .4168 .4207
2.7 .9965 .9966 .9967 .9968 .9969 .9970 .9971 .9972 .9973 .9974
−0.1 .4247 .4286 .4325 .4364 .4404 .4443 .4483 .4522 .4562 .4602
2.8 .9974 .9975 .9976 .9977 .9977 .9978 .9979 .9979 .9980 .9981
−0.0 .4641 .4681 .4724 .4761 .4801 .4840 .4880 .4920 .4960 .5000
Find the area by finding 2.7 in the left hand column, and then Find the area by finding −0.2 in the left hand column, and then
moving across the row to the column under 0.01. moving across the row to the column under 0.05.
The area to the left of z = 2.71 is 0.9966. The area to the left of z = −0.25 is 0.4013
Example 1: Find the area under the standard normal curve to the left of z = −2.33.
Always draw the curve!
From the Standard Normal Table, the area is equal to
0.0099.
μ = 78 z = x - μ = 90 - 78
z
σ 8
−2.33
2. Find the area
0
under the standard normal curve to the right of z = 0.94. σ=8 = 1.5
0.8264 1 − 0.8264 = 0.1736
P(x < 90)
From the Standard Normal Table, the area is equal to
0.1736. The probability that a
z x student receives a test
0 0.94
3. Find the area under the standard normal curve between z = −1.98 and z = 1.07. μ =78 90
0.8577 z score less than 90 is
μ =0 1.5?
P(x < 90) = P(z < 1.5) = 0.9332 0.9332.
0.8577 − 0.0239 = 0.8338
0.0239
From the Standard Normal Table, the area is equal to
0.8338.
z
−1.98 0 1.07
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Example: Example:
The average on a statistics test was 78 with a standard deviation of 8. The average on a statistics test was 78 with a standard deviation of 8. If
If the test scores are normally distributed, find the probability that a the test scores are normally distributed, find the probability that a
student receives a test score greater than 85. student receives a test score between 60 and 80.
z1 = x - μ = 60 - 78 = -2.25
σ 8
μ = 78 z = x - μ = 85 - 78 x -μ 80 - 78
σ 8 P(60 < x < 80) z2 = = = 0.25
= 0.875 0.88 σ 8
σ=8 μ = 78
P(x > 85)
The probability that a σ=8 The probability that a
μ =78 85 x x
student receives a test 60 μ =7880 student receives a test
z score greater than 85 is z score between 60 and 80
μ = 0.88
? −2.25
? μ =0 ?0.25
0.1894. is 0.5865.
P(60 < x < 80) = P(−2.25 < z < 0.25) = P(z < 0.25) − P(z < −2.25)
P(x > 85) = P(z > 0.88) = 1 − P(z < 0.88) = 1 − 0.8106 = 0.1894
= 0.5987 − 0.0122 = 0.5865
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Problem:
c) P(13.6 X 18.8)
x − 13.6 − 16.2
z1 = = = −2.08
1.25
x − 18.8 − 16.2
z2 = = = 2.08
1.25
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Practice Problems:
Problem:
The time for oil to percolate Solution : Example: You work in Quality Control for GE. Light bulb life has a normal
Given = 20, P( X 31.5) = 0.25, = ? distribution with = 2000 hours and = 200 hours. What’s the probability that a
to all parts of an engine can x − 31.5 − 20 9.2 bulb will last
z= = =
be treated as a random
A. between 2000 and 2400
variable having a normal P( X 31.5) = 0.25
11.5 11.5
hours?
distribution with mean 20 P( Z ) = 0.25 = 1 − P( Z ) = 0.25
B. less than 1470 hours?
seconds. Find its standard P( Z
11.5
) = 0.75 C. More than 2000 hours?
deviation if the probability is
from table we have
0.25 that it will take a value 11.5
= 0.675 Example: If X is a normal random
greater than 31.5 seconds
= 17.037 variable with mean 3 and the variance 16 then find
(i) P(X< 11) (ii) P(X > - 1) (iii) P(2 < X < 7)
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Solution
Example : A survey indicates that for each trip to a supermarket, a 1. (a) The figure at the left shows a normal curve with μ = 45 minutes and σ = 12
shopper spends an average of 45 minutes with a standard deviation of minutes. The area for x between 24 and 54 minutes is shaded.
12 minutes in the store. The lengths of time spent in the store are The z-scores that correspond to 24 minutes and to 54 minutes are
z1 = 24 – 45 / 12 = -1.75 and z2 = 54 – 45 / 12 = 0.75.
normally distributed and are represented by the variable x. A shopper
So, the probability that a shopper will be in the store between 24 and
enters the store. (a) Find the probability that the shopper will be in 54 minutes is
the store for each interval of time listed below. (b) Interpret your answer P(24 < x < 54) = P( -1.75 < z < 0.75)
when 200 shoppers enter the store. How many shoppers would you = P(z < 0.75 )- P(z < -1.75)
expect to be in the store for each interval of time listed below? = 0.7734 - 0.0401 = 0.7333.
(b) Interpretation When 200 shoppers enter the store, you would expect
1. Between 24 and 54 minutes 2. More than 39 minutes
200(0.7333) = 146.66, or about 147, shoppers to be in the store between 24 and 54
minutes.
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Example:4
In a normal distribution 31% of the items are under 45 and 8% of the items are over 64. Find the mean and
standard deviation of the distribution.
Soln: Let 𝜇 & 𝜎 the mean and standard deviation of the normal distribution
2. (a) The figure at the left shows a normal curve with m = 45 minutes and s = 12 minutes. By data 𝑃 𝑥 < 45 = 0.31 𝑎𝑛𝑑 𝑃 𝑥 > 64 = 0.08
The area for x greater than 39 minutes is shaded.
45−𝜇
The Z- score that corresponds to 39 minutes is z = 39 – 45 / 12 = -0.5. When 𝑥 = 45, 𝑍 =
𝜎
= 𝑧1 (𝑠𝑎𝑦)
So, the probability that a shopper will be in the store more than 39 minutes is
64−𝜇
P(x > 39) = P(z > -0.5) = 1 – P(z < -0.5) = 1 - 0.3085 = 0.6915. When 𝑥 = 64, 𝑍 =
𝜎
= 𝑧2 (𝑠𝑎𝑦)
(b) Interpretation When 200 shoppers enter the store, you would expect 200(0.6915) = 138.3, Thus we have 𝑃 𝑍 < 𝑧1 = 0.31 𝑎𝑛𝑑 𝑃 𝑍 > 𝑧2 = 0.08
or about 138, shoppers to be in the store more than 39 minutes.
i.e0.5 + 𝐹 𝑧1 = 0.31 𝑎𝑛𝑑 0.5 − 𝐹 𝑧2 = 0.08
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
BITS Pilani
Pilani Campus
45 − 𝜇 64 − 𝜇
⟹ = −0.5 𝑎𝑛𝑑 = 1.4
𝜎 𝜎
𝜇 − 0.5𝜎 = 45
Objective
❑ Independence
In many situations , we will be interested two or more characteristic 1. p(x) ≥ 0 for all values of x
(numerically) of the outcome of the random experiment. 2. p(x) = 1
Example1 - two-dimensional sample space: In the development of a new
receiver for the transmission of digital information, each received bit is rated
as acceptable, suspect, or unacceptable, depending on the quality of the where the summation of p(x) is over all possible values of x.
received signal. Let X denote the number of acceptable bits and Y denote
the number of suspect bits .
Example2 Two electronic components of a missile system work in harmony
for the success of the total system. Let X and Y , the two random variables
which denote the life in hours of the two components.
Ex.3 –Both voltage & current might be interest in a certain experiment.
In some random experiments, knowledge of the values y=number of x=number of bars of signal strength
times city
of X does not change any of the probabilities associated name is stated 1 2 3 Marginal
probability
with the values for Y. distribution of
Y
If two random variables are independent, then 4 0.15 0.1 0.05 0.3
f ( x, y ) = f ( x). f ( y ) 3
2
0.02
0.02
0.1
0.03
0.05
0.2
0.17
0.25
at x and y respectively
and f(y) is marginal probability at y i) Obtain the marginal probability distribution of x and y
ii) P(x=1 ∩ y=3)
iii)P(x=2 ∩ y=2)
The conditional probability distribution is also probability iv)P(x=2 ∩ y≤3)
mass function.(PMF) v) P(x+y <4)
vi)P(x=1/ y=1)
vii) Are x and y independent random variable
Example
EX2: The joint probability distribution of random variables x and The joint probability distribution of two random variables X and Y is given by :
y is as bellow P(X=0,Y=1)=1/3, P(X=1,Y=-1)=1/3, P(X=1,Y=1)=1/3.
y\x 1 2 3
Find i) Marginal distribution of X and Y
1 1/12 1/6 0 ii) The conditional probability distribution of X given Y=1
2 0 1/9 1/5
3 1/18 ¼ 2/15
i) Obtain the marginal probability distribution of x and y
ii) Obtain the conditional probability distribution of x; given y =2
Problem-1
Ex5:The joint distribution can be given in tabular form. Let X1 and X2 have the Joint probability distribution f(x1,x2)
For example: of X1 and X2
y\x 0 1 2
0 1/9 2/9 1/9 x1
1 2/9 2/9 0 0 1 2
2 1/9 0 0 x2 0 0.1 0.4 0.1
1 0.2 0.2 0
i) Obtain the marginal probability a) Find P(X1+X2 >1)
distribution of x and y
b) Find the probability distribution f1(x1) = P (X1=x1) of the
ii) Obtain the conditional probability individual random variate X1.
distribution for y given x.
Solution
x1
0 1 2 b)Since the event X1 = 0 is composed of two pairs of
values(0,0) and (0,1) , we add their values (0,0) and (0,1) ;
x2 0 0.1 0.4 0.1
we add their corresponding probabilities to obtain
1 0.2 0.2 0
P(X1 = 0) = f(0,0) + f(0,1) =0.1 + 0.2 =0.3
continuing , we obtain P(X1= 1) = 0.6 and P(X1 = 2) = 0.1
a) The event X1+X2 >1 is composed of the pairs of ie f1(0) = 0.3, f1(1) =0.6 f1(2) = 0.1 is the probability
values (1,1), (2,0) and (2,1) distribution of X1.
Adding their corresponding probabilities
P(X1 + X2 > 1) = f(1,1) + f(2,0) + f(2,1)
= 0.2 + 0.1 + 0
=0.3
Solution
Problem-3
• Two random variables are independent and each has a • Let X and Y be two independent random variables and each
binomial distribution with success probability 0.7 and 4 trials. has a binomial distribution with success probability 0.7 and 4
a) Find the joint probability distribution trials.
b) Find the probability that the first random variable is
greater than the second.
• Here n=4, p=0.7,x=no. of successes (0,1,2,3,4)
• Also, f(x, y) = f(x) f(y) for all x ,y the two random variables are
independent.
247 248
Using binomial distribution, first we find probabilities a) Using the relation f(x, y)=f(x) f(y) we find joint probabilities
X
0 1 2 3 4
0 0.000065 0.000612 0.000214 0.000333 0.000194
1 0.000612 0.0057153 0.0200 0.031116 0.018151
Y 2 0.000214 0.0200 0.070013 0.108909 0.063530
3 0.000333 0.031116 0.108909 0.169414 0.098825
4 0.000194 0.018151 0.063530 0.098825 0.057648
Problem-4
If X has mean 8 and variance 3 while Y has mean -12.5 and variance 2.25 and
b) To find the probability that the first random variable is greater the two are independent.
than the second. Find
i.e. To find f(x>y) a) E(X-Y)
b) Var(X-Y)
f(1>y) = f(1,0)= 0.000612
f(2>y) = f(2,0)+f(2,1)= 0.000214+0.0200=0.020214
f(3>y) = f(3,0)+f(3,1)+f(3,2)=0.140358
f(4>y) = f(4,0)+f(4,1)+f(4,2)+f(4,3)=0.344059
Exercise 3
Exercise 2
• X and Y are two rv’s having the joint pmf
a) If X and Y are independent with 𝑝𝑋 0 = 0.5, 𝑝𝑋 1 = 0.3, 1
𝑓 𝑥, 𝑦 = (2𝑥 + 𝑦), where x and y can assume only the
27
𝑝𝑋 2 = 0.2 and 𝑝𝑌 0 = 0.6, 𝑝𝑌 1 = 0.1,
integer values 0,1 and 2.
𝑝𝑌 2 = 𝑝𝑌 3 = 0.05 and 𝑝𝑌 4 = 0.2.
Find the conditional distribution of Y for X=x.
Display the joint pmf of (X, Y).
b) Compute 𝑃 𝑋 ≤ 1 𝑎𝑛𝑑 𝑌 ≤ 1 from joint pmf and verify that it
equals the product 𝑃 𝑋 ≤ 1 ∙ 𝑃(𝑌 ≤ 1).
c) What is 𝑃(𝑋 + 𝑌 = 0)
d) Compute 𝑃(𝑋 + 𝑌 ≤ 1)
BITS Pilani
Pilani Campus
Joint Distribution (Continuous rv ‘s)
Let X and Y be continuous rv’s. A joint probability density function 𝑓(𝑥, 𝑦) for these two
variables is a function satisfying
1) 𝑓 𝑥, 𝑦 ≥ 0
∞ ∞
2) −∞ −∞ 𝑓(𝑥, 𝑦) 𝑑𝑥 𝑑𝑦 = 1
Also,
𝑃 𝑎 ≤ 𝑋 ≤ 𝑏, 𝑐 ≤ 𝑌 ≤ 𝑑 = න න 𝑓(𝑥, 𝑦) 𝑑𝑦 𝑑
Problem solving Continuous Joint distribution. Suggested 𝑎 𝑐
problems from Sec. 5.10
Marginal Probabilities
Two continuous random variables X and Y are said to be independent if for every
The marginal probability density function of X and Y, denoted by
pair of x and y values
𝑓𝑋(𝑥) and 𝑓𝑌(𝑦) are given by
∞ 𝑓 𝑥, 𝑦 = 𝑓𝑋(𝑥) ∙ 𝑓𝑌(𝑦)
𝑓𝑋 𝑥 = −∞ 𝑓(𝑥, 𝑦) 𝑑𝑦 for −∞ < 𝑋 < ∞
∞
If it is not satisfied, then X and Y are said to be dependent.
𝑓𝑌 𝑦 = −∞ 𝑓(𝑥, 𝑦) 𝑑x for −∞ < Y < ∞
Joint Cumulative distribution Problem 1
Let X and Y be continuous rv’s. If the joint probability density of two random variables
is given by
Let 𝑓(𝑥, 𝑦) be a joint probability density function for these two variables.
−2x −3x
f( x 1 , x 2 ) = 6 e 1 2 for x 1 > 0 , x 2 > 0
Then the joint cumulative distribution function is given by
𝑥 𝑦 = 0 else where.
𝐹(𝑥, 𝑦) = න න 𝑓 𝑥, 𝑑𝑥𝑑𝑦
−∞ −∞
a) The first random variable will take on value between 23 −2x −3x
a) 6e 1 2 dx1dx2 = ( e −2 - e −4 ) (e −6 - e −9 )
1and 2 and the second random variable will take on a 12
value between 2 and 3.
b) The first random variable will take on a value less than = 0.0003
2 and the second random variable will take on a value
2 −2x −3x
greater than 2 b) 6e 1 2 dx1dx2 = ( 1-e −4 ) e −6 = 0.0025
0 2
Solution
Q.A candy company distributed boxes of chocolates with a mixture of creams, toffees, and nuts
coated in both light and dark chocolate. For a randomly selected box, let X and Y, respectively, be
1 1
2
the proportions of the light and dark chocolates that are creams and suppose that the joint density a)
f ( x, y ) d x d y = 5
(2x + 3 y)dxdy
−− 0 0
function is 0 x 1, 0 y 1
1 x=1
2 ( 2 x + 3 y), 2x2 6xy
f (x, y) = 5
elsewhere
= 5
+
5
dy
0, 0 x=0
1
1
2 6y 2y 3y2
= 5 + 5 d y = 5 + 5
a) Verify whether f(x, y)dxdy = 1 0 0
−− 2 3
= + = 1
b) Find P[(X,Y) A], where A is the region {(x,y) | 0<x<½,¼<y<½} 5 5
c) Find g(x) and h(y) for the joint density function.
Solution
Solution
By definition,
b) P ( X , Y ) A = P ( 0 X 12 , 14 Y 1
) y =1
4x + 3
2 1
2 4xy 6y2
= 2 2 2
1 1
g(x) = f (x, y)dy = (2x + 3y)dy = + =
1 0 5 ( 2 x + 3 y ) d x d y − 0
5 5 10 y =0
5
4
1
2
2x2 6xy
x= 1
2 For 0x 1, and g(x)=0 elsewhere.
= 5
+
5
dy
4(1+ 3y)
1
1
x=0 2
h( y) = f (x, y)dx = (2x + 3y)dx =
4
1 1 Similarly, 5
1
2
3y y 3y 2 2
−
5
=
0
+ dy = +
1
10 5 10 10
4
1
4 For 0 y 1, and h(y)=0 elsewhere.
1 1 + 3 − 1 + 3 = 1 3
= 2
10 4 4 1 6 1 6 0
Problem 2 Solution
If the joint probability density of two random variables is given by x x
1 2 −2x −3x
F(x1,x2) = 6e 1 2 dx1dx2
−2x −3x o 0
f( x1 , x2 ) = 6 e 1 2 for x1 > 0 , x2 > 0
for x1>0 , x2 >0
= 0 : else where
=0 else where.
Find the joint cumulative distribution function of the two random −2x −3x
variables. Also find the probability that both random variables will So that F(x1,x2) = (1-e 1 )(1-e 2)
take on values less than 1. for x1 >0 , x2 >0
= 0 ; else where
Problem 3
If t h e joint pro ba bilit y d e n s i t y o f t w o r a n d o m v a r i a b l e s is g i v e n
The probability that both random variables will take on by
values less than 1 is
−2x −3x
1 1 −2x −3x f( x 1 , x 2 ) = 6 e 1 2 for x 1 > 0 , x 2 > 0
F(1,1) = 6 e 1 2 dx1dx2
o 0
= 0 else where.
= (1-e −2)(1-e −3) = 0.8216
F i n d t h e m a r g i n a l d e n s i t y of t h e first r a n d o m variable
Solution Problem 5
Integrating out x 2 we get
If t h e joint p r o b a b ilit y d e n s i t y o f t w o r a n d o m v a r i a b l e s is g i v e n b y
−2x −3x
f 1 (x 1 ) = 6e 1 2 dx for x 1 > 0
2 −2x −3x
0 f( x 1 , x 2 ) = 6 e 1 2 for x 1 > 0 , x 2 > 0
=0 elsewhere
−3x
e
= 0 else where.
2
f1 (x 1 ) = 6e -2x 1 −3
verif y t h a t f(x 1 ,x 2 ) = f 1 (x 1 ) .f 2 (x 2 ) . (i.e. E s t a b l i s h i n g i n d e p e n d e n c e b y
0
f a c t o r i n g t h e join t pr o bab ilit y d ensity)
=-2 e -2x 1 ( e - - e0)
Solution
Integrating out x 2 we get
In the similar way,
−2x −3x −2x
1
f 1 (x 1 ) = 6e 1 2 dx for x 1 > 0 f2(x2) = 6e-3x2 ( e ) for x2>0
0 2 −2 0
=0 elsewhere
−3x
e 2 =0 elsewhere
f1 (x 1 ) = 6e -2x 1 −3
0
f2(x2) = 3 e-3x2 for x2>0
-
=-2 e 1 ( e - e )
-2x 0
= 0 elsewhere
f 1 (x 1 ) = 2e -2x 1 for x 1 >0
= 0 else where
Problem 6
Thus,
If two random variables have the joint probability density
f 1 (x 1 ) . f 2 (x 2 ) = 6 e - 2 x 1 -3 x 2 for x1>0, x2>0
f(x1, x2) = 2 ( x + 2x ) for 0 < x1 < 1 , 0 < x2 < 1
3 1 2
= 0 elsewhere
= 0 else where
Solution
The conditional density of the first random variable given
that the second takes on the value x 2 is given by
First we have to find the marginal density of the
second random variable by integrating out x1 then we 2 ( x + 2x ) 2x + 4x
get f 1 (x 1 /x 2 ) = 3 1 2 = 1 2 for 0 < x 1 < 1, 0 < x 2 <1
1 (1+ 4x ) 1 + 4x
3 2 2
1
f2(x2) = 2 ( x + 2x )dx = 1 (1+ 4x ) for 0 < x2 < 1 and
0
3 1 2 1 3 2 and
= 0 , else where . f 1 (x 1 /x 2 ) = 0 for x 1 0 or x 1 1
x 2 0 or x 2 1
Exercise 1 Exercise 2
If two random variables have the joint density If two random variables have the joint density
Find the probability that 0.2<x<0.5 and 0.4<y<0.6 Find joint cumulative distribution function of two rv and use it to
verify the value obtained in the previous problem.
BITS Pilani
Pilani Campus
Exercise 3
If two random variables have the joint density
➢Sampling is a process of selecting samples (small data) from populations (Big or Huge or
Contact Session -11:M5- Sampling distribution
Massive data). The main purpose of sampling is to analyze populations in an optimum manner
References
and draw valid inferences on them with the help of samples.
Time Type Description
Pre- CS-11 RL 5.1, RL 5.2 RL 5.1: Population and sample, random sample, parameters T1Sec 6.1, 6.2, 6.3
and statistics ➢Sampling is widely used in business as a means of gathering useful information about a
RL 5.2: Central limit theorem, Sampling distribution of mean population.
and variance, t-distribution
During CS-11 CS-11 Problem solving. Suggested problems from Sec. 6.2 and Sec. T1 ➢Data are gathered from samples and conclusions are drawn about the population as a part of
6.3 to cover examples on sampling distribution of mean and
the inferential statistics process
variance, t-distribution, x2 - distribution
➢A sample provides a reasonable means for gathering useful decision-making information that
Post-CS-11 HW Chapter6, Sec. 6.1, 6.2, 6.3 T1
might be otherwise unattainable and unaffordable.
Lab
• A house wife checks by taking three or four grains of rice (sample) from a cooking
pan of whole rice (population) and decides whether the whole rice in the cooking
pan properly cooked or not. Taking a sample instead of conducting a census offers several
advantages
• A pathologist examines a few drops of blood (sample) taken from the blood of
whole body(population) of a patient and decides the level of illness of patient. 1.The sample can save money.
• A quality control engineer checks the quality of few items (sample) in regular 2.The sample can save time.
intervals that are taken from lots of items (population) in the manufacturing process
and decides whether the lot is meeting with the specified standard quality or not. 3.For given resources, the sample can broaden the scope of the
In all the above instances, we can’t directly deal with populations and here sampling is study.
inevitable, that is compulsory and unavoidable and it gives satisfactory results in
understanding the populations in a proper manner. 4.If accessing the population is impossible, the sample is the only
option.
Population • If we collect the data relating to one or more characteristics from all units of
Population is an aggregate or totality of units or objects or things or individuals
the population then it becomes big data or huge data or massive data.
under statistical investigation with respect to one or more characteristics. It is a
• Theoretically, a population may be assumed to follow any of probability
complete or whole set of collection of targeted units for study. The targeted
distribution such as Binomial or Poisson of Normal or Exponential etc.
units may be human beings or non-human beings of interest. A population is
distributions.
also called Universal set.
in a finite population is denoted by N. infected area, the number of stars in the sky etc. furnish infinite populations.
❑ Numbers of students in a class is 50 here N=50, which is finite. That is population finite. Ex:
The number of stars in the sky. Here N is infinite; therefore, the population is an infinite population.
Example:
Sampling: How many different samples of size n=2 can be chosen from a finite population
of size
The process of selecting samples is called sampling.
A) N=7
Note:
A sample with less than 30 terms is called small B) N=24
sample. Answer:
A sample with more than or equal to 30 terms is A) With replacement 7^2
called large sample. Without 7C 2
B) With replacement 24^2
Without 24C 2
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Pilani Campus
Population of Wages of employees of an organization Select different samples of varied sizes
Sample 1
1861 2495 1000 2497 1865 791 2090 2637 1327 1678 3000 2486 820 1678 2070 2638 2490 1865 1000 2090 596 3200
1680 2858 795 2495 2496 2501 1160 1480 1860 2490
Sample 2
2090 2840 2490 2640 659 827 2646 2638 2643 868 2840 2858 3000 2490 2998 3050 2070 2896 3200 2490 3280
1327 1866 1861 2486 2865 3011 2494 1489 1865 2855 Sample 3
2858 3240 2497 2865 656 2093 934 1861 868 795
2840 2499 2093 2660 1165 2600 2085 2640 2998 1861
2956 2495 2865 1865 3000 3019 1670 2858 2642 1680 Sample 4
2086 1000 2497 596 656 875 2085 934 1313
3038 3000 1313 596 656 3240 590 2501 2485 3015
Sample 5
2092 1679 3024 2497 2825 2630 2070 2900 1861 2636
820 1313 3000 2640 596 2640 2600 2495 934 2500
2495 2637 2497 1159 2640 3050 870 2896 2500 2638
926 2860 1481 875 2482 1860 2086 934 3200 2490
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Sample 7
The measures of population namely mean
2858 2490 868 1670 1480 2643 1480 1680 2085 2490 ( ) variance ( 2 ) , standard deviation ( ) are
Sample 8 known as population parameters or
parameters.
2495 2858 1861 2092 2499 3000 2660 1000 1679 926 2660
Sample 9
795 791 3200 2085 2638 2497 2486 1159 2640
The measures computed from the sample
observations namely mean ( x )
variance ( s2 ) standard deviation (s) are
Sample 10
3019 3240 3200 3050 3000 3015 2900 2896 2998
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Sampling Variability
➢The term "sampling variability" refers to the fact that the statistical information from
a sample (called a statistic) will vary as the random sampling is repeated.
Sampling distribution
➢Sampling variability will decrease as the sample size increases.
The probability distribution of a statistic calculated on
➢the samples must be randomly chosen, must be of the same size (not smaller than
the basis of a random sample. 30), and the more samples that are used, the more reliable the information gathered
will be.
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Pilani Campus
Definition
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
ത
Sampling Distribution Of 𝒙 Example
➢ Suppose a small finite population consists of only N = 8 numbers: 54 55 59 63 64 68 69 70
➢ Suppose we take all possible samples of size n = 2 from this population with replacement.
Example
➢Notice that the shape of the histogram for sample means is quite unlike the shape of the histogram for the population.
➢The sample means appear to “pile up” toward the middle of the distribution and “tail off” toward the extremes.
➢As sample sizes become much larger, the sample mean distributions begin to approach a normal distribution and the
variation among the means decreases.
➢The central limit theorem states that sample means are normally
• A population consist of five numbers 2,3,6,8 and 11 Consider all possible samples of size 2
distributed regardless of the shape of the population for large
that can be drawn with replacement from this population. Find a) Mean of population B)
samples and for any sample size with normally distributed
Mean of the sampling distribution of means.
populations.
➢Thus, sample means can be analyzed by using z scores
➢The formula to determine z scores for individual values from a
normal distribution:
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Pilani Campus
Example
Example
Suppose the mean expenditure per customer at a tire store is $85.00, with a standard Suppose that during any hour in a large department store, the average number of shoppers is
deviation of $9.00. If a random sample of 40 customers is taken, what is the probability 448, with a standard deviation of 21 shoppers. What is the probability that a random sample of
that the sample average expenditure per customer for this sample will be $87.00 or more? 49 different shopping hours will yield a sample mean between 441 and 446 shoppers?
Solution For this problem, µ= 448, σ= 21, and n = 49. The problem is to determine P(441<= 𝑥ҧ <=446).
Because the sample size is greater than 30, the central limit theorem
The following
Can be used, and the sample means are normally distributed.
µ= $85 σ= $9
➢The earlier example was based on the assumption that the ➢As the size of the finite population becomes larger in
population was infinitely or extremely large. relation to sample size, the finite correction factor
approaches 1.
➢In cases of a finite population, a statistical adjustment can be
made to the z formula for sample means. The adjustment is ➢Suppose population size N=1000, then
called the finite correction factor Sample size n n/N Correction factor
10 0.01 0.9955
➢Following is the z formula for sample means when samples are 25 0.025 0.9879
drawn from finite populations. 50 0.05 0.9752
100 0.1 0.9492
200 0.2 0.8949
500 0.5 0.7075
BITS Pilani, Pilani Campus BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Exercises:
Problem:
A random sample of size 100 is taken from an infinite population having mean =76 and
2 =256. what is the probability that X will be between 75and 78.
Solution:
n=100 =76 2 =256 = 16
Z =
X −
F (1.25) − F (−0.625)
n
X = 75 75 − 76 = −0.625
1
, Z = 1 = 0.89944 - 0.2676
16
100
= 0.235
X = 78 , Z = 78 − 76 = 1.25
2
2 16
100
P[ 75 X 78 ] =P[ −0.625 Z 1.25 ]
11/9/2024 326
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
A wire bonding process is said in control if the
mean pull strength is 10 pounds. It is known that
the pull-strength measurements are normally
distributed with a std. deviation of 1.5 pounds.
Periodic samples of size 4 are taken from this Z= (7.75 -10)/ (1.5/2) =- 2.25 /0.75 = -3
process and the process is said to be out of control
if a sample mean is less than 7.75 pounds. F(z) = F(-3) = 0.0013
Observe that F(z) is small.
Comment.
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
BITS Pilani
Pilani|Dubai|Goa|Hyderabad
Types of Inference:
i. Estimation
11/9/2024 SS ZG622 Software Project Management – KG Krishna 333 11/9/2024 SS ZG622: Software Project Management – KG Krishna 334
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Estimation:
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Degree of freedom (d.f./d.o.f.)
Sampling Distributions of mean for unknown σ and n < 30 : t-
distribution • It is denoted by Greek alphabet (nu)
x −
t= with (n − 1) d .o. f . • It is the number in statistical calculation and represent how
s/ n many values involved in calculation have the freedom to vary.
Where sample S.D. is calculated by formula • d.o.f. = Number of observations of sample – Number of
population parameters which are calculated using sample data .
1 n
S = i ( − )
2 2
x x
n − 1 i =1
• If there are 'n' observations , the d.o.f. will be (n-1) for
Note : For n>=30 , Normal distribution gives good approximation to t-distribution.
calculating one parameter.
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
= (15+10+30+25)/4
=20
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Critical Value Definition
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Example: 1
11/9/2024 SS ZG622 Software Project Management – KG Krishna 343 11/9/2024 SS ZG622 Software Project Management – KG Krishna 344
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Solution: Example:
• A treatment plant that sends effluent into the river claims the mean
Sample mean= 0.5060 Population mean: 0.5000 suspended solids is never above 40 mg/l. Measurements of the
Sample mean= 10 Sample S.D.=0.0040 suspended solids in river water on Sec 6.4 The Sampling
Distribution of the Variance 207 n = 14 Monday mornings yield x =
46 and s = 9.4 mg/l. Based on data collected over a period of many
years, it is reasonable to assume that the individual measurements
follow a normal distribution. Do the data support or refute the
treatment plant’s claim?
Degree of freedom= n-1=10-1=9
Since t= 4.7334> 3.250 with
level of significance = 0.005 and D.f=9
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Solution: Example:
• The tensile strength (1,000 psi) of a new composite can be
modelled as a normal distribution. A random sample of size
25 specimens has mean x = 45.3 and standard deviation s =
7.9. Does this information tend to support or refute the claim
that the mean of the population is 40.5?
11/9/2024 SS ZG622 Software Project Management – KG Krishna 347 11/9/2024 SS ZG622 Software Project Management – KG Krishna 348
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Solution Chi-square( 2)distribution
• 2distribution curve is not symmetrical, lies
entirely in first quadrant, and hence not a
normal curve.
• 2 varies from 0 to ∞.
Chi-square Distribution for variance.
s2
x ~ N ( , ) then = ( n − 1)
2
with (n − 1) d .o. f .
2
11/9/2024 SS ZG622 Software Project Management – KG Krishna 349 11/9/2024 SS ZG622 Software Project Management – KG Krishna 350
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
11/9/2024 SS ZG622 Software Project Management – KG Krishna 351 11/9/2024 SS ZG622 Software Project Management – KG Krishna 352
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Solution; Exercise
An optical firm purchases glass to be ground into lenses ,
and it is known from past experience that the variance of
the refractive index of this kind of glass is 1.26 .10-4. As it is
important that the various pieces of glass have nearly the
same index of refraction, the firm rejects such a shipment
if the sample variance of 20 pieces selected at random
exceeds 2.00.10-4 .
Assuming that the sample values may be looked upon as a
random sample from a normal population , what is the
probability that a shipment will be rejected even though
2 = 1.26 .10-4 ?
11/9/2024 SS ZG622 Software Project Management – KG Krishna 353
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Solution
Exercise
n
( X − X )2
Using 2 statistic , we get 2 = (n − 1)s2 = i =1 i
Plastic sheeting produced by a machine must be periodically monitored for
2 2
possible fluctuations in thickness. Uncontrollable variation in the viscosity
19(2.00.10−4 ) = of the liquid mold produces some variation in thickness. Based on
= 30.2 experience with a great many samples, when the machine is working well,
1.26.10− 4
an observation on thickness has a normal
distribution with standard deviation σ = 1.35 mm.
And we could see from 2 table that for 19 degrees of Samples of 20 thickness measurements are collected regularly. A value of
freedom , the sample standard deviation exceeding 1.4mm signals concern about the
product. Find the probability that, when σ = 1.35, the next sample will signal
2 0.05 = 30.1 concern about the product.
Thus the probability that a good shipment will be rejected
is less than 0.05
11/9/2024 356
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Solution Interval estimation
n
( X i − X )2
• The chi square statistic χ2 = (n − 1)s = i =1
2
Confidence Interval
2 2
• = 19 · 1.42 /1.22 = 30.6
• For 19 degrees of freedom, χ2 0.05 = 30.1. A range of values within which the population
parameter is expected to occur is called confidence
• The probability of a false signal of concern is less than
0.05. In the long run, a false signal will occur less than 5
interval and it is denoted by 1 − 𝛼
times in 100 samples.
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Confidence Interval
Confidance interval for
For large sample (n 30) when is known.
x − z / 2 . , x + z / 2 . with (1- ) .100% confidance.
n n
For large sample (n 30) when is unknown.
s s
x − z / 2 . , x + z / 2 . with (1- ) .100% confidance.
n n
For small sample (n <30) when is unknown.
s s
x − t / 2 . , x + t / 2 . with (1- ) .100% confidance.
n n
11/9/2024 361 11/9/2024 SS ZG622 Software Project Management – KG Krishna 362
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Question: 3 Solution
11/9/2024 SS ZG622 Software Project Management – KG Krishna 363 11/9/2024 SS ZG622 Software Project Management – KG Krishna 364
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Question:4 Solution
To estimate the average number of pounds of copper recovered
Per ton of ore mine, a sample of 150 tons of ores is monitored
A sample mean of 11 pounds with a sample S.D of 3 pounds was
Obtained. Construct a 95% confidence interval on the mean
numbers of pounds of copper recovered per ton of ore mined
11/9/2024 SS ZG622 Software Project Management – KG Krishna 365 11/9/2024 SS ZG622 Software Project Management – KG Krishna 366
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Question:5
A random sample of size 50 taken from a N(𝜇, 5) has a mean 40.
Obtain a 95% confidence interval for 2 𝜇+3
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Question:6
11/9/2024 SS ZG622 Software Project Management – KG Krishna 369 11/9/2024 SS ZG622 Software Project Management – KG Krishna 370
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Question:7
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Question: 8 Exercise Questions:
• 1. A random sample of size n=100 is taken from a population
with =5.1. Given that the sample mean is x =21.6,
construct a 95% confidence interval for the population mean
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
x i
n
( x − ˆ )
2
ˆ =
i
i =1
ˆ = i =1
11/9/2024
n n
375 11/9/2024 SS ZG622 Software Project Management – KG Krishna 376
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Solution; Exercise Question:
1. In one area along the interstate , the number of dropped
wireless phone connections per call follows a Poisson’s
distribution. From 4 calls, the number of dropped connection is 2
0 3 1.
• A)Find max likelihood estimate of
• B) Obtain max likelihood estimate that the next two calls will
be completed without any accidental drops.
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Question: Question:
• Given 15 yields • The number of traffic accidents in 10 randomly chosen days is
• 5.57 5.76 4.18 4.64 7.02 6.62 6.33 7.24 5.57 7.89 4.67 as below 4 0 6 5 2 1 2 0 4 3. Use these data to estimate
7.24 6.43 5.59 5.39 the proportion of day that had 2 or fewer accidents that year.
• Taking the yields as a random sample from a normal
population • Sample of 10 grains of metallic sand taken from a large sand
• A) Obtain max likelihood estimate of the mean yield and pile have respective lengths(in mm) 2.2 3.4 1.6 0.8 2.7 3.3
variance. 1.6 2.8 2.5 1.9. Estimate the percentage of sand grains in the
entire pile whose length is between 2 and 3mm.
• B) Obtain max likelihood estimate of the coefficient of
variation /
11/9/2024 SS ZG622 Software Project Management – KG Krishna 379 11/9/2024 SS ZG622 Software Project Management – KG Krishna 380
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Question; BITS Pilani
Pilani|Dubai|Goa|Hyderabad
Point Interval
Estimation Estimation
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Estimation
Point Estimation-Methods
Point Estimator
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Point Estimation
Point Estimation
Example
• Sample mean X̂ , is the point estimator for the • Let T be the time that is needed for a specific task in a factory to be
completed. In order to estimate the mean and variance of T, we observe a
mean. random sample T1,T2,⋯⋯,T6. Thus, Ti's are i.i.d. and have the same
• Point estimator for the variance is the mean of the distribution as T. We obtain the following values (in minutes):
• 18,21,17,16,24,20.
random variable Y = ( X − ) 2, given by
• Find the values of the sample mean, the sample variance, and the sample
standard deviation for the observed sample.
𝑛
1 18 + 21 + 17 + 16 + 24 + 20
𝜎ො 2 =
𝑛−1
(𝑋𝑘 − μො )2
Tˆ = = 19.33
𝑘=1 6
1 n
ˆ
S =
2
6 − 1 k =1
( X k − 19.33) 2 = 8.67
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Maximum Likelihood
Maximum Likelihood
Estimation (MLE)
• Suppose we have a random sample x1,x2, ..,xn whose • It is observed that a good estimate of
assumed probability distribution depends on some unknown
parameter θ. unknown parameter θ would be the value of θ
• Ex: that maximizes the probability
1) For Binomial unknown parameters are n, p. i.e. the likelihood of getting the data we
2) For Poisson unknown parameter is λ. observed (this is reason, why we called as
3) For Normal unknown parameters are 𝜇 𝑎𝑛𝑑 𝜎 2 . likelihood function)
Our goal is to find good estimate of θ(population parameter)
using sample and which can be done with the help of MLE.
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
• Let x1,x2, ..,xn be i.i.d. random variables drawn • The maximum likelihood estimate (MLE) of θ is that value of θ
that maximizes likelihood(θ).
from some probability distribution that
• It is defined as
depends on some unknown parameter θ. n
L( ) = f ( xi / )
• The goal of MLE to maximize likelihood i =1
function n
log L( ) = log f ( xi / )
𝐿 𝜃 = 𝑓(x1,x2, ..., xn 𝜃 i =1
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
11/9/2024 393 11/9/2024 394
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Maximum Likelihood for a Binomial Maximum Likelihood
distribution Example 3:
• Suppose we wish to find the maximum likelihood • Consider a sample 0,1,0,0,1,0 from a binomial distribution, with the form
P[X=0]=(1-p), P[X=1]=p. Find the maximum likelihood estimate of p.
estimate (MLE) of θ for a Binomial distribution,
pk (k , ) = nCk k (1 − ) n − k • Soln :
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Maximum Likelihood
Example 5:
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
Maximum Likelihood
Example 6:
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
SSWT ZC111 P & S
CONTACT SESSION 14
BITS Pilani
Pilani Campus
Hypothesis is a decision making statement framed about the The hypothesis formulated for the purpose of its rejection under the assumption that it is true is called the Null hypothesis denoted by 𝑯𝟎 .
assumption referred to as hypothesis which may or may not be 𝑯𝟏 : There is significant difference between process A and B
true. Example:2 To test whether there is a relationship between two variates, we can formulate
𝑯𝟎 : There is no relationship between variates
There are two types of hypothesis 𝑯𝟏 : There is relationship between variates
1. Null hypothesis
2. Alternative hypothesis
409 410
➢ To illustrate the general concepts involved in deciding whether or not a statement about the
➢ It instructs a member of its research staff to paint each of 36
population is true or false, suppose that a customer protection agency wants to test a paint
boards using a different 1- gallon can of paint, with the intention
manufacturer’s claim that the average drying time of his new “ fast-drying ” paint is 20
of rejecting the claim if the mean of the drying time exceeds
minutes.
20.75 minutes .
➢ It instructs a member of its research staff to paint each of 36 boards using a different 1-
gallon can of paint, with the intention of rejecting the claim if the mean of the drying
time exceeds 20.75 minutes .
➢ Otherwise it will accept the claim, in either case it will take
➢ Otherwise it will accept the claim, in either case it will take whatever action is called for in whatever action is called for in its plans.
its plans.
412
413
➢ Level of significance is the probability level in which null hypothesis is rejecting in this level.
➢ By setting the level of significance at a small value, you are saying that you want the
Three types of hypothesis tests
right-tailed test
➢ Commonly used levels of significance:
two-tailed test
= 0.10 = 0.05 = 0.01
The type of test depends on the region of the sampling distribution that favors a
For example, if 0.05 or 5% level of significance is chosen in designing a test of a hypothesis,
rejection of H0.
then there are about 5 chances in 100 that we would reject the hypothesis when it should
be accepted. That is, we are about 95% confident that we have made the right decision, This region is indicated by the alternative hypothesis.
z
-3 -2 -1 0 1 2 3 z
-3 -2 -1 0 1 2 3
Test
statistic Test
statistic
p = 0.94 Condition of
equality
Condition of equality
Identifying Hypotheses
Identifying Hypotheses
Example: For each claim, state H0 and H1. Then determine whether the Example: For each claim, state H0 and H1. Then determine whether the
hypothesis test is a left-tailed, right-tailed, or two-tailed test. Sketch a normal
sampling distribution and shade the area for the P-value. hypothesis test is a left-tailed, right-tailed, or two-tailed test. Sketch a
A university publicizes that the proportion of its students who graduate in 4 normal sampling distribution and shade the area for the P-value.
years is 82%.
A water faucet manufacturer announces that the mean flow rate of a
certain type of faucet is less than 2.5 gallons per minute.
Solution:
H0: p = 0.82 Solution:
H1: p ≠ 0.82 H0: μ ≥ 2.5 gpm P-value area
½ P-value area ½ P-value area
H1: μ < 2.5 gpm
Two-tailed test z
z Left-tailed test -z 0
-z 0 z
1. State the claim mathematically and verbally. Identify the null and
Example: For each claim, state H0 and H1. Then determine whether the
alternative hypotheses.
hypothesis test is a left-tailed, right-tailed, or two-tailed test. Sketch a normal
sampling distribution and shade the area for the P-value. H0: ? Ha: ?
2. Specify the level of significance. This sampling distribution is based
A cereal company advertises that the mean weight of the contents of its 20- α= ? on the assumption that H0 is true.
ounce size cereal boxes is more than 20 ounces. 3. Determine the standardized
Solution: sampling distribution and
P-value area
H0: μ ≤ 20 oz draw its graph. z
0
H1: μ > 20 oz 4. Calculate the test statistic
0 z
z and its standardized value.
Right-tailed test Add it to your sketch. z
0
Test statistic
430
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
7. Write a statement to interpret the decision in the context of the original 2. 0.01?
claim. Solution:
Because 0.0132 > 0.01, you should fail to reject the null hypothesis.
431
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
Example: Finding the P-value Example: Finding the P-value
Find the P-value for a left-tailed hypothesis test with a test statistic of z = -1.99.
Find the P-value for a two-tailed hypothesis test with a test statistic of z = 1.82. Decide
Decide whether to reject H0 if the level of significance is α = 0.01.
whether to reject H0 if the level of significance is α = 0.05.
Solution: Solution:
For a left-tailed test, P = (Area in left tail) For a two-tailed test, P = 2(Area in tail of test statistic)
1 – 0.9656 =
P = 0.0233 P = 2(0.0344)
0.0344
0.9838 = 0.0688
z z
-1.99 0 0 1.82
Because 0.0233 > 0.01, you should fail to reject H0 Because 0.0688 > 0.05, you should fail to reject H0
SOLUTION
EXCERCISES
( 7.42)
n=64 , μ = 190, 𝑋ത = 187
1. A company wants to establish that the mean life of its batteries , when The test statistic (for large sample) is
used in a wireless mouse, is over 187 days. The data will consist of the life
z = (x̅ - μ) / (σ /√n) , so that
lengths of batteries in 64 different wireless mice.
a) Formulate the null and the alternative hypotheses. z α / 2 . (σ /√n) = E, which is the maximum error.
b) If the true mean is 190 days , what error can be made? (a) (i) Null hypothesis 𝐻0 : μ = 187 (ii) Alternative hypothesis 𝐻1 : μ > 187
(b) The Error is E = z α / 2 .σ / √n = x̅- μ
When μ = 190, E is 187 -190 = -3, implies that
➢ The error is of type II.
➢ If it erroneously rejects the null hypothesis that the dam is safe, it would commit the
type I error.
➢ If it erroneously accepts the null hypothesis that the dam is safe, it would commit
2. Suppose that an engineering firm is asked to check the safety of a type II error.
➢ In the classical theory of testing hypothesis we consider cost factors and other
dam. What type of error would it commit if it erroneously rejects the considerations that are partly arbitrary and partly subjective as they will affect the
choice of a sample size, the choice of an alternative hypothesis , the choice of α & β,
null hypothesis that the dam is safe? What would the likely impact of and so forth.
these errors be? ➢ In this approach , the maximum value of α ( the probability of falsely rejecting the null
hypothesis) is controlled.
➢ Even when costs are difficult to determine but the consequences of one error are
much more serious than for the other, the hypotheses should be labeled so that the
most serious error is the type I error. This would be the likely impact of these errors.
2. Suppose that a consumer protection agency wants to test a paint a) The size is large and hence the statistic is given
by
manufacturer’s claim that the average drying time of his new “ fast - z = (x̅ - μ) / (σ /√n)
= (20.50 – 20) / (2.4 / √36)
drying “ paint is 20 minutes . Assume that the S.D. of such drying time
= 0.5 / 0.4 = 1.25
is 2.4 minutes. If the no. of boards painted is 36 so that the paint It follows from a standard normal curve table that
the area of the shaded region is = Area to the right
manufacturer’s claim is rejected for the sample mean which may of Z = 1.25 ( i.e, to the right of
x̅ = 20.50 ), as z is positive.
exceed 20.50 minutes , then find = 0.5 – (Area between μ = 20 & x̅ = 20.5)
=0.5 – 0.3994 (i.e.,area between z=0 & z=1.25
(a) the probability of a Type I error; From the table)
=0.1056. Hence, the probability of erroneously
(b) the probability of a Type II error when μ = 21 minutes. rejecting the claim (i.e, the probability of a Type I
error) is 0.1056.
Now, alternative hypothesis μ < 100 pounds on the basis of the weights
Z = (20.50 – 21) / (2.4 / √36)
of a random sample of n = 40 packages shipped by truck. The
= - 0.5 / 0.4 = -1.25
∴ The area of the shaded region = Area to the left of x̅ = 20.5(∵ z is population has σ = 12 pounds . For what values of x̄ must the null
negative)
hypothesis be rejected if the probability of a Type I error is to be α
= 0.5 – (Area between x̅ = 20.50 & μ = 21 )
= 0.5 – 0.3994 ( area between z= - 1.25 & z=0, from the table) = 0.01?
= 0.1056
SOLUTION
NOTE : 1. If = 0.05, the dividing lines, or critical values, of the criteria are –1.645 and 1.645 for the one-sided
Here the sample is large and so the statistics must be z, given by , alternatives, and
z = (x̄ − μ) / (σ/√ n) = (x̄ - 100) / (12 /√40) –1.96 and 1.96 for the two-sided alternative.
Here given, Ho : μ = 100 2. If = 0.01, the dividing lines of the criteria are –2.33 and 2.33 for the one-sided alternatives, and –2.575
H1 : μ < 100 ( i.e, a one –tailed test) and 2.575 for the two-sided alternative.
We know that for a one –tailed test, Z 0.01 = 2.33 . Example: Suppose that we want to test on the basis of n = 35 determinations and at the
Since H1 is μ < 100 ( less than relation), we have that z = -2.33 0.05 level of significance whether the thermal conductivity of a certain kind of cement
i.e, We have (x̄ -100) / (12 / √40) = - 2.33 brick is 0. 340, as has been claimed. From the information gathered in similar studies, we
i.e., (x̄ -100) √40 = - 2.33 x 12 can expect that the variability of such determinations is given by = 0.010 and by
i.e., x̄ (√40 ) - 100 √40 = - 27.96 experiment the mean of the 35 determinations is 0.343.
(or) x̄ (√40 ) = 100 √40 - 27.96
= 632.46 – 27.96 Solution: Step 1: Null hypothesis: = 0.340
= 604.5 Alternative hypothesis: 0.340
Therefore, x̄ = 604.5 / √40 = 95.579
Thus, for x̄ < 95.58, the null hypothesis will be rejected (when α = 0.01 ).
Step 2: Level of significance: = 0.05.
Step 3: Criterion: Reject the null hypothesis if Z < -1.96 or Z > 1.96, where Solution: 1. Null hypothesis: 28,000 miles
X − Alternative hypothesis : < 28,000 miles.
2. Level of significance: 0.01.
z= 0
3. Criterion: Since the probability of Type-I error is greatest when = 28,000 miles, we proceed as if we were testing the null
/ n hypothesis = 28,000 miles against the alternative hypothesis < 28,000 miles at the 0.01 level of significance. Thus, the null
Step 4: Calculations: hypothesis must be rejected if Z < -2.33, where with replaced by S.
0.343 − 0.340
z= = 1.77
27, 463 − 28, 000
0.010 / 35 4. Calculations: z = = −2.52
1,348 / 40
Step 5: Decision: Since Z = 1.77 falls on the interval from –1.96 to 1.96, the null hypothesis cannot be
5. Decision: Since Z = -2.52 is less than –2.33, the null hypothesis must be rejected at level of significance 0.01. In other words, the
rejected. That is, the difference between µ = 0.343 and = 0.340 can be attributed to chance. trucking firm’s suspicion that
< 28,000 miles is confirmed.
BITS Pilani
Pilani|Dubai|Goa|Hyderabad
450
➢ Following the outline of the preceding section, we begin with ➢ Returning now to the example dealing with the
steps 1 and 2 by writing thermal conductivity of the cement bricks where
α = 0.05, suppose that the mean of the 35
1. Null hypothesis: μ = 0.340 determinations is 0.343
Alternative hypothesis: μ ≠ 0.340 3. So we continue by writing
2. Level of significance: α = 0.05
(Two sided test) Criterion:
The test statistics is Z = (x̅ − μ0) / (σ /√n)
Z = (0.343 − 0.340) / (0.010 / √35) = 1.77 .
452
Solution:
The observed value of the test statistic is ➢ We thus understand that the P-value is the probability of obtaining a value
t = (3.9 − 4.5) / (1.5/ √25) = −2.00 for the test statistic that is as extreme or more extreme than the value
actually observed, where the probability is calculated under the null
Since the alternative is two-sided, we must consider large
hypothesis.
positive values of t as well as large negative values.
From the t table, P value > 0.025
➢ In many practical situations if σ is unknown but the sample size is large, we
must make the approximation of substituting for it, the sample standard
deviation S .
➢ The specifications for a certain kind of ribbon call for a mean breaking
Statistic for small sample test strength of 180 pounds. If five pieces of the ribbon (randomly selected from
concerning mean (normal population) different rolls) have a mean breaking strength of 169.5 pounds with a
standard deviation of 5.7 pounds, test the null hypothesis μ = 180 pounds
t = (x̅ − μ0) / (S / √n) against the alternative hypothesis μ > 180 pounds at the 0.01 level of
significance. Assume that the population distribution is normal.
➢ which is a random variable having the t distribution with (n − 1)
degrees of freedom. The criteria for the one sample t test are
like those for Z but are based on the t distribution.
SOLUTION
STEP 1:
EXERCISES: Null Hypothesis H0 : There is no significant difference & let
μ = 73.2.
1. The mean of ball bearings manufactured by a company is 73.2 inches, with STEP 2:
S.D of 8.6 inches. If a sample of 45 is chosen at random and its mean is Alternative Hypothesis H1: There is significant difference
76.7 inches, test the significant difference between the means, as to which such that μ > 73.2 (One – tailed test).
one is superior at 1% level of significance.
STEP 3:
Since the sample is larger, the test statistic is
Z = (x̄ - μ) / (σ / √n)
= (76.7 – 73.2) / (8.6 / √45)
i.e, z = (3.5 x 6.7) / 8.6 = 2.72
SOLUTION (CONTINUED)
EXERCISES (CONTINUED)
STEP 4:
Let us assume that α = 5% = 0.05 For (n – 1 =) 5 degrees of
freedom, t0.05 = 2.015
EXERCISES (CONTD).
SOLUTION :
(7.57) 9 measurements were made on a key performance indicator as follows: xi (xi - x̄) (xi - x̄)2
x̄ = 1026 / 9
123 106 114 128 113 109 120 102 111 123 9 81
= 114
(a) Conduct a test of hypothesis with the intent of showing that the mean key 106 -8 64
n=9
performance indicator is different from 107. Take α = 0.05 and assume a
114 0 0
normal population.
128 14 196
(b) Based on your conclusion in part (a), what error could you have made?
113 -1 1
109 -5 25
120 6 36
102 -12 144
111 -3 9
556
s2 = Σ ( xi - x̄ )2 / (n -1 ) (7.60)
= 556 / 8 = 69.5 In 64 randomly selected hours of production, the mean and the standard
s = √69.5 = 8.33 deviation of the number of acceptable pieces produced by an automatic
STEP 1: Null Hypothesis Ho : μ = 107 stamping machine are x = 1,038 and s = 146. At the 0.05 level of
significance, does this enable us to reject the null hypothesis μ = 1,000
Alternative Hypothesis H1: μ ≠ 107 (two=tailed test)
STEP 2: t = (x̄ - μ) / (s / √n)
against the alternative hypothesis μ > 1,000?
= ( 114 – 107) x √9 / 8.33 = 21 / 8.33 = 2.52
STEP3: We know that (for α = 0.05) t0.05 for 8 d.f = 2.306
STEP4: Since, calculated value of t > tabulated t value, H0 is rejected.
b) Type I error.
GIVEN: STEP 3:
x̄ = 1038 s = 146 & n= 64 Since the sample is larger, the test statistic is
STEP 1: Z = (x̄ - μ) / (s / √n)
Null Hypothesis H0 : There is no significant difference. = (1038 – 1000) / (146 / √64)
Given μ = 1000. i.e, z = (38 x 8) / 146 = 304 / 146 = 2.08
STEP 2: STEP 4:
Alternative Hypothesis H1: There is significant difference. Given α = 5% = 0.05
i.e, μ > 1000 ( one – tailed test) We know that Zα = 1.645 (for a one-tailed test).
STEP 5:
Since the calculated value of z is > the tabulated value , NULL HYPOTHESIS A builder claims that heat pumps are installed in 70% of all
H0 IS REJECTED. homes being constructed today in the city of Bangalore.
STEP 6: DECISION:
Would you agree with this claim if a random sample of
Thus, we find that there is significant difference between the sample and
population mean. new homes in this city shows that 28 out of 55 had heat
pumps installed? What P-value and confidence interval
are related in this situation?
Lecture 16 480
BITS Pilani, Pilani Campus
CURVE FITTING Method of Least Squares `
Let 𝑦 = 𝑓(𝑥) any function where 𝑥 is independent variable & 𝑦 be the Where a and b are the constants to be determined Let 𝑦𝑖
dependent variable. For 𝑥 = 𝑥1 , 𝑥2 , … , 𝑥𝑛 we can find the corresponding be the value of 𝑦 when 𝑥 = 𝑥𝑖 . Then value of 𝑌𝑖 is called
𝑦 values as 𝑦 = 𝑦1 , 𝑦2 , … , 𝑦𝑛 then the pairs of values the estimated value of 𝑦𝑖
𝑥1 , 𝑦1 , 𝑥2 , 𝑦2 , . . , 𝑥𝑛 , 𝑦𝑛 given 𝑛 points on the curve 𝑦 = 𝑓(𝑥) .
Generally it is not possible to find the actual curve when 𝑛 pairs of
values are given we try to find an approximate curve which most suits 𝑦𝑖 = 𝑎 + 𝑏𝑥𝑖 − − − ②
the given data. Such a curve is referred to as the curve of best fit. The
process of determining the curve of best fit is called curve fitting. The Let 𝑠 = σ 𝑦𝑖 − 𝑌𝑖 2
method generally employed for curve fitting is the method of least
squares . 2
Method of Least Squares Using ① ⟹ 𝑠 = σ 𝑦𝑖 − 𝑎 − 𝑏𝑥𝑖 −−−③
Suppose we wish to have a straight line that serves as best
approximation to the actual curve 𝑦 = 𝑓(𝑥) passing through 𝑛 given We determine 𝑎 & 𝑏 so that 𝑠 is minimum (least) the
points 𝑥1 , 𝑦1 , 𝑥2 , 𝑦2 , . . , 𝑥𝑛 , 𝑦𝑛 . This line be referred to as the line of 𝜕𝑠 𝜕𝑠
necessary conditions for this are = 0 and = 0
best fit and we take its equation as 𝑦 = 𝑎 + 𝑏𝑥−− − ① 𝜕𝑎 𝜕𝑏
481 482
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
Σ(𝑥𝑖𝑦𝑖−𝑎𝑥𝑖−𝑏𝑥𝑖)=0
𝒚 = 𝒂 + 𝒃𝒙 −−−−①
Σ𝑥𝑖𝑦𝑖=𝑎Σ𝑥𝑖+𝑏 Σ𝑥𝑖− − − ⑤
Normal equations to ① are
These ④ & ⑤ are known as normal equations which
are two simultaneous equations for finding 𝑎 & 𝑏
σ 𝑦 = 𝑛𝑎 + 𝑏 σ 𝑥 −−−− ②
substituting these values of 𝑎 & 𝑏 in ① we get the
equation of line of best fit for the given data
σ 𝑥𝑦 = 𝑎 σ 𝑥 + 𝑏 σ 𝑥 2 −−−− ③
483 484
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
Correlation
If 𝑥 and 𝑦 are any two variables such that decrease or increase in one
variable may result in decrease or increase in the other variable. If 𝑥
increases 𝑦 also increases or if 𝑥 decreases 𝑦 also decreases then we
say that 𝑥 & 𝑦 are positively correlated or directly correlated, If 𝑥
increases 𝑦 decreases or if 𝑥 decreases 𝑦 increases then 𝑥 and 𝑦 are
said to be negatively correlated or inversely correlated.
Example: The demand (𝑥) and price (𝑦) are directly correlated
supply (𝑥) & price (𝑦) are inversely correlated.
This correlation is measured by quantity known as correlation coefficient
which is defined as
From the above table σ 𝑥 − 𝑥ҧ 𝑦 − 𝑦ത
② ⟹15=5𝑎+𝑏(18)
𝑟=
𝑛 𝜎𝑥 𝜎𝑦
③ ⟹71=𝑎(18)+𝑏(170) Solving these equations we get 𝑎 = 1.646 and
𝑖𝑒 5𝑎+18𝑏=15 − − −(𝑖) b= 0.3761
18𝑎+110𝑏=71− − −(𝑖𝑖)
485 486
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
487 488
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
Regression
489 490
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
Regression Lines
491 492
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
493 494
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
495 496
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
497 498
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
499 500
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
501 502
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
503 504
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
505 506
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
507 508
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
509 510
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
511 512
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
513 514
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
515 516
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
517 518
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
519 520
BITS Pilani, Pilani Campus BITS Pilani, Pilani Campus
521
BITS Pilani, Pilani Campus