Unit-3 PROPORTIONS
Unit-3 PROPORTIONS
Estimations of Proportions:
x x x x
1 1
x n n x n n
z /2 p z /2
n n n n
x
The proportion of success p
n
x x x x
1 1
x n n x n n
z /2 p z /2
n n n n
36 36 36 36
1 1
36 100 100 36 100 100
1.96 p 1.96
100 100 100 100
0.266 p 0.454
p 1 p
Note that the maximum error of estimate E z /2
n
x
Here p
n
Problem 2: In a sample survey conducted in a large city, 136 of 400 persons answered
yes to the question of whether their city’s public transportation is adequate. With 99%
x 136
confidence, what can we say about the maximum error, if 0.34 is used as
n 400
an estimate of the corresponding true proportion?
x 136
Solution: Since 0.34 and confidence is 99%, then α = 0.01 and
n 400
z /2 z0.005 2.575 .
p 1 p
Then maximum error of estimate E z /2
n
E 2.575
0.34 0.66 0.061
400
Problem 3: What is the size of the smallest sample required to estimate an unknown
proportion of customers who would pay for an additional service, to within a
maximum error of 0.06 with at least 95% confidence?
x x x x
1 1
x n n x n n
z /2 p z /2
n n n n
84 84 84 84
1 1
84 200 200 84 200 200
1.96 p 1.96
200 200 200 200
0.3516 p 0.4200
p 1 p
Solution: we know that maximum error of estimate E z /2
n
p 1 p
The maximum error of estimate is E z /2
n
84 84
1
200 200
2.575
200
0.08987
Problem 6: In a random sample of 400 industrial accidents, it was found that 231
were due at least partially to unsafe working conditions. Construct a 99% confidence
interval for the corresponding true proportion using the large sample confidence
interval formula.
x x x x
1 1
x n n x n n
z /2 p z /2
n n n n
From the given data n = 400, x = 231, α = 1-0.99= 0.01, z /2 z0.005 2.575
x x x x
1 1
x n n x n n
z /2 p z /2
n n n n
15 15 51 15
1 1
15 90 90 15 90 90
1.96 p 1.96
90 90 90 90
0.1667 0.0770 p 0.1667 0.0770
0.0897 p 0.2437
Here we test the null hypothesis p p0 against one of the alternative hypothesis
p p0 , p p0 , or p p0 with the use of the statistic
X np0
Z
n p0 1 p0
Then z0.05=1.645
X np0
The null hypothesis is rejected if Z > 1.645 where the test statistic Z
np0 1 p0
X np0 48 60 0.70
Then Z 1.69
np0 1 p0 60 0.70 0.30
Since z = 1.69 is greater than 1.645, the null hypothesis is rejected. So, we accept
alternative hypothesis. That is p > 0.70 is accepted.
Then z0.05=1.645
X np0
The null hypothesis is rejected if Z > 1.645 where the test statistic Z
np0 1 p0
Since z = 2.1913 is greater than 1.645, the null hypothesis is rejected. So, we accept
alternative hypothesis. That is p > 0.30 is accepted.
Problem 10: The performance of a computer is observed over a period of 2 years to
check the claim that the probability is 0.20 that its downtime will exceed 5 hours in
any given week. Testing the null hypothesis p = 0.20 against the alternative hypothesis
p ≠ 0.20, what can we conclude at the level of significance α= 0.05, if there we only
11 weeks in which the downtime of the computer exceeded 5 hours?
Then z0.025=1.96
The null hypothesis is rejected if Z <-1.96 or Z>1.96 where the test statistic
X np0
Z
np0 1 p0
Since z = -2.391 is less than -1.96, the null hypothesis is rejected. So, we accept
alternative hypothesis. That is p ≠ 0.30 is accepted.
Problem 11: To check on an ambulance service’s claim that at least 40% of its calls
are life-threatening emergencies, a random sample was taken from its files, and it was
found that only 49 of 150 calls were life-threatening emergencies. Can the null
hypothesis p = 0.40 be rejected against the alternative hypothesis
Then z0.05=1.645
X np0
The null hypothesis is rejected if Z <-1.645 where the test statistic Z
np0 1 p0
Since z = -1.8333 is less than -1.645, the null hypothesis is rejected. So, we accept
alternative hypothesis. That is p <0.40 is accepted.
Problem 12: In a random sample of 600 cars making a right turn at a certain
intersections, 157 pulled into the wrong lane. Test the null hypothesis that actually
30% of all drivers make this mistake at the given intersection, using the alternative
hypothesis p ≠ 0.30 and the level of significance
Here we test the null hypothesis p 0.30 against the alternative hypothesis p ≠ 0.30
The null hypothesis is rejected if Z < -Zα/2 or Z > Zα/2 where the test statistic
X np0
Z
np0 1 p0
Then z0.025=1.96
X np0 157 600 0.30
Then Z 2.049
np0 1 p0 600 0.30 0.70
Since z = -2.0491 is less than -1.96, the null hypothesis is rejected. So, we accept
alternative hypothesis. That is p ≠0.30 is accepted.
Then z0.005=2.575
Since z = -2.0491 is between -2.575 and 2.575l, the null hypothesis is accepted. That
is p = 0.30 is accepted.
Conclusion: The null hypothesis is rejected with 95% confidence and accepted with
99% confidence.
In the above table x represents the total number of successes, n – x represents the total
number of failures and n the total number of trails. The entry in the cell belonging to
the ith row and jth column is called the observed frequency oij with i= 1, 2 and j = 1,
2, …, k. Let us denote the observed proportion of success by p . So, the value of p is
x
given by p .
n
Hence the expected number of successes and failures for the jth sample are estimated
by the following formulae:
x
e1 j n j p n j
n
x nx
e2 j n j (1 p) n j (1 ) n j ( )
n n
The quantities e1j and e2j are called the expected cell frequencies for j =1, 2, …, k.
The test statistic for test concerning difference among proportions is given by
2 k (oij eij ) 2
2
i 1 j 1 eij
Decision: Reject the null hypothesis if the value of exceeds with k – 1 degrees
2 2
of freedom.
Problem 13: Samples of three kinds of materials subjected t extreme temperature
changes, produced the results shown in the following table:
Material A Material B Material C Total
Crumbled 41 27 22 90
Remained intact 79 53 78 210
Total 120 80 100 300
Use the 0.05 level of significance to test whether, under the stated conditions, the
probability of crumbing is the same for the three kinds of materials.
Solution:
Null Hypothesis, H0: p1 = p2 = p3
Alternative Hypothesis, H1: p1 , p2 and p3 are not all equal.
Level of significance, α = 0.05
90
p
300
i 1 j 1 eij
Since 4.575 does not exceeds 0.05 (3 – 1) d.f = 5.991, we can’t reject the null
2 2
hypothesis at the 0.05 level of significance. Hence the probability of crumbling is the
same for the three kinds of material.
Problem 14: Four methods are under development for making disks of a
superconductivity material. Fifty disks are made by each method and they are checked
for superconductivity when cooled with liquid nitrogen.
Perform a chi square test with α = 0.05 to test whether the probability of
superconductivity is the same for the four kinds of methods.
Solution:
Null Hypothesis, H0: p1 = p2 = p3= p4
Alternative Hypothesis, H1: p1 , p2 , p3 and p are not all equal.
Level of significance, α = 0.05
120
p
200
i 1 j 1 eij
Since 19.50 exceeds 0.05 (4 – 1) d.f = 7.815, we reject the null hypothesis at the
2 2
0.05 level of significance. Hence the probability of superconductivity is not the same
for four methods.
Problem 15: The following data come from a study in which random samples of the
employees of three government agencies were asked questions about their pension
plan:
Agency 1 Agency 2 Agency 3 Total
For the pension plan 67 84 109 260
Against the pension plan 33 66 41 140
Total 100 150 150 400
Use the 0.01 level of significance to test the null hypothesis that the actual proportions
of employees favoring the pension plan are the same.
Solution:
Null Hypothesis, H0: p1 = p2 = p3
Alternative Hypothesis, H1: p1, p2 and p3 are not all equal.
Level of significance, α = 0.01
260
p
400
i 1 j 1 eij
at the 0.01 level of significance. Hence the probability for favoring the pension plan
by the three agencies is not the same.
Finally, we have to write the decision that either accepting the null hypothesis or
rejecting the null hypothesis.
Also the (1 – α)100% large sample confidence interval for the difference of two
proportions is given by
x1 x1 x2 x2
1 1
x1 x2 n1 n1 n2 n2
z / 2
n1 n2 n1 n2
Problem 16: A study shows that 16 of 200 tractors produced on one assembly line
required extensively adjustments before they could be shipped. While the same was
true for 14 of 400 tractors produced on another assembly line. At the 0.01 level of
significance, does this support the claim that the second production line does superior
work? Also construct a 95% confidence interval for p1 - p2.
Solution:
Null Hypothesis, H0: p1 = p2
Alternative Hypothesis, H1 : p1 > p2
= 2.384
From table 3, Z0.01 = 2.33
Decision: Since Z = 2.384 exceeds Z0.01 = 2.33, we have to reject the null hypothesis.
So, accept the alternative hypothesis. That is the true proportion of tractors requiring
extensive adjustments is greater for first assembly line than for the second.
A 95% confidence interval for the difference of two proportions is given by
x1 x1 x2 x2
1 1
x1 x2 n1 n1 n2 n2
z / 2
n1 n2 n1 n2
0.003 p1 p2 0.087
26 12
200 200 26 12
= with p 0.095
1 1 200 200
0.095(1 0.095)
200 200
0.07
=
(0.2932)(0.1)
= 2.3873
Z0.01 = 2.33
Decision: Since Z = 2.3873 exceeds Z0.01 = 2.33, we have to reject the null hypothesis.
So, accept the alternative hypothesis.
Problem 18: The owner of a machine shop must decide which of two snack-vending
machines to install in his shop. If each machine is tested for 250 times and the first
machine fails to work(neither delivers the snack nor returns the money) 13 times and
the second machine fails to work 7 times, test at the 0.05 level of significance whether
the difference between the corresponding sample proportions is significant.
Solution:
Null Hypothesis, H0: p1 = p2
Alternative Hypothesis, H1 : p1 ≠ p2
Given x1 = 13, x2 = 7, n1 = 250 and n2 = 250
Level of significance, α = 0.05
X1 X 2
n1 n2 X1 X 2
Test Statistic, Z with p
1 1 n1 n2
p(1 p ) ( )
n1 n2
13 7
250 250 13 7
= with p 0.04
1 1 250 250
0.04(1 0.04 )
250 250
0.024
=
(0.1959)(0.0.0844)
= 1.369
Z0.025 = 1.96
Decision: Since Z = 1.369 does not exceed Z0.025 = 1.96, we can’t reject the null
hypothesis. So, accept the null hypothesis.
Home work: A study showed that 64 of 180 persons who saw a photocopying
machine advertised during the telecast of a baseball game and 75 of 180 other persons
who saw it advertised on a variety show remembered the brand name 2 hours later.
Use the 0.05 level of significance whether the difference between the corresponding
sample proportions is significant?