0% found this document useful (0 votes)
57 views26 pages

BCS301 Module 4

Uploaded by

patilchinmay510
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
57 views26 pages

BCS301 Module 4

Uploaded by

patilchinmay510
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 26

4.

1 Sampling variable, central limit theorem and confidence limits


Sampling variable: Variables sampling is a process used to predict the value of a specific
variable within a population.
Central limit theorem: The sample mean follows approximately the normal distribution with
mean and standard deviation .
That is, , where and are mean and standard deviations of the population from
where the sample was selected and the sample size becomes large ( .
Z value of a standard normal distribution:

Probability of a sample mean in an interval:

Confidence limits:

Lower limit is and upper limit is .


Confidence interval is
1. Using central limit theorem evaluate where represents the mean of
a random sample of size 100 from an infinite population with mean and
variance

By data,

By central limit theorem . That is,


For a standard normal distribution,
Therefore,

1
2. An unknown distribution has a mean of 90 and a standard deviation of 15. Samples of
size are drawn randomly from the population. Find the probability that The
sample mean is between 85 and 92 using central limit theorem.

By data,

By central limit theorem . That is,


For a standard normal distribution,
Therefore,

3. A random sample of size 64 is taken from an infinite population having mean 112 and
variance 144. Using central limit theorem find the probability of getting the sample
mean greater than 114.5.

By data,

By central limit theorem . That is,


For a standard normal distribution,
Therefore,

4. An electric lamp manufacturers light bulbs that have a length of life that is
approximately normally distributed with a mean of 800 hours and a standard
deviation of 40 hours. Using the central limit theorem, find the probability that the
random sample of 16 bulbs will have an average life of less than 775 hours.

By data,

By central limit theorem . That is,


For a standard normal distribution,
Therefore,

2
4.2 Test of significance - t test
Working rule:
 Write the null hypothesis H0.

 Find the calculated value using

 Find the critical value using the table at degrees of freedom.


 If calculated value < critical value, Accept H0. H0 is the conclusion.
 If calculated value > critical value, Reject H0. H1 is the conclusion.
Notation: Mean S.D

Sample
Population

1. A Machinist making engine parts with axle diameter of 0.7 inches. A random sample
of 10 parts shows mean diameter 0.742 inches with a SD of 0.04 inches. Based on this
sample, would you say that the work is inferior at 5% level of significance?
[

Since n = 10, small sample, apply t test.


Null hypothesis:
H0: The work is not inferior.
Calculated value:
By data,

Critical value:
Level of significance
Degrees of freedom

Comparison:
Calculated value and Critical value
Therefore, Calculated value Critical value
Conclusion: Reject H0. The work is inferior.

3
2. The nine items of the sample have the following values: 45, 47, 50, 52, 48, 47, 49, 53
and 51. Does the mean of these differ significantly from the assumed mean of 47.5?
[

Since n = 9, apply t test.


Null hypothesis:
H0: There is no significant difference from the assumed mean 47.5
Calculated value:
By data,

Critical value:
Level of significance
Degrees of freedom

Comparison:
Calculated value
Critical value
Calculated value Critical value
Conclusion:
Accept H0. There is no significant difference from the assumed mean 47.5

4
3. A random sample of 10 boys had the following IQ: 70, 120, 110, 101, 88, 83, 95, 98,
107, 100. Does this data support the hypothesis that the population mean of IQ’s is
100 at 5% level of significance? [
Since n = 10, apply t - test.
Null hypothesis:
H0: The population mean of IQ’s is 100.
Calculated value:

Critical value:
Level of significance
Degrees of freedom

Comparison:
Calculated value
Critical value
Calculated value Critical value
Conclusion:
Accept H0. The population mean IQ is 100 at 5% level of significance.

4. A certain stimulus administered to each of 12 patients resulted in the following


increases of blood pressure: 5, 2, 8, -1, 3, 0, -2, 1, 5, 0, 4, 6. Can it be concluded that
the stimulus will in general be accompanied by an increase in blood pressure?
[

Since n = 12, apply paired t - test.


Null hypothesis:
H0: The stimulus will not increase in blood pressure.
Calculated value:

5
7

Critical value:
Level of significance
Degrees of freedom

Comparison:
Calculated value
Critical value
Calculated value Critical value
Conclusion:
Reject H0. The stimulus will increase blood pressure.

5. Eleven students were given a test in statistics. They were given a month’s further
tuition and a second test of equal difficulty was held at the end of it. Do the marks
give evidence that the students have benefitted by extra coaching?
Boys: 1 2 3 4 5 6 7 8 9 10 11
Marks I test: 23 20 19 21 18 20 18 17 23 16 19
Marks II test: 24 19 22 18 20 22 20 20 23 20 17
[

n = 11, apply paired t test.


Null hypothesis:
H0: The students did not have benefitted by extra coaching.
Calculated value:

6
Critical value:
Level of significance
Degrees of freedom

Comparison:
Calculated value
Critical value
Calculated value Critical value
Conclusion:
Accept H0. The students did not have benefit by extra coaching.

6. A group of boys and girls were given an intelligent test. The mean score SD’s and
numbers in each group are as follows:
Mean S.D n
Boys 124 12 18
Girls 121 10 14
Is the mean score of boys significantly different from that of girls?
[

Since = 18, = 14, apply two samples t- test.


Null hypothesis:
H0: the mean score of boys does not differ significantly from that of girls.
Calculated value:
By data,

Critical value:
Level of significance
Degrees of freedom

7
Comparison:
Calculated value
Critical value
Calculated value Critical value
Conclusion:
Accept H0. The mean score of boys does not differ significantly from that of girls.

7. The means of two random samples of sizes 9 and 7 are 196.42 and 198.82 respectively.
The sum of squares of the deviations from the mean are 26.94 and 18.73 respectively.
Can the sample be considered to have been drawn from the same normal population?
[
Since apply two samples t test.
Null hypothesis:
H0: sample is drawn from the same normal population.
Calculated value:
By data,

Critical value:
Level of significance
Degrees of freedom

Comparison:
Calculated value
Critical value
Calculated value Critical value
Conclusion:
Reject H0. Sample is not drawn from the same normal population.

8
8. From a random sample of 10 pigs fed on diet A, the increases in weight in a certain
period were 10, 6, 16, 17, 13, 12, 8, 14, 15, 9 lbs. For another random sample of 12
pigs fed on diet B, the increases in the same period were 7, 13, 22, 15, 12, 14, 18, 8, 21,
23, 10, 17 lbs. Test whether diets A and B differ significantly as regards their effect on
increases in weight? [

Since , apply two samples t test.


Null hypothesis:
H0: diets A and B do not differ significantly.
Calculated value:

By data,

Critical value:
Level of significance
Degrees of freedom

Comparison:
Calculated value
Critical value
Calculated value Critical value
Conclusion:
Accept H0. Diets A and B do not differ significantly.

9
Home work:
9. A machine is expected to produce nails of length 3 inches. A random sample of 25 nails
gave an average length of 31. Inches with standard deviation 0.3. Can it be said that the
machine is producing nails as per specifications? Given
10. Two horses A and B were tested according to the time (seconds) to run a particular race
with the following results:
Horse A 28 30 32 33 33 29 34
Horse B 29 30 30 24 27 29
Test whether you can discriminate between the two horses.
Given that for 11 degrees of freedom
11. Two types of batteries are tested for their length of life and the following results were
obtained:
Battery A Battery B
Mean 500 500
Variance 100 121
Sample size 10 10

Check whether there is a significant difference between two means. [ ]


12. A sample of 12 measurements of the diameter of a metal ball gave the mean 7.38 mm
with standard deviation 1.24 mm. Find 99% confidence limits for actual diameter.
[ ]
Note: Confidence limits for the mean are

13. A group of 10 boys fed on a diet A and another group of 8 boys fed on a different diet B
for a period of six months recorded the following increase in weight (lbs).

Diet A 5 6 8 1 12 4 3 9 6 10
Diet B 2 3 6 8 10 1 2 8
Test whether diet A and B differ significantly regarding their effort on increase in
weight

10
t-table

11
5.4 Test for goodness of fit
--------------------------------------------------------------------------------------------------------
Working rule:
Write observed frequency.
Find expected frequency
Write the null hypothesis The expected frequency distribution is a good fit to the
observed frequency distribution.

Calculated value: Under H0,

Where Observed frequency or tabulated frequency


Expected frequency or theoretical frequency
Critical value:
Level of significance (Always upper tailed)
number of terms, number of constraints.

Degrees of freedom . Where

Use table to get the critical value.


Comparison:
If calculated value < critical value, accept Reject otherwise.
Conclusion:
The expected frequency distribution is a good fit to the observed frequency distribution.
(or)
The expected frequency distribution is not a good fit to the observed frequency distribution.
--------------------------------------------------------------------------------------------------------

12
1. A die is thrown 60 times and the frequency distribution for the number appearing
on the face x is given by the following table:

: 1 2 3 4 5 6

: 15 6 4 7 11 17

Test the hypothesis that the die is unbiased.


Solution:

Observed frequency By data,

Expected frequency Since the die is unbiased,

Therefore,

Null hypothesis The die is unbiased.

Calculated value: Under .

1 15 10 25 2.5
2 6 10 16 1.6
3 4 10 36 3.6
4 7 10 9 0.9
5 11 10 1 0.1
6 17 10 49 4.9
13.6
Critical value:

Level of significance and Degrees of freedom

Comparison:

Calculated value and Critical value

Calculated value Critical Value.

Conclusion: Reject The die is not unbiased.

13
2. The following table gives the number of road accidents that occurred in a large city
during the various days of a week. Test the hypothesis that the accidents are
uniformly distributed over all the days of a week. [

Day: Sun Mon Tue Wed Thu Fri Sat Total


No. of accidents: 14 16 8 12 11 9 14 84
Solution:

Observed frequency By data,

Expected frequency: Accidents are uniformly distributed over all 7 days.

, where

Therefore,

Null hypothesis

The accidents are uniformly distributed over all the days of a week.

Calculated value: Under

Sun 14 12 4
Mon 16 12 16
Tue 8 12 16
Wed 12 12 0
Thu 11 12 1
Fri 9 12 9
Sat 14 12 4

Critical value:

Level of significance and Degrees of freedom

Comparison: Calculated value and Critical value

Calculated value Critical Value.

Conclusion: Accept
The accidents are uniformly distributed over all the days of a week.

14
3. A set of 5 similar coins is tossed 320 times and the result is
Number of heads: 0 1 2 3 4 5
Frequency: 6 27 72 112 71 32
Test the hypothesis that the data follows a binomial distribution. [ ]

Solution:

Observed frequency By data,

Expected frequency: No. of coins

Null hypothesis The data follows Binomial distribution.

Calculated value: Under

0 6 10 16 1.6
1 27 50 529 10.58
2 72 100 784 7.84
3 112 100 144 1.44
4 71 50 441 8.82
5 32 10 484 48.4
78.68
Critical value: Level of significance - the number of frequencies available.
Degrees of freedom

Comparison: Calculated value and Critical value

Calculated value Critical Value.

Conclusion: Reject The data does not follow Binomial distribution.

15
4. Fit a Poisson distribution to the following data and test for its goodness of fit at
level of significance 0.05.

0 1 2 3 4

419 352 154 56 19 [ ]

Solution:

Observed frequency: By data,

Expected frequency:

Therefore,

Null hypothesis The data follows Poisson distribution.

Calculated value: Under

0 419 405+1 169 0.4033


1 352 366 196 0.5355
2 154 165 121 0.7333
3 56 50 36 0.7200
4 19 11+2 36 2.7692
5.1613
Numbers added in only to preserve .

Critical value: Level of significance

Number of frequencies . Number of constraints

Degrees of freedom

Comparison: Calculated value and Critical value

Calculated value Critical Value.

Conclusion: Accept The data follows Poisson distribution.

16
5. Genetic theory states that children having one parent of blood type M and other
blood type N will always be one of the three types M, MN, N and that the
proportion of these types will on average 1:2:1. A report says that out of 300
children having one M parent and one N parent, 30% were found to be type M ,
45% of type MN and remainder of type N. Test the hypothesis by test.

Observed frequency: By data,

Expected frequency:

Null hypothesis The proportion of these types is on average 1:2:1

Calculated value: Under

M 30% of 300
MN 45% of 300
M 25% of 300

Critical value: Level of significance

Number of frequencies . Number of constraints

Degrees of freedom

Comparison: Calculated value and Critical value

Calculated value Critical Value.

Conclusion: Accept The proportion of these types is on average 1:2:1

17
6. In experiments on Pea breading, the following frequencies of seeds were obtained:

Round Wrinkled Round Wrinkled Total


and yellow and yellow and green and green
315 101 108 32 556

Theory predicts that the frequencies should be in proportions . Examine


the correspondence between theory and experiment. [ ]

Solution:

Observed frequency: By data,

Expected frequency: Frequencies are in the proportion

Null hypothesis The frequencies are in proportion

Calculated value: Under

RY 315 4 0.0128
WY 101 9 0.0865
RG 108 16 0.1538
WG 32 9 0.2571
0.5102

Critical value: Level of significance

Number of frequencies . Number of constraints

Degrees of freedom

Comparison: Calculated value and Critical value

Calculated value Critical Value.

Conclusion: Accept The frequencies are in proportion

18
Tabulated values of

1 3.84 6.64
2 5.99 9.21
3 7.82 11.34
4 9.49 13.28
5 11.07 15.09
6 12.59 16.81
7 14.07 18.48
8 15.51 20.09
9 16.92 21.67
10 18.31 23.21

-------------------------------------------------------------------------------------------------------
7. The theory predicates the proportion of beans in the four groups
should be in the ratio In an experiment with 1600 beans the numbers in the
four groups were 882, 313, 287 and 118. Does the experimental result support the
theory? [
]
8. Records taken of the number of male and female births in 800 families having four
children are as follows:
Number of male births 0 1 2 3 4
Number of female births 4 3 2 1 0
Number of families 32 178 290 236 94
Test whether the data are consistent with the hypothesis that the binomial law holds and
the chance of male birth is equal to that of female birth. [ ]

-------------------------------------------------------------------------------------------------------------------------

19
4.3 F Distribution
Introduction:
 The F test is named in honour of the great statistician R A Fisher. The objective of the F
test is to find out whether the two independent estimates of population variance differ
significantly or whether the two samples may be regarded as drawn from the same
population having the same variance.
 For carrying out the test significance, The ratio F is defined as

where

 The calculated value of F is compared with the tabulated value for and at or
level of significance. – Degrees of freedom for sample having larger variance and
- Degrees of freedom for sample having smaller variance.
 If the calculated value is less than the tabulated value the null hypothesis is accepted and
it is inferred that both the samples come from the same population.
 Since F test is based on the ratio of two variances, it is also known as the variance ratio
test.
 The ratio of two variances follows a distribution called the F distribution.
 F- Test is based on the following assumptions:
(i) The values in each group are normally distributed.
(ii) The variance within each group should equal for all groups
(iii) The error (variation of each value around its own group mean) should be
independent for each value.

20
1. Two random samples were drawn from two normal populations and their values are:

Test whether two populations have the same variance at the level of significance.

Assume The two populations have the same variance.

To Find: and

Calculated value:

Critical value:
Degrees of freedom
At 5% level of significance,
Conclusion:
Since Calculated value Critical value, accept
Therefore, the two populations have the same variance.

21
2. In a sample of 8 observations, the sum of square of deviations of items from the mean
was 84.4. In another sample of 10 observations, the value was found to be 102.6. Test
whether the difference is significant at level.

Assume There is no significant difference in the variances of the two samples.


By data,

Therefore,

Calculated value:

Critical value:
Degrees of freedom
At 5% level of significance,

Conclusion:
Calculated value Critical value. Accept .
There is no significant difference in the variances of the two samples at 5% level of
significance.

22
3. Two samples are drawn from two normal populations. From the following data, test
whether the two samples have the same variance at 5% level:
Sample 1: 60 65 71 74 76 82 85 87
Sample 2: 61 66 67 85 78 63 85 86 88 91

Assume Two samples drawn from two normal populations have the same variance.

To Find: and

Calculated value:

Critical value:
Degrees of freedom
At 5% level of significance,
Conclusion:
Since Calculated value Critical value, accept
Therefore, the two samples taken from the two populations have the same variance.

23
4. The following data present the yields in quintals of common 10 subdivisors of equal
area of two agricultural plots:
Plot 1: 6.2 5.7 6.5 6.0 6.3 5.8 5.7 6.0 6.0 5.8
Plot 2: 5.6 5.9 5.6 5.7 5.8 5.7 6.0 5.5 5.7 5.5
Test whether two samples taken from two random populations have the same
variance. ( )
Assume Two samples drawn from two normal populations have the same variance.

To Find: and

Calculated value:

Critical value:
Degrees of freedom
At 5% level of significance,
Conclusion:
Since Calculated value Critical value, accept
Therefore, the two samples taken from the two populations have the same variance.

24
5. Two samples of sizes 9 and 8 give the sum of squares of deviations from their
respective means equal to 160 square inches and 91 square inches respectively. Can
these be regarded as drawn from the same population?

Assume Two samples are drawn from the same population.


By data,

Therefore,

Calculated value:

Critical value:
Degrees of freedom
At 5% level of significance,

Conclusion:
Calculated value Critical value. Accept .
Two samples are drawn from the same population.

25
6. Measurements on the length of a copper wire were taken in two experiments A and B
as under:
A’s Measurements (mm): 12.29 12.25 11.86 12.13 12.44 12.78 12.77 11.90 12.47
B’s Measurements (mm): 12.39 12.46 12.34 12.22 11.98 12.46 12.23 12.06
Test whether B’s measurements are more accurate than A’s. (The readings taken in
both cases being unbiased.)
Assume Both measurements have the same variance.

To Find: and

Calculated value:

Critical value:
Degrees of freedom
At 5% level of significance,
Conclusion:
Since Calculated value Critical value, accept
Therefore, Both measurements have the same variance.

26

You might also like