0% found this document useful (0 votes)

39 views25 pages

ISO Module 4 BCS301

Uploaded by

naikmeghana369

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views25 pages

ISO Module 4 BCS301

Uploaded by

naikmeghana369

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

MODULE-4
Statistical Inference 2: Sampling variables, central limit theorem and confidences limit for unknown
mean. Test of Significance for means of two small samples, students-‘t’ distribution, and Chi-square
distribution as a test of goodness of fit. F-Distribution.

Lecture-1 Statistical Inference-2: Sampling variables, central limit theorem.

Sampling of variables: Each member of the population gives a value of variable and the population is a
frequency distribution of variables.
Thus, a random sample of size 𝑛 from the population is same as selecting n values of variables from those of the
distribution.
Sampling distribution: The probability distribution of a statistic is called a sampling distribution.
Sampling distribution of the mean: The probability distribution of 𝑋̅ is called the sampling distribution of the
mean.
The first important sampling distribution to be considered is that of the mean 𝑋̅. Suppose that a random sample
of n observations is taken from a normal population with mean 𝜇 and variance 𝜎 2 .
Each observation of 𝑋𝑖 , 𝑖 = 1, 2, . . . , 𝑛 of the random sample will then have the same normal distribution as
the population being sampled.
1
Hence, we conclude that 𝑋̅ = (𝑋1 + 𝑋2 + ··· +𝑋𝑛 ) has a normal distribution with mean,
𝑛
1
𝜇𝑋̅ = (𝜇 + 𝜇 + ⋯ + 𝜇) = 𝜇
𝑛

𝑛 𝑡𝑒𝑟𝑚𝑠

Similarly,
2
1 2 2 2
𝜎2
𝜎 𝑋̅ = 2 (𝜎 + 𝜎 + ⋯ + 𝜎 ) =
𝑛 𝑛

𝑛 𝑡𝑒𝑟𝑚𝑠
Therefore, if a population is distributed normally with mean 𝜇 and standard deviation 𝜎, then the means of all
𝜎
positive random samples of size 𝑛, are also distributed normally with mean 𝜇 and standard error 𝑛 .
√

If we are sampling from a population with unknown distribution, either finite or infinite, the sampling
𝜎2
distribution of 𝑋̅ will still be approximately normal with mean 𝜇 and variance , provided that the sample size
𝑛
is large.
This amazing result is an immediate consequence of the following theorem, called the Central Limit Theorem.

Central limit theorem: If the variable 𝑋 has a non-normal distribution with mean 𝜇 and standard deviation 𝜎,
𝑥−𝜇
then the limiting distribution of 𝑧 = 𝜎 as 𝑛 ⟶ ∞, is the standard normal distribution.
√𝑛

This theorem holds good for a sample of 25 or more which is regarded as large.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

1
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

1. An electrical firm manufactures light bulbs that have a length of life that is approximately normally
distributed, with mean equal to 800 hours and a standard deviation of 40 hours. Find the probability that a
random sample of 16 bulbs will have an average life of less than 775 hours.
Solution: Given that 𝜇 = 800 and 𝜎 = 40.
The sampling distribution of 𝑥̅ will be approximately normal,
𝜎
with 𝜇 = 800 and standard error 𝜎𝑋̅ = 𝑛 = 40/ √16 = 10.
√
𝑥−𝜇 𝑥−800
𝑧= 𝜎 = .
10
√𝑛

probability that a random sample of 16 bulbs will have an average life of less than 775 hours is
𝑃(𝑥 < 775) = 𝑃(𝑧 < −2.5) = 0.5 − 𝐴(2.5) = 0.5 − 0.4938 = 0.0062.
2. The mean of a certain normal population is equal to the standard error of the mean of samples of 100 from
that distribution. Find the probability that the mean of the sample of 25 from the distribution will be negative.

Solution: Let 𝜇 be the mean and 𝜎 standard deviation of the distribution.

𝜎
Given that for 𝑛 = 100, 𝜇 = standard error =
√𝑛
𝜎
⟹ 𝜇 = 10 .
𝑥−𝜇 5𝑥
For 𝑛 = 25 , we have 𝑧 = 𝜎 = − 0.5
𝜎
5

Since 𝑥 < 0, 𝑧 < −0.5.

Therefore 𝑃( 𝑥 < 0) = 𝑃(𝑍 < −0.5) = 0.5 − 𝐴(0.5) = 0.5 − 0.1915 = 0.3085.

Review questions:

1. The probability distribution of 𝑋̅ is called?

2. If a population is distributed normally with mean 𝜇, then 𝑋̅ has a normal distribution with mean?
3. Standard error of the all positive random samples of size 𝑛 is?
4. Variance of the all positive random samples of size 𝑛 is?
5. Standard error of the all positive random samples of size 36 with standard deviation of the population 3 is?

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

2
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Lecture- 2 Central limit theorem. Problems.

1. Traveling between two campuses of a university in a city via shuttle bus takes, on average, 28 minutes with a
standard deviation of 5 minutes. In a given week, a bus transported passengers 40 times. What is the probability
that the average transport time was more than 30 minutes? Assume the mean time is measured to the nearest
minute.
Solution: Given that 𝜇 = 28 and 𝜎 = 5.
The sampling distribution of 𝑥̅ will be approximately normal,
𝜎
and standard error 𝜎𝑥̅ = = 5/ √40 = 0.7906.
√𝑛

𝑥−𝜇 𝑥−28
𝑧= 𝜎 = 0.7906.
√𝑛

Since the time is measured on a continuous scale to the nearest minute, an 𝑥̅ greater than 30 is equivalent to
𝑥̅ ≥ 30.5.
Probability that a random sample of 40 times will have an average transport time was more than 30 minutes is

𝑃(𝑥 > 30.5) = 𝑃(𝑧 > 3.1622) = 0.5 − 𝐴(3.1622) = 0.5 − 0.4992 = 0.0008.

2. A sample of 900 members is found to have a mean of 3.4𝑐𝑚 . Can it be reasonably regarded as truly random
sample from a large population with mean 3.25𝑐𝑚 and standard deviation 1.61cm?

Solution: Given that 𝑥 = 3.4𝑐𝑚, 𝑛 = 900, 𝜇 = 3.25 𝑎𝑛𝑑 𝜎 = 1.61𝑐𝑚.

𝑥−𝜇 3.4−3.25
∴ 𝑧= 𝜎 = 1.61 = 2.795 > 2.58.
√𝑛 30

Deviation of sample mean from the mean is significant, and hence it cannot be regarded as random sample.

3. An important manufacturing process produces cylindrical component parts for the automotive industry. It is
important that the process produce parts having a mean diameter of 5.0 millimeters. The engineer involved
conjectures that the population mean is 5.0 millimeters. An experiment is conducted in which 100 parts
produced by the process are selected randomly and the diameter measured on each. It is known that the
population standard deviation is σ = 0.1 millimeter. The experiment indicates a sample average diameter of 𝑥 =
5.027 millimeters. Does this sample information appear to support or refute the engineer’s conjecture?
Solution: H: Sample data 𝑥 = 5.027 support the conjecture 𝜇 = 5.0.
Given that 𝜇 = 5 and 𝜎 = 0.1.
The sampling distribution of 𝑥̅ will be approximately normal,
𝜎
with 𝑥 = 5.027 and standard error 𝜎𝑥̅ = = 0.1/ √100 = 0.01.
√𝑛

𝑥−𝜇 0.027
𝑧= 𝜎 = = 2.7 > 2.58.
0.01
√𝑛

Therefore, the data does not support the conjecture that 𝜇 = 5.0.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

3
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Review questions:
1. For what values of difference between the sample mean and mean of the population, null hypothesis is
rejected in 0.05 level of significance if the standard deviation of the population 5 and sample size is 100?
2. For what values of difference between the sample mean and mean of the population null hypothesis is
accepted in 0.01 level of significance if the standard deviation of the population 5 and sample size is 100?
3. Standard error of the all positive random samples is 1, and variance of the population is 25, then the size of
the sample is?

4. If a population is distributed normally with mean 𝜇, then 𝑋̅ has a normal distribution with mean?

5. For what values of difference between the sample mean and mean of the population, hypothesis that sample
mean is more than population mean is rejected in 0.05 level of significance if the standard deviation of the
population 5 and sample size is 100?
6. For what values of difference between the sample mean and mean of the population, hypothesis that sample
mean is less than population mean is accepted in 0.05 level of significance if the standard deviation of the
population 5 and sample size is 100?

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

4
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Lecture-3 Confidences limit for unknown mean.

𝑥−𝜇 𝜎 𝜎 𝜎 𝜎
|𝑧| < 𝑧0 ⟹ | 𝜎 | < 𝑧0 ⟹ 𝜇 − 𝑧0 < 𝑥 < 𝜇 + 𝑧0 and 𝑥 − 𝑧0 < 𝜇 < 𝑥 + 𝑧0 .
√𝑛 √𝑛 √𝑛 √𝑛
√𝑛

1. A soft-drink machine is regulated so that the amount of drink dispensed averages 240 milliliters with a
standard deviation of 15 milliliters. Periodically, the machine is checked by taking a sample of 40 drinks and
computing the average content. If the mean of the 40 drinks is a value within the interval 𝜇𝑥̅ ± 2𝜎𝑥̅ , the
machine is thought to be operating satisfactorily; otherwise, adjustments are made. The company official found
the mean of 40 drinks to be 𝑥 = 236 milliliters and concluded that the machine needed no adjustment. Was this
a reasonable decision?
Solution: Given that 𝜇𝑥̅ = 𝜇 = 240, 𝜎 = 15 .
𝜎
𝑛 = 40, 𝜎𝑥̅ = = 2.3717 and 𝑥 = 236.
√𝑛

If the mean of the 40 drinks is a value within the interval 𝜇𝑥̅ ± 2𝜎𝑥̅ ,
then confident limit of 𝑥 is 𝜇𝑥̅ − 2𝜎𝑥̅ ≤ 𝑥 ≤ 𝜇𝑥̅ + 2𝜎𝑥̅ ⟹ 235.26 ≤ 𝑥 ≤ 244.74
Since 𝑥 = 236, which is within the limit. Hence, yes, the decision is reasonable.
2. Traveling between two campuses of a university in a city via shuttle bus takes, on average, 28 minutes with a
standard deviation of 5 minutes. In a given week, a bus transported passengers 40 times. Find the confident
limit of the average transport time at 5% level of significance.
Solution: Given that 𝜇 = 28 and 𝜎 = 5.
The sampling distribution of 𝑥̅ will be approximately normal,
𝜎
and standard error 𝜎𝑥̅ = = 5/ √40 = 0.7906.
√𝑛
𝑥−𝜇 𝑥−28
𝑧= 𝜎 = 0.7906.
√𝑛

𝜎 𝜎
Confident limit of the average transport time is 𝜇 − 𝑧0 < 𝑥 < 𝜇 + 𝑧0
√𝑛 √𝑛

28 − 1.96 × 0.7906 < 𝑥 < 28 + 1.96 × 0.7906

That is 26.45 < 𝑥 < 29.55.
3. A sample of 900 members is found to have a mean of 3.4𝑐𝑚. Find the confident limit for mean of population
if standard deviation is 1.61cm at 0.01 level of significance.
Solution: Given that 𝑥 = 3.4 and 𝜎 = 1.61.
The sampling distribution of 𝑥̅ will be approximately normal,
𝜎
and standard error 𝜎𝑥̅ = = 1.61/ √900 = 0.0537.
√𝑛
𝜎 𝜎
Confident limit of the population mean is 𝑥 − 𝑧0 < 𝜇 < 𝑥 + 𝑧0
√ 𝑛 √𝑛

3.4 − 2.58 × 0.0537 < 𝜇 < 3.4 + 2.58 × 0.0537

That is 3.2615 < 𝜇 < 3.5385.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

5
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Review questions:
1. Difference between the sample mean and mean of the population 𝑧 < 0 , then the hypothesis that sample
mean is more than population mean is rejected or accepted?
2. Difference between the sample mean and mean of the population 𝑧 > 0 , then the hypothesis that sample
mean is less than population mean is rejected or accepted?

3. If 𝑧 < −2.33 , then the hypothesis that sample mean is less than population mean is rejected or accepted in
1% level?

4. If |𝑧| > 2.33 , is the alternate hypothesis that sample mean is less than population mean is accepted?

5. If |𝑧| > 1.645 , is the alternate hypothesis that sample mean is more than population mean is rejected?

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

6
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Lecture-4 Test of significance for means of two large samples:

Test of significance for means of two large samples:

1 1 𝑥1 ~𝑥2
Standard errors of the means of the two samples of same population is 𝑒 = 𝜎√𝑛 + 𝑛 and 𝑧 = .
1 2 𝑒

If 𝑧 > 2.58, then the sampling is not simple or samples are not drawn from the same population.
If 𝑧 > 1.96, then the difference is significant at 5% level of significance.

If independent samples of size 𝑛1 and 𝑛2 are drawn at random from two different populations, discrete or
continuous, with means 𝜇1 and 𝜇2 and variances 𝜎12 and 𝜎22 , respectively, then the sampling distribution of the
differences of means, 𝑥1 − 𝑥2 , is approximately normally distributed with mean 𝜇𝑥1 −𝑥2 = 𝜇1 − 𝜇2 ,
𝜎2 𝜎2
and standard error 𝑒 = √𝑛1 + 𝑛2 .
1 2

( 𝑥1 −𝑥2 )−( 𝜇1 −𝜇2 )

Hence, 𝑧= is approximately a standard normal variable.
𝑒

𝑥1 ~𝑥2
If the means of different populations are same then, 𝑧 = .
𝑒

Examples:
1. Two independent experiments are run in which two different types of paint are compared. Eighteen
specimens are painted using type A, and the drying time, in hours, is recorded for each. The same is done with
type B. The population standard deviations are both known to be 1. Assuming that the mean drying time is
equal for the two types of paint, find the probability that the difference 𝑥1 − 𝑥2 in the sample is at least 15
minutes, where 𝑥1 and 𝑥2 are average drying times for samples of size 18 for type A and B respectively.
Solution: Given that 𝑛1 = 𝑛2 = 18, 𝜎1 = 𝜎2 = 1 and 𝜇1 = 𝜇2 .

𝜎2 𝜎2 1 1 1
Standard error 𝑒 = √𝑛1 + 𝑛2 = √18 + 18 = 3 .
1 2

( 𝑥1 −𝑥2 )−( 𝜇1 −𝜇2 )

𝑧= = 3( 𝑥1 − 𝑥2 )
𝑒

𝑃(𝑥1 − 𝑥2 > 0.25) = 𝑃(𝑧 > 0.75) = 0.5 − A(0.75) = 0.5 − 0.2734 = 0.2266.

2. The means of simple samples of sizes 1000 and 2000 are 67.5 and 68.0 cm respectively. Can the samples are
drawn from the same population of S.D. 2.5cm?
Solution: H: Let samples are drawn from the same population of S.D. 2.5cm.

Given that 𝑥1 = 67.5, 𝑥2 = 68.0 , 𝑛1 = 1000 , 𝑛2 = 2000 𝑎𝑛𝑑 𝜎 = 2.5 .

1 1
𝑒 = 𝜎√𝑛 + 𝑛 = 0.0968 ,
1 2

𝑥1 ~𝑥2 0.5
𝑧= = 0.0968 = 5.16 > 2.58. Difference is significant, hypothesis is rejected.
𝑒

Hence the samples are not drawn from the same population.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

7
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

3. A sample of height of 6400 soldiers has a mean of 67.85 inches and S.D. 2.56 inches while a sample of
heights of 1600 sailors has a mean of 68.55 inches and S.D. 2.52 inches. Do the data indicate that sailors are on
the average taller than the soldiers? Test the hypothesis at 1% level of significance.

Solution: 𝐻1 : Sailors are on the average taller than the soldiers.

𝐻0 : Sailors and soldiers are on the average same height.

Given that 𝑥1 = 67.85, 𝑥2 = 68.55 , 𝑛1 = 6400 , 𝑛2 = 1600 , 𝜎1 = 2.56, 𝜎2 = 2.52 .

𝜎2 𝜎2
𝑒 = √𝑛1 + 𝑛2 = 0.0707 .
1 2

𝑥2 −𝑥1 0.7
and 𝑧 = = 0.0707 = 9.9010 > 2.33. (1% level of significance in one-tailed test).
𝑒

This is highly significant. 𝐻0 is rejected and hence alternate hypothesis 𝐻1 is accepted.

Therefore, sailors are on the average taller than the soldiers.

Review questions:
1. To test the two samples of same population, 𝜇1 − 𝜇2 = ?
2. To test the two samples of different populations with same mean, 𝜇1 − 𝜇2 = ?
1 1
3. If the standard error 𝑒 = 𝜎√𝑛 + 𝑛 then 𝑧 =?
1 2

𝜎2 𝜎2
4. If the standard error 𝑒 = √𝑛1 + 𝑛2 then 𝑧 =?
1 2

𝑥2 −𝑥1
5. If 𝜇1 − 𝜇2 = 0.5 , 𝑒 = 1 and 𝑧 = = 2.3 then null hypothesis is rejected or accepted in 5% level.?
𝑒
𝑥2 −𝑥1
6. If 𝜇1 − 𝜇2 = 0.5 , 𝑒 = 0.5 and 𝑧 = = 3 then null hypothesis is rejected or accepted in 1% level.?
𝑒

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

8
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Lecture-5 Test of Significance for means of two large samples problems

1. The television picture tubes of manufacturer A have a mean lifetime of 6.5 years and a standard deviation of
0.9 year, while those of manufacturer B have a mean lifetime of 6.0 years and a standard deviation of 0.8 year.
What is the probability that a random sample of 36 tubes from manufacturer A will have a mean lifetime that is
at least 1 year more than the mean lifetime of a sample of 49 tubes from manufacturer B?
Solution: Given that
A B
𝜇1 = 6.5 𝜇2 = 6
𝜎1 = 0.9 𝜎2 = 0.8
𝑛1 = 36 𝑛2 = 49

𝜎12 𝜎22 ( 𝑥1 −𝑥2 )−( 𝜇1 −𝜇2 ) ( 𝑥1 −𝑥2 )−0.5

Clearly 𝜇1 − 𝜇2 = 0.5 and standard error 𝑒 = √ + = 0.1886. 𝑧 = =
𝑛1 𝑛2 𝑒 0.1886

The probability that the mean lifetime for 36 tubes from manufacturer A will be at least 1 year longer than the
mean lifetime for 49 tubes from manufacturer B is
𝑃( 𝑥1 − 𝑥2 ≥ 1) = 𝑃(𝑧 ≥ 2.6511) = 0.5 − 𝐴(2.6511) = 0.5 − 0.4959 = 0.0041 .
2. Two independent experiments are run in which two different types of paint are compared. If someone did the
experiment 10,000 times under the condition that 𝜇1 = 𝜇2 , If the population standard deviations are both known
to be 1.0, in how many of those 10,000 experiments would there be a difference 𝑥1 − 𝑥2 that was as large as (or
larger than) 1.0?
𝜎2 𝜎2 1 1
Solution: Clearly 𝜇1 − 𝜇2 = 0 and standard error 𝑒 = √𝑛1 + 𝑛2 = √10,000 + 10,000 = 0.0141.
1 2

( 𝑥1 −𝑥2 )−( 𝜇1 −𝜇2 ) ( 𝑥1 −𝑥2 )

Then, 𝑧 = =
𝑒 0.0141

The probability that the difference 𝑥1 − 𝑥2 that was as large as (or larger than) 1.0 is
𝑃( 𝑥1 − 𝑥2 ≥ 1) = 𝑃(𝑧 ≥ 70.922) = 0.5 − 𝐴(70.922) = 0.5 − 0.5 = 0 .
Hence, none of the 10,000 experiments with a difference 𝑥1 − 𝑥2 more than 1.0.
3. The mean score for freshmen on an aptitude test at a certain college is 540, with a standard deviation of 50.
Assume the means to be measured to any degree of accuracy. What is the probability that two groups selected at
random, consisting of 32 and 50 students, respectively, will differ in their mean scores by (a) more than 20
points? (b) an amount between 5 and 10 points.
Solution: Given that, 𝜇 = 540 , 𝜎 = 50, ∴ 𝜇𝑥1 −𝑥2 = 𝜇1 − 𝜇2 = 0 .

1 1 1 1
𝑛1 = 32, 𝑛2 = 50 , 𝑒 = 𝜎√𝑛 + 𝑛 = 50√32 + 50 = 11.3192 .
1 2

( 𝑥1 −𝑥2 ) ( 𝑥1 −𝑥2 )
𝑧= = .
𝑒 11.3192

a) Probability that two groups will differ in their mean scores by more than 20 points
= 𝑃(𝑥1 − 𝑥2 > 20) + 𝑃(𝑥2 − 𝑥1 > 20)
= 2𝑃(𝑧 > 1.7669) = 2(0.5 − 𝐴(1.7669)) = 2(0.5 − 0.4616) = 0.0768.
DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.
9
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

b) Probability that two groups will differ in their mean scores by between 5 and 10 points
= 𝑃(5 ≤ |𝑥1 − 𝑥2 | ≤ 10)
= 2𝑃(0.4417 ≤ 𝑧 ≤ 0.8835) = 2(𝐴(0.8835) − 𝐴(0.4417)) = 2(0.3106 − 0.1700) = 0.2812.

Review questions:
1. To test the two samples of different populations with same mean, 𝑧 = ?
𝑥2 −𝑥1
2. To test the two samples of different populations, if 𝑧 = then standard error 𝑒 =?
𝑒

𝜎2 𝜎2
3. If the standard error 𝑒 = √𝑛1 + 𝑛2 then 𝑧 =?
1 2

𝑥2 −𝑥1
4. If 𝜇1 − 𝜇2 = 1 , 𝑒 = 1 and 𝑧 = = 3 then null hypothesis is rejected or accepted in 1% level.?
𝑒
𝑥2 −𝑥1
5. If 𝜇1 − 𝜇2 = 0.5 , 𝑒 = 0.5 and 𝑧 = = 4 then null hypothesis is rejected or accepted in 1% level.?
𝑒

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

10
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Lecture-6 (Tutorial) Central limit theorem. Problems.

1. If all possible samples of size 16 are drawn from a normal population with mean equal to 50 and standard
deviation equal to 5, what is the probability that a sample mean 𝑥 will fall in the interval from
𝜇 − 1.9𝑒 to 𝜇 − 0.4𝑒 ? Assume that the sample means can be measured to any degree of accuracy
Solution: Given that 𝜇 = 50, 𝜎 = 5 and 𝑛 = 16.
𝑥−𝜇 𝑥−50
𝑧= 𝜎 = .
1.25
√𝑛

Hence, 𝑃(𝜇 − 1.9𝑒 ≤ 𝑥 ≤ 𝜇 − 0.4𝑒) = 𝑃(50 − 2.375 ≤ 𝑥 ≤ 50 − 0.5)

= 𝑃(47.625 ≤ 𝑥 ≤ 49.5) = 𝑃(−1.9 ≤ 𝑧 ≤ −0.4)
= 𝐴(1.9) − 𝐴(0.4) = 0.4713 − 0.1554 = 0.3159 .
2. A certain type of thread is manufactured with a mean tensile strength of 78.3 kilograms and a standard
deviation of 5.6 kilograms. How is the variance of the sample mean changed when the sample size is (a)
increased from 64 to 196? (b) decreased from 784 to 49?
Solution: Given that 𝜇 = 78.3, 𝜎 = 5.6 .
𝜎2
Variance of the sample mean is .
𝑛
𝜎2
(a) If 𝑛 = 64, Variance of the sample mean is = 0.49 .
𝑛
𝜎2
If 𝑛 = 196, Variance of the sample mean is 𝑛 = 0.16 .
Therefore, Variance is reduced from 0.49 to 0.16
𝜎2
(b) If 𝑛 = 784, Variance of the sample mean is = 0.04 .
𝑛
𝜎2
If 𝑛 = 49, Variance of the sample mean is = 0.64 .
𝑛
Variance is increased from 0.04 to 0.64.
3. If a certain machine makes electrical resistors having a mean resistance of 40 ohms and a standard deviation
of 2 ohms, what is the probability that a random sample of 36 of these resistors will have a combined resistance
of more than 1458 ohms?
Solution: Given that 𝜇 = 40, 𝜎 =2.
𝜎 1
𝑛 = 36, 𝜎𝑥̅ = =3
√𝑛
𝑥−𝜇 𝑥−40
𝑧= 𝜎 = 1 = 3(𝑥 − 40)
√𝑛 3

Probability that a random sample of 36 of these resistors will have a combined resistance of more than 1458
1458
ohms = 𝑃 ( 𝑥 > 36 ) = 𝑃( 𝑥 > 40.5 ) = 𝑃(𝑧 > 1.5) = 0.5 − 𝐴(1.5) = 0.5 − 0.4332 = 0.0668.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

11
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Lecture-7 Test of Significance for means of two small samples. Students-‘t’ distribution, as a test of
goodness of fit.

𝑺𝒕𝒖𝒅𝒆𝒏𝒕’𝒔 𝒕 − 𝑫𝒊𝒔𝒕𝒓𝒊𝒃𝒖𝒕𝒊𝒐𝒏: Consider a small sample of size 𝑛, drawn from a normal population with
mean 𝜇 and S.D. 𝜎 . If 𝑥 𝑎𝑛𝑑 𝜎𝑠 be the sample mean and S.D. Then the statistic, 𝑡 is defined as
𝑥−𝜇
𝑡= √𝑛 − 1, where 𝜈 = 𝑛 − 1 denotes the degree of freedom of 𝑡.
𝜎𝑠
𝑦0
Sampling distribution for 𝑡 is called Student’s t − Distribution is given by 𝑦 = 𝜈+1 .
𝑡2 2
(1+ )
𝜈
∞
The probability 𝑃 that the value of 𝑡 will exceed 𝑡0 is 𝑃 = ∫𝑡 𝑦 𝑑𝑡
0
Where 𝑦0 is a constant such that the area under the curve is unity.

t-curve

Normal curve

0 t

Significance test of a sample mean: Given a random sample 𝑥1 , 𝑥2 , 𝑥3 , ⋯ ⋯ 𝑥𝑛 from a normal population, we
have to test the hypothesis that the mean of the population is 𝜇.
𝑥−𝜇
For this, we first calculate 𝑡 = √𝑛 − 1
𝜎𝑠
∑𝑛
1 𝑥𝑖 1 𝑛 ∑𝑛 2 𝑛
𝑖=1 𝑥𝑖 −(∑𝑖=1 𝑥𝑖 )
2
Where, 𝑥 = , 𝜎𝑠 2 = 𝑛−1 ∑𝑛1(𝑥𝑖 − 𝑥)2 = .
𝑛 𝑛(𝑛−1)

Then find the value of 𝑃 for the given d.f. from the table.
If the calculated 𝑡 > 𝑡0.05 , the difference between 𝑥 and 𝜇 is said to be significant at 5% level of significance.
If 𝑡 > 𝑡0.01 , the difference between 𝑥 and 𝜇 is said to be significant at 1% level of significance.
If 𝑡 < 𝑡0.05 , the data is said to be consistent with the hypothesis.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

12
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Critical values of 𝑡, for two tail tests.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

13
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Examples:
1. A certain stimulus administered to each of 12 patients resulted in the following increases of blood pressure:
5, 2, 8, -1, 3, 0, -2, 1, 5, 0, 4, 6. Can it be concluded that the stimulus will in general be accompanied by an
increase in blood pressure.
Solution: 𝐻1 : The stimulus will increase the blood pressure.
𝐻0 : The stimulus does not change the B.P.
Taking the population to be normal with mean 0 and S.D. 𝜎 .
∑𝑛
1 𝑥𝑖
𝑥= = 2.5833,
𝑛
1
𝜎𝑠 2 = ∑𝑛1(𝑥𝑖 − 𝑥)2
𝑛−1
1
= [5.8404 + 0.3402 + 29.3406 + 12.84 + 0.1736 + 6.6734 + 21.0066 + 2.5068 + 5.8404 + 6.6734 + 2.0070 + 11.6738]
11

= 9.5378
𝑛 ∑𝑛 2 𝑛
𝑖=1 𝑥𝑖 −(∑𝑖=1 𝑥𝑖 )
2 12×185−312
Or, 𝜎𝑠 2 = = = 9.5379.
𝑛(𝑛−1) 12×11

∴ 𝜎𝑠 = 3.0883
𝑥−𝜇 2.5833−0
Now 𝑡= √𝑛 − 1 = √11 = 2.7743.
𝜎𝑠 3.0883

For 𝜈 =11, from the table 𝑡0.05 = 1.8. for single-tailed test.
Since, 𝑡 > 𝑡0.05, the difference between 𝑥 and 𝜇 is said to be significant at 5% level of significance.
Therefore hypothesis is rejected, that is the stimulus will increase the B.P.

Review questions:
1. Find 𝑡0.025 for 𝜈 = 8 in one tailed test.
2. Find 𝑡0.05 for 𝜈 = 30 in one tailed test.
3. Find 𝑡0.1 for 𝜈 = 9 in one tailed test.
4. Find 𝑡0.005 for 𝜈 = 12 in one tailed test.
5. The t-curve is symmetrical about the line .

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

14
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Lecture-8 Students-‘t’ distribution, problems

1. The nine items of a sample have the following values: 45, 47, 50, 52, 48, 47, 49, 53 and 51. Does the mean of
these differ significantly from the assumed mean of 47.5? Test the hypothesis in 5% significant level.
Solution: 𝐻1 : 𝑥 ≠ 𝜇 = 47.5 .
𝐻0 : 𝑥 = 𝜇 = 47.5.
∑𝑛
1 𝑥𝑖
∑𝑛1 𝑥𝑖 = 442, ∑𝑛𝑖=1 𝑥𝑖 2 = 21762, ∴ 𝑥 = = 49.1111,
𝑛
𝑛 ∑𝑛 2 𝑛
𝑖=1 𝑥𝑖 −(∑𝑖=1 𝑥𝑖 )
2 9×21762−4422
𝜎𝑠 2 = = = 6.8611.
𝑛(𝑛−1) 9×8

∴ 𝜎𝑠 = 2.6194, given that 𝜇 = 47.5 .

𝑥−𝜇 49.1111−47.5
𝑡= √𝑛 − 1 = √8 = 1.7395.
𝜎𝑥 2.6196
For 𝜈 = 8, 𝑡0.05 = 2.31 , since 𝑡 < 𝑡0.05 , the value of t is not significant at 5% level of significance. Thus
the test provides no evidence against the population mean being 47.5.

2. Ten individuals are chosen at random from a population and their heights in inches are found to be 63, 63,
66, 67, 68, 69, 70, 70, 71, 71. Test the hypothesis that the mean height of the universe is 66 inches.
(For d.f. 9, 𝑡0.05 = 2.262 )

Solution: 𝐻: 𝑥 = 𝜇 = 66.
∑𝑛
1 𝑥𝑖
∑𝑛1 𝑥𝑖 = 678, ∑𝑛𝑖=1 𝑥𝑖 2 = 46050, ∴ 𝑥 = = 67.8,
𝑛
𝑛 ∑𝑛 2 𝑛
𝑖=1 𝑥𝑖 −(∑𝑖=1 𝑥𝑖 )
2 10×46050−6782
𝜎𝑠 2 = = = 9.0667.
𝑛(𝑛−1) 10×9

∴ 𝜎𝑠 = 3.0111, given that 𝜇 = 66 .

𝑥−𝜇 67.8−66
𝑡= √𝑛 − 1 = √9 = 1.7934.
𝜎𝑠 3.0111
For 𝜈 = 9, 𝑡0.05 = 2.262 , since 𝑡 < 𝑡0.05 , the value of t is not significant at 5% level of significance.
Accept the hypothesis. Thus the test provides no evidence against the population mean being 66 inches.

3. A machinist is making engine parts with axle diameter of 0.7 inch. A random sample of 10 parts shows mean
diameter 0.742 inch with a S.D 0.04 inch. On the basis of this sample, would you say that the work is inferior?
Solution: 𝐻1 : 𝑥 ≠ 𝜇 = 0.7 (The work is inferior)
𝐻0 : 𝑥 = 𝜇 = 7. (The work is not inferior)
i.e. there is no significant difference between 𝑥 & 𝜇 .
Given that, 𝑥 = 0.742, 𝜇 = 0.7, 𝜎𝑠 = 0.04, 𝑛 = 10.
𝑥−𝜇 0.742−0.7
𝑡= √𝑛 − 1 = √9 = 3.15.
𝜎𝑠 0.04

For 𝜈 = 9, 𝑡0.05 = 2.262 , since 𝑡 > 𝑡0.05 , the value of t is significant at 5% level of significance.
This implies that 𝑥 differs significantly from 𝜇 and null hypothesis is rejected. Hence the work is inferior.

4. For a random sample of 16 values with mean 41inches, and the sum of the squares of the deviations from the
mean is 135 𝑖𝑛𝑐ℎ𝑒𝑠 2 . Estimate the 95% confident limits for the mean of the population.
(𝑡0.05 = 2.13 for𝜈 = 15)
DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.
15
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Solution: Given that, 𝑥 = 41, ∑𝑛1(𝑥𝑖 − 𝑥)2 = 135, 𝑛 = 16 . ∴ 𝜈 = 15.

1 135
𝜎𝑠 2 = 𝑛−1 ∑𝑛1(𝑥𝑖 − 𝑥)2 = = 9. ∴ 𝜎𝑠 = 3.
15
𝜎𝑠
|𝑡| < 2.13 ⟹ |𝑥 − 𝜇| < 2.13
√𝑛−1

⟹ |𝑥 − 𝜇| < 1.65, approximately.

⟹ 41 − 1.65 < 𝜇 < 41 + 1.65
Or 39.35 < 𝜇 < 42.65.

Review questions:
1. The 𝑡-test is applicable to samples for which 𝑛 is .
2. The t-curve attains its maximum value at .
3. Find 𝑡0.0005 for 𝜈 = 13 in one tailed test.
4. Find 𝑡0.0005 for 𝜈 = 30 in one tailed test.
5. Find 𝑡0.1 for 𝜈 = 15 in one tailed test.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

16
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Lecture-9 Students-‘t’ distribution, Significance test of difference between sample means.

𝑥~𝑦 1 1
Significance test of difference between sample means: 𝑡 = , where 𝑒 = 𝜎√𝑛 + 𝑛
𝑒 1 2

𝑛 𝑛
∑1 1 𝑥𝑖 ∑1 2 𝑦𝑖
Where 𝑥= , 𝑦= ,
𝑛1 𝑛2

𝑛 𝑛 𝑛 𝑛
1 𝑛 𝑛 1 𝑛1 ∑1 1 𝑥𝑖 2 −(∑1 1 𝑥𝑖 )2 𝑛2 ∑1 2 𝑦𝑖 2 −(∑1 2 𝑦𝑖 )2
𝜎2 = 𝑛 {∑1 1(𝑥𝑖 − 𝑥)2 + ∑1 2 (𝑦𝑖 − 𝑦)2 } = 𝑛 { + }.
1 +𝑛2 −2 1 +𝑛2 −2 𝑛1 𝑛2

1
For the different standard deviation, 𝜎 2 = 𝑛 {(𝑛1 − 1)𝜎𝑥 2 + (𝑛2 − 1)𝜎𝑦 2 }
1 +𝑛2 −2

If the two samples are of same size then,

1 2 𝑛 ∑𝑛 2 𝑛 2
1 𝑑𝑖 −(∑1 𝑑𝑖 ) 𝑑−0
𝜎 2 = 𝑛−1 ∑𝑛1(𝑑𝑖 − 𝑑) = and 𝑡 = √𝑛 , where 𝑑 = 𝑥 − 𝑦 .
𝑛(𝑛−1) 𝜎

1. Two horses A and B were tested according to the time (in seconds) to run a particular race gives the
following results

Horse A 28 30 32 33 33 29 34
Horse B 29 30 30 24 27 29

Test whether you can discriminate between the two horses. (For, 𝜈 = 11, 𝑡0.05 = 2.2, 𝑡0.02 = 2.72)
𝑛 𝑛
𝑛 ∑1 1 𝑥𝑖 ∑1 2 𝑦𝑖
Solution: ∑1 1 𝑥𝑖 = 219, ∑𝑛1 2 𝑦𝑖 = 169 𝑥= = 31.2857, 𝑦 = = 28.1667 ,
𝑛1 𝑛2

∑𝑛1 1 𝑥𝑖 2 = 6883, ∑𝑛1 2 𝑦𝑖 2 = 4787

𝑛 𝑛 𝑛 𝑛
1 𝑛1 ∑1 1 𝑥𝑖 2 −(∑1 1 𝑥𝑖 )2 𝑛2 ∑1 2 𝑦𝑖 2 −(∑1 2 𝑦𝑖 )2
𝜎2 = 𝑛 { + } = 5.2965. 𝜎 = 2.3014.
1 +𝑛2 −2 𝑛1 𝑛2

1 1
𝑒 = 𝜎√𝑛 + 𝑛 = 1.2804.
1 2

𝐻1 : There is a discriminate between the two horses, i.e. 𝑥 ≠ 𝑦

𝐻0 : There is no discriminate between the two horses, i.e. 𝑥 = 𝑦
𝑥~𝑦
𝑡 = 𝑒 = 2.4360. ⟹ 𝑡0.05 < 𝑡 < 𝑡0.02 . (𝜈 = 𝑛1 + 𝑛2 − 2 = 11)
Therefore the discrimination between the horses is significant at 5% level but not at 2% level.

2. A group of boys and girls were given an intelligence test. The mean score, S.D.s and numbers in each group
are as follows.
Boys Girls
Mean 124 121
S.D. 12 10
𝑛 18 14

Is the mean score of boys significantly different from that of girls?

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

17
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Solution: Let 𝑥 be the boy’s score, 𝑦 be girl’s core.

𝐻1 : Mean score of boys significantly different from that of girls. 𝑥 ≠ 𝑦
𝐻0 : Mean score of boys are not significantly different from that of girls. 𝑥 = 𝑦
Given that, 𝑥 = 124, 𝑦 = 121 , 𝜎𝑥 = 12, 𝜎𝑦 = 10, 𝑛1 = 18 , 𝑛2 = 14 .
1
𝜎2 = 𝑛 {(𝑛1 − 1)𝜎𝑥 2 + (𝑛2 − 1)𝜎𝑦 2 }
1 +𝑛2 −2
1
= 30 {17 × 144 + 13 × 100} = 124.9333.

1 1
∴ 𝜎 = 11.1774. 𝑒 = 𝜎√𝑛 + 𝑛 = 3.9830
1 2

𝑥~𝑦 124−121
𝑡= = = 0.7532 < 2.04 = 𝑡0.05 . (for 𝜈 = 30).
𝑒 3.9830

Null hypothesis is accepted. Mean score of boys are not significantly different from that of girls.
3. Eleven school boys were given a test in drawing. Further they were given a month’s tuition and a second test
of equal difficulty was held at the end of it. Do the marks give the evidence that students have benefitted by
extra coaching? ( For d.f. 𝜈 =10, 𝑡0.05 = 1.812 for one tailed test)
Boys 1 2 3 4 5 6 7 8 9 10 11
I-test 23 20 19 21 18 20 18 17 23 16 19
II-test 24 19 22 18 20 22 20 20 23 20 17
Solution:

∑ 𝑑 = 11, ∑ 𝑑2 = 61
∑𝑑 𝑥2 𝑥1 𝑑 = 𝑥2 − 𝑥1
𝑑= =1 24 23 1
𝑛
1 2 𝑛 ∑𝑛 2 𝑛 2 11×61−112
19 20 -1
1 𝑑𝑖 −(∑1 𝑑𝑖 )
𝜎𝑠 2 = 𝑛−1 {∑𝑛1(𝑑𝑖 − 𝑑) } = = =5. 22 19 3
𝑛(𝑛−1) 11×10
18 21 -3
∴ 𝜎𝑠 = 2.2361 . 20 18 2
𝐻1 : Students had benefitted by extra coaching. 𝑑 > 0, or ̅̅̅
𝑥2 > ̅̅̅
𝑥1 22 20 2
𝐻0 : Students have not been benefitted by extra coaching. 𝑑 ≤ 0. 20 18 2
Then the mean of the difference between the marks 𝜇 = 0. 20 17 3
𝑑−𝜇 √11 23 23 0
𝑡= √𝑛 = 2.2361 = 1.4832 < 𝑡0.05 = 1.812. 20 16 4
𝜎𝑠
17 19 -2
Hence difference is not significant. Accept 𝐻0 , reject 𝐻1 .
∑ 𝑑 = 11
There is no evidence that the students have benefitted by extra coaching.

Review questions:
1. If the two samples are of same size then expected value of difference is?
2. If the two samples are of same size, and are of different standard deviations, then 𝜎 2 =?
3. If the two samples are of same size 𝑛, then 𝜎 2 =?
4. If the two samples are of same size 𝑛, to test ̅̅̅
𝑥1 = 𝑥
̅̅̅2 the degree of freedom is?
5. If the two samples are of same size 𝑛, to test mean of differences is zero, the degree of freedom is?

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

18
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Lecture-10 Chi-square distribution as a test of goodness of fit.

CHI-SQUARE (𝝌𝟐 ) TEST: If 𝑂𝑖 and 𝐸𝑖 are observed and expected frequencies for 𝑖 = 1,2 ⋯ 𝑛.
(𝑂𝑖 −𝐸𝑖 )2
Then 𝜒2 = ∑ with 𝑛 − 1 degrees of freedom.
𝐸𝑖
𝜒2 𝜈−1
2 −
The equation of 𝜒 curve is 𝑦 = 𝑦0 𝑒 2 ( 𝜒2) 2 , where 𝜈 = 𝑛 − 1.

Goodness of fit: The value of 𝜒 2 is used to test whether the deviations of the observed frequencies from
theoretical frequencies are significant or not.

Mean of 𝜒 2 distribution is degree of freedom 𝜈 = 𝑛 − 1, and variance is 2𝜈.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

19
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Examples:
1. In experiments on pea breeding, the following frequencies of seeds were obtained.
Round and Wrinkled Round and Wrinkled Total
yellow and yellow green and green
315 101 108 32 556
Theory predicts that the frequencies should be in proportions 9: 3: 3: 1. Examine the correspondence between
theory and experiment.
9 3 3 1
Theoretical frequencies are 16 × 556, 16 × 556, 16 × 556, 16 × 556 .
i.e. 313, 104, 104, 35.
(𝑂𝑖 −𝐸𝑖 )2 4 9 16 9
𝜒2 = ∑ = 313 + 104 + 104 + 35 = 0.5103 .
𝐸𝑖
2 2
For d.f. 𝜈 = 𝑛 − 1 = 3, 𝜒0.05 = 7.815. Since calculated value of 𝜒 2 is much less than 𝜒0.05 , there is a very
high degree of agreement between theory and experiment.

2. A set of five similar coins is tossed 320 times and the result is
No. of heads 0 1 2 3 4 5
𝑓 6 27 72 112 71 32
Test the hypothesis that the data follow a binomial distribution.
5
𝑛 𝐶
Solution: In binomial distribution 𝑃(𝑥) = 𝐶𝑥 𝑝 𝑥 𝑞 𝑛−𝑥 = 32𝑥 .
No. of heads 0 1 2 3 4 5
𝑂𝑥 6 27 72 112 71 32
5
𝐶𝑥 10 50 100 100 50 10
𝐸𝑥 = × 320
32
(𝑂𝑖 −𝐸𝑖 )2 16 529 784 144 441 484
𝜒2 = ∑ = 10 + + 100 + 100 + + = 78.68 .
𝐸𝑖 50 50 10
2 2
For d.f. 𝜈 = 𝑛 − 1 = 5, 𝜒0.05 = 11.07. Since calculated value of 𝜒 2 is much greater than 𝜒0.05 , the
hypothesis that the data follow the binomial distribution is rejected.

3. The following table gives the number of aircraft accidents that occurred during the various days of the week.
Find whether the accidents are uniformly distributed over the week.
Days Sun Mon Tue Wed Thru Fri Sat Total
No. of accidents 14 16 8 12 11 9 14 84
2
For d.f. 𝜈 = 6 , 𝜒0.05 = 12.59
Solution: If 𝑂𝑖 and 𝐸𝑖 are observed and expected frequencies for 𝑖 = 1,2 ⋯ 7.
Clearly 𝐸𝑖 = 12 𝑓𝑜𝑟 𝑒𝑎𝑐ℎ 𝑖
(𝑂𝑖 −𝐸𝑖 )2 4+16+16+0+1+9+4
Then 𝜒2 = ∑ = = 4.1667 < 12.59 .
𝐸𝑖 12
2
Since calculated value of 𝜒 2 is much less than 𝜒0.05 , the accidents are uniformly distributed over the week.
4. A machine is supposed to mix peanuts, hazelnuts, cashews, and pecans in the ratio 5:2:2:1. A can containing
500 of these mixed nuts was found to have 269 peanuts, 112 hazelnuts, 74 cashews, and 45 pecans. At the 0.05
level of significance, test the hypothesis that the machine is mixing the nuts in the ratio 5:2:2:1.
Solution: Since the ratio of the peanuts, hazelnuts, cashews, and pecans is 5:2:2:1, i.e. 50%, 20%, 20% and 10%
respectively.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

20
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Therefore, expected number of peanuts, hazelnuts, cashews, and pecans out of 500 is 250, 100, 100 and 50
respectively. Observed values are 269, 112, 74, and 45 respectively.
(𝑂𝑖 −𝐸𝑖 )2 361 144 676 25
𝜒2 = ∑ = 250 + 100 + 100 + 50 = 10.144 > 7.815 .
𝐸𝑖
2
Since calculated value of 𝜒 2 is more than 𝜒0.05 , reject the hypothesis that the machine is mixing the nuts in the
ratio 5:2:2:1.

Review questions:
1. If the standard deviation of a 𝜒 2 distribution is 10, then its degree of freedom is .
2. Mean and variance of 𝜒 2 distribution with degree of freedom is 8 are and respectively.
3. For the degree of freedom 1, 𝜒 2 distribution reduces to distribution.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

21
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Lecture-11 F-Distribution.

F-Distribution: Let 𝑥1 , 𝑥2 , 𝑥3 ⋯ 𝑥𝑛1 and 𝑦1 , 𝑦2 , 𝑦3 ⋯ 𝑦𝑛1 are two independent random samples of a normal
populations with equal standard deviation 𝜎 . Let 𝑥 and 𝑦 are sample mean,
1 𝑛 1 𝑛
𝑠1 2 = 𝑛 {∑1 1(𝑥𝑖 − 𝑥)2 } and 𝑠2 2 = 𝑛 {∑1 2 (𝑦𝑖 − 𝑦)2 } be the sample variance.
1 −1 2 −1

𝑠 2 𝑠 2
Then define 𝐹 = 𝑠1 2 or 𝐹 = 𝑠2 2 depending on either 𝑠1 2 > 𝑠2 2 or 𝑠2 2 > 𝑠1 2 respectively.
2 1

This gives F-distribution (or variance ratio distribution). Clearly F-distribution depends only on 𝜈1 and 𝜈2 .
𝐹𝛼 (𝜈1 , 𝜈2 ) is the value of 𝐹 for 𝜈1 and 𝜈2 such that area to the right of 𝐹𝛼 is 𝛼.

Snedecor’s F-table for 5% and 1% significance levels are given below.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

22
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

F-distribution is useful for testing the equality of population means by comparing the sample variances.

Examples:
1. Two samples of sizes 9 and 8 give the sum of squares of deviations from their respective means equal to 160
and 91 respectively. Can these be regarded as drawn from the same normal population?
Solution: Given that ∑91(𝑥𝑖 − 𝑥)2 = 160 and ∑81(𝑦𝑖 − 𝑦)2 = 91
1 𝑛 160 1 𝑛 91
𝑠1 2 = 𝑛 {∑1 1(𝑥𝑖 − 𝑥)2 } = = 20 , 𝑠2 2 = 𝑛 {∑1 2(𝑦𝑖 − 𝑦)2 } = = 13.
1 −1 8 2 −1 7

𝑠 2 20
𝐹 = 𝑠1 2 = 13 = 1.5385.
2

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

23
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

From the table, 𝐹0.05 (8, 7) = 3.73. Since the calculated value of 𝐹 < 𝐹0.05 , populations variances are not
significantly different. Hence, the samples can be regarded as drawn from the same normal population.

2. Two independent samples of size 7 and 6 have the following values.

Sample-A 28 30 32 33 33 29 34
Sample-B 29 30 30 24 27 29

Examine whether the samples have been drawn from normal populations having the same variance.
Given that 𝐹0.05 (6, 5) = 4.95 and 𝐹0.05 (5, 6) = 4.39 .

Solution: Let the samples have been drawn from normal populations having the same variance.
𝑛
∑1 1 𝑥𝑖 28+30+32+33+33+29+34 219
𝑥= = = = 31.2857
𝑛1 7 7

𝑛 𝑛
1 𝑛 𝑛1 ∑1 1 𝑥𝑖 2 −(∑1 1 𝑥𝑖 )2 7×6883−2192
𝑠1 2 = 𝑛 −1
{∑1 1(𝑥𝑖 − 𝑥)2 } = 𝑛1 (𝑛1 −1)
= 7×6
= 5.2381
1

𝑛
∑1 2 𝑦𝑖 29+30+30+24+27+29 169
𝑦= = = = 28.1667
𝑛2 6 6

𝑛 𝑛
1 𝑛 𝑛2 ∑1 2 𝑦𝑖 2 −(∑1 2 𝑦𝑖 )2 6×4787−1692
𝑠2 2 = 𝑛 {∑1 2(𝑦𝑖 − 𝑦)2 } = = = 5.3667
2 −1 𝑛2 (𝑛2 −1) 6×5

𝑠2 2 5.3667
𝐹= = = 1.0245.
𝑠1 2 5.2381

Given that, 𝐹0.05 (5, 6) = 4.39. Clearly the calculated value of 𝐹 < 𝐹0.05 , the samples can be regarded as
drawn from the same normal population.

Review questions:

1. If 𝑠2 2 > 𝑠1 2 , then, 𝐹 = .

𝑆12
2. If = 0.4, then, 𝐹 = .
𝑆2

𝑆12
3. If 𝑛1 = 7, 𝑛2 = 5, > 1 , to test the hypothesis whether 𝐹∝ (6, 4) is used or 𝐹∝ (4, 6) is used?
𝑆2

𝑆12
4. If 𝑛1 = 9, 𝑛2 = 7, < 1 , to test the hypothesis whether 𝐹∝ (8, 6) is used or 𝐹∝ (6, 8) is used?
𝑆2

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

24
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Lecture-12 (Tutorial) Tutorial 8: Proof of short cut method.

1 𝑛 ∑𝑛 2 𝑛
𝑖=1 𝑥𝑖 −(∑𝑖=1 𝑥𝑖 )
2
Prove that 𝜎 2 = 𝑛−1 ∑𝑛1(𝑥𝑖 − 𝑥)2 = 𝑛(𝑛−1)

2
Proof: ∑𝑛1(𝑥𝑖 − 𝑥)2 = ∑𝑛1(𝑥𝑖 2 − 2𝑥𝑖 𝑥 + 𝑥 )
2
= ∑𝑛1(𝑥𝑖 2 − 2𝑥𝑖 𝑥 + 𝑥 )
2
= ∑𝑛1 𝑥𝑖 2 − ∑𝑛1 2𝑥𝑖 𝑥 + ∑𝑛1 𝑥
2 ∑𝑛
1 𝑥𝑖
= ∑𝑛1 𝑥𝑖 2 − 2𝑥 ∑𝑛1 𝑥𝑖 + 𝑥 ∑𝑛1 1 (Since 𝑥 = , ∑𝑛1 𝑥𝑖 = 𝑛𝑥 and ∑𝑛1 1 = 𝑛)
𝑛
2 2
= ∑𝑛1 𝑥𝑖 2 − 2𝑛𝑥 + 𝑛𝑥
2 ∑𝑛
1 𝑥𝑖 2 (∑𝑛
𝑖=1 𝑥𝑖 )
2
= ∑𝑛1 𝑥𝑖 2 − 𝑛𝑥 (Since 𝑥 = , 𝑥 = )
𝑛 𝑛2
(∑𝑛
𝑖=1 𝑥𝑖 )
2
= ∑𝑛1 𝑥𝑖 2 − 𝑛
𝑛 ∑𝑛 2
1 𝑥𝑖 − (∑𝑛
𝑖=1 𝑥𝑖 )
2
= 𝑛
𝑛 ∑𝑛 2 𝑛
𝑖=1 𝑥𝑖 −(∑𝑖=1 𝑥𝑖 )
2
∴ 𝜎2 = .
𝑛(𝑛−1)

Hence, in t-distribution, for two samples of different size,

𝑛 𝑛 𝑛 𝑛
1 𝑛 𝑛 1 𝑛1 ∑1 1 𝑥𝑖 2 −(∑1 1 𝑥𝑖 )2 𝑛2 ∑1 2 𝑦𝑖 2 −(∑1 2 𝑦𝑖 )2
𝜎2 = 𝑛 {∑1 1(𝑥𝑖 − 𝑥)2 + ∑1 2 (𝑦𝑖 − 𝑦)2 } = 𝑛 { + }
1 +𝑛2 −2 1 +𝑛2 −2 𝑛1 𝑛2

If the two samples are of same size 𝑛, then

1 𝑛 𝑛 1 𝑛 ∑𝑛 2 𝑛
1 𝑥𝑖 −(∑1 𝑥𝑖 )
2 𝑛 ∑𝑛 2 𝑛
1 𝑦𝑖 −(∑1 𝑦𝑖 )
2
𝜎2 = 𝑛 {∑1 1(𝑥𝑖 − 𝑥)2 + ∑1 2 (𝑦𝑖 − 𝑦)2 } = 2𝑛−2 { + }
1 +𝑛2 −2 𝑛 𝑛

𝑛 ∑𝑛 2 𝑛 2 𝑛 2 𝑛
1 𝑥𝑖 −(∑1 𝑥𝑖 ) +𝑛 ∑1 𝑦𝑖 −(∑1 𝑦𝑖 )
2
= 2𝑛(𝑛−1)

If the two samples are of same size, and are of different standard deviations, then
1
𝜎2 = 𝑛 {(𝑛1 − 1)𝜎𝑥 2 + (𝑛2 − 1)𝜎𝑦 2 }
1 +𝑛2 −2

1
= 2𝑛−2 {(𝑛 − 1)𝜎𝑥 2 + (𝑛 − 1)𝜎𝑦 2 }
1
= 2 {𝜎𝑥 2 + 𝜎𝑦 2 } .

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

Nurtured Womb e Book
100% (5)
Nurtured Womb e Book
22 pages
Step To Sample Book
100% (1)
Step To Sample Book
120 pages
Notes Statistics
No ratings yet
Notes Statistics
50 pages
Name N Address Details DEAF Aug 2019
No ratings yet
Name N Address Details DEAF Aug 2019
428 pages
Sampling and Sampling Distributions (Autosaved)
0% (1)
Sampling and Sampling Distributions (Autosaved)
74 pages
Chapter Two Fundamentals of Marketing Estimation and Hypothesis Testing
No ratings yet
Chapter Two Fundamentals of Marketing Estimation and Hypothesis Testing
73 pages
Sampling Dist
No ratings yet
Sampling Dist
40 pages
Chapter 10 QBM
No ratings yet
Chapter 10 QBM
38 pages
Sampling Distribution
No ratings yet
Sampling Distribution
127 pages
Normal Prob - Sampling Distr and Estimation-2022
No ratings yet
Normal Prob - Sampling Distr and Estimation-2022
27 pages
Facility Inspection Checklist
No ratings yet
Facility Inspection Checklist
2 pages
Unit-4 - Confidence Interval and CLT
No ratings yet
Unit-4 - Confidence Interval and CLT
29 pages
MC Math 13 Module 10
No ratings yet
MC Math 13 Module 10
15 pages
Week 6
No ratings yet
Week 6
14 pages
SBP Ing
No ratings yet
SBP Ing
18 pages
Internal Audit Report: 1. Summary of Findings
No ratings yet
Internal Audit Report: 1. Summary of Findings
7 pages
Sampling Distributions
No ratings yet
Sampling Distributions
36 pages
Sampling Distributions and Confidence Intervals For The Mean
No ratings yet
Sampling Distributions and Confidence Intervals For The Mean
19 pages
Statistics Homework Help, Statistics Tutoring, Statistics Tutor - by Online Tutor Site
No ratings yet
Statistics Homework Help, Statistics Tutoring, Statistics Tutor - by Online Tutor Site
30 pages
Central Limit Theorem
100% (3)
Central Limit Theorem
38 pages
Business Plan of Rapido Deliveries
No ratings yet
Business Plan of Rapido Deliveries
85 pages
Statistics and Probability: Quarter 3 - Module 6: Central Limit Theorem
No ratings yet
Statistics and Probability: Quarter 3 - Module 6: Central Limit Theorem
17 pages
Chapter4 - Sampling Distribution - S
No ratings yet
Chapter4 - Sampling Distribution - S
89 pages
Revision SB Chap 8 12 Updated 1
No ratings yet
Revision SB Chap 8 12 Updated 1
44 pages
Chapter4A Single-Sample Confidence-Interval S
No ratings yet
Chapter4A Single-Sample Confidence-Interval S
103 pages
Chapter 3 Radiation
100% (1)
Chapter 3 Radiation
36 pages
Assignment Inferential Statistics 1
No ratings yet
Assignment Inferential Statistics 1
5 pages
Nasrin Mcom Projctbbb
No ratings yet
Nasrin Mcom Projctbbb
62 pages
Towards A Third Food Regime: Behind The Transformation
No ratings yet
Towards A Third Food Regime: Behind The Transformation
13 pages
Math 1060 - Lecture 5
No ratings yet
Math 1060 - Lecture 5
9 pages
Notes STA408 - Chapter 2 PDF
No ratings yet
Notes STA408 - Chapter 2 PDF
4 pages
Sampling Distributions of Sample Means
No ratings yet
Sampling Distributions of Sample Means
7 pages
Nursing in Research in Malawi
100% (1)
Nursing in Research in Malawi
28 pages
Statistics
No ratings yet
Statistics
49 pages
Stat - Prob 11 - Q3 - SLM - WK6-8
No ratings yet
Stat - Prob 11 - Q3 - SLM - WK6-8
34 pages
MGMT 222 Ch. III
No ratings yet
MGMT 222 Ch. III
10 pages
6.5 - The Central Limit Theorem: Objectives
No ratings yet
6.5 - The Central Limit Theorem: Objectives
6 pages
Sampling and Sampling Distribution - Part 2
No ratings yet
Sampling and Sampling Distribution - Part 2
34 pages
The Normal Distribution Estimation Correlation
100% (1)
The Normal Distribution Estimation Correlation
16 pages
GENG5507 Stat TutSheet 5 Solutions
No ratings yet
GENG5507 Stat TutSheet 5 Solutions
5 pages
Module 6
No ratings yet
Module 6
12 pages
Chapter 8
No ratings yet
Chapter 8
7 pages
Stats Quiz
No ratings yet
Stats Quiz
6 pages
Statistics and Probability Module 7: Week 7: Third Quarter
No ratings yet
Statistics and Probability Module 7: Week 7: Third Quarter
7 pages
Module 3 Ie Stat 1
No ratings yet
Module 3 Ie Stat 1
12 pages
Value Oriented Education
No ratings yet
Value Oriented Education
10 pages
Math11 SP Q3 M7
No ratings yet
Math11 SP Q3 M7
16 pages
Greek Lit Quiz
No ratings yet
Greek Lit Quiz
2 pages
Topic 12 Central Limit Theorem PDF
100% (2)
Topic 12 Central Limit Theorem PDF
4 pages
Applied Statistics and Probability For Engineers Chapter - 8
No ratings yet
Applied Statistics and Probability For Engineers Chapter - 8
13 pages
Motion To Disqualify Allen Baddour
No ratings yet
Motion To Disqualify Allen Baddour
12 pages
Module 25 - Statistics 2
No ratings yet
Module 25 - Statistics 2
9 pages
Hong Kong History
No ratings yet
Hong Kong History
5 pages
Originators Guide Rules v2.3 Nov 06
No ratings yet
Originators Guide Rules v2.3 Nov 06
171 pages
Lesson 7 For Basic Mathematics
No ratings yet
Lesson 7 For Basic Mathematics
27 pages
Statistics M6
No ratings yet
Statistics M6
18 pages
Stat - Prob-11 Q3 SLM WK6 Students
No ratings yet
Stat - Prob-11 Q3 SLM WK6 Students
6 pages
Share Statistics - Probability - Q3 - Mod6 - Central-Limit-Theorem
No ratings yet
Share Statistics - Probability - Q3 - Mod6 - Central-Limit-Theorem
17 pages
Rcsi PHD Thesis
100% (2)
Rcsi PHD Thesis
6 pages
Mineral Resource Conflict Jharkhand
No ratings yet
Mineral Resource Conflict Jharkhand
20 pages
2024 Estimation
No ratings yet
2024 Estimation
91 pages
Inspection Report 14 N
No ratings yet
Inspection Report 14 N
1 page
Modals of Probability 2
No ratings yet
Modals of Probability 2
2 pages
ERPNEXT
No ratings yet
ERPNEXT
5 pages
Instruction Manual Fieldvue dvc2000 Digital Valve Controller Fisher en 135208
No ratings yet
Instruction Manual Fieldvue dvc2000 Digital Valve Controller Fisher en 135208
80 pages
Paper Eng
No ratings yet
Paper Eng
21 pages
Module 4 Class
No ratings yet
Module 4 Class
65 pages
Michael Jackson Dissertation
100% (2)
Michael Jackson Dissertation
4 pages
Legal Basis of International Relation
No ratings yet
Legal Basis of International Relation
4 pages
Analysing Data of Stats
No ratings yet
Analysing Data of Stats
6 pages
Probability & Statistical Analysis
No ratings yet
Probability & Statistical Analysis
28 pages
CH I - Sampling and Sampling Distributions
No ratings yet
CH I - Sampling and Sampling Distributions
13 pages
Q4 L1 Central Limit Theorem
No ratings yet
Q4 L1 Central Limit Theorem
24 pages
Sp-Module 6
No ratings yet
Sp-Module 6
14 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
17 pages
Module 6 Central Limit Theorem
No ratings yet
Module 6 Central Limit Theorem
40 pages
Questionabnk - Ddco 2024
No ratings yet
Questionabnk - Ddco 2024
4 pages
Ddco Manual
No ratings yet
Ddco Manual
25 pages
Central Limit Theorem
No ratings yet
Central Limit Theorem
4 pages
DS 5
No ratings yet
DS 5
23 pages
Importing Gold Dore
No ratings yet
Importing Gold Dore
3 pages
Qs Content
No ratings yet
Qs Content
3 pages
Ddco Module - 4 New
No ratings yet
Ddco Module - 4 New
27 pages
Module Grade 11
No ratings yet
Module Grade 11
8 pages
Central Limit Theorem Grade 11 Group 4
No ratings yet
Central Limit Theorem Grade 11 Group 4
7 pages
Os Module4
No ratings yet
Os Module4
3 pages
Ds Module1
No ratings yet
Ds Module1
63 pages
Java - Viva Questions
No ratings yet
Java - Viva Questions
2 pages
Os Module4
No ratings yet
Os Module4
27 pages
ISO Module2 BCS301
No ratings yet
ISO Module2 BCS301
28 pages
Module 5 Welding
No ratings yet
Module 5 Welding
119 pages
Statistics For Economists Lecture V
No ratings yet
Statistics For Economists Lecture V
37 pages
The Philosophy of Fear and Freedom
No ratings yet
The Philosophy of Fear and Freedom
2 pages
Chapter 8 New
No ratings yet
Chapter 8 New
34 pages
Oral Cancer Essay
No ratings yet
Oral Cancer Essay
3 pages
Central Limit Theorem
No ratings yet
Central Limit Theorem
5 pages
Lesson 13
No ratings yet
Lesson 13
23 pages
USB Devices As VMFS Datastore in Vsphere ESXi 70 Virtennet
No ratings yet
USB Devices As VMFS Datastore in Vsphere ESXi 70 Virtennet
14 pages

ISO Module 4 BCS301

Uploaded by

ISO Module 4 BCS301

Uploaded by

MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2024

Lecture-1 Statistical Inference-2: Sampling variables, central limit theorem.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

Solution: Let 𝜇 be the mean and 𝜎 standard deviation of the distribution.

Since 𝑥 < 0, 𝑧 < −0.5.

1. The probability distribution of 𝑋̅ is called?

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

Lecture- 2 Central limit theorem. Problems.

Solution: Given that 𝑥 = 3.4𝑐𝑚, 𝑛 = 900, 𝜇 = 3.25 𝑎𝑛𝑑 𝜎 = 1.61𝑐𝑚.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

Lecture-3 Confidences limit for unknown mean.

28 − 1.96 × 0.7906 < 𝑥 < 28 + 1.96 × 0.7906

3.4 − 2.58 × 0.0537 < 𝜇 < 3.4 + 2.58 × 0.0537

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

Lecture-4 Test of significance for means of two large samples:

Test of significance for means of two large samples:

( 𝑥1 −𝑥2 )−( 𝜇1 −𝜇2 )

( 𝑥1 −𝑥2 )−( 𝜇1 −𝜇2 )

Given that 𝑥1 = 67.5, 𝑥2 = 68.0 , 𝑛1 = 1000 , 𝑛2 = 2000 𝑎𝑛𝑑 𝜎 = 2.5 .

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

Solution: 𝐻1 : Sailors are on the average taller than the soldiers.

Given that 𝑥1 = 67.85, 𝑥2 = 68.55 , 𝑛1 = 6400 , 𝑛2 = 1600 , 𝜎1 = 2.56, 𝜎2 = 2.52 .

This is highly significant. 𝐻0 is rejected and hence alternate hypothesis 𝐻1 is accepted.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

Lecture-5 Test of Significance for means of two large samples problems

𝜎12 𝜎22 ( 𝑥1 −𝑥2 )−( 𝜇1 −𝜇2 ) ( 𝑥1 −𝑥2 )−0.5

( 𝑥1 −𝑥2 )−( 𝜇1 −𝜇2 ) ( 𝑥1 −𝑥2 )

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

Lecture-6 (Tutorial) Central limit theorem. Problems.

Hence, 𝑃(𝜇 − 1.9𝑒 ≤ 𝑥 ≤ 𝜇 − 0.4𝑒) = 𝑃(50 − 2.375 ≤ 𝑥 ≤ 50 − 0.5)

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

Critical values of 𝑡, for two tail tests.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

Lecture-8 Students-‘t’ distribution, problems

∴ 𝜎𝑠 = 2.6194, given that 𝜇 = 47.5 .

∴ 𝜎𝑠 = 3.0111, given that 𝜇 = 66 .

Solution: Given that, 𝑥 = 41, ∑𝑛1(𝑥𝑖 − 𝑥)2 = 135, 𝑛 = 16 . ∴ 𝜈 = 15.

⟹ |𝑥 − 𝜇| < 1.65, approximately.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

Lecture-9 Students-‘t’ distribution, Significance test of difference between sample means.

If the two samples are of same size then,

∑𝑛1 1 𝑥𝑖 2 = 6883, ∑𝑛1 2 𝑦𝑖 2 = 4787

𝐻1 : There is a discriminate between the two horses, i.e. 𝑥 ≠ 𝑦

Is the mean score of boys significantly different from that of girls?

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

Solution: Let 𝑥 be the boy’s score, 𝑦 be girl’s core.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

Lecture-10 Chi-square distribution as a test of goodness of fit.

Mean of 𝜒 2 distribution is degree of freedom 𝜈 = 𝑛 − 1, and variance is 2𝜈.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

Snedecor’s F-table for 5% and 1% significance levels are given below.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

2. Two independent samples of size 7 and 6 have the following values.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

Lecture-12 (Tutorial) Tutorial 8: Proof of short cut method.

Hence, in t-distribution, for two samples of different size,

If the two samples are of same size 𝑛, then

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

You might also like