6.1 Basic Statistic
6.1 Basic Statistic
While the study of statistics is a field in itself, there are certain simple and fundamental
statistical terms that are commonly used in the SQC Departments. An understanding of these
terms and their proper interpretation will aid in understanding the fuller implications of the
test results. The person who wishes to pursue this study further should read some standard
textbooks on statistics.
1.0 SAMPLING
Sampling is used to obtain information about a large group, called the population, from a
smaller representative group, called the sample. The method of drawing samples from the
population or 'sampling' should be such that the sample becomes a true representative of the
whole population.
Table – 1: Yarn Breaking Strength Values (The results are arranged in ascending order)
11.5 15.3 16.3 16.8 18.3
12.4 15.4 16.4 16.8 18.4
13.2 15.5 16.4 16.8 18.5
13.4 15.6 16.4 16.9 18.7
13.6 15.7 16.5 17.2 19.1
14.2 15.8 16.5 17.3 19.3
14.3 15.9 16.6 17.4 19.6
14.6 16.2 16.7 17.5 20.1
14.7 16.3 16.7 17.6 20.7
14.8 16.3 16.7 17.9 21.8
Now, frequency of a value of a variable (here yarn strength) is the number of times it occurs in
a given series & observations. The range of the strength values obtained is divided into a
number of groups or classes like 11.0 – 11.9, 12.0 - 12.9 and so on up to 21.0 - 21.9. Such
groups are called Class Intervals. Between 13.0 - 13.9, there are 3 values 13.2, 13.4 and 13.6
and so here the frequency (called Class Frequency) is 3. Thus a table may be constructed as
following.
Group Tally Mark Frequency Group Tally Mark Frequency
11.0 – 11.9 I 1 17.0 – 17.9 IIII I 6
12.0 – 12.9 I 1 18.0 – 18.9 IIII 4
13.0 – 13.9 III 3 19.0 – 19.9 III 3
14.0 – 14.9 IIII 5 20.0 – 20.9 II 2
15.0 – 15.9 IIII II 7 21.0 – 21.9 I 1
16.0 – 16.9 IIII IIII IIII II 17
3.2 Mode
The Mode is the value that occurs most frequently, and it is possible for a group of data
to have more than one mode. In this respect, the mode differs from the mean and the median
as there are always only one mean and one median.
3.3 Median
The Median is the middle one of a set of values (observations) arranged in descending order.
When the number (N) of values is an even number, the average of the two central items is
taken as the median. If the number of values is odd, then the median is the value of the item
which is item number (N + 1)/2 from the end.
Mostly, the measure of central tendency used in textile testing laboratory will be the
arithmetic mean or average. The mode and median are defined merely to show how they
differ from the mean.
4.1 Range
The range of a set of observations is the difference between the maximum and the minimum
values and thus it represents the maximum possible difference between any two observations
in a set. Thus,
Range = Maximum value - Minimum Value
Example :
X X2 X X2
3 9 7 49
6 36 4 16
8 64 3 9
2 4 5 25
5 25 6 36
4 16 6 36
3 9 7 49
5 25 8 64
6 36 2 4
90 512
N = 18 ∑X = 90 X = 512
2
Example: Using the data given as example for the standard deviation, the coefficient of
1.91
variation may be determined as CV% = x 100 = 38.2%
5.0
6.1 SE of Mean
The standard error of the mean is given by
SEmean = σ1 /√n .....(5)
where σ1 is the true standard deviation of the population and n is the number of specimens in
the sample. Since σ1, is not known, the sample value, σs is substituted in place of σ1 as
approximation, but only in case of samples having specimens more than 30.
When the good estimate of true CV% is not available, the sample CV% value may be
substituted in place of true CV%, as approximation, provided the number of specimens tested
is more than 30.
The Table - 3 gives the sample sizes directly for confidence levels of 95% and 99%, and for
different confidence limits as % of the average.
Example
Let us determine the minimum number of tests required to estimate the average strength of a
yarn with confidence limits of ± 5% of the average when the CV% of yam strength is known to
be 10%. From the Table - A.3, we can easily find that
Minimum number of tests for 95% confidence = 16
Minimum number of tests for 99% confidence = 27
Example
On carrying out breaking load measurements with 100 yarn specimens, a sample average
value of 3.60 kg and standard deviation of 0.64 kg had been obtained. Let us determine the
95% confidence limits. From the Table - 4, with sample size of 100 and for 95% confidence
level, we find that the multiplying factor F2 is ± 0.199. Hence, 95% confidence limits will be ±
0.199 x 0.64 = ± 0.13 which means that the probability is 95% that the true average yarn
breaking load value lies between 3.47 and 3.73 kg (i.e. 3.60 ± 0.13).
Example - 1
65 end-breakages are observed in a spinning machine in 3 hours. Let us determine the 95%
confidence limits of the true end-breakage rate, i.e. end- breakages per hour.
From the Table - 5 (by interpolating between the listed values), the 95% confidence limits of
total recurrences can be found as 50 and 81 end-breakages in hours. Hence, the confidence
limits of the true end-breakage rate are 50/8 and 81/8, or 6 and 10 per hour. Thus, the true
end-breakage rate lies between 6 and 10 per hour with a probability of 95%.
Example - 2
A measurement of number of slubs per Km of yarn length showed a value of 8 where the
target average was 3. Let us examine whether the target was exceeded or not.
From Table-5, the lower and upper 95% confidence limits for 3 occurrences are 1 and 9
respectively. Thus, if the target of 3 is being maintains test values of 1 and 9 would be
expected to occur in approximately* 19 cases out of 20. We may conclude that there is no
evidence from this single test that the target is not being maintained.
**Tabulating values to whole numbers results in the confidence being not exactly 95% and 99%,
particularly for small numbers, i.e. below 10
Table-3
Sample Sizes for 95% Confidence Level
Table-5
Approximate Confidence Limit for Occurrence of an Infrequent Event