Fundamentals of Statistics
Fundamentals of Statistics
for P and n, (since the sample mean xand samplce variance s are known from the saple valucs\, we ge
the corresponding estimates of the population paramelers.
This technique is commonly uscd if we have to cstimate thc theorctical trequencies of o
distribution by fitting an appropriate probability distribution to il.
Inthe above discussion, thereader is exposed to basic conccpts and notions in point cstimation, A
detailedstudy of thesc topics is, however, beyond the scope of the book.
16-3. SAMPLING DISTRIBUTION OF A STATISTIC
As alrcady statcd, sampling distribution of a statistic plays a very inportant role in the stalistical
inference - both in the cstimation theory as well as testing of hypothesis. In Chapter 15, [$ 154-1] we Baye
the concept of sampling distribution of astatistic. We will rccapitulate the ideas briefly and study its use in
detail.
If we take a sample of size n from a population of size N, then here are C, = k; (say). possble
samples. We can compute the statistic, say, t for cach of these samples. Let ,, h, . be the values of the
statistic for these kpossible samples. Thus, the statistic tmay be regarded as a random variable which can
take any one of the values ,, b, ..., i. The set of the values of t constitutes, what is known as the sampling
distribution of the statistic t.
We can compute the mcan, variance and other statistical constants of the sampling distribution of the
statistic 1. For example,
...(1618)
Mean - E()-=,(say)
and Var (t) - -Eo]'-}«-i2 ... (l6-19)
The standard deviation of the sampling distribution ofa statistic is known as its standard error (S.E).
. .(l6:20)
SE () =\Var (/)=(-)2
The sampling distribution of a statistic and its standard error are extensivcly used in Statistics :
(i) To deermine the precision of the sample estimate of some population parameter, which is grve
the reciprocal of the S.E. of 1hesampling distribution of the estimate. Thus, if is a statistic used to ct
the paramcter 0 then
. .(16:21)
Prccision of =
S.E. ()
value in the
(ii) To test if the sample statistic differs significantly from the corresponding hypothetical
the ditterence
Can
population, i.e., to test the significance of the difference (t - 0). By this we mean: tluctuations O
Ieween the sample estimate und the corresponding population, Parameter be attributed to
ampling thesae
estimates of
(iii) To test the significance of the difference between two independent sample
population paramcter.
(iv) Toobtain point estimates of the population parameters.
TESTING OF HYPOTHESIS 16-7
THEORY 0F ESTIMATION AND
ahiain interval estimates of the population parameters. i.e., to obtain probable limits bctween
thc truc valuc of the paramcter may be cxpccted to Jic. Thesc limits are provided by
which
... (1622)
to/)isthe significant or
critical value of t al level of significancc o foratwo-tailed test. [For details
where
sec§ 1669].
1B4:1. Sampling Distribution of Man. We have alrcady discusscd in §1512that ifx,x), ... . x,is
population of size N. then
arandom sample (without replacement) of sizen rom a tinitc
Var (x ) V-1 . .(1623)
n/N
CNoo, i.e., if wc take samplcs from an infinite (very large) population so that thcsampling fraction
can be neglcctcd, thcn rewriting (16:23), we get
Var (r) ~ 0 , - n g
N
[.N NifNis large]
In addition to the above properties, the sampling distribution of x has a very important asymptotic
propcrty i.e., as n’ o, This is embodied in the Central Limit Theorem given bclow.
16:3-2. Central Limit Theorem. We know that if x, x, , x, is a. random sample from normal
population with mcan u and variance o², then the sample mean x is also normally distributed with mean u
and variance a/n, i.e.,x ~ N(u, o²/n). This result is true cven if the population from which the samples are
rawn is not normal, provided the sample size is sufficiently large as statcd in the following Central Limit
Iheorem, which is one of the most fundamental theorems in Statistics.
y X, X, ..,. x, is a random sample of size n from any population, then the sample mean (x ) is
nornally distributed with mean u and variance g'/n provided n is sufficiently large. In other words,
X~N (Lu, g'/n), asymptotically i.e., as n ’ , Larger the sample size, better will be the approximation.
Om practical point of view, the approximation is fairly good if n> 30.
Remark. In fact, above result is simply a deduction from thc more generalised central limit theorem
Which is statcd below:
Central Limit Theorem. "If X, X2, ..., X, are independent random variables following any
diassytrmptibutoitoicn,ally normally
then under certain very general conditions, their sum X = X + X, + . + X,, is
distributed,i.e., Xfollows normal distribution as n ’ o,
16-8 distributions of most of
FUNDAMENTALS OF STATISTICS
becn proved that thc sampling thc stalislics, like
it has
By using this thcorem, (P - P2); difference of
difference of sample
proportions
sample proportion (p),standard deviation (s - s;) etc., are )
asymptotically normal, i.e., mans
the
sample
difference of sample
corresponding to any one of
variates
then by central limit
thcorem,
thesc statistics is N(0, for large samples. Thus, if t
is any standaslrdaliisscic,
1- EO ~ N(0, 1)
oo, This
S.E. ()
result, viz., the distribution of any
Sample
stutistic in its
f
. .(1625)
standardised
asymptotically, i.e., as n
n’ o, is Cxtensively used in Large Testslarpe (Chapter
17] and form is
asymptotically normal as
confidence limits for the population parameters when samples are also in
constructing
Example 16:2. Apopulation consists of five numbers (2, 3, 6, 8, I). Consider all possible samples of
size two whichcan be drawn with replacement fromn this population. Calculate the standurd| error of sample
mean. Solution. In random sampling with replacement, any onc of the five numbers 2, 3, 6, 8, I| drawn in
the first draw can be associated with any one of these five numbers drawnin the sccond draw and hence the
total number of possible samples of size 2 is 5 x5 = 25 and is given by the cross product,:as given below,.
(2, 3, 6, 8, 11) x(2, 3, 6, 8, 11)
(2, 3) (2, 6) (2, 8) (2, 11)
(2, 2) (3, 8)
(3,2) (3, 3) (3, 6) (3, 11)
(6,3) (6,6) (6, 8) (6, 11)
(6, 2)
(8, 2) (8, 3) (8, 6) (8, 8) (8, 11)
(11,2) (11, 3) (11, 6) (11,8) (11, 11)
The corresponding sample means are :
Total
E() = 25 150
25
'=
6
Example 16:3. Apopulation consists of 4 members 0, 4, 6 and 6. Draw all possible samples of size 2
tawn with replacement. Find the sapling distribution of sample mean. Hence find the mean and variance
ofthe sample mean. [L.C.WA. (Intermediate), Dec. 1999]
Solution. Population values are given to be :0, 4, 6 and 6.
Since the samples of size 2 are drawn with replacement, there are 4 x 4 = 16 samples, which are given
ynt cross product {0, 4, 6, 6} x {0, 4, 6, 6} and are enumerated in the following table.
Sr. No.
Sample Sample Sample Sr. No. Sample Sample Sample
values totals mean values totals mean
0,0 0 9 6.0 6
2
0,4 4 2 10 6, 4 10
3
0, 6 6 3 6, 6 12 6
4
0, 6 6 3 12 6, 6 12 6
4.0 13 6, 0 6 3
6
4, 4 4 14 6, 4 10
4,6 10 15 6, 6 12 6
4,6 10 5 16 6,6 12 6
The
Valuseamofplsanple
ing distribution of sample mean (x ) is given below.
Irequency (è) mean n(x) 2 3 4 6 Total
}f- l6
Probabil ty 1 2 4 4
2 4 4 4
16 16 16 l6 16 16 16
16-10 FUNDAMENTALS OF STATISTICS
6x4
E(x) - r p (r) =0 + 2x +3xit 4xt 5x+
4+12 +t41 20 +24) 64 = 4
2 + 25x. + 36 Xi46
E(?) x'.p(r ) =0+4 x 16 + 9x + 16x
(8+36 + 16+ 100 + 144) = 304
16 19
16= 3.
Var (x) = E( x?)- [E(x )] = 19- 42= 19-
Example 16:4. The narks scored by five students in a test of Statistics carrying 100 marke ous ta
is drawn without replacemen, draw up
50, 60, and 40. Ifa simple random sample of'size 4 the
distribution of sample mean. Hence, find the standard error of the sample mean. sampling
J.C.W.A. (Intermediate), Dec. 19971
Solution. From a population of5 units, asimple random sample without replacemcnt, of size 4can be
drawn in SC, =SC, =5, ways. Hence, the total number of possible samples is 5, as gIven in the
table. fol owing
Sample o. Sample values Total
(1) (2) (3)
Sample
mean
(4)=9
[The samples of size 4 are obtained on 1 50, 60, 50, 60 220 55
picking 4 observations out of the 5 given 2 60, 50, 60, 40 210 52.5
observations 50, 60, 50, 60 and 40, starting 3 50, 60, 40, 50 200 50
with the first obscrvation viz., 50 and then 4 60, 40, 50, 60 210 52.5
moving in a cyclic order.] 40, 50, 60, 50 200 50