Time Series 2nd Year
Time Series 2nd Year
SAMPLING TECHNIQUES
UNIT-I
BASIC CONCEPTS
Population: The set of all possible observation under study is called population. It is denoted
by (N).
Sample: It is the subset of a population that has been collected through data collection. It is
denoted by (n).
Sample Space: The totality all sample points consistent with the method of sample adapter
will be called sample space. For example: S H , T When tossing a coin.
Estimator: An estimator is a random sample and may take different values from sample to
sample.
Unbiased estimator: An estimator (t) is set to be unbiased estimator for the parameter ( ) ,
if Et or Et 0 . Similarly E t Bt is known as biased estimator.
Standard Error: The positive square root of variance is called standard error of the
estimator.
Finite Population: A population said to be finite if it has countable number of values. For
example: number of students in the M.S.University.
Sampling Unit: The contribution of population which all the individuals to the sample from
the population that cannot be further subdivided for the purpose of sampling at time is called
sampling unit. For example, to know that the average income of a family, the head of family
is called sampling unit.
Sampling Frame: we need to frame the structure of the survey for adapting any sampling
procedure, it is essential to have a list or map to identify each sampling unit by a number,
such a list or map is called sampling frame. For example, If list of voters in a particular place.
1
NEED FOR SAMPLING
The sampling methods have been extensively used for a variety of purposes and in great
diversity of situations. In practice it may not be possible to collect the information on all units
of a population due to various reasons such as
The total count of all units of the population for a certain characteristic is known as
complete enumeration or census survey. The money, man power and time required for
carrying out complete enumeration is generally large and there are many situations with
limited means where the complete enumeration will not be possible.
When only a part of the population is selected denoted as sample and examine it is
called sample enumeration or sample survey.
1) Less time: There is considerable saving in time and labor since only a part of the
population has to be examined. The sampling results can we obtain more much
rapidly and it can analysis such faster since relatively and process.
2) Reduced cost of the survey: Sampling usually results in reduction of cost, in terms
of money and in terms time. All though the amount of labor and expenses are invalid
in collecting information. All generally greater per unit of the sample then complete
enumeration the total cost of the sample survey is the expected to be much smaller
than that of complete census of since in most of the cases our resources or limited in
terms of money and the time with in which the results of the survey should be obtain
it is usually imperative to the sampling rather than complete to sampling rather than
complete enumerations.
3) Greater accuracy of results: The results of the sample are usually much more
reliable than those obtained from a complete census due to the following reasons.
2
a. It is always possible to determine the extent of the sampling errors.
4) Greater scope: Sample survey generally has greater scope as compared with
complete census the complete enumeration is impracticable rather inconceivable if the
survey requires highly trained personal and more sophisticated equipment for the
collection and analysis of the data. Since sample survey saves in time and money it is
possible to have a through and intensive enquiry because detailed information can be
obtained from a small group of respondents.
5) If the population is too large, for example of trees in a jungle we are left with no way
but to resort to sampling.
7) If the population is hypothetical for example while tossing a coin, the process may
continue indefinitely. Sampling method is the only scientific method of estimating
parameters of the universe/population.
The first step is to define clear and concrete terms of the objectives of the survey. The
sponsors of the survey should take care that these objectives are commensurate with the
3
available resources in terms of money, man power and the time limit required for the
availability of the results of the survey.
The population that is the aggregate of objects from which sample is chosen should be
defined in clear and unambiguous terms.
The population must be capable of division into sampling units for purpose of the
sample selection. The sampling units must cover the entire population and must be distinct
unambiguous and not overlapping in the sense that every element of the population belongs
to one and only one sampling units
In order to cover the population decided upon there should be some list map or other
acceptable material called the frame to serve as a guide for the population to be covered.
4) Data to be collected:
The data should be collected keeping in view the objectives of the survey. We should
not have the tendency to collect too many data some of which are never subsequently
examined and analyzed.
Having decided about the type of the data to be collected the next important part of
the sample survey is the construction of the questionnaire or schedule of the enquires which
requires skill special technique as well as familiarity with the subject matter under study. The
question should be clear, brief collaborative non offending courteous in tone unambiguous
and to the point, so that not much scope of guessing is the left on the part of the respondent or
interviewer suitable and detailed instruction for filling up the questionnaire or schedule
should also prepared.
i) Interview method:
In this method the investigator goes from house to house and interviews the
individuals personally. He asks the questions one by one and fills up the schedule on
the basis of the information guess by the individuals.
7) Non respondent:
Quite often the data cannot be collected for the sampled units. This incompleteness is
called non response which obviously tends to change the results. In such cases of response
should be handled with caution in order to draw unbiased and valid conclusions.
4
8) Selection of proper sampling designs:
The size of the sample (n) the procedure of selection and the estimation of the
population parameters along with their margins of uncertainty are some of the important
statistical problems that should receive that careful attention.
A number of designs for the selection of a sample are available and a judicious
selection will guarantee good and available estimation.
It is absolutely essential that the person should be thoroughly trained in locating the
sample units, recording the measurements the methods of the collection of required data
before starting the field work. The success of a survey to a great extent depends upon the
reliable field work. It is very necessary to make provision for adequate supervisory staff for
inspection after field work.
An initial quality check should be carried out by the supervisory staff while
the investigators are in the field.
Before carrying out the tabulation of the data we must decide about the
procedure for the quality of the data. For the large scale survey, mission tabulation
will obliviously be much quicker and economical.
c) Statistical analysis:
Statistical analysis should be made only after the data has been properly
scrutinized, edited and tabulated. Different method of estimation may be available for
the same data, appropriate formulae should be used to provide final estimates of the
required information.
Any complete survey is helpful in providing a note of caution and taking lesson from
it for designing future surveys. The information gained from any completed sample in the
form of the data regarding means, standard deviation and the nature of the variability of the
principle of the measurements tougher sampling.
5
PRINCIPLE OF SAMPLING SURVEY:
This principle as its origin in the mathematical theory of probability. According to the
king, the law of statistical regularity lays down moderately large number of items chosen at
random from a large group are almost sure on the average to possess the characteristics of the
large principle stress the desirability and the importance of selecting the sample at random so
that each every unit in the population has an equal chances of being selecting the sample.
2) Principle of validity:
By the validity of a sample design we mean that it should enable us to obtain valid
test and estimates about the parameters of the population. The samples obtain by the
techniques of the probability sampling, satisfy this principle.
The principle improves upon obtaining optimum results in terms of efficiency and the
cost of the design with the resources at our disposal. The reciprocal of the sampling variance
of an estimate provides a measure of its efficiency while a measure of the cost of the design is
provided by the total expenses incurred in terms money and man hour.
LIMITATION OF SAMPLING:
Advantage:
Disadvantage:
i) Proper care should be taken in the planning and execution of the sample,
survey otherwise the results obtained might be inaccurate and misleading.
ii) Sampling theory requires services of the trained and qualified person and
sophisticated equipment for its planning execution and analysis. In the absence
of these results, the sample survey are not trust worthy.
iii) however, if the information required about each and every unit of the universe
there is no way but to resort to complete enumeration. If time and money are
6
not important factors or if the universe is not too large, a complete census may
be better than any sampling.
TYPES OF SAMPLING:
In this type of sampling, the sample selected is with definite purpose in view and the
choice of sampling units depends greatly on the discretion and the judgment of the
investigated. These samplings suffer from the drawback of favoritism and nepotism
depending upon believes and prejudices of the investigation and thus does not give a
representative sample of the population. This sampling method is seldom used and cannot be
recommended for general use since it is biased due to element of subjectiveness or the part of
the investigator. However, if the investigator is experience and skilled and the sampling is
carefully applied, then judgment sample may yield valuable results.
2) Probability sampling:
3) Mixed sampling:
If the samples are selected partly according to some laws of chance and partly
according to fixed sampling rule, they are termed as mixed samples and the techniques of
selecting such sample is known as mixed sampling.
The different types of sampling given above have a number of variations. Some of
which may be listed below.
QUESTIONNAIRE:
Merits:
a. Questionnaire method is economical.
b. it can we widely used when the area of investing is large.
c. it saves money labor and time.
d. error in the investing isvery small because the information is explain directly
to the respondence.
Demerits:
i. In this method there is no direct connection between the investigator and the
respondent. Therefore we can sure about the accuracy and reliability of the
information.
ii. This method is suitable only for literate people in many countries illiterate
people cannot and replay the questionnaire.
iv. People may not give correct answer, thus one is lead to false conclusion.
v. Sometimes the information may not will to gives the return answer.
SCHEDULE:
8
gets answer to the question in schedule and records their answers he explain clearly the
adjective and purpose of the enquiry.
Merits:
ii. It yield reliable accurate result because the enumerates or educated and trained.
iv. Even if the respondence it liberate this technique can be widely used.
v. As the enumerators personally obtain the information there is less Choice for the
non-respondence.
Demerits:
9
UNIT-II
The errors involved in the collection, processing and analysis data in a survey may be
classified as,
i) Sampling error
ii) Non-sampling error
Sampling error:
The error which arises due to only a sample being to estimate the population being
used to estimate the population parameter is tripped sample error or sampling fluctuations.
The error is inherent and available in any and every sampling scheme. A sample with the
smallest sampling error will always be consisted a good reprehensive of the population.
This error can be reduced by increasing the size of the sample unfair the degree of
sampling error is inversely proportional to the square root of the sample size.
Besides sampling error, the sample estimate may be subject to the other errors which
group together is term non sampling error. The main sources of non-sampling errors are,
4) In practice the census survey results many suffer from non-sampling error.
Although these may be free from sampling error. The non-sampling error is likely to
increase with increase in sample size while sampling error decrease with increase in
sample size.
Sampling Error α 1
n
where n- sampling units.
Bias:
The different between estimator t and the parameter is called bias are error. An
estimator t is said to be unbiased estimator for the parameter if Et , otherwise
biased thus bias is given by E t Bt .
10
The sampling variance of (t) is defined by V t E t E t .
2
The non-sampling errors occur at any one are more of this stages of the survey
planning field work and tabulation of the survey data. These errors are badly classify errors
follows,
Type 1: Non Response Error
If the sampling frame is not abraded or old frame is use on account of economic or time
saving device it may lead to bias as the targeted population is not enumerated. These a use of
such frames may lead either to inclusion of some units not belonging to the population or to
omission of units belonging to the population. Such procedure mat brings unknown bias. In
some situation apart of sampled units may refuses to respond to the question or may be not at
home at the time of interview. It may also lead to this type of error. It can be seen that the
method will provide biased estimate. Some of the main sources assigned to these errors may
be as follows:
iii. Difficulty arising arrivers and Sourness on the part of respondents or faulty method of
enumeration data collection.
11
Type 2: Response Error
These errors refer in general to the difference between the individual true value and
the corresponding sample value irrespective of the resend of discrepancy. Sometimes there
may be interaction between both of them and it may be inflate these errors. The
measurements device or technique mat is defective and may cause observational errors may
be assigned as under,
c. Problems involved in data collection and other types of errors on the part of
respondence.
12
UNIT-III
If units is selected and noted then return to the population before the next drawing is
made and this procedure repeated „n‟ times it gives raise to simple random sample of n units
this processing is generally known as simple random sampling replacement.
In this procedure is repeated till n distinct units or selected and the reparation are
ignored it called simple random sampling without replacement.
Theorem: 3.1
n
The probability of specified unit is being included in the sample is equal to
N
Proof:
Since, the specified unit can be included in the sample size „n‟ with „n‟ mutually
exclusive ways that is it can be selected in that the sample at the r th draw r 1,2,n that
is PE r
1
r=1,2,3…n.
N
n
1
So that the probability that specify units selected in the sample =
r 1 N
1 1 1 1
= .... (n times)
N N N N
n
=
N
Theorem: 3.2
The probability that specified unit of the population is being selected at any given
draw is equal to the probability of its being selected at the first draw.
Proof:
Let E r be the event that specified unit is selected at the r th draw.
Therefore, PE r = (The probability that specified unit is not selected in any of the previous
(r-1) draw) X (The probability that it is selected at the r th with the
condition that it is not selected in the previous (r-1) draw)
13
that is,
N 1 N 2 N (r 1)
P E r =
1
N N 1 N (r 2) N (r 1)
N (r 1) 1
=
N N (r 1)
1
=
N
Note: The above theorem leads the property of simple random sampling without
replacement.
Since the theory of sampling is based on the assumption of random sampling, the
technique of random sampling is basic significance some of the procedure used for selecting
a random sample are follows.
1. Lottery method
2. Random sampling tables method.
1. Lottery method:
This is method where a ticket/chit many be associated with each unit of population.
Thus each sampling unit has its identification mark from identification 1 to N. the procedure
of selecting an individual is simple. All the ticket/chits are placed in a container, drum or
metallic spherical device in which a before each draw. Draw of tickets/chits may be
continued until a sample of the required size is obtained.
This procedure of numbering units on tickets/chits and selecting one after reshuffling
becomes cabpersons when the population size is large it may be lather difficult to achieve a
through shuffling in practice human bias be accruing in this method.
14
A practical methods of selecting of a random sample is to choose units one by one with the
help of the table of random numbers. By considering two digits numbers we can obtain
numbers from 00 to 99, all having same frequency. Similarly, three or more digits number
may be obtain combining three or more rows or columns of these tables. The simplest way
selecting a sample of the required size is by selecting a random number from 1 to N and then
taking the unit bearing that numbers. The use random number is, therefore modify some of
these modify procedure are
i. Remainder approach
ii. Quotient approach
iii. Independent choice of digits.
Remainder approach:
Let N be the r digits number and let its r digits highest n=multiple be N. A random
number K is chosen from 1 to N and the unit with the serial equal to remainder obtain on
dividing K by N is selected if the remainder is zero the last units is zero. For example,
i. Let N=123
ii. The highest three digit multiple of 123 is 984.
iii. For selecting unit one random number from 001 to 984 that as selected
iv. Get the random number be selected 287
v. Dividing 287 by 123 is equal to 41.
vi. Hence the unit within the serial number 41 is selected in the sample.
Quotient Approach:
N'
Let N be the r digit number and r digits highest multiple be the N ' such that q.
N
A random k is chosen from 0 to ( N '1) . Diving k by q the quotient r is obtained and the unit
bearing serial r 1 is selected in the sample. For example,
i) N=16
ii) The highest two digit multiple of 16 is 96 i.e., N ' =96, Hence q=6
iii) Let the two digit random number chosen be 65 which lies between 0 to 95
iv) Diving 65 by 6 is equal to 10
v) Hence the unit bearing number 10-1=9 is selected in the sample.
This method consists of the selection of two random numbers which are combined to
form a one random number. One number is chosen according to the first digit and other
15
according to the remaining digits of the population size. If the number is chosen is zero the
last unit is chosen. But if the number is made up is greater than or equal to N, the number is
rejected and the operation is repeated.
For Example: Select a random sample of 11 households from a list of 112 households in a
village.
(i) By using the 3-digit random numbers given in column 1 to 3, 4 to 6, and so on of the
random number table and rejecting numbers greater than 112 (also the number 000), we have
for the sample bearing serial numbers 033, 051, 052, 099, 102, 081, 092, 013, 017, 076 and
079.
(ii) In the above procedure, a large number of random number is rejected. Hence, a
commonly used device, i.e., remainder approach, is employed to avoid the rejection of such
large numbers. The greatest three-digit multiple of 112 is 896. By using three-digit random
numbers as above, the sample will comprise of households with serial numbers 086, 033,
049, 097, 051, 052, 066,107, 015, 106 and 020.
(iii) In case the quotient approach is applied, the 3digit multiple of 112 is 869 and 869/112=8.
Using the same random numbers and dividing them by 8, we have the sample of households
with list numbers 025, 004, 020, 026, 006, 006, 092, 041, 085, 027 and 086 with the
replacement method and with list numbers 025, 004, 020, 026, 006, 092, 041, 085, 027, 086
and 042 without the replacement method.
Let us consider a finite population of N units and let y be the character under
condensation. The capital letters are used to describe the characteristics of the population
whereas the small letter refers to the sample letters refers to sample observation. For
example:
The N population units may be denoted by U 1 , U 2 ....U N and the small n sample units
will be denoted by u1 , u 2 ....u n .
Let Yi i 1,2...N be the value if the character for the i th unit in the population and
the corresponding small letters denoted the value of the character for the units selected in
1 N
the sample. Then the refine population mean, Y y i
N i 1
1 n
Sample mean, y n y i
n i 1
Sample mean y n may also be written alternatively as follows,
1 n
yn ai y i
n i 1
16
where,
th
1if i unit is included in the sample
ai
th
0 if i unit is include in the sample
2
1 n
s
2
i n
n 1 i 1
y y
1 n 2 2
yi n yn
n 1 i 1
n
Population total y i
i 1
n
Sample total = yi
i 1
2
N
i
Y Y
Population variance 2 i 1
N
2
n
y y
i
i 1
Sample variance s 2
n 1
n s2 N 2
Sample variance of SRSWOR V y 1 where s 2
N n N 1
1
2
Sample variance of SRSWR V y
n N
N n
Estimation of population total y yi N y
n i 1
n
y
Estimation of population mean y i y
i 1 n
17
Theorem: 3.3
E yi Y (2)
1 n 1
E y y n. y
n i 1 n
E y Y .
Theorem: 3.4
E s2 S 2
Proof:
We know that, Sample Variance,
1 n 2
s
2
yi y
n 1 i 1
18
1 n
n 1 i 1
yi n y n
2 2
1 n 2 yi
2
yi n n
n 1 i 1
1 n 2 n n
2
yi n 2
n 1 i 1 i 1
yi
1 n 2 1 n 2 n
i
n 1 i 1
y
n i 1
y i yi y j
i j 1
1 n 2 1 n
1 n
y
nn 1 i 1
y 2
nn 1 i j 1
yi y j
n 1 i 1
i i
n
1 1 1 n
yi2 yi y j
i 1 n 1 nn 1 nn 1 i j 1
2 n 1
n n
1
yi yi y j
i 1 nn 1 nn 1 i j 1
1 n 2 1 n
s2 i nn 1 i
n i 1
y
j 1
yi y j
1 n
E yi y j
n
1
E s2
n i 1
E y i2
nn 1 i j 1
(1)
n
Consider, E y i2
E a i y i2
i 1
yi2 E ai
yi2 0 pai 1 1 pai 1
o i th unit of the sample is not included in the sample size n
y 2
i
(1 i unt of the sample is included in the sample size n)
th
1 n
yi2 0 1
n N
n
E ( yi ) 2 yi2 (2)
N
E y i y j E ai a j y i y j
n
Consider,
i 1
19
y y E a a
n
i j i j
i j 1
y y 0 Pa
1, a j 1 1 Pai 1, a j 1
n
i j i
i j 1
y y Pa 1 a j 1
n
i j i
i j 1
y y Pa 1Pa j 1
n
i j i
i j 1
P i th unit of the sample include in the sample
n
yi y j j th unit of the sample
i j 1
P include
n
n n 1
yi y j (3)
i j 1 N N 1
1 n nn 1 n
n
1
E s2
nN
y
i 1
2
nn 1 N N 1 i j 1
i yi y j
1 n 1 n
yi2 yi y j
N i 1 N N 1 i j 1
1 n 2 2
i y N Y N
N 1 i 1
S2
Therefore , E s 2 S 2 .
Theorem: 3.5
In simple random sampling without replacement the variance of the sample mean is
given by,
1 1
var y n S 2
n N
N n 2
S
nN
Proof:
2
var y n E y n E y n
2 2
E y Y N E y n Y N (1)
20
2
2
1 n
Consider, E y n E yi
n i 1
2
1 n
2
E y
n i 1 i
n
2 2
1 n
E yi E yi y j
n i 1 i j 1
1 n 2 n
2
E yi E yi y j (2)
n i 1 i j 1
Consider,
n n
E y i2 E a i y i2
i 1 i 1
n
y i2 E a i
i 1
n
n n
y i2 E (ai ) (3)
i 1 N N
But,
2
N
n
y i YN y i N Y N
2 2
i 1 i 1
2
N N 2
y i2 y i YN N Y N
i 1 i 1
N
1 N 2
yi2 N 1S 2 N YN2 NYN2 S2 i N
N 1 i 1
y Y (4)
i 1
21
n 2
n
E yi N 1S 2 N YN2 ...........(5)
i 1 N
n n
E yi y j E ai a j yi y j
i j 1 i j 1
y y E a a
n
i j i j
i j 1
n
nn 1
y y N N 1
i j
i j 1
nn 1 n
2
N
yi yi2
N N 1 i 1 i 1
nn 1
NYN N 1S 2 NYN
N N 1
2 2
nn 1 2 2 2
N Y N N 1S N Y N
2
N N 1
nn 1 2
N Y N N 1 N 1 N 1S 2
N N 1
n nn 1N 1
E yi y j
N YN S 2
2
i j 1 N N 1
nn 1 2
2
N Y N S (6)
N
n N N
n( N 1) 2 Nn 2 n(n 1) N n(n 1) 2
S 2 YN S
Nn n N n2 N Nn 2
( N 1) 2 1 2 (n 1) 2 (n 1) 2
S YN YN S
Nn n n Nn
N 2 1 2 1 2 n 2 1 2 n 2 1 2
S S YN YN YN S S
Nn Nn n N n Nn Nn
S2 1 2 1 2 1 1 1 2
S YN YN2 YN2 S 2 S
n Nn n n N Nn
S2 S2
YN
2
n N
1 1
S 2 YN2 (7)
n N
22
Substitute eqn. 7 in eqn. in 1,
1 1 2
var y n S YN YN
2 2
n N
1 1
S
n N
N n 2
var ( y n ) S
Nn
i. since the sample units are selected at the random going each units at equal
chances of being selected the subjectively or personal bias is completely
eliminated as such a simple random sampling is more representative population
of a judgment is compared to purposive sampling.
ii. the statisticians are as certain the efficiency of the estimates of the parameters
by considering the sampling distribution of the statistics or estimates i.e, if the
sample size „n‟ increase, y n an estimates of Y N becomes more efficient.
Drawbacks:
i) The selection of simple random sampling requires up to data frame, i.e a completely
catalogued population from which samples are to be drawn. Frequently it is the
virtually impossible to identify the units in the population before the sample is drawn
and this restricts the use of sample random sampling.
ii) Administrative inconvenience: A simple random sample may results in the selection
of the sampling units are widely spread geographically and in such case that the cost
of collecting data may be much in terms of time and money.
iii) At times a simple random sampling might most non-random looking results. For
example, If we draw a random sample of size from a pack of cards we may get all the
cards of the same suit. However, the probability of such an outcome is extremely
small.
iv) For a given precision the simple random sampling usually requires a larger sample
size as compared to stratified sampling.
23
PROPLEM 1: Draw a random sample (without replacement) of size 15 from a population of
size 500.
Solution:
Identify the 500 units in the population with the numbers from any of the random
number series.
Starting at random with any number on that page and moving row-wise, column-wise
or diagonally, we select one by one the three digited numbers, discarded the numbers
over 500, until 15 numbers below 500 are obtained.
Finally, the units in the population, corresponding to these 15 numbers will constitute
our random sample without replacement.
82 102 88 93 97 38 103 92
102 62 63 72 64 68 59 69
73 65 46 79 87 84 29 52
28 36 46 79 87 84 29 52
56 66 42 37 35 97 32 35
89 99 54 72 26 67 18 27
60 72 33 42 52 82 14 22
57 73 63 61 63 92 40 58
62 61 43 25 42 36 17 30
75 87 47 56 76 36 35 44
56 51 111 73 93 58 49 89
50 80 54 55 91 12 82 76
Select a sample of 25 plants by using simple random sampling method. Also calculate the 25
samples and verify whether the sample mean is equal to the population mean.
Solution:
Kapas Yield of 96 plants
(ie.) Population, N=96
24
N 96
X i X i
Population Mean(X N ) i 1
i 1
N 96
82 102 88 ... 12 82 76 5798
60.40
96 96
82 102 88 93 97 38 103 92
1 13 25 37 49 61 73 85
102 62 63 72 64 68 59 69
2 14 26 38 50 62 74 86
73 65 46 79 87 84 29 52
3 15 27 39 51 63 75 87
28 36 46 79 87 84 29 52
4 16 28 40 52 64 76 88
56 66 42 37 35 97 32 35
5 17 29 41 53 65 77 89
89 99 54 72 26 67 18 27
6 18 30 42 54 66 78 90
60 72 33 42 52 82 14 22
7 19 31 43 55 67 79 91
57 73 63 61 63 92 40 58
8 20 32 44 56 68 80 92
62 61 43 25 42 36 17 30
9 21 33 45 57 69 81 93
75 87 47 56 76 36 35 44
10 22 34 46 58 70 82 94
56 51 111 73 93 58 49 89
11 23 35 47 59 71 83 95
50 80 54 55 91 12 82 76
12 24 36 48 60 72 84 96
S.No. Random Number Rapsi Yield S.No. Random Number Rapsi Yield
1. 19 72 14. 34 47
2. 16 36 15. 76 29
3. 88 52 16. 93 30
4. 02 102 17. 61 38
5. 47 73 18. 09 62
6. 73 103 19. 06 89
7. 46 56 20. 84 82
8. 15 65 21. 25 88
9. 14 62 22. 29 42
25
10. 04 28 23. 94 44
11. 65 97 24. 48 55
12. 23 51 25. 55 52
13. 89 35
n 25
xi x i
Sample Mean(x n ) i 1
i 1
n 25
72 36 52 ... 55 52 1490
59.6
25 25
REMARKS:
For population:
Let us suppose that a population with n units U 1 , U 2 ......U n is classified into two
mutually disjoint and exhaustive classes A and A such that A + A =N
A
Then, proportion of units processing the given attributes is ,
N
A
i.e P A PN
N
Q=the proportion of units which do not possess the given attributes,
A
Q 1 P
N
Where, P & Q represents population of success and failure respectively in the population.
With the i th sampling units let us associated a variate yi i 1,2,....N defined as follows
1if it belong to class A
yi
0 if it belong to class A
N
Thus y
i 1
i A the number of units in the population processing the given attribute.
For sample:
With the i th sampled units. Let us associated a variate yi i 1,2,....N defined as follows,
1if i sampled unit posses the given attribute
th
yi
0 if i sampled unit is does not posses sin g the given attributes
th
Thus, yi a the number of units possessing the given attributes
27
Population mean:
N
x i
w.k.t x i 1
N
the total number of units in the population possessing the attributes is „A‟
N
i.e ., x A
i 1 i
A NP
X P
N N
X P
Sample mean:
N
x i
w.k.t x i 1
N
the total number of sampling units possessing the attribute is „a‟
n
x
i 1
i a
a np
i.e, x p
n n
x p
also yi2 a np
For population mean square:
W.K.T,
2
Yi YN
1 N
S 2
N 1 i 1
1 N
S2 Yi NYN2
N 1 i 1
N
1
NP NP 2 Yi A
N 1 i 1
NP
(1 p )
N 1
NPQ
S2
N 1
28
For sample mean square:
W.K.T,
2
1 N
s
2
yi y N
n 1 i 1
1 N 2
s2
n 1 i 1
y i ny n
N
1
np np 2 yi a
n 1 i 1
np
(1 p )
n 1
npq
s2
n 1
Theorem: 3.6
Lemma:
29
N
yi
YN (3)
i 1 N
Hence proof lemma is E ( y n ) YN
n N
W.k.t y a & Y
i 1
i
i 1
i A
a np & A NP
Substitute the above information in equation 3
E ( y n ) YN E ( p ) P
Remarks:
E ( NP ) NP
N E ( P)
NP
Theorem: 3.7
CONFIDENCE LIMITS:
After having the estimate of an unknown parameter it becomes necessary measure the
reliability of these estimates and the construct some confidence limits with the given degree
of confidence if we assume that the estimated y n is normally.
Distributed about the Population Mean Y lower and upper confidence limits for the
population mean are given,
30
1
1 f 2
Y L y t ,n 1
s
n
And
1
1 f 2
Y U y t ,n 1 s
n
where t ,n 1 stands for the value of students „t‟ distribution with (n-1) degrees of freedom at
level of significance. Similarly the confidence limits for the population,
t ,n1 s 1 f
YL Ny
n
And,
t ,n 1 s 1 f
YU Ny
n
In sampling analysis the most ticklish question is: What should be the size of the sample or
how large or small should be „n‟. If the sample size („n‟) is too small, it may not serve to
achieve the objectives and if it is too large, we may incur huge cost and waste resources. As a
general rule, one can say that the sample must be of an optimum size i.e., it should neither be
excessively large nor too small. Technically, the sample size should be large enough to give a
confidence interval of desired width and as such the size of the sample must be chosen by
some logical process before sample is taken from the universe. Size of the sample should be
determined by a researcher keeping in view the following points:
Nature of study: If items are to be intensively and continuously studied, the sample
should be small. For a general survey the size of the sample should be large, but a
small sample is considered appropriate in technical surveys.
Type of sampling: Sampling technique plays an important part in determining the size
of the sample. A small random sample is apt to be much superior to a larger but badly
selected sample.
31
doubling the accuracy for a fixed significance level, the sample size has to be
increased fourfold.
Availability of finance: In practice, size of the sample depends upon the amount of
money available for the study purposes. This factor should be kept in view while
determining the size of sample for large samples result in increasing the cost of
sampling estimates.
There are two alternative approaches for determining the size of the sample. The first
approach is “to specify the precision of estimation desired and then to determine the sample
size necessary to insure it” and the second approach “uses Bayesian statistics to weigh the
cost of additional information against the expected value of the additional information.” The
first approach is capable of giving a mathematical solution, and as such is a frequently used
technique of determining „n‟. The limitation of this technique is that it does not analyse the
cost of gathering information vis-a-vis the expected value of information. The second
approach is theoretically optimal, but it is seldom used because of the difficulty involved in
measuring the value of information. Hence, we shall mainly concentrate here on the first
approach.
To begin with, it can be stated that whenever a sample study is made, there arises some
sampling error which can be controlled by selecting a sample of adequate size. Researcher
will have to specify the precision that he wants in respect of his estimates concerning the
population parameters. For instance, a researcher may like to estimate the mean of the
universe within 3 of the true mean with 95 per cent confidence. In this case we will say that
the desired precision is ±3, i.e., if the sample mean is Rs 100, the true value of the mean will
be no less than Rs 97 and no more than Rs 103. In other words, all this means that the
acceptable error, e, is equal to 3. Keeping this in view, we can now explain the determination
of sample size so that specified precision is ensured.
(a)
p
The confidence interval for the population, is given by X z
n
where X = sample mean
z = the value of the standard variate at a given confidence level (to be read from the
table giving the areas under normal curve as shown in appendix) and it is 1.96 for
a 95% confidence level;
n = size of the sample
p = standard deviation of the population ( to be estimated from past experience or on
the basis of a trial sample). Suppose, we have p = 4.8 for purpose.
32
If the difference between m and X or the acceptable error is to be kept within ±3 of the
sample mean with 95% confidence, then we can express the acceptable error, „e‟ as equal to
p
ez
n
4.8
3 1.96
n
Hence, n
1.96 (4.8) 2
2
9.834 10
32
In a general way, if we want to estimate in a population with standard deviation
p with an error no greater than “e” by calculating a confidence interval with confidence
z 2 2
corresponding to z, the necessary sample size, n equals as under n 2 .
e
All this is applicable when the population happens to be infinite. But in case of finite
population, the above stated formula for determining sample size will become
z 2 .N . * p
n
( N 1)e 2 z 2 p2
*In case of infinite population, the confidence interval for is given by
X z
p N n
n N 1
where
N n is the finite population multiplier and all other mean thing as stated above.
N 1
If the precision is taken as equal “e” then we have,
e z.
p
.
N n
n N 1
p2 N n
e2 z 2 . .
n N 1
z 2 . p2 .N z 2 . p2 .n
e ( N 1)
2
n n
z . . N
2 2
e 2 ( N 1) z 2 . p2 p
n
z . . N
2 2
n p
e ( N 1) z 2 . p2
2
Stratification means division into layers auxiliary information related to the character
under study may be used to divided population into various groups such that,
Thus a population consisting of „N‟ sampling units is divided into „k‟ relatively
homogenous mutually disjoint subgroups terms as strata of sizes N1 , N 2 N k such that
k
N N i if a sample random sampling (generally without replacement) of sizes
i 1
k
ni (i 1,2...k ) is drawn from the stratum respectively such that n ni .
i 1
This sample is termed as stratified sample of size „n‟ and the technique of drawing
such a sample is called stratified random sampling.
k
n ni = total sample size from all strata.
i 1
1 k Ni
y n population mean yi
N i 1 j 1 j
1 k k
N i YNi Wi YNi s
N i 1 i 1
2
1 ni
si2 ij
ni 1 j 1
y y ni , i 1,2...k
34
Theorem: 4.1
If in every stratum the sample estimator y i is unbiased and samples are drawn
independently in difference strata then y st is an unbiased estimator of the population mean
and its sampling variance is given by.
k
V y st wi2 v y i
i
Proof:
k
N i yi
w.k.t y st
i 1 N
taking expectation on both sides,
k N y
E ( y st ) E i i
i 1 N
k
N
i E yi
i 1 N
k
Wi E y i
i 1
E y st Yst
Hence, Sample mean is unbiased estimator of the population for given stratified random
sampling.
To prove:
k
V ( y st ) Wi 2V yi
i 1
N i2
k
2 V yi
i 1 N
2
N
k
i V yi
i 1 N
k
var( y st ) Wi 2V ( yi )
i 1
Theorem: 4.2
35
1 1 2
var( y i ) S i
ni N i
k N yi
v y st v i
i 1 N
k
N2
i2 v y i
i 1 N
N 1 1 2
k 2
i S i
i 1 N ni N i
2
N S
2
k k
N2 S2
i i i2 i
i 1 N ni i 1 N Ni
k
S2 k
N S2 N
Wi 2 i i i Wi i
i 1 ni i 1 N N N
k
S i2 k
S i2
Wi 2
Wi
i 1 ni i 1 N
1. In stratified sampling the allocation of the sample to different strata is done by the
consideration of three factors such as,
i. The total number of units in the stratum.
ii. The variability with in this stratum and
iii. The cost in taking observation per sampling unit in the stratum.
3. There are four method of allocation of sample size is to different strata in a stratified
sampling procedure these are,
i. Equal allocation
ii. Proportional allocation
iii. Neyman‟s allocation
iv. Optimum allocation
Equal Allocation:
36
Proportional Allocation:
Neyman’s Allocation:
This allocation of the total sample size to strata of is called minimum variance
allocation and is due to Neyman (1934). In this allocation is assume that the sampling cost
per unit among difference strata in the same and the size of the its fixed. Sample size is are
nW S
allocated by ni k i i
Wi S i
i
Ni
w.k.t wi
N
nN i S i
ni k
N S
i
i i
n N
Optimum Allocation:
In this method of allocation a sample sizes „ ni ‟ in the respective strata are determine
with the view to minimize variance for y st for a specified cost of containing the sample
survey or to minimize the cost for a specified value of variance of ( y st ) . The simplest cost
k
function in stratified sampling that we can take is , c a ni ci , where overhead cost a is
i
constant and ci is the average cost of surveying one units in the i th stratum which may
depend the nature and size of the units in the stratum.
Then the allocation of a given sample of size „n‟ to different and given cost
k
function c a ni ci ,
i
37
Wi S i
c
ni n
i
k
wi s i
i c
i
N i Si
variance of estimate y st is minimum of ni then
ci
N i Si
c
ni n
i
k
ns
i ic i
i
Proof:
w.k.t the variance of the stratified random sampling,
1 k si2
v( y st ) 2 N i N i ni ..........(1)
N i 1 ni
k
The cost function, c c0 ni ci ..........(2)
i
ni
0 2
N
i 1 ni
0 0 0
i 1
ci 0
1 k
N i2 S i2 k
n2
i 1 ni
i 1
ci 0
k
N 2S 2
i2 i2 ci 0
i 1 N ni
38
N i2 S i2
2 2 c i 0
N ni
N i2 S i2
c i 2 2
N ni
N i2 S i2
n 2 2
N c i
i
N i S i2
ni
N ci
Here is unknown and we have the value of is fixed. The sum overall all stratum,
k
i.e., n ni
i 1
k
N i Si
N
i 1 c i
k
1 N i Si
n
N
i 1 ci
......(5)
N S 1
/ ci
N
i i
ni
k
1 N i Si
n
N i 1 ci
ni n
N S i i / c i
k
N i Si
i 1 ci
In stratified random sampling the v( y st ) is minimum for a fixed total size of the
sample „n‟.
nW S nN S N
ni k i i k i i where , Wi i
Wi S i N i S i
N
i i
Proof:
Here the total sample size „n‟ is fixed then we have to point „ ni ‟ such that var y st is
k
minimum subject to c ni n
i
39
k
The cost function is written as c ni n
i
w.k.t,
The variance of stratified random sampling
1 k s2
var y st 2 N i N i ni i .......(2)
N i 1 ni
Consider, the function,
v y st wor c
k
s i2 k
N i N i ni
1
ni n
N2
i 1 ni i 1
2 2
1 k NS 1 k NS k
2 i i 2 i i ni n
N i 1 ni N i 1 i 1
1 N i S i2
k k
2
N
i 1 ni2
i 1
(1) 0
k
N S2
2i i2 0
i 1 ni N
2
NS
2i i2
ni N
N i S i2
ni2
N 2
Taking square root on both sides,
NS
ni i i ........(3)
N
Here is fixed the total overall stratum.
k
n ni
i 1
k
N i Si
n ........(4)
N i 1
Dividing equation 3 in by equation 4, we get
ni N S /N NS
k i i k i i
N i Si / N Ni Si
n
i 1 i 1
40
nN i S i
ni k
N S
i 1
i i
The comparative study of simple random sampling without stratification and stratified
random sampling under different systems of allocation such as proportion allocation and
Neyman‟s optimum allocation.
The variance of estimate of population mean in simple random sampling is given by,
1 1
var y st srss S 2
n N
2
k k
1
where, S 2 ij N
N 1 i 1 j 1
Y Y
The variance of the estimate of the population mean in stratified random sampling with
proportional allocation is given by,
1 k S i2
var y st prop 2 N i N i ni
N i 1 ni
N i ni 2
k
Ni
S i
i 1 n i N2
k
p N N
i2 i 1S i2 pi i
i 1 N ni N
k
p N
i 1 S i2
i 1 N n
1 N k
1 pi S i2
Nn i 1
1 1 k
pi S i2 .........(2)
n N i 1
In order to compare equation 1 and 2 first express S 2 terms of S i2
N i Yij YN
1 k
2
Let S 2 ........(3)
N 1 i 1
Adding and subtracting Y Ni in equation 3 we get,
2
N 1 i 1 j 1
S2
1 k Ni
N 1 i 1 j 1
Yij YNi YNi YN 2
41
k Ni
S 2 N 1 Yij Yni YNi YN 2 YNi YN Yij YN
k Ni k Ni
2 2
i 1 j 1 i 1 j i 1 j 1
Y YNi 0
Ni
i.e., ij
j 1
Assume that N i and N are sufficiently large so that take N i 1 N i and N 1 N then,
NS 2 N i S i2 N i YNi YN
k k
2
i 1 i 1
S i i YNi YN
k k
Ni 2 N
S2
2
i 1 N i 1 N
S 2 pi S i2 pi YNi YN
k k
2
.........(4)
i 1 i 1
n N i 1 i 1
1 1 1 1
var y n srs pi S i2 pi YNi YN
k k
2
n N i 1 n N i 1
1 1 k
var y n srs var y st prop pi YNi YN
2
n N i 1
42
nN i S i
ni k
in equation 2 we get,
N S
i 1
i i
k
k
N i Ni Si
var y st ney 2 N i 1S i2
1 i 1
N i 1 nN i S i
k 2
1 N S
2 .i i N i S i2
N i 1 nN i S i
k 2
1 N
2 i N i S i2
N i 1 n
k
1 N S2 2
N S2
i 2 i i 2i
n i 1 N N
1 k 2 2 pi S i2
pi S i
n i 1 N
2
1 k 1 k
pi S i pi S i
2
n i 1 n i 1
k
1
pi S i2 pi S i2
n i 1
1 k
pi S i2 pi S i2 pi S i2 pi S i2
n i 1
1 k
k
pi S i2 2 pi S i2 pi S i2 S w pi S i
n i 1 i 1
1 k
pi S i S w
n i 1
........(4)
var y st prop
var y st ney 0
var y st prop
var y st ney
The variance of estimate of the population mean in stratified random sampling with
neyman‟s optimum allocation.
1 k 1 k
var y st ney p i S i p i S i2
n i 1
2
N i 1
........(1)
1 1 k 1 1 k
pi S i2 pi YNi YN pi S i pi S i2
2 1 k 2 1 k
n N i 1 n N i 1 n i 1 N i 1
1 k 1 1 k
pi S i2 pi S i2 pi YNi YN pi S i pi S i2
1 k 2 1 k 2 1 k
n i 1 N i 1 n N i 1 n i 1 N i 1
1 1 1
pi S i2 pi S i2 p i YNi YN
k k k
2
n i 1 i 1 n N i 1
1 k
p i S i S w 2
1 1 k
pi YNi YN
2
.........(3)
n i 1 n N i 1
Theorem: 4.5
Proof:
n S2
w.k.t, varsrs 1 .........(1)
N n
k
S2
v y srswor 2 N i N i ni i
1
and ...........(2)
N i 1 ni
the population correction is ignored,
44
S2
varsrs .........(3)
n
w.k.t, the allocation of sample size in stratified random sampling with proportion allocation
is,
nN i
ni
N
Substitute ni in equation 2 we get,
k
nN i NS i 2
var y st prop 2
1
Ni Ni
N i 1 N nN i
1 k n S i2
var y st prop i 1 N n
N
N i 1
k 2
NS
var y st prop 2 i i
1
..........(4)
N i 1 n
w.k.t, the allocation of sample size in stratified random sampling with neyman‟s optimum
allocation is,
nN S
ni k i i
Ni Si
i
var y st ney
1 k
N S 3
1 k
2 N S
i i 2
i i
N i 1 nN i S i N2 i 1
k N Si 2
var y st ney
1
N S 1
2 i
i i
N2 i 1 nN i S i
2 n
k
var y st ney
1
2
N i S i 1
2
nN i 1 Ni
2
1 k
var y st ney 2
Ni Si .......(5) f . p.c ignored
nN i 1
From equation 3, 4 and 5
varsrs var prop varney
45
UNIT-V
A sampling technique in which only the first unit is selected with the help of random
numbers and the rest get selected automatically according to some pre-designed pattern is
known as systematic random sampling. Systematic random sampling is also referred to
briefly as systematic sampling. Suppose N units of the population are numbered from 1 to N
in some order. Let N=nk, where n is the sample size and k is an integer, and a random
number less than or equal to k be selected and every kth unit thereafter. The resultant sample
is called every kth systematic sample and such a procedure termed linear systematic sampling.
If N nk, and every kth unit be included in a circular manner till the whole list is exhausted it
will be called circular systematic sampling.
K possible systematic sample together with their means are given in the following
table. Thus k rows of the table given the k systematic sample. The column of the above table
are also sometimes referred to as k strata. From the above table the N units in the population
occurs once one of the k sample and thus has an equal chance of being included in the
sample. Since the probability of selecting the i th sample i 1,2,3...k as the systematic is .
1
k
We get,
1 k
E y i y i y .
k i 1
Thus if N=nk the sample mean provide an unbiased estimate of the population mean.
v y sys y i. y
1 k
k i 1
Theorem 5.1:
If N=nk the systematic sample mean y sys is an unbiased estimate of the population mean y
Proof:
E y sy
1 k
w.k.t yi
k i 1
1 k n y ij
k i 1 j 1 n
1 k n
yij
nk i 1 j 1
1 k n
y ij N nk
N i 1 j 1
y
E y sy y
46
Theorem 5.2:
vy sys N 1
2
S2
n 1
S wsy
N n
Proof:
S2
1 k n
ij Y Y
N 1 i 1 j 1
Add & subtract by y i ,
2
i 1 j 1
2 2
i 1 j 1 i 1 j 1 i 1 j 1
2 2
i 1 j 1 i 1 j 1
2
yi Y
k k
N 1S 2
k n 1S 2
wsy .........(1)
i 1 j 1
k k
v y yi y N v y yi y
1 2 2
w.k.t,
N i 1 i 1
now consider,
k n
2
2
y
k n
i Y y i y
i 1 j 1 i 1 j 1
nk v yi ..............(2)
Substitute equation 2 in equation 1 we get,
N 1S 2 k n 1S 2 wsy nk v yi
Divided on both sides by nk
N 1S 2 k n 1 S 2 nk v y
wsy i
nk nk nk
N 1 S 2 k n 1 S 2 v y
wsy i
nk nk
v y i
N 1 S 2 n 1 S 2
wsy
N n
Hence, v y sys
N 1 S 2 n 1 S 2
wsy
N n
47
Theorem 5.3: Systematic sampling vs. simple random sampling.
The systematic sample is more precise then a simple random sample without
replacement if the mean square within the systematic sample is larger than the population
mean square. In other words systematic sampling will yield better results only if the units
within the same sample are heterogeneous.
Proof:
w.k.t the variance of simple random sample is without replacement is given by,
N n S
2
v y srs
N n
Also we know that the variance of systematic sample is given by,
vy sys N 1
2
S2
n 1
S wsy
N n
For claiming the above statement the assumption in given by,
v y srs v y sys
N n S
2 2
S2
N 1 n 1
S wsy
N n N n
n 1 S wsy N 1 S N n S
2 2 2
n N N n
N 1 N n
2
n 1 S wsy
S2
n N Nn
n N 1 n n
2
n 1 S wsy
S2
n Nn
nN n N n
2
n 1 S wsy
S2
n Nn
nN N
2
n 1 S wsy
S2
n Nn
N n 1
2
n 1 S wsy
S2
n Nn
S 2 wsy S2
Theorem 5.4:
nk 1 S 2
var y srs . 1 n 1
nk n
Where is the intra class correlation between the units of the same systematic sampling and
is given by,
y y y ij y
k n
ij
i 1 j j 1
nk n 1 2
48
y y y ij y
k n
ij
i 1 j j 1
n 1n 1s 2
N 2 N 1S 2
Since ,
nk 2 nk 1S 2
Proof:
w.k.t,
2
v y srs y i.
k
1
y
k i 1
2
1 k 1 n 1 n
v y srs y ij y j
k i 1 n j 1 n j 1
2
1 k 1 n
v y srs y ij y j
k i 1 n j 1
2
1 k n
v y srs 2 y ij y j
n k i 1 j 1
2
n
n kv y srs y ij y j
k
2
i 1 j 1
k n
n 2 kv y srs yij y j yij y j yij y j
n
i 1 j 1 j j 1
v y srs nk 1S n 1nk 1S 2
2
v y srs
nk 1 . S 2 1 n 1
nk n
Remarks:
If v y srs 0
nk 1 . S 2 1 n 1 0
nk n
1 n 1 0
n 1 1
1
n 1
1
Thus the minimum value of is when v y srs 0
n 1
49
Theorem 5.5: Systemic sampling vs. Simple Random Sampling without Replacement.
The relative efficiency of the estimate of the population mean is systematic sampling
over simple random sampling without replacement is given by,
var y srswor
E
var y sys
N n 2
S
Nn
nk 1S 2
2
1 n 1
n k
nk 1
nk 11 n 1
Obviously this depend on the value of
nk 1
If E 1 1
nk 11 n 1
nk n nk 1 nk 1n 1
nk n nk 1 nk 1n 1
n 1 (nk 1)(n 1)
1 (nk 1)
1
nk 1
Thus systematic sampling would be more efficiency as compared with simple random
sampling without replacement
1
If
nk 1
On other can simple random sampling without replacement with the supervise to the
1
systematic sampling
nk 1
Let us now record the population of N=nk units to be divided into „n‟ strata
corresponding to the N columns and suppose that one units is drawn randomly from in each
1 k
stratum thus stratified random sampling of size n the, mean of the j th stratum, y j y ij
k i 1
k n n
1 1
Population mean, y
nk i 1 j 1
y ij y j
n j 1
yij y j
k
1
Stratum mean square, S 2j
2
N j 1 i 1
yij y j
1 k
2
k 1 i 1
50
1 k n
S wst =pooled mean square between units within strata
2
yij
nk i 1 j 1
wst in the correlation co-efficient between deviations from stratum means of pair of items
that are in the same systematic sample.
Thus,
E yij y j yij y j
wst
E yij y j
i 1 y ij y j y ij y j
k n
1 j 1
wst
k (n 1)
y y j
k n
1
ij
nk i 1 j 1
i 1 y ij y j y ij y j
k n
j 1
wst
(n 1)n(k 1) S wst
2
Theorem 5.7:
nk 1 2
var y sys .S wst 1 n 1 wst
nk
Proof:
2
var y sys y i y
1 k
k i 1
2
1 n
var y sys
2 k
1 1 n
ij
i 1 n j 1
y y j
k n j 1
2
1 k n
var y sys 2 y ij y j
n k i 1 j 1
1
var y sys 2 yij yij yij yij yij yij
k k k k
2
n k i 1 j 1 i 1 j j 1
1
n k
var y sys 2 n(k 1) S 2 wst n(n 1)(k 1) wst S wst
2
n(k 1) S 2 wst
var y sys 1 n(n 1) wst
n2k
(k 1)
var y sys 1 n(n 1) wst
nk
var y st 2 yij y j
1 k n
n k i 1 j 1
k 1 2
var y st S wst
nk
The relative efficiency of the estimate of the population mean in systematic sampling
stratified random sampling is given by,
k 1 2
var y st
S wst
nk 1
E
var y sys (k 1) 2 n (n 1) wst
S wst 1 (n 1)
nk
Thus the relative efficiency of systematic sampling gives difference upon the values
are wst and are nothing can be concluded in general. If wst 0 then E 1 thus in this case
stratified random sampling will provided better estimate of y .if wst 0 then E 1 and
thus in the case both sampling provided estimates of y with equal precision.
Demerits:
the main disadvantage of systematic sampling is that systematic sample are not in a
general random sample since the requirement in merits to is rarely fulfilled.
if N is not a multiple of n then as the actual sample size is different from that
requirement.
sample mean is not an unbiased estimate of the population mean.
systematic sampling may yield highly estimates if there are periodic features
associated with the sampling interval.
52
APPENDIX
61 37 37 06 29 96 54 18 15 08 28 21 40 64 85 81 52 42 60 87 50 39 69 24 54 67 10 61 26 96
30 13 85 98 57 19 22 45 50 31 76 16 26 65 79 38 80 95 99 60 97 32 95 38 05 54 94 52 51 13
95 11 75 45 00 72 66 66 97 27 51 83 52 02 27 41 69 72 67 18 95 25 23 85 88 04 74 31 17 63
17 08 99 87 82 89 93 96 38 42 24 99 14 20 23 77 77 30 66 95 28 48 82 62 15 50 31 30 21 99
23 84 05 51 62 85 93 82 75 90 86 68 98 40 87 93 42 37 82 31 63 00 93 03 49 06 64 98 28 65
38 43 10 09 48 90 98 84 88 10 28 22 72 93 97 41 24 94 13 54 51 65 46 06 20 25 44 06 22 49
37 72 55 05 51 43 30 00 72 36 23 91 73 45 94 82 81 77 66 93 40 95 01 73 95 12 86 91 19 04
65 68 39 29 61 76 44 76 56 61 27 79 93 18 02 57 29 82 32 40 93 91 06 52 17 02 56 47 76 09
12 84 69 90 17 89 57 07 48 77 53 06 77 54 00 79 87 88 42 78 70 53 99 02 13 65 08 72 61 09
16 24 64 00 57 44 36 91 22 98 80 87 42 75 50 04 11 82 27 59 47 79 19 59 39 52 23 64 92 64
36 20 51 02 05 11 04 03 90 52 21 55 81 48 53 37 95 67 83 93 46 48 49 60 93 61 67 77 81 75
81 46 54 47 88 08 15 23 49 57 53 41 57 77 89 04 45 11 20 86 35 88 47 04 08 60 57 48 17 14
00 20 67 08 68 75 72 39 08 79 49 36 46 54 14 88 14 61 95 72 11 76 31 18 54 77 30 37 05 13
86 21 64 79 77 11 49 61 37 98 96 40 19 16 18 03 04 61 52 77 88 81 57 77 22 96 52 37 52 35
41 48 87 32 12 63 81 05 52 12 59 82 90 90 06 81 32 77 91 79 82 19 61 77 18 70 57 53 44 71
34 49 18 35 39 09 22 88 33 56 12 72 48 68 49 31 55 05 31 77 74 10 55 72 94 82 56 71 84 46
10 84 03 08 39 13 67 18 94 83 81 98 98 81 93 23 92 23 89 11 49 73 54 16 73 91 29 75 24 31
77 51 26 79 84 46 03 11 91 20 84 50 90 97 51 97 87 87 52 11 33 18 94 82 83 12 58 64 21 39
53
92 32 80 57 19 14 25 94 83 01 97 61 69 64 90 77 68 65 95 41 54 39 37 94 99 09 54 99 39 48
81 46 13 83 71 79 05 73 26 55 89 34 54 60 60 42 33 84 37 02 68 24 63 62 95 06 07 28 91 64
29 94 28 32 42 17 30 50 47 50 35 85 51 33 19 95 71 93 49 43 64 93 62 67 87 67 16 59 15 51
72 92 92 88 95 75 93 19 20 25 84 58 66 90 60 73 85 43 07 33 40 87 84 96 08 17 33 50 87 76
29 10 09 67 99 76 28 22 89 52 48 20 60 97 32 03 96 75 08 18 44 54 39 24 90 43 61 75 03 62
89 06 72 87 21 81 44 82 79 90 64 48 69 64 78 17 35 49 38 97 24 30 08 07 02 49 17 61 44 57
97 88 35 46 04 24 63 47 94 30 49 25 39 16 92 41 78 47 57 99 20 95 84 01 14 60 87 51 06 80
10 44 83 30 15 86 37 30 61 46 66 09 14 06 22 79 11 15 90 98 06 29 67 73 96 52 57 71 47 90
63 27 80 67 28 37 73 38 55 31 19 63 79 04 32 36 11 55 63 19 82 41 88 02 15 96 66 03 59 73
82 35 14 99 53 05 11 61 98 35 02 38 99 34 43 76 52 45 58 64 91 32 22 46 53 18 09 48 12 83
82 17 50 34 40 38 05 74 35 60 02 95 31 01 96 46 75 69 11 23 93 19 21 08 31 49 30 55 29 56
81 51 30 65 69 42 25 79 82 71 38 61 39 69 04 91 78 28 92 09 14 26 74 94 66 24 87 12 54 99
00 73 77 22 60 57 96 57 77 27 48 54 94 06 42 04 77 89 80 91 23 73 80 95 21 92 49 81 70 70
20 95 66 37 92 09 61 16 24 67 39 52 56 94 88 38 47 09 93 80 36 21 58 84 65 58 55 38 17 76
60 86 49 05 64 03 31 03 01 93 31 80 35 63 08 73 03 66 31 38 53 30 93 11 53 96 83 70 25 16
35 59 54 51 74 86 36 52 67 96 87 64 35 94 28 53 09 28 91 45 56 84 67 28 11 01 19 27 88 70
58 70 27 42 49 32 29 84 81 26 73 84 23 91 19 29 69 37 62 23 18 06 84 63 76 70 75 40 99 90
51 73 94 59 02 17 48 55 50 22 15 61 95 24 85 53 39 97 34 07 54 18 13 85 61 20 13 68 86 18
35 63 45 25 72 77 85 26 55 68 04 66 35 55 93 05 69 26 92 11 34 95 73 34 44 64 07 35 69 87
17 15 15 91 52 25 21 19 40 96 46 48 06 62 94 69 85 35 24 63 74 83 04 14 63 37 39 70 00 37
38 13 97 34 03 11 22 45 29 74 39 54 15 28 71 32 67 94 80 16 47 44 14 84 47 78 63 05 73 94
01 08 75 82 15 68 29 98 57 88 69 90 13 07 95 36 83 81 55 82 51 00 98 67 46 20 95 94 61 63
54
83 23 27 53 79 60 19 77 82 71 38 91 94 76 24 92 81 24 44 12 17 50 18 08 66 25 22 45 12 52
74 55 87 40 13 43 50 61 30 24 77 31 43 57 65 08 99 34 76 84 88 36 86 21 70 73 32 56 82 33
65 28 70 15 04 38 75 26 29 41 97 77 70 18 16 76 14 62 58 93 97 14 45 62 99 45 39 18 07 36
71 04 46 40 74 99 13 86 22 85 26 49 89 34 01 24 39 57 18 15 20 24 72 11 63 67 17 98 01 97
52 61 30 66 03 41 21 39 83 39 92 04 19 91 71 28 82 27 63 07 79 04 45 09 20 11 33 94 81 17
40 53 21 02 85 92 53 90 99 02 16 84 17 22 57 95 79 01 57 14 22 23 80 11 22 11 36 08 43 24
57 92 36 95 65 66 34 22 64 44 61 48 97 49 70 80 46 40 31 27 57 45 73 11 22 31 62 35 67 08
20 68 16 90 17 64 12 12 18 88 91 85 67 73 97 05 18 16 19 88 08 70 48 78 30 23 22 35 74 82
77 75 14 74 86 48 09 06 64 31 66 99 10 10 09 68 14 45 17 47 63 88 09 92 54 45 51 69 14 78
45 15 31 75 71 62 83 08 50 35 94 29 75 02 22 90 56 25 86 97 13 36 72 00 21 26 37 09 90 92
71 27 42 16 71 13 16 03 23 31 27 53 46 00 25 58 08 17 24 81
Note: Numbers are blocked in groups of two digits for convenience only. In using this table you can
read numbers of any number of digits in any way you want.
55
II. Area under Normal Curve
z 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09
0.0 0.0000 0.004 0.008 0.012 0.016 0.0199 0.0239 0.0279 0.0319 0.0359
0.1 0.0398 0.0438 0.0478 0.0517 0.0557 0.0596 0.0636 0.0675 0.0714 0.0753
0.2 0.0793 0.0832 0.0871 0.091 0.0948 0.0987 0.1026 0.1064 0.1103 0.1141
0.3 0.1179 0.1217 0.1255 0.1293 0.1331 0.1368 0.1406 0.1443 0.148 0.1517
0.4 0.1554 0.1591 0.1628 0.1664 0.1700 0.1736 0.1772 0.1808 0.1844 0.1879
0.5 0.1915 0.195 0.1985 0.2019 0.2054 0.2088 0.2123 0.2157 0.219 0.2224
0.6 0.2257 0.2291 0.2324 0.2357 0.2389 0.2422 0.2454 0.2486 0.2517 0.2549
0.7 0.2580 0.2611 0.2642 0.2673 0.2903 0.2734 0.2764 0.2794 0.2823 0.2852
0.8 0.2881 0.2910 0.2939 0.2967 0.2995 0.3023 0.3051 0.3078 0.3106 0.3133
0.9 0.3159 0.3186 0.3212 0.3238 0.3264 0.3289 0.3315 0.334 0.3365 0.3389
1.0 0.3413 0.3438 0.3461 0.3485 0.3508 0.3531 0.3554 0.3577 0.3599 0.3621
1.1 0.3643 0.3665 0.3686 0.3708 0.3729 0.3749 0.3770 0.3790 0.3810 0.3830
1.2 0.3849 0.3869 0.3888 0.3907 0.3925 0.3944 0.3962 0.398 0.3997 0.4015
1.3 0.4032 0.4049 0.4066 0.4082 0.4099 0.4115 0.4131 0.4147 0.4162 0.4177
1.4 0.4192 0.4207 0.4222 0.4236 0.4251 0.4265 0.4279 0.4292 0.4306 0.4319
1.5 0.4332 0.4345 0.4357 0.4370 0.4382 0.4394 0.4406 0.4418 0.4429 0.4441
1.6 0.4452 0.4463 0.4474 0.4484 0.4495 0.4505 0.4515 0.4525 0.4535 0.4545
1.7 0.4554 0.4564 0.4573 0.4582 0.4591 0.4599 0.4608 0.4616 0.4625 0.4633
1.8 0.4641 0.4649 0.4656 0.4664 0.4671 0.4678 0.4686 0.4693 0.4699 0.4706
1.9 0.4713 0.4719 0.4726 0.4732 0.4738 0.4744 0.4750 0.4756 0.4761 0.4767
2.0 0.4772 0.4778 0.4783 0.4788 0.4793 0.4798 0.4803 0.4808 0.4812 0.4817
2.1 0.4821 0.4826 0.4830 0.4834 0.4838 0.4842 0.4846 0.4850 0.4854 0.4857
2.2 0.4861 0.4864 0.4868 0.4871 0.4875 0.4878 0.4881 0.4884 0.4887 0.4890
2.3 0.4893 0.4896 0.4898 0.4901 0.4904 0.4906 0.4909 0.4911 0.4913 0.4916
2.4 0.4918 0.4920 0.4922 0.4925 0.4927 0.4929 0.4931 0.4932 0.4934 0.4936
2.5 0.4938 0.4940 0.4941 0.4943 0.4945 0.4946 0.4948 0.4949 0.4951 0.4952
2.6 0.4953 0.4955 0.4956 0.4957 0.4959 0.4960 0.4961 0.4962 0.4963 0.4964
2.7 0.4965 0.4966 0.4967 0.4968 0.4969 0.4970 0.4971 0.4972 0.4973 0.4974
2.8 0.4974 0.4975 0.4976 0.4977 0.4977 0.4978 0.4979 0.4979 0.4980 0.4981
2.9 0.4981 0.4982 0.4982 0.4983 0.4984 0.4984 0.4985 0.4985 0.4986 0.4986
3.0 0.4987 0.4987 0.4987 0.4988 0.4988 0.4989 0.4989 0.4989 0.4990 0.4990
56