Lecture 13
Lecture 13
V1 (Yˆ ) = N 2 (1 − f ) ......................(6.2.2.10)
n
Judging by this approximation, the ratio method will give a more precise result whenever
RS x
2ρ >
Sy
or ....................................(6.2.2.11)
C.V ( x)
ρ>
2C.V ( y )
Thus, the issue depends by and large on the strength of correlation between y and x . If x is
the same character as y but has been measured on a previous occasion, the coefficient of
variation may be taken to be equal. In that case it pays to use the ratio method of estimation if
ρ exceeds 0.5. But one should not be dogmatic about the inequality (6.2.2.11) since it is
based on an approximation. Ratio estimator is at its best when the relation between y and x
is a straight line through the origin i.e. y = kx .
Conversely, the regression estimation is used to estimate the population mean when the
regression line of y on x does not pass through the origin but makes an intercept along the
y − axis( y ≠ kx) .
yd = y − k ( x − X )............(6.2.2.12)
82
The estimator y d is called difference estimator, which is unbiased for Y .
Its variance is
V ( yd ) = V ( y ) + k 2V ( x ) − 2kCov( x , y )
1− f 2
= ( S y + k 2 S x2 − 2kS xy )..................................(6.2.2.13)
n
To find the best value of k to use, we differentiate V ( yd ) with respect to k and equate it to
S xy
kop = = β , the population regression coefficient.
S x2
1− f n
∑ {( yi − y ) − k ( xi − x )}
2
Vˆ ( yd ) =
n(n − 1) i =1
1− f 2
=
n
( s y + k 2 sx2 − 2ksxy ) .....................................(6.2.2.14)
yd = y − β ( x − X )............(6.2.2.15)
In practice β is not known but can be estimated from the sample at hand. The sample
sxy
estimator of β is βˆ = .
sx2
Using this estimated value of β we have the linear regression estimator of the mean
ylr = y − βˆ ( x − X ).......................(6.2.2.16)
1− f 2 ˆ2 2
Vˆ ( ylr ) =
n
( )
s y + β s x − 2 βˆ sxy ...............(6.2.2.17)
Or
83
1− f 2
Vˆ ( ylr ) = s y (1 − ρˆ 2 )
n
where .......................(6.2.2.18)
sxy2
ρ = 2 2
ˆ 2
s y sx
Example 6.2
For the data in example 6.1, estimate the total area under maize cultivation using the method
of regression estimation.
Solution
Since the population regression coefficient is not known, it is estimated from the sample data.
From example 6.1
sxy 88302.982
βˆ = 2
= = 0.46387
s x 190361.93
ylr = y − βˆ ( x − X )
= 530.0755 − 0.46387(829.1885 − 952.4995) = 587.2758
sxy2 (88302.982) 2
ρˆ 2 = = = 0.66256
sx2 s y2 (190361.93)(61822.741)
84
1− f 2
Vˆ (Yˆlr ) = N 2 s y (1 − ρˆ 2 )
n
= (88) (0.03864)(61822.742)(0.33744)
2
= 6, 242,338.104
In-Text Question
The ratio method is at its best when y = kx . True or False
In-Text Answer
True
SAQ 6.1
Define ratio estimation
SAQ 6.2
Explain the use of regression estimation
85
Notes on SAQ
SAQ 6.1
Ration estimation is a technique that uses available auxiliary information which is connected
with the variable of interest
SAQ 6.2
This is another method of using auxiliary information to improve the estimate of the
population mean or total of the character under study
References
Cochran, W.G, (1977); Sampling Techniques third edition, New York: John Wiley & Sons
Daroga Singh and Chaudhary F.S, (1986); Theory and Analysis of Sample Survey Design,
New Delhi: Wiley Eastern Limited
Des Raj and Promod Chandhok (1998); Sample Survey Theory, New Delhi: Narosa
Publishing House
Kish L. (1965); Survey Sampling, New York: John Wiley & Sons
Okafor F.C (2002); Sampling Survey Theory with Applications, Nsukka: Afro-Orbis
Publishers
86