Ratio Estimators in Simple Random Sampling Using Information On Auxiliary Attribute
Ratio Estimators in Simple Random Sampling Using Information On Auxiliary Attribute
Florentin Smarandache
University of New Mexico, USA
Published in:
Rajesh Singh, Pankaj Chauhan, Nirmala Sawan, Florentin Smarandache (Editors)
AUXILIARY INFORMATION AND A PRIORI VALUES IN CONSTRUCTION
OF IMPROVED ESTIMATORS
Renaissance High Press, Ann Arbor, USA, 2007
(ISBN-10): 1-59973-046-4
(ISBN-13): 978-1-59973-046-2
pp. 7 - 17
Abstract
Some ratio estimators for estimating the population mean of the variable under
study, which make use of information regarding the population proportion possessing
certain attribute, are proposed. Under simple random sampling without replacement
(SRSWOR) scheme, the expressions of bias and mean-squared error (MSE) up to the first
order of approximation are derived. The results obtained have been illustrated
7
1. Introduction
The use of auxiliary information can increase the precision of an estimator when
study variable y is highly correlated with auxiliary variable x. There exist situations when
information is available in the form of attribute φ , which is highly correlated with y. For
example
c) Amount of yield of wheat crop and a particular variety of wheat etc. (see
and φ i denote the observations on variable y and φ respectively for ith unit (i = 1,2,....N) .
Suppose there is a complete dichotomy in the population with respect to the presence or
absence of an attribute, say φ , and it is assumed that attribute φ takes only the two
= 0, otherwise.
N n
Let A = ∑ φi and a = ∑ φ i denote the total number of units in the population and
i =1 i =1
A a
sample respectively possessing attribute φ . Let P = and p = denote the proportion
N n
Taking into consideration the point biserial correlation between a variable and an
attribute, Naik and Gupta (1996) defined ratio estimator of population mean when the
8
prior information of population proportion of units, possessing the same attribute is
available, as follows:
⎛P⎞
t NG = y⎜⎜ ⎟⎟ (1.1)
⎝p⎠
here y is the sample mean of variable of interest. The MSE of t NG up to the first order of
approximation is
⎛1− f ⎞ 2
MSE ( t NG ) = ⎜ [
⎟ S y + R 1 Sφ − 2R 1S yφ
2 2
] (1.2)
⎝ n ⎠
where
n
f= ,
Y
R1 = , S =
2 1 N
∑ (y i − Y )2 , S =
2
φ
1 N
∑ (φi − P )2 ,
N − 1 i =1 N − 1 i=1
y
N P
∑ (φi − P )(y i − Y ) .
1 N
S yφ =
N − 1 i=1
In the present paper, some ratio estimators for estimating the population mean of
the variable under study, which make use of information regarding the population
proportion possessing certain attribute, are proposed. The expressions of bias and MSE
have been obtained. The numerical illustrations have also been done by taking some
y + b φ ( P − p)
t1 = P = R *P (2.1)
p
s yφ y + b φ ( P − p) ⎛ 1 ⎞n
where b φ = , R* = , s φ2 = ⎜ ⎟∑ (φ i − p ) and
2
s φ2 p ⎝ n − 1 ⎠ i =1
⎛ 1 ⎞n
s yφ = ⎜ ⎟∑ (φ i − p )(y i − Y ) .
⎝ n − 1 ⎠ i =1
9
Remark 1: When we put b φ = 0 in (2.1), the proposed estimator turns to the Naik and
MSE of this estimator can be found by using Taylor series expansion given by
∂f (c, d) ∂f (c, d)
f (p, y) ≅ f (P, y) + (p − P) + ( y − Y) (2.2)
∂c P ,Y
∂c P ,Y
Expression (2.2) can be applied to the proposed estimator in order to obtain MSE
equation as follows:
⎛ y bφP ⎞ 1
≅ −⎜⎜ 2 + 2 ⎟⎟ (p − P) + ( y − Y)
⎝p p ⎠ P ,Y p P,Y
E(R − R1 )
* 2
≅
(Y + B P ) φ
2
V ( p) −
2( Y + Bφ P) 1
Cov(p, y) + 2 V( y)
P4 P 3
P
1 ⎧ ( Y + Bφ P ) 2( Y + Bφ P) ⎫
2
≅ 2⎨ 2
V ( p) − Cov(p, y) + V( y)⎬ (2.3)
P ⎩ P P ⎭
Sφy ρ pbS y
where B φ = = .
Sφ2 Sφ
S yφ
ρ pb = , is the point biserial correlation coefficient.
S ySφ
Now,
MSE ( t1 ) = P 2 E(R1 − R φ ) 2
≅
(Y + B P ) V(p ) − 2(Y + B P ) Cov(p, y ) + V(y)
φ
2
φ
(2.4)
2
P P
10
Simplifying (2.4), we get MSE of t1 as
⎛1− f
MSE (t1 ) ≅ ⎜ [
⎞ 2 2
(
⎟ R 1 Sφ + Sy 1 − ρpb
2 2
)] (2.5)
⎝ n ⎠
Several authors have used prior value of certain population parameters (s) to find
more precise estimates. Searls (1964) used Coefficient of Variation (CV) of study
(1964) work, Sen (1978), Sisodiya and Dwivedi (1981), and Upadhyaya and Singh
(1984) used the known CV of the auxiliary character for estimating population mean of a
study character in ratio method of estimation. The use of prior value of Coefficient of
Kurtosis in estimating the population variance of study character y was first made by
Singh et al. (1973). Later, used by and Searls and Intarapanich (1990), Upadhyaya and
Singh (1999), Singh (2003) and Singh et al. (2004) in the estimation of population mean
of study character. Recently Singh and Tailor (2003) proposed a modified ratio estimator
In next section, we propose some ratio estimators for estimating the population
mean of the variable under study using known parameters of the attribute φ such as
coefficient of variation Cp, Kurtosis (β2 (φ) ) and point biserial correlation coefficient ρpb .
3. Suggested Estimators
y + b φ ( P − p)
t= (m1P + m 2 ) (3.1)
(m1p + m 2 )
where m1 (≠ 0) , m2 are either real number or the functions of the known parameters of
11
The following scheme presents some of the important estimators of the population
Estimator Values of
m1 m2
y + b φ ( P − p) 1 1
t1 = P
p
y + b φ ( P − p) 1 β2 (φ)
t2 = [P + β2 (φ)]
(p + β2 (φ))
y + b φ ( P − p)
t3 =
(p + Cp ) P + Cp [ ] 1 Cp
y + b φ ( P − p) ρpb
t4 =
(p + ρpb )) P + ρpb [ ] 1
y + b φ ( P − p) β2 (φ)
t5 = [
(pβ2 (φ) + Cp ) Pβ2 (φ) + Cp ] Cp
y + b φ ( P − p) β2 (φ)
t6 = [
(pCp + β2 (φ)) PCp + β2 (φ) ] Cp
y + b φ ( P − p) ρpb
t7 = [
(pCp + ρpb ) PCp + ρpb ] Cp
y + b φ ( P − p) ρpb
t8 = [
(pρpb + Cp ) Pρpb + Cp ] Cp
y + b φ ( P − p) β2 (φ) ρpb
t9 = [
(pβ2 (φ) + ρpb ) Pβ2 (φ) + ρpb ]
y + b φ ( P − p) ρpb β2 (φ)
t10 = [
(pρpb + β2 (φ)) Pρpb + β2 (φ) ]
12
Following the approach of section 2, we obtain the MSE expression for these
proposed estimators as –
⎛1− f
MSE ( t i ) ≅ ⎜
⎞
[ ]
⎟ R iSφ + Sy (1 − ρpb ) ,
2 2 2
( i = 1,2,3,....,10 ) (3.2)
⎝ n ⎠
Y Y Y Y
where R 1 = , R2 = , R3 = , R4 = ,
P P + β2 (φ) P + Cp P + ρpb
4. Efficiency comparisons
⎛1− f ⎞ 2
V ( y) = ⎜ ⎟Sy
⎝ n ⎠
(4.1)
V ( y) − MSE ( t i ) ≥ 0 , i = 1,2,.....,10
Sφ2 2
⇒ ρ > 2 Ri
2
pb
Sy
(4.2)
When this condition is satisfied, proposed estimators are more efficient than
13
Now, we compare the MSE of the proposed estimators with the MSE of
Naik and Gupta [2] estimator t NG . From (3.2) and (1.1) we have
Sφ2 2
2
pb
Sy
[
⇒ ρ ≥ 2 R i − R φ2 + 2R φ K yp ] (4.3)
Cy
where K yp = ρ yp .
Cp
5. Empirical Study
The data for the empirical study is taken from natural population data set
In the below table 5.1 percent relative efficiencies (PRE) of various estimators
14
Table 5.1: PRE of different estimators of Y with respect to y .
y 100
t NG 11.61
t1 7.36
t2 236.55
t3 227.69
t4 208.09
t5 185.42
t6 230.72
t7 185.27
t8 230.77
t9 152.37
t10 237.81
which uses some known values of population proportion performs better than the usual
Conclusion:
15
We have suggested some ratio estimators for estimating Y which uses
some known value of population proportion. For practical purposes the choice of the
References
Jhajj, H. S., Sharma, M. K. and Grover, L. K., A family of estimators of population mean
using information on auxiliary attribute. Pakistan Journal of Statistics, 22
(1), 43-50 (2006).
Naik, V. D. and Gupta, P. C., A note on estimation of mean with known population
proportion of an auxiliary character. Journal of the Indian Society of
Agricultural Statistics, 48 (2), 151-158 (1996).
Ray, S. K. and Singh, R. K., Difference-cum-ratio type estimators. Journal of the Indian
Statistical Association, 19, 147-151 (1981).
Searls, D. T., The utilization of known coefficient of variation in the estimation procedure.
Journal of the American Statistical Association, 59, 1125-1126 (1964).
Searls, D. T. and Intarapanich, P., A note on an estimator for the variance that utilizes the
kurtosis. The American Statistician, 44, 295-296 (1990).
Sen, A. R., Estimation of the population mean when the coefficient of variation is known.
Communications in Statistics Theory and Methods A, 7, 657-672 (1978).
Singh, G. N., On the improvement of product method of estimation in sample surveys.
Journal of the Indian Society of Agricultural Statistics, 56 (3), 267-275
(2003).
Singh H. P. and Tailor, R., Use of known correlation coefficient in estimating the finite
population mean. Statistics in Transition, 6, 555-560 (2003).
Singh H. P., Tailor, R., Tailor, R. and Kakran, M. S., An improved estimator of
population mean using power transformation. Journal of the Indian
Society of Agricultural Statistics, 58 (2), 223-230 (2004).
16
Singh, J., Pandey, B. N. and Hirano, K., On the utilization of a known coefficient of
kurtosis in the estimation procedure of variance. Annals of the Institute of
Statistical Mathematics, 25, 51-55 (1973).
17