Open Journal of Mathematical Sciences
ISSN: 2523-0212 (Online) 2616-4906 (Print)
DOI: 10.30538/oms2020.0110
A modified efficient difference-type estimator for population mean under two-phase sampling design
A. E. Anieting\(^1\), J. K. Mosugu
Department of Statistics, University of Uyo, Uyo, Nigeria.; (A.E.A)
National Open University of Nigeria, Abuja, Nigeria.; (J.K.M.)
\(^{1}\)Corresponding Author: akaninyeneanieting@uniuyo.edu.ng
Abstract
Keywords:
1. Introduction
Auxiliary information is used either in the estimation stage or in the formation of an estimator to get improved designs and increase the efficiency of estimators in sampling technique. In [1], Laplace started the use of the auxiliary information in formulating ratio type estimation. The statisticians paid a lot of care towards the formation of new and efficient estimators for the population parameters estimation [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13]. Khan and Al-Hossain [14] suggested a generalized chain ratio in regression estimator for mean of the population using two auxiliary variables. In this research work, a modified form of difference-type estimator for mean of the population using two-phase sampling is suggested [15].
Firstly, we give some definitions and notions. Consider a finite population of size \(N\) of different units \(U =\{U_1,U_2,\ U_3, \dots ,U_N\}\). Let \(x\) and \(y\) be the auxiliary and the study variables with corresponding values \(x_i\) and \(y_i\) respectively for the \(i^{th}\) unit \(i =\{1, 2, 3,\dots, N\}\) defined in a finite population \(U\) with means \[\overline{Y}= (1/N) \sum^N_i{y_i}\] and \[\overline{X}= (1/N) \sum^N_i{x_i}\] of the study as well as auxiliary variable respectively.
Also let \[S^2_x= \frac{1}{N-1}\sum^N_i{{(x_i-\overline{X})}^2}\] and \[S^2_y = \frac{1}{N-1}\sum^N_i{{(y_i-\overline{Y})}^2}\] be the population variances of the auxiliary and the study variables respectively and let \(C_x\) and \(C_y\) be the coefficient of variation of the auxiliary as well as study variable respectively, while \({\rho }_{yx}\) is the correlation coefficient between \(x\) and \(y\).
Let the sample mean of \(x\) and \(y\) be as \[\overline{X}=\frac{1}{n-1}\sum^{n}_{i}{x}_{i}\] and \[\overline{y}=\frac{1}{n-1}\sum^{n}_{i}{y}_{i}\] respectively. Also let \[{\widehat{S}}^{2}_{y}= \frac{1}{n-1}\sum^{n}_{i}({y}_{i}- \overline{y})^{2}\] and \[{\widehat{S}}^{2}_{x} =\frac{1}{n-1}\sum^n_i (x_i- \overline{x})^2\] be the corresponding sample variances of the study as well as auxiliary variable respectively. Let \[ S_{yx} = \frac{\sum^N_i{\left(y_i-\overline{Y}\right)(x_i-\overline{X})}}{N-1},\] \[ \ S_{yz} = \frac{\sum^N_i \left(y_i-\overline{Y}\right)(z_i-\overline{Z})}{N-1}\ \] and \[S_{xz} =\frac{\sum^N_i \left(z_i-\overline{Z}\right)(x_i-\overline{X})}{N-1}\] be the co-variances between their respective subscripts. Similarly \[b_{yx}=\frac{{\hat{S}}_{xy}}{{\hat{S}}^2_x}\] is the corresponding sample regression coefficient of \(y\) on \(x\) based on a sample of size \(n\). Also, \[C_y=\frac{S_y}{\overline{Y}}, C_x=\frac{S_x}{\overline{X}}\,\,\text{ and}\,\,C_z=\frac{S_z}{\overline{Z}}\] are the coefficient of variations of the study and the auxiliary variables respectively. Also, \(\theta=\frac{1}{n}-\frac{1}{N},\ \theta_1=\frac{1}{n'}-\frac{1}{N}\) and \(\theta_2=\frac{1}{n}-\frac{1}{n'}\).
2. Some existing estimators
Consider a finite population of size N units. To estimate the mean of the population \(\overline{Y}\), it is assumed that the correlation between y and x is greater than the correlation between y and z, (i.e\({\rho }_{yx}\) \(\mathrm{>}\)~\({\rho }_{yz}\)~). When the mean of the population \(\overline{X\ }\)of the auxiliary variable x is unknown, but information on the other cheaply auxiliary variable say z closely related to x but compared to x remotely to y, is available for all the units in a population. The usage of two phase sampling is imperative in such a situation. In double sampling scheme, a large initial sample of size n\(\mathrm{\prime}\) (n\(\mathrm{\prime}\)~\(\mathrm{< }\)N) is drawn from the population U using simple random sample without replacement sampling (SRSWOR) scheme and measure x and z to estimate \(\overline{X\ }\) and \(\ \overline{Z}\) . In the second phase, a sample (subsample) of size n from first phase sample of size n\(\mathrm{\prime}\), i.e. (n\(\mathrm{< }\)~n\(\mathrm{\prime}\)) is drawn using (SRSWOR) or directly from the population U and observed the study variable \(y.\) The usual variance of simple estimator \(t_o = {\overline{y}}=\frac{1}{n}\sum^n_i{y_i}\) up to first order of approximation is given by3. The proposed estimator
On the basis of Khan and Al-hossain [14], a modified difference-type estimator for the mean of the population in two-phase sampling scheme using two auxiliary variables is proposed as4. Comparison of efficiency
In this section, the proposed estimator is compared with other existing estimators.- By (1) and (13)
- By (11) and (13)
- By (3) and (13)
- By (7) and (13)
5. Numerical comparison
Utilizing the Data set given in [14], the mean square errors (MSE's) together with the percent relative efficiencies (PRE's) of the proposed estimator with respect to \(t_0\) is given in Table 1.Table 1
Estimators | MSE's | PRE's |
---|---|---|
\(\ \ \ \ \ \ \ \ t_0\) | 1.7525 | 100 |
\(\ \ \ \ \ \ \ \ t_1\) | 1.5032 | 116.59 |
\(\ \ \ \ \ \ \ \ t_3\) | 1.2793 | 137.00 |
\(\ \ \ \ \ \ \ \ t_5\) | 1.1312 | 154.92 |
\(\ \ \ \ \ \ \ \ t_m\) | 0.8206 | 213.56 |
\(\ \ \ \ \ \ \ \ t_{ae}\) | 0.6693 | 261.84 |
6. Conclusion
Inferring from Table 1, it shows that the proposed estimator has smaller mean squared error and higher percent relative efficiency than the other existing estimators. Hence, the proposed estimator is efficient and highly recommended for use in practice with respect to difference type estimation.Author Contributions
All authors contributed equally to the writing of this paper. All authors read and approved the final manuscript.Conflict of Interests
The authors declare no conflict of interest.References
- Laplace, P. S. (1820). Théorie analytique des probabilités. Courcier.[Google Scholor]
- Hansen, M. H., & Hurwitz, W. N. (1943). On the theory of sampling from finite populations. The Annals of Mathematical Statistics, 14(4), 333-362.[Google Scholor]
- Sukhatme, B. V. (1962). Some ratio-type estimators in two-phase sampling. Journal of the American Statistical Association, 57(299), 628-632.[Google Scholor]
- Srivastava, S. K. (1970). A Two-Phase Sampling Estimator in Sample Surveys. Australian Journal of Statistics, 12(1), 23-27.[Google Scholor]
- Chand, L. (1975). Some ratio-type estimators based on two or more auxiliary variables. Unpublished Ph.D. dissertation, Iowa State University, Ames 1975.[Google Scholor]
- Cochran, W. G. (1977). Sampling techniques. New York: Wiley and Sons, 3.[Google Scholor]
- Kiregyera, B. (1980). A chain ratio-type estimator in finite population double sampling using two auxiliary variables. Metrika, 27(1), 217-223.[Google Scholor]
- Kiregyera, B. (1984). Regression-type estimators using two auxiliary variables and the model of double sampling from finite populations. Metrika, 31(1), 215-226.[Google Scholor]
- Khare, B. B., Srivastava, U., & Kumar, K. (2013). A generalized chain ratio in regression estimator for population mean using two auxiliary characters in sample survey. Journal of Scientific Research, 57, 147-153.[Google Scholor]
- Bahl, S., & Tuteja, R. (1991). Ratio and product type exponential estimators. Journal of information and optimization sciences, 12(1), 159-164.[Google Scholor]
- Singh, H. P., Singh, S., & Kim, J. M. (2006). General families of chain ratio type estimators of the population mean with known coefficient of variation of the second auxiliary variable in two phase sampling. Journal of the Korean Statistical Society, 35(4), 377-395.[Google Scholor]
- Singh, R., Chauhan, P., Sawan, N., & Smarandache, F. (2011). Improved exponential estimator for population variance using two auxiliary variables. Italian Journal of Pure and Applied Mathematics, 28, 101-108.[Google Scholor]
- Singh, B. K., & Choudhury, S. (2012). Exponential chain ratio and product type estimators for finite population mean under double sampling scheme. Journal of Science Frontier Research in Mathematics and Design Sciences, 12(6), 0975-5896.[Google Scholor]
- Khan, M., & Al-Hossain, A. Y. (2016). A note on a difference-type estimator for population mean under two-phase sampling design. SpringerPlus, 5(1), 1-7.[Google Scholor]
- Singh, G., & Majhi, D. (2014). Some chain-type exponential estimators of population mean in two-phase sampling. Statistics in Transition new series, Glówny Urzad Statystyczny (Polska), 15(2), 221-230.[Google Scholor]