Regression analysis of data from a cluster sample

Dan Pfeffermann, Gad Nathan

Research output: Contribution to journalArticlepeer-review

50 Scopus citations

Abstract

For the case of different regression relationships in different subgroups of a finite population, only part of which are sampled, the extended least squares estimator of any weighted average of the distinct coefficients is derived, under assumptions relating only to the first two moments of the distribution of the coefficients. Under these assumptions, the estimator is shown to be the best linear ξ-unbiased estimator, while under further distributional assumptions, it is also the Bayesian estimator for a quadratic loss function. For the case of unknown variances a method for estimating them from the sample is proposed. The empirical estimator thus obtained is shown to perform well by a simulation comparison with the optimal estimator and with other proposed empirical estimators.

Original languageEnglish
Pages (from-to)681-689
Number of pages9
JournalJournal of the American Statistical Association
Volume76
Issue number375
DOIs
StatePublished - Sep 1981

Keywords

  • Bayesian estimator
  • Cluster sampling
  • Complex samples
  • Extended least squares
  • Regression

Fingerprint

Dive into the research topics of 'Regression analysis of data from a cluster sample'. Together they form a unique fingerprint.

Cite this