Modelling of complex survey data: Why model? Why is it a problem? How can we approach it?

Danny Pfeffermann*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

58 Scopus citations

Abstract

This article attempts to answer the three questions appearing in the title. It starts by discussing unique features of complex survey data not shared by other data sets, which require special attention but suggest a large variety of diverse inference procedures. Next a large number of different approaches proposed in the literature for handling these features are reviewed with discussion on their merits and limitations. The approaches differ in the conditions underlying their use, additional data required for their application, goodness of fit testing, the inference objectives that they accommodate, statistical efficiency, computational demands, and the skills required from analysts fitting the model. The last part of the paper presents simulation results, which compare the approaches when estimating linear regression coefficients from a stratified sample in terms of bias, variance, and coverage rates. It concludes with a short discussion of pending issues.

Original languageEnglish
Pages (from-to)115-136
Number of pages22
JournalSurvey Methodology
Volume37
Issue number2
StatePublished - 21 Dec 2011

Keywords

  • Informative sampling
  • Likelihood-based methods
  • NMAR nonresponse
  • Probability weighting
  • Randomization distribution
  • Sample model

Fingerprint

Dive into the research topics of 'Modelling of complex survey data: Why model? Why is it a problem? How can we approach it?'. Together they form a unique fingerprint.

Cite this