Abstract
This article attempts to answer the three questions appearing in the title. It starts by discussing unique features of complex survey data not shared by other data sets, which require special attention but suggest a large variety of diverse inference procedures. Next a large number of different approaches proposed in the literature for handling these features are reviewed with discussion on their merits and limitations. The approaches differ in the conditions underlying their use, additional data required for their application, goodness of fit testing, the inference objectives that they accommodate, statistical efficiency, computational demands, and the skills required from analysts fitting the model. The last part of the paper presents simulation results, which compare the approaches when estimating linear regression coefficients from a stratified sample in terms of bias, variance, and coverage rates. It concludes with a short discussion of pending issues.
Original language | English |
---|---|
Pages (from-to) | 115-136 |
Number of pages | 22 |
Journal | Survey Methodology |
Volume | 37 |
Issue number | 2 |
State | Published - 21 Dec 2011 |
Keywords
- Informative sampling
- Likelihood-based methods
- NMAR nonresponse
- Probability weighting
- Randomization distribution
- Sample model