Cornell University
Library
Cornell UniversityLibrary

eCommons

Help
Log In(current)
DigitalCollections@ILR
ILR School
  1. Home
  2. ILR School
  3. Centers, Institutes, Programs
  4. Labor Dynamics Institute
  5. NSF Census Research Network
  6. Triangle Census Research Network (TCRN)
  7. Bayesian multiple imputation for large-scale categorical data with structural zeros

Bayesian multiple imputation for large-scale categorical data with structural zeros

File(s)
SurvMeth14.pdf (151.73 KB)
main article pre-print
Permanent Link(s)
https://hdl.handle.net/1813/34889
Collections
Triangle Census Research Network (TCRN)
Author
Manrique-Vallier, D.
Reiter, J. P.
Abstract

We propose an approach for multiple imputation of items missing at random in large-scale surveys with exclusively categorical variables that have structural zeros. Our approach is to use mixtures of multinomial distributions as imputation engines, accounting for structural zeros by conceiving of the observed data as a truncated sample from a hypothetical population without structural zeros. This approach has several appealing features: imputations are generated from coherent, Bayesian joint models that automatically capture complex dependencies and readily scale to large numbers of variables. We outline a Gibbs sampling algorithm for implementing the approach, and we illustrate its potential with a repeated sampling study using public use census microdata from the state of New York, USA.

Sponsorship
National Science Foundation
Date Issued
2013-12-18
Publisher
Survey Methodology
Keywords
Latent class
•
Log-linear
•
Missing
•
Mixture
•
Multinomial
•
Nonresponse
Type
article

Site Statistics | Help

About eCommons | Policies | Terms of use | Contact Us

copyright © 2002-2026 Cornell University Library | Privacy | Web Accessibility Assistance