Measuring inequality using censored data: a multiple-imputation approach to estimation and inference

To measure income inequality with right-censored (top-coded) data, we propose multiple-imputation methods for estimation and inference. Censored observations are multiply imputed using draws from a flexible parametric model fitted to the censored distribution, yielding a partially synthetic data set from which point and variance estimates can be derived using complete-data methods and appropriate combination formulae. The methods are illustrated using US Current Population Survey data and the generalized beta of the second kind distribution as the imputation model. With Current Population Survey internal data, we find few statistically significant differences in income inequality for pairs of years between 1995 and 2004. We also show that using Current Population Survey public use data with cell mean imputations may lead to incorrect inferences. Multiply-imputed public use data provide an intermediate solution.

en

http://eprints.lse.ac.uk/32013/