Vilhuber, Lars; Miranda, Javier
We describe and analyze a method that blends records from both observed and synthetic microdata into public-use tabulations on establishment statistics. The resulting tables use synthetic data only in potentially sensitive cells. We describe different algorithms, and present preliminary results when applied to the Census Bureau's Business Dynamics Statistics and Synthetic Longitudinal Business Database, highlighting accuracy and protection afforded by the method when compared to existing public-use tabulations (with suppressions).
Vilhuber acknowledges support through NSF Grants SES-1042181 and BCS-0941226.
Statistical Journal of the International Association for Official Statistics
synthetic data; statistical disclosure limitation; confidentiality protection; gross job flows; local labor markets; time-series
Accepted version: Statistical Journal of the International Association for Official Statistics, forthcoming in 2016.
Previously Published As
Statistical Journal of the IAOS, vol. 32, no. 1, pp. 69-80, 2016