Cornell University
Library
Cornell UniversityLibrary

eCommons

Help
Log In(current)
DigitalCollections@ILR
ILR School
  1. Home
  2. ILR School
  3. Centers, Institutes, Programs
  4. Labor Dynamics Institute
  5. Labor Dynamics Institute Publications
  6. Expanding the Role of Synthetic Data at the U.S. Census Bureau

Expanding the Role of Synthetic Data at the U.S. Census Bureau

File(s)
ISIpaper_2013_JLM_8_13_2013_5pm.pdf (148.56 KB)
0-LouisJarminMiranda_ISI_2013_20130822.pdf (1.05 MB)
Presentation (PDF)
1-LouisJarminMiranda_ISI_2013_20130822.pptx (1.13 MB)
Presentation (PPTX)
Permanent Link(s)
https://hdl.handle.net/1813/89087
Collections
Labor Dynamics Institute Publications
Author
Jarmin, Ron
Louis, Thomas A.
Miranda, Javier
Abstract

National Statistical offices (NSOs) create official statistics from data collected directly from survey respondents, from government administrative records and from other third party sources. The raw source data, regardless of origin, is usually considered to be confidential. In the case of the U.S. Census Bureau, confidentiality of survey and administrative records microdata is mandated by statute, and this mandate to protect confidentiality is often at odds with the needs of data users to extract as much information as possible from rich microdata. Traditional disclosure protection techniques applied to resolve this tension have resulted in official data products that come no where close to fully utilizing the information content of the underlying microdata. Typically, these products take for the form of basic, aggregate tabulations. In a few cases anonymized public-use micro samples are made available, but these are increasingly under risk of re-identification by the ever larger amounts of information about individuals and firms that is available in the public domain. One potential approach for overcoming these risks is to release products based on synthetic or partially synthetic data where values are simulated from statistical models designed to mimic the (joint) distributions of the underlying microdata rather than making the actual underlying microdata available. We discuss recent Census Bureau work to develop and deploy such products. We also discuss the benefits and challenges involved with extending the scope of synthetic data products in official statistics.

Description
Presented at World Statistical Congress 2013.
Date Issued
2013-01-01
Keywords
cofidentiality synthetic data
•
official statistics

Site Statistics | Help

About eCommons | Policies | Terms of use | Contact Us

copyright © 2002-2026 Cornell University Library | Privacy | Web Accessibility Assistance