Dynamically consistent noise infusion and partially synthetic data as confidentiality protection measures for related time-series
dc.contributor.author | Abowd, John | |
dc.contributor.author | Gittings, Kaj | |
dc.contributor.author | McKinney, Kevin L. | |
dc.contributor.author | Stevens, Bryce E. | |
dc.contributor.author | Vilhuber, Lars | |
dc.contributor.author | Woodcock, Simon | |
dc.date.accessioned | 2020-12-06T22:17:24Z | |
dc.date.available | 2020-12-06T22:17:24Z | |
dc.date.issued | 2012-04-24 | |
dc.description | Presented at FCSM. | |
dc.description.abstract | The Census Bureau's Quarterly Workforce Indicators (QWI) provide detailed quarterly statistics on employment measures such as worker and job ows, tabulated by detailed worker characteristics in various combinations. The data are released for detailed NAICS industries and for several levels of geography, the lowest aggregation of which are counties. OnTheMap, another Census Bureau product, provides a subset of these tabulations at the tract level. Disclosure avoidance methods are required to protect the information about individuals and businesses that contribute to the underlying data. The QWI disclosure avoidance mechanism we describe here relies heavily on the use of noise infusion through a permanent multiplicative noise distortion factor, used for magnitudes, counts, differences and ratios. There is minimal suppression and no complementary suppressions. To our knowledge, the release in 2003 of the QWI was the first large-scale use of noise infusion in any official statistical product. We show that the released statistics are analytically valid along several critical dimensions -- measures are unbiased and time series properties are preserved. We provide an analysis of the degree to which con dentiality is protected. Furthermore, we show how the judicious use of synthetic data, injected into the tabulation process, can completely eliminate suppressions, maintain analytical validity, and increase the protection of the underlying con dential data. | |
dc.description.legacydownloads | fcsm2012_noise_synthetic.pdf: 398 downloads, before Oct. 1, 2020. | |
dc.identifier.other | 2915483 | |
dc.identifier.uri | https://hdl.handle.net/1813/89082 | |
dc.language.iso | en_US | |
dc.rights | Required Publisher Statement: Copyright held by authors. | |
dc.subject | noise infusion | |
dc.subject | synthetic data | |
dc.subject | statistical disclosure limitation | |
dc.subject | time-series | |
dc.subject | local labor markets | |
dc.subject | gross job | |
dc.title | Dynamically consistent noise infusion and partially synthetic data as confidentiality protection measures for related time-series | |
local.authorAffiliation | Abowd, John: John.Abowd@cornell.edu Cornell University | |
local.authorAffiliation | Gittings, Kaj: Louisiana State University | |
local.authorAffiliation | McKinney, Kevin L.: U.S. Census Bureau | |
local.authorAffiliation | Stevens, Bryce E.: U.S. Consumer Finance Protection Bureau | |
local.authorAffiliation | Vilhuber, Lars: lv39@cornell.edu Cornell University | |
local.authorAffiliation | Woodcock, Simon: Simon Fraser University |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- fcsm2012_noise_synthetic.pdf
- Size:
- 455.33 KB
- Format:
- Adobe Portable Document Format