Dynamically consistent noise infusion and partially synthetic data as confidentiality protection measures for related time-series

dc.contributor.authorAbowd, John
dc.contributor.authorGittings, Kaj
dc.contributor.authorMcKinney, Kevin L.
dc.contributor.authorStevens, Bryce E.
dc.contributor.authorVilhuber, Lars
dc.contributor.authorWoodcock, Simon
dc.date.accessioned2020-12-06T22:17:24Z
dc.date.available2020-12-06T22:17:24Z
dc.date.issued2012-04-24
dc.descriptionPresented at FCSM.
dc.description.abstractThe Census Bureau's Quarterly Workforce Indicators (QWI) provide detailed quarterly statistics on employment measures such as worker and job ows, tabulated by detailed worker characteristics in various combinations. The data are released for detailed NAICS industries and for several levels of geography, the lowest aggregation of which are counties. OnTheMap, another Census Bureau product, provides a subset of these tabulations at the tract level. Disclosure avoidance methods are required to protect the information about individuals and businesses that contribute to the underlying data. The QWI disclosure avoidance mechanism we describe here relies heavily on the use of noise infusion through a permanent multiplicative noise distortion factor, used for magnitudes, counts, differences and ratios. There is minimal suppression and no complementary suppressions. To our knowledge, the release in 2003 of the QWI was the first large-scale use of noise infusion in any official statistical product. We show that the released statistics are analytically valid along several critical dimensions -- measures are unbiased and time series properties are preserved. We provide an analysis of the degree to which con dentiality is protected. Furthermore, we show how the judicious use of synthetic data, injected into the tabulation process, can completely eliminate suppressions, maintain analytical validity, and increase the protection of the underlying con dential data.
dc.description.legacydownloadsfcsm2012_noise_synthetic.pdf: 398 downloads, before Oct. 1, 2020.
dc.identifier.other2915483
dc.identifier.urihttps://hdl.handle.net/1813/89082
dc.language.isoen_US
dc.rightsRequired Publisher Statement: Copyright held by authors.
dc.subjectnoise infusion
dc.subjectsynthetic data
dc.subjectstatistical disclosure limitation
dc.subjecttime-series
dc.subjectlocal labor markets
dc.subjectgross job
dc.titleDynamically consistent noise infusion and partially synthetic data as confidentiality protection measures for related time-series
local.authorAffiliationAbowd, John: John.Abowd@cornell.edu Cornell University
local.authorAffiliationGittings, Kaj: Louisiana State University
local.authorAffiliationMcKinney, Kevin L.: U.S. Census Bureau
local.authorAffiliationStevens, Bryce E.: U.S. Consumer Finance Protection Bureau
local.authorAffiliationVilhuber, Lars: lv39@cornell.edu Cornell University
local.authorAffiliationWoodcock, Simon: Simon Fraser University
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
fcsm2012_noise_synthetic.pdf
Size:
455.33 KB
Format:
Adobe Portable Document Format