Cornell University
Library
Cornell UniversityLibrary

eCommons

Help
Log In(current)
DigitalCollections@ILR
ILR School
  1. Home
  2. ILR School
  3. Centers, Institutes, Programs
  4. Labor Dynamics Institute
  5. Labor Dynamics Institute Publications
  6. How Protective Are Synthetic Data?

How Protective Are Synthetic Data?

File(s)
Abowd_Vilhuber2008_Chapter_HowProtectiveAreSyntheticData.pdf (365.87 KB)
Permanent Link(s)
https://hdl.handle.net/1813/89075
Collections
Labor Dynamics Institute Publications
Author
Abowd, John
Vilhuber, Lars
Abstract

This short paper provides a synthesis of the statistical disclosure limitation and computer science data privacy approaches to measuring the confidentiality protections provided by fully synthetic data. Since all elements of the data records in the release file derived from fully synthetic data are sampled from an appropriate probability distribution, they do not represent “real data,” but there is still a disclosure risk. In SDL this risk is summarized by the inferential disclosure probability. In privacy-protected database queries, this risk is measured by the differential privacy ratio. The two are closely related. This result (not new) is demonstrated and examples are provided from recent work

Date Issued
2008-01-01
Keywords
Conditional Distribution
•
Release Data
•
Synthetic Data
•
Laplace Distribution
•
Differential Privacy
Related Version
Published as: Abowd J.M., Vilhuber L. (2008) How Protective Are Synthetic Data?. In: Domingo-Ferrer J., Saygın Y. (eds) Privacy in Statistical Databases. PSD 2008. Lecture Notes in Computer Science, vol 5262. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87471-3_20
Related DOI
https://doi.org/10.1007/978-3-540-87471-3_20

Site Statistics | Help

About eCommons | Policies | Terms of use | Contact Us

copyright © 2002-2026 Cornell University Library | Privacy | Web Accessibility Assistance