eCommons

DigitalCollections@ILR
ILR School
 

Replicating the Synthetic LBD with German Establishment Data

dc.contributor.authorDrechsler, Jörg
dc.contributor.authorVilhuber, Lars
dc.date.accessioned2020-12-06T22:17:28Z
dc.date.available2020-12-06T22:17:28Z
dc.date.issued2013-04-15
dc.descriptionPresented at the World Statistical Congress 2013.
dc.description.abstractOne major criticism against the use of synthetic data has been that the efforts necessary to generate useful synthetic data are so intense that many statistical agencies cannot afford them. However, we argue in this paper that the field is still evolving and many lessons that have been learned in the early years of synthetic data generation can now be used in the development of new synthetic data products, considerably reducing the required investments. We evaluate whether synthetic data algorithms that have been developed in the U.S. to generate a synthetic version of the Longitudinal Business Database (LBD) can easily be transferred to generate a similar data product for other countries. We construct a German data product with information comparable to the LBD - the German Longitudinal Business Database (GLBD) - that is generated from different administrative sources at the Institute for Employment Research, Germany. In a second stage, the algorithms developed for the synthesis of the LBD will be applied to the GLBD. Extensive evaluations will illustrate whether the algorithms provide useful synthetic data without further adjustment. The ultimate goal of the project is to provide access to multiple synthetic datasets similar to the SynLBD at Cornell to enable comparative studies between countries. The Synthetic GLBD is a first step towards that goal.
dc.description.legacydownloadsISIpaper_2013.pdf: 191 downloads, before Oct. 1, 2020.
dc.description.legacydownloads0-ISIpaper_Drechsler_Vilhuber_Online_Appendix.pdf: 22 downloads, before Oct. 1, 2020.
dc.description.legacydownloads1-WSC2013_Drechsler.pdf: 59 downloads, before Oct. 1, 2020.
dc.identifier.other4030355
dc.identifier.urihttps://hdl.handle.net/1813/89091
dc.language.isoen_US
dc.subjectconfidentiality
dc.subjectcomparative studies
dc.subjectGerman Longitudinal Business Database
dc.subjectsynthetic data
dc.titleReplicating the Synthetic LBD with German Establishment Data
local.authorAffiliationDrechsler, Jörg: joerg.drechsler@iab.de IAB
local.authorAffiliationVilhuber, Lars: lv39@cornell.edu Cornell University

Files

Original bundle
Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
ISIpaper_2013.pdf
Size:
155.21 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
0-ISIpaper_Drechsler_Vilhuber_Online_Appendix.pdf
Size:
88.34 KB
Format:
Adobe Portable Document Format
Description:
Online appendix (tables)
Loading...
Thumbnail Image
Name:
1-WSC2013_Drechsler.pdf
Size:
286.64 KB
Format:
Adobe Portable Document Format
Description:
Presentation at WSC 2013