Show simple item record

dc.contributor.authorVilhuber, Lars
dc.contributor.authorAbowd, John
dc.contributor.authorBlock, William
dc.contributor.authorLagoze, Carl
dc.contributor.authorWilliams, Jeremy
dc.date.accessioned2013-11-15T13:28:52Z
dc.date.available2013-11-15T13:28:52Z
dc.date.issued2013-11
dc.identifier.urihttps://hdl.handle.net/1813/34534
dc.description.abstractSocial science researchers are increasingly interested in making use of confidential micro-data that contains linkages to the identities of people, corporations, etc. The value of this linking lies in the potential to join these identifiable entities with external data such as genome data, geospatial information, and the like. Leveraging these linkages is an essential aspect of “big data” scholarship. However, the utility of these confidential data for scholarship is compromised by the complex nature of their management and curation. This makes it difficult to fulfill US federal data management mandates and interferes with basic scholarly practices such as validation and reuse of existing results. We describe in this paper our work on the CED2AR prototype, a first step in providing researchers with a tool that spans the confidential/publicly-accessible divide, making it possible for researchers to identify, search, access, and cite those data. The particular points of interest in our work are the cloaking of metadata fields and the expression of provenance chains. For the former, we make use of existing fields in the DDI (Data Description Initiative) specification and suggest some minor changes to the specification. For the latter problem, we investigate the integration of DDI with recent work by the W3C PROV working group that has developed a generalizable and extensible model for expressing data provenance.en_US
dc.description.sponsorshipNSF Grant #1131848en_US
dc.publisher2013 Federal Committee on Statistical Methodology Research Conferenceen_US
dc.subjectPROVen_US
dc.subjectDDIen_US
dc.subjectProvenanceen_US
dc.subjectConfidentialityen_US
dc.titleManaging Confidentiality and Provenance across Mixed Private and Publicly-Accessed Data and Metadataen_US
dc.typepresentationen_US


Files in this item

Thumbnail
Thumbnail
Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

Statistics