Cornell University
Library
Cornell UniversityLibrary

eCommons

Help
Log In(current)
  1. Home
  2. Cornell University Graduate School
  3. Cornell Theses and Dissertations
  4. Defining and Enforcing Privacy in Data Sharing

Defining and Enforcing Privacy in Data Sharing

File(s)
thesis-ashwin.pdf (866.35 KB)
Permanent Link(s)
https://hdl.handle.net/1813/11192
Collections
Cornell Theses and Dissertations
Author
Machanavajjhala, Ashwin Kumar VenkataNaga
Abstract

Recent advances in processing and storing information has led to an explosion of data collection. Many organizations like the Census, hospitals and even search engine companies collect, analyze and distribute personal information in return for useful services. However, the collected data track entire public and private lives of individuals, thus resulting in an immense privacy risk of unauthorized disclosure. This dissertation presents novel conceptual and practical tools to ensure privacy of individuals while enabling the dissemination of valuable data about humans to improve their lives. Our contributions include novel formal definitions of the privacy risk arising from unauthorized disclosure, and practical algorithms for enforcing these definitions of privacy.

We consider two distinct settings of data dissemination that require different notions of privacy. In the first part of this dissertation, we consider a setting where no sensitive information should be disclosed. We consider the problem of deciding whether answering a query on a relational database leads to any disclosure of sensitive information. This problem was shown to be intractable; we propose practical algorithms for a reasonably large set of query classes.

In the second part of the dissertation, we consider the problem of publishing anonymous'' aggregate information about populations of individuals while preserving the privacy of individual-specific information. We present a novel framework for reasoning about the privacy risk in this setting. We also propose the first formal privacy definition and practical algorithms for publishing anonymous'' data that provably guarantees privacy of the individuals contributing to the data while releasing useful aggregate information. We also present a case study of applying formal privacy definitions to a real Census data publishing application.

Description
Prof. Johannes Gehrke, Committee Chair.
Prof. Eva Tardos, Committee Member.
Prof. Joseph Halpern, Committee Member.
Dr. Jayavel Shanmugasundaram, Committee Member.
Date Issued
2008-07-31T21:19:44Z
Keywords
Privacy
•
Databases
•
K-Anonymity
•
L-Diversity
•
Access Control
•
Anonymous Data Publishing
Type
dissertation or thesis

Site Statistics | Help

About eCommons | Policies | Terms of use | Contact Us

copyright © 2002-2026 Cornell University Library | Privacy | Web Accessibility Assistance