Now showing items 1-20 of 164

    • Do Single Mothers in the United States use the Earned Income Tax Credit to Reduce Unsecured Debt? 

      Shaefer, H. Luke; Song, Xiaoqing; Williams Shanks, Trina R. (National Poverty Center, 2011-10)
      The Earned Income Tax Credit (EITC) is a refundable credit for low-income workers that is mainly targeted at families with children. This study uses the Survey of Income and Program Participation’s (SIPP) topical modules ...
    • NCRN Meeting Fall 2011 

      Vilhuber, Lars (2011-12-19)
    • Estimating identification disclosure risk using mixed membership models 

      Manrique-Vallier, Daniel; Reiter, Jerome (Journal of the American Statistical Association, 2012)
      Statistical agencies and other organizations that disseminate data are obligated to protect data subjects' confi dentiality. For example, ill-intentioned individuals might link data subjects to records in other databases ...
    • A Proposed Solution to the Archiving and Curation of Confidential Scientific Inputs 

      Abowd, John M.; Vilhuber, Lars; Block, William (Springer, 2012)
      We develop the core of a method for solving the data archive and curation problem that confronts the custodians of restricted-access research data and the scientific users of such data. Our solution recognizes the dual ...
    • The NSF-Census Research Network: Cornell Node 

      Block, William C.; Lagoze, Carl; Vilhuber, Lars; Brown, Warren A.; Williams, Jeremy; Arguillas, Florio (International Association for Social Science Information Services & Technology, 2012-06-06)
      Cornell University has received a $3M NSF-Census Research Network (NCRN) award to improve the documentation and discoverability of both public and restricted data from the federal statistical system. The current internal ...
    • NCRN Meeting Fall 2012 

      Vilhuber, Lars (2012-09-14)
    • Cornell Node of the NSF-Census Research Network - Annual Report to NSF for 2012 

      Abowd, John M.; Block, William; Li, Ping; Vilhuber, Lars (2012-10-15)
      Abstract: This is the annual report by the Cornell node of the NSF-Census Research Network to NSF for 2012
    • An Early Prototype of the Comprehensive Extensible Data Documentation and Access Repository (CED2AR) 

      Block, William C.; Williams, Jeremy; Abowd, John M.; Vilhuber, Lars; Lagoze, Carl (European DDI User Conference (EDDI12), 2012-12-03)
      This presentation will demonstrate the latest DDI-related technological developments of Cornell University’s $3 million NSF-Census Research Network (NCRN) award, dedicated to improving the documentation, discoverability, ...
    • Asymptotic Theory of Cepstral Random Fields 

      McElroy, T.S.; Holan, S.H. (Annals of Statistics, 2013)
      Random fields play a central role in the analysis of spatially correlated data and, as a result,have a significant impact on a broad array of scientific applications. Given the importance of this topic, there has been a ...
    • Encoding Provenance Metadata for Social Science Datasets 

      Lagoze, Carl; Williams, Jeremy; Vilhuber, Lars (Springer, 2013)
      Recording provenance is a key requirement for data-centric scholarship, allowing researchers to evaluate the integrity of source data sets and re- produce, and thereby, validate results. Provenance has become even more ...
    • Presentation: Revisiting the Economics of Privacy: Population Statistics and Privacy as Public Goods 

      Abowd, John (Data Linkage and Anonymisation Scoping Meeting, Issac Newton Institute for Mathematical Sciences, Cambridge, UK, 2013-01)
      Anonymization and data quality are intimately linked. Although this link has been properly acknowledged in the Computer Science and Statistical Disclosure Limitation literatures, economics offers a framework for ...
    • Data Management of Confidential Data 

      Lagoze, Carl; Block, William C.; Williams, Jeremy; Abowd, John M.; Vilhuber, Lars (8th International Digital Curation Conference, 2013-01)
      Social science researchers increasingly make use of data that is confidential because it contains linkages to the identities of people, corporations, etc. The value of this data lies in the ability to join the identifiable ...
    • Credible interval estimates for official statistics with survey nonresponse 

      Manski, Charles F. (2013-04)
      Government agencies commonly report official statistics based on survey data as point estimates, without accompanying measures of error. In the absence of agency guidance, users of the statistics can only conjecture the ...
    • Improving User Access to Metadata for Public and Restricted Use US Federal Statistical Files 

      Block, William C.; Williams, Jeremy; Vilhuber, Lars; Lagoze, Carl; Brown, Warren; Abowd, John M. (2013-04-03)
      Presentation at NADDI 2013
    • NCRN Meeting Spring 2013 

      Vilhuber, Lars (2013-05-16)
    • b-Bit Minwise Hashing in Practice 

      Li, Ping; Shrivastava, Anshumali; König, Arnd Christian (Fifth Asia-Pacific Symposium on Internetware, 2013-10)
      Minwise hashing is a standard technique in the context of search for approximating set similarities. The recent work [26, 32] demon- strated a potential use of b-bit minwise hashing [23, 24] for ef- ficient search and ...
    • Encoding Provenance of Social Science Data: Integrating PROV with DDI 

      Lagoze, Carl; Block, William C; Williams, Jeremy; Abowd, John; Vilhuber, Lars (2013-10-09)
      Provenance is a key component of evaluating the integrity and reusability of data for scholarship. While recording and providing access provenance has always been important, it is even more critical in the web environment ...
    • Cornell Node of the NSF-Census Research Network - Annual Report to NSF for 2013 

      Abowd, John M.; Block, William; Li, Ping; Vilhuber, Lars (2013-10-15)
      This is the annual report by the Cornell node of the NSF-Census Research Network to NSF for 2013.
    • Fast Near Neighbor Search in High-Dimensional Binary Data 

      Shrivastava, Anshumali; Li, Ping (Internetware 2013, 2013-10-23)
      Numerous applications in search, databases, machine learning, and computer vision, can benefit from efficient algorithms for near neighbor search. This paper proposes a simple framework for fast near neighbor search in ...
    • Managing Confidentiality and Provenance across Mixed Private and Publicly-Accessed Data and Metadata 

      Vilhuber, Lars; Abowd, John; Block, William; Lagoze, Carl; Williams, Jeremy (2013 Federal Committee on Statistical Methodology Research Conference, 2013-11)
      Social science researchers are increasingly interested in making use of confidential micro-data that contains linkages to the identities of people, corporations, etc. The value of this linking lies in the potential to join ...