Now showing items 1-20 of 156

    • Asymptotic Theory of Cepstral Random Fields 

      McElroy, T.S.; Holan, S.H. (Annals of Statistics, 2013)
      Random fields play a central role in the analysis of spatially correlated data and, as a result,have a significant impact on a broad array of scientific applications. Given the importance of this topic, there has been a ...
    • b-Bit Minwise Hashing in Practice 

      Li, Ping; Shrivastava, Anshumali; König, Arnd Christian (Fifth Asia-Pacific Symposium on Internetware, 2013-10)
      Minwise hashing is a standard technique in the context of search for approximating set similarities. The recent work [26, 32] demon- strated a potential use of b-bit minwise hashing [23, 24] for ef- ficient search and ...
    • Bayesian multiple imputation for large-scale categorical data with structural zeros 

      Manrique-Vallier, D.; Reiter, J. P. (Survey Methodology, 2013-12-18)
      We propose an approach for multiple imputation of items missing at random in large-scale surveys with exclusively categorical variables that have structural zeros. Our approach is to use mixtures of multinomial distributions ...
    • Bayesian Semiparametric Hierarchical Empirical Likelihood Spatial Models 

      Porter, Aaron T.; Holan, Scott H.; Wikle, Christopher K. (2014-05-07)
      We introduce a general Bayesian hierarchical framework that incorporates a flexible nonparametric data model specification through the use of empirical likelihood methodology, which we term semiparametric hierarchical ...
    • Boosting Models for Edit, Imputation and Prediction of Multiple Response Outcomes 

      Li, Ping; Abowd, John M. (2014-02-05)
      In this paper, we propose a statistical framework that generalizes the classical logit model to predict multiple responses (i.e., multi-label classification). We develop an effective implementation based on boosting and ...
    • CED 2 AR: The Comprehensive Extensible Data Documentation and Access Repository 

      Lagoze, Carl; Vilhuber, Lars; Williams, Jeremy; Perry, Benjamin; Block, William C. (Digital Libraries (JCDL), 2014 IEEE/ACM Joint Conference on, 2014-12-04)
      We describe the design, implementation, and deployment of the Comprehensive Extensible Data Documentation and Access Repository (CED 2 AR). This is a metadata repository system that allows researchers to search, browse, ...
    • Cell Suppression as used to Protect Magnitude Data Tables 

      Massell, Paul B. (2015-04-01)
      The most common data products released by the Economic Directorate of the Census Bureau are magnitude data tables. Common magnitude variables in these tables are ‘sales’ (aka ‘receipts’), and ‘number of employees’. In ...
    • Collaborative Editing of DDI Metadata: The Latest from the CED2AR Project 

      Perry, Benjamin; Kambhampaty, Venkata; Brumsted, Kyle; Vilhuber, Lars; Block, William (2014-12-02)
    • Communicating Uncertainty in Official Economic Statistics 

      Manski, Charles (2014-04)
      Federal statistical agencies in the United States and analogous agencies elsewhere commonly report official economic statistics as point estimates, without accompanying measures of error. Users of the statistics may ...
    • Communicating Uncertainty in Official Economic Statistics: An Appraisal Fifty Years after Morgenstern 

      Manski, Charles F. (2014-10)
      Federal statistical agencies in the United States and analogous agencies elsewhere commonly report official economic statistics as point estimates, without accompanying measures of error. Users of the statistics may ...
    • Confidentiality of the SynLBD 

      Vilhuber, Lars; Kinney, Saki (2017-05-09)
      We describe the confidentiality protection provided by the SynLBD. The presentation was originally prepared by Saki Kinney for the World Statistics Congress 2013.
    • Confidentiality Protection and Physical Safeguards 

      Vilhuber, Lars (2017-02-09)
      Confidentiality protection is a multi-layered concept, involving statistical (cryptographic) methods and physical safeguards. When providing access to researchers (both internal to the agency and external academic), a ...
    • Confidentiality Protection and Physical Safeguards (LatAm version) 

      Vilhuber, Lars (2017-06-07)
      Confidentiality protection is a multi-layered concept, involving statistical (cryptographic) methods and physical safeguards. When providing access to researchers (both internal to the agency and external academic), a ...
    • Connecting Researchers with Data: Discovery, Documentation, Access and Security 

      Brown, Warren A.; Jacobs, Stephanie; Schiller, David; Heining, Jörg (2015-12-02)
      The Cornell Institute for Social and Economic Research (CISER), Cornell University and the Institute for Employment Research (IAB), German Federal Employment Agency are collaborating to expand use of IAB’s confidential ...
    • Credible interval estimates for official statistics with survey nonresponse 

      Manski, Charles F. (2013-04)
      Government agencies commonly report official statistics based on survey data as point estimates, without accompanying measures of error. In the absence of agency guidance, users of the statistics can only conjecture the ...
    • Crowdsourcing Metadata – Challenges and Outlook 

      Vilhuber, Lars (2016-04-29)
      Presentation at Annual Workshop of the Canadian Data Liberation Initiative
    • Dasymetric Modeling and Uncertainty 

      Nagle, Nicholas N.; Buttenfield, Barbara P.; Leyk, Stefan; Spielman, Seth (2013-11-07)
      Survey weights are often adjusted so that the estimated totals match with known benchmark totals. This practice is limited by the requirement that benchmarks be perfectly known and the tendency for survey weight variability ...
    • Data Management of Confidential Data 

      Lagoze, Carl; Block, William C.; Williams, Jeremy; Abowd, John M.; Vilhuber, Lars (8th International Digital Curation Conference, 2013-01)
      Social science researchers increasingly make use of data that is confidential because it contains linkages to the identities of people, corporations, etc. The value of this data lies in the ability to join the identifiable ...
    • Disclosure Limitation and Confidentiality Protection in Linked Data 

      Vilhuber, Lars (Quebec Inter-University Centre for Social Statistics (QICSS) Conference, 2016-11-30)
      Lars Vilhuber speaks about “Disclosure Limitation and Confidentiality Protection in Linked Data” at the Center for Interuniversity Research and Analysis of Organizations‘s conference on “Facilitate the access to Quebec ...
    • Do Single Mothers in the United States use the Earned Income Tax Credit to Reduce Unsecured Debt? 

      Shaefer, H. Luke; Song, Xiaoqing; Williams Shanks, Trina R. (National Poverty Center, 2011-10)
      The Earned Income Tax Credit (EITC) is a refundable credit for low-income workers that is mainly targeted at families with children. This study uses the Survey of Income and Program Participation’s (SIPP) topical modules ...