Skip to main content
Journal cover image

Sampling with synthesis: A new approach for releasing public use census microdata

Publication ,  Journal Article
Drechsler, J; Reiter, JP
Published in: Journal of the American Statistical Association
December 1, 2010

Many statistical agencies disseminate samples of census microdata, that is, data on individual records, to the public. Before releasing the microdata, agencies typically alter identifying or sensitive values to protect data subjects' confidentiality, for example by coarsening, perturbing, or swapping data. These standard disclosure limitation techniques distort relationships and distributional features in the original data, especially when applied with high intensity. Furthermore, it can be difficult for analysts of the masked public use data to adjust inferences for the effects of the disclosure limitation. Motivated by these shortcomings, we propose an approach to census microdata dissemination called sampling with synthesis. The basic idea is to replace the identifying or sensitive values in the census with multiple imputations, and release samples from these multiply-imputed populations. We demonstrate that sampling with synthesis can improve the quality of public use data relative to sampling followed by standard statistical disclosure limitation; simulation results showing this are available online as supplemental material. We derive methods for analyzing the multiple datasets generated by sampling with synthesis. We present algorithms for selecting which census values to synthesize based on considerations of disclosure risk and data utility. We illustrate sampling with synthesis on a population constructed with data from the U.S. Current Population Survey. © 2010.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Journal of the American Statistical Association

DOI

ISSN

0162-1459

Publication Date

December 1, 2010

Volume

105

Issue

492

Start / End Page

1347 / 1357

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 3802 Econometrics
  • 1603 Demography
  • 1403 Econometrics
  • 0104 Statistics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Drechsler, J., & Reiter, J. P. (2010). Sampling with synthesis: A new approach for releasing public use census microdata. Journal of the American Statistical Association, 105(492), 1347–1357. https://doi.org/10.1198/jasa.2010.ap09480
Drechsler, J., and J. P. Reiter. “Sampling with synthesis: A new approach for releasing public use census microdata.” Journal of the American Statistical Association 105, no. 492 (December 1, 2010): 1347–57. https://doi.org/10.1198/jasa.2010.ap09480.
Drechsler J, Reiter JP. Sampling with synthesis: A new approach for releasing public use census microdata. Journal of the American Statistical Association. 2010 Dec 1;105(492):1347–57.
Drechsler, J., and J. P. Reiter. “Sampling with synthesis: A new approach for releasing public use census microdata.” Journal of the American Statistical Association, vol. 105, no. 492, Dec. 2010, pp. 1347–57. Scopus, doi:10.1198/jasa.2010.ap09480.
Drechsler J, Reiter JP. Sampling with synthesis: A new approach for releasing public use census microdata. Journal of the American Statistical Association. 2010 Dec 1;105(492):1347–1357.
Journal cover image

Published In

Journal of the American Statistical Association

DOI

ISSN

0162-1459

Publication Date

December 1, 2010

Volume

105

Issue

492

Start / End Page

1347 / 1357

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 3802 Econometrics
  • 1603 Demography
  • 1403 Econometrics
  • 0104 Statistics