
Imputation of confidential data sets with spatial locations using disease mapping models.
Publication
, Journal Article
Paiva, T; Chakraborty, A; Reiter, J; Gelfand, A
Published in: Statistics in medicine
May 2014
Data that include fine geographic information, such as census tract or street block identifiers, can be difficult to release as public use files. Fine geography provides information that ill-intentioned data users can use to identify individuals. We propose to release data with simulated geographies, so as to enable spatial analyses while reducing disclosure risks. We fit disease mapping models that predict areal-level counts from attributes in the file and sample new locations based on the estimated models. We illustrate this approach using data on causes of death in North Carolina, including evaluations of the disclosure risks and analytic validity that can result from releasing synthetic geographies.
Duke Scholars
Published In
Statistics in medicine
DOI
EISSN
1097-0258
ISSN
0277-6715
Publication Date
May 2014
Volume
33
Issue
11
Start / End Page
1928 / 1945
Related Subject Headings
- Statistics & Probability
- North Carolina
- Models, Statistical
- Humans
- Geographic Mapping
- Datasets as Topic
- Cause of Death
- 4905 Statistics
- 4202 Epidemiology
- 1117 Public Health and Health Services
Citation
APA
Chicago
ICMJE
MLA
NLM
Paiva, T., Chakraborty, A., Reiter, J., & Gelfand, A. (2014). Imputation of confidential data sets with spatial locations using disease mapping models. Statistics in Medicine, 33(11), 1928–1945. https://doi.org/10.1002/sim.6078
Paiva, Thais, Avishek Chakraborty, Jerry Reiter, and Alan Gelfand. “Imputation of confidential data sets with spatial locations using disease mapping models.” Statistics in Medicine 33, no. 11 (May 2014): 1928–45. https://doi.org/10.1002/sim.6078.
Paiva T, Chakraborty A, Reiter J, Gelfand A. Imputation of confidential data sets with spatial locations using disease mapping models. Statistics in medicine. 2014 May;33(11):1928–45.
Paiva, Thais, et al. “Imputation of confidential data sets with spatial locations using disease mapping models.” Statistics in Medicine, vol. 33, no. 11, May 2014, pp. 1928–45. Epmc, doi:10.1002/sim.6078.
Paiva T, Chakraborty A, Reiter J, Gelfand A. Imputation of confidential data sets with spatial locations using disease mapping models. Statistics in medicine. 2014 May;33(11):1928–1945.

Published In
Statistics in medicine
DOI
EISSN
1097-0258
ISSN
0277-6715
Publication Date
May 2014
Volume
33
Issue
11
Start / End Page
1928 / 1945
Related Subject Headings
- Statistics & Probability
- North Carolina
- Models, Statistical
- Humans
- Geographic Mapping
- Datasets as Topic
- Cause of Death
- 4905 Statistics
- 4202 Epidemiology
- 1117 Public Health and Health Services