Skip to main content
Journal cover image

Secure, privacy-preserving analysis of distributed databases

Publication ,  Journal Article
Karr, AF; Fulp, WJ; Vera, F; Young, SS; Lin, X; Reiter, JP
Published in: Technometrics
August 1, 2007

In industrial and government settings, there is often a need to perform statistical analyses that require data stored in multiple distributed databases. However, the barriers to literally integrating these data can be substantial, even insurmountable. In this article we show how tools from information technology - specifically, secure multiparty computation and networking - can be used to perform statistically valid analyses of distributed databases. The common characteristic of these methods is that the owners share sufficient statistics computed on the local databases in a way that protects each owner's data from the other owners. Our focus is on horizontally partitioned data, in which data records rather than attributes are spread among the databases. We present protocols for securely performing regression, maximum likelihood estimation, and Bayesian analysis, as well as secure construction of contingency tables. We outline three current research directions: a software system implementing the protocols, secure EM algorithms, and partially trusted third parties, which reduce incentives for owners to be dishonest. © 2007 American Statistical Association and the American Society for Quality.

Duke Scholars

Published In

Technometrics

DOI

ISSN

0040-1706

Publication Date

August 1, 2007

Volume

49

Issue

3

Start / End Page

335 / 345

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 0104 Statistics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Karr, A. F., Fulp, W. J., Vera, F., Young, S. S., Lin, X., & Reiter, J. P. (2007). Secure, privacy-preserving analysis of distributed databases. Technometrics, 49(3), 335–345. https://doi.org/10.1198/004017007000000209
Karr, A. F., W. J. Fulp, F. Vera, S. S. Young, X. Lin, and J. P. Reiter. “Secure, privacy-preserving analysis of distributed databases.” Technometrics 49, no. 3 (August 1, 2007): 335–45. https://doi.org/10.1198/004017007000000209.
Karr AF, Fulp WJ, Vera F, Young SS, Lin X, Reiter JP. Secure, privacy-preserving analysis of distributed databases. Technometrics. 2007 Aug 1;49(3):335–45.
Karr, A. F., et al. “Secure, privacy-preserving analysis of distributed databases.” Technometrics, vol. 49, no. 3, Aug. 2007, pp. 335–45. Scopus, doi:10.1198/004017007000000209.
Karr AF, Fulp WJ, Vera F, Young SS, Lin X, Reiter JP. Secure, privacy-preserving analysis of distributed databases. Technometrics. 2007 Aug 1;49(3):335–345.
Journal cover image

Published In

Technometrics

DOI

ISSN

0040-1706

Publication Date

August 1, 2007

Volume

49

Issue

3

Start / End Page

335 / 345

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 0104 Statistics