Skip to main content

Privacy preserving regression modelling via distributed computation

Publication ,  Journal Article
Sanil, AP; Karr, AF; Lin, X; Reiter, JP
Published in: Kdd 2004 Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
January 1, 2004

Reluctance of data owners to share their possibly confidential or proprietary data with others who own related databases is a serious impediment to conducting a mutually beneficial data mining analysis. We address the case of vertically partitioned data - multiple data owners/agencies each possess a few attributes of every data record. We focus on the case of the agencies wanting to conduct a linear regression analysis with complete records without disclosing values of their own attributes. This paper describes an algorithm that enables such agencies to compute the exact regression coefficients of the global regression equation and also perform some basic goodness-of-fit diagnostics while protecting the confidentiality of their data. In more general settings beyond the privacy scenario, this algorithm can also be viewed as method for the distributed computation for regression analyses.

Duke Scholars

Published In

Kdd 2004 Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

DOI

Publication Date

January 1, 2004

Start / End Page

677 / 682
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Sanil, A. P., Karr, A. F., Lin, X., & Reiter, J. P. (2004). Privacy preserving regression modelling via distributed computation. Kdd 2004 Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 677–682. https://doi.org/10.1145/1014052.1014139
Sanil, A. P., A. F. Karr, X. Lin, and J. P. Reiter. “Privacy preserving regression modelling via distributed computation.” Kdd 2004 Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, January 1, 2004, 677–82. https://doi.org/10.1145/1014052.1014139.
Sanil AP, Karr AF, Lin X, Reiter JP. Privacy preserving regression modelling via distributed computation. Kdd 2004 Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2004 Jan 1;677–82.
Sanil, A. P., et al. “Privacy preserving regression modelling via distributed computation.” Kdd 2004 Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Jan. 2004, pp. 677–82. Scopus, doi:10.1145/1014052.1014139.
Sanil AP, Karr AF, Lin X, Reiter JP. Privacy preserving regression modelling via distributed computation. Kdd 2004 Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2004 Jan 1;677–682.

Published In

Kdd 2004 Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

DOI

Publication Date

January 1, 2004

Start / End Page

677 / 682