Skip to main content
construction release_alert
Scholars@Duke will be undergoing maintenance April 11-15. Some features may be unavailable during this time.
cancel

PrivateSQL: A differentially private SQL query engine

Publication ,  Conference
Kotsogiannis, I; Tao, Y; He, X; Fanaeepour, M; Machanavajjhala, A; Hay, M; Miklau, G
Published in: Proceedings of the VLDB Endowment
January 1, 2018

Differential privacy is considered a de facto standard for private data analysis. However, the definition and much of the supporting literature applies to flat tables. While there exist variants of the definition and specialized algorithms for specific types of relational data (e.g. graphs), there isn't a general privacy definition for multi-relational schemas with constraints, and no system that permits accurate differentially private answering of SQL queries while imposing a fixed privacy budget across all queries posed by the analyst. This work presents PrivateSQL, a first-of-its-kind endto-end differentially private relational database system. PrivateSQL allows an analyst to query data stored in a standard database management system using a rich class of SQL counting queries. PrivateSQL adopts a novel generalization of differential privacy to multi-relational data that takes into account constraints in the schema like foreign keys, and allows the data owner to flexibly specify entities in the schema that need privacy. PrivateSQL ensures a fixed privacy loss across all the queries posed by the analyst by answering queries on private synopses generated from several views over the base relation that are tuned to have low error on a representative query workload. We experimentally evaluate PrivateSQL on a real-world dataset and a workload of more than 3; 600 queries. We show that for 50% of the queries PrivateSQL offers at least 1; 000x better error rates than solutions adapted from prior work.

Duke Scholars

Published In

Proceedings of the VLDB Endowment

DOI

EISSN

2150-8097

Publication Date

January 1, 2018

Volume

12

Issue

11

Start / End Page

1371 / 1384

Related Subject Headings

  • 4605 Data management and data science
  • 0807 Library and Information Studies
  • 0806 Information Systems
  • 0802 Computation Theory and Mathematics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Kotsogiannis, I., Tao, Y., He, X., Fanaeepour, M., Machanavajjhala, A., Hay, M., & Miklau, G. (2018). PrivateSQL: A differentially private SQL query engine. In Proceedings of the VLDB Endowment (Vol. 12, pp. 1371–1384). https://doi.org/10.14778/3342263.3342274
Kotsogiannis, I., Y. Tao, X. He, M. Fanaeepour, A. Machanavajjhala, M. Hay, and G. Miklau. “PrivateSQL: A differentially private SQL query engine.” In Proceedings of the VLDB Endowment, 12:1371–84, 2018. https://doi.org/10.14778/3342263.3342274.
Kotsogiannis I, Tao Y, He X, Fanaeepour M, Machanavajjhala A, Hay M, et al. PrivateSQL: A differentially private SQL query engine. In: Proceedings of the VLDB Endowment. 2018. p. 1371–84.
Kotsogiannis, I., et al. “PrivateSQL: A differentially private SQL query engine.” Proceedings of the VLDB Endowment, vol. 12, no. 11, 2018, pp. 1371–84. Scopus, doi:10.14778/3342263.3342274.
Kotsogiannis I, Tao Y, He X, Fanaeepour M, Machanavajjhala A, Hay M, Miklau G. PrivateSQL: A differentially private SQL query engine. Proceedings of the VLDB Endowment. 2018. p. 1371–1384.

Published In

Proceedings of the VLDB Endowment

DOI

EISSN

2150-8097

Publication Date

January 1, 2018

Volume

12

Issue

11

Start / End Page

1371 / 1384

Related Subject Headings

  • 4605 Data management and data science
  • 0807 Library and Information Studies
  • 0806 Information Systems
  • 0802 Computation Theory and Mathematics