Skip to main content

Optimizing error of highdimensional statistical queries under differential privacy

Publication ,  Conference
McKenna, R; Miklau, G; Hay, M; Machanavajjhala, A
Published in: Proceedings of the VLDB Endowment
January 1, 2018

Differentially private algorithms for answering sets of predicate counting queries on a sensitive database have many applications. Organizations that collect individual-level data, such as statistical agencies and medical institutions, use them to safely release summary tabulations. However, existing techniques are accurate only on a narrow class of query workloads, or are extremely slow, especially when analyzing more than one or two dimensions of the data. In this work we propose HDMM, a new differentially private algorithm for answering a workload of predicate counting queries, that is especially effective for higher-dimensional datasets. HDMM represents query workloads using an implicit matrix representation and exploits this compact representation to efficiently search (a subset of) the space of differentially private algorithms for one that answers the input query workload with high accuracy. We empirically show that HDMM can efficiently answer queries with lower error than state-of-the-art techniques on a variety of low and high dimensional datasets.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Proceedings of the VLDB Endowment

DOI

EISSN

2150-8097

Publication Date

January 1, 2018

Volume

11

Issue

10

Start / End Page

1206 / 1219

Related Subject Headings

  • 4605 Data management and data science
  • 0807 Library and Information Studies
  • 0806 Information Systems
  • 0802 Computation Theory and Mathematics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
McKenna, R., Miklau, G., Hay, M., & Machanavajjhala, A. (2018). Optimizing error of highdimensional statistical queries under differential privacy. In Proceedings of the VLDB Endowment (Vol. 11, pp. 1206–1219). https://doi.org/10.14778/3231751.3231769
McKenna, R., G. Miklau, M. Hay, and A. Machanavajjhala. “Optimizing error of highdimensional statistical queries under differential privacy.” In Proceedings of the VLDB Endowment, 11:1206–19, 2018. https://doi.org/10.14778/3231751.3231769.
McKenna R, Miklau G, Hay M, Machanavajjhala A. Optimizing error of highdimensional statistical queries under differential privacy. In: Proceedings of the VLDB Endowment. 2018. p. 1206–19.
McKenna, R., et al. “Optimizing error of highdimensional statistical queries under differential privacy.” Proceedings of the VLDB Endowment, vol. 11, no. 10, 2018, pp. 1206–19. Scopus, doi:10.14778/3231751.3231769.
McKenna R, Miklau G, Hay M, Machanavajjhala A. Optimizing error of highdimensional statistical queries under differential privacy. Proceedings of the VLDB Endowment. 2018. p. 1206–1219.

Published In

Proceedings of the VLDB Endowment

DOI

EISSN

2150-8097

Publication Date

January 1, 2018

Volume

11

Issue

10

Start / End Page

1206 / 1219

Related Subject Headings

  • 4605 Data management and data science
  • 0807 Library and Information Studies
  • 0806 Information Systems
  • 0802 Computation Theory and Mathematics