Skip to main content

PSynDB: Accurate and accessible private data generation

Publication ,  Conference
Huang, Z; McKenna, R; Bissias, G; Miklau, G; Hay, M; Machanavajjhala, A
Published in: Proceedings of the VLDB Endowment
January 1, 2018

Across many application domains, trusted parties who collect sensitive information need mechanisms to safely disseminate data. A favored approach is to generate synthetic data: a dataset similar to the original, hopefully retaining its statistical features, but one that does not reveal the private information of contributors to the data. We present PSynDB, a web-based synthetic table generator that is built on recent privacy technologies [10, 11, 15]. PSynDB satisfies the formal guarantee of differential privacy and generates synthetic tables with high accuracy for tasks that the user specifies as important. PSynDB allows users to browse expected error rates before running the mechanism, a useful feature for making important policy decisions, such as setting the privacy loss budget. When the user has finished configuration, the tool outputs a data synthesis program that can be ported to a trusted environment. There it can be safely executed on the private data to produce the private synthetic dataset for broad dissemination.

Duke Scholars

Published In

Proceedings of the VLDB Endowment

DOI

EISSN

2150-8097

Publication Date

January 1, 2018

Volume

12

Issue

12

Start / End Page

1918 / 1921

Related Subject Headings

  • 4605 Data management and data science
  • 0807 Library and Information Studies
  • 0806 Information Systems
  • 0802 Computation Theory and Mathematics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Huang, Z., McKenna, R., Bissias, G., Miklau, G., Hay, M., & Machanavajjhala, A. (2018). PSynDB: Accurate and accessible private data generation. In Proceedings of the VLDB Endowment (Vol. 12, pp. 1918–1921). https://doi.org/10.14778/3352063.3352099
Huang, Z., R. McKenna, G. Bissias, G. Miklau, M. Hay, and A. Machanavajjhala. “PSynDB: Accurate and accessible private data generation.” In Proceedings of the VLDB Endowment, 12:1918–21, 2018. https://doi.org/10.14778/3352063.3352099.
Huang Z, McKenna R, Bissias G, Miklau G, Hay M, Machanavajjhala A. PSynDB: Accurate and accessible private data generation. In: Proceedings of the VLDB Endowment. 2018. p. 1918–21.
Huang, Z., et al. “PSynDB: Accurate and accessible private data generation.” Proceedings of the VLDB Endowment, vol. 12, no. 12, 2018, pp. 1918–21. Scopus, doi:10.14778/3352063.3352099.
Huang Z, McKenna R, Bissias G, Miklau G, Hay M, Machanavajjhala A. PSynDB: Accurate and accessible private data generation. Proceedings of the VLDB Endowment. 2018. p. 1918–1921.

Published In

Proceedings of the VLDB Endowment

DOI

EISSN

2150-8097

Publication Date

January 1, 2018

Volume

12

Issue

12

Start / End Page

1918 / 1921

Related Subject Headings

  • 4605 Data management and data science
  • 0807 Library and Information Studies
  • 0806 Information Systems
  • 0802 Computation Theory and Mathematics