Skip to main content

DP2-Pub: Differentially Private High-Dimensional Data Publication With Invariant Post Randomization

Publication ,  Journal Article
Jiang, H; Yu, H; Cheng, X; Pei, J; Pless, R; Yu, J
Published in: IEEE Transactions on Knowledge and Data Engineering
October 1, 2023

A large amount of high-dimensional and heterogeneous data appear in practical applications, which are often published to third parties for data analysis, recommendations, targeted advertising, and reliable predictions. However, publishing these data may disclose personal sensitive information, resulting in an increasing concern on privacy violations. Privacy-preserving data publishing has received considerable attention in recent years. Unfortunately, the differentially private publication of high dimensional data remains a challenging problem. In this paper, we propose a differentially private high-dimensional data publication mechanism (DP2-Pub) that runs in two phases: a Markov-blanket-based attribute clustering phase and an invariant post randomization (PRAM) phase. Specifically, splitting attributes into several low-dimensional clusters with high intra-cluster cohesion and low inter-cluster coupling helps obtain a reasonable allocation of privacy budget, while a double-perturbation mechanism satisfying local differential privacy facilitates an invariant PRAM to ensure no loss of statistical information and thus significantly preserves data utility. We also extend our DP2-Pub mechanism to the scenario with a semi-honest server which satisfies local differential privacy. We conduct extensive experiments on four real-world datasets and the experimental results demonstrate that our mechanism can significantly improve the data utility of the published data while satisfying differential privacy.

Duke Scholars

Published In

IEEE Transactions on Knowledge and Data Engineering

DOI

EISSN

1558-2191

ISSN

1041-4347

Publication Date

October 1, 2023

Volume

35

Issue

10

Start / End Page

10831 / 10844

Related Subject Headings

  • Information Systems
  • 46 Information and computing sciences
  • 08 Information and Computing Sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Jiang, H., Yu, H., Cheng, X., Pei, J., Pless, R., & Yu, J. (2023). DP2-Pub: Differentially Private High-Dimensional Data Publication With Invariant Post Randomization. IEEE Transactions on Knowledge and Data Engineering, 35(10), 10831–10844. https://doi.org/10.1109/TKDE.2023.3265605
Jiang, H., H. Yu, X. Cheng, J. Pei, R. Pless, and J. Yu. “DP2-Pub: Differentially Private High-Dimensional Data Publication With Invariant Post Randomization.” IEEE Transactions on Knowledge and Data Engineering 35, no. 10 (October 1, 2023): 10831–44. https://doi.org/10.1109/TKDE.2023.3265605.
Jiang H, Yu H, Cheng X, Pei J, Pless R, Yu J. DP2-Pub: Differentially Private High-Dimensional Data Publication With Invariant Post Randomization. IEEE Transactions on Knowledge and Data Engineering. 2023 Oct 1;35(10):10831–44.
Jiang, H., et al. “DP2-Pub: Differentially Private High-Dimensional Data Publication With Invariant Post Randomization.” IEEE Transactions on Knowledge and Data Engineering, vol. 35, no. 10, Oct. 2023, pp. 10831–44. Scopus, doi:10.1109/TKDE.2023.3265605.
Jiang H, Yu H, Cheng X, Pei J, Pless R, Yu J. DP2-Pub: Differentially Private High-Dimensional Data Publication With Invariant Post Randomization. IEEE Transactions on Knowledge and Data Engineering. 2023 Oct 1;35(10):10831–10844.

Published In

IEEE Transactions on Knowledge and Data Engineering

DOI

EISSN

1558-2191

ISSN

1041-4347

Publication Date

October 1, 2023

Volume

35

Issue

10

Start / End Page

10831 / 10844

Related Subject Headings

  • Information Systems
  • 46 Information and computing sciences
  • 08 Information and Computing Sciences