Skip to main content
Journal cover image

Publishing anonymous survey rating data

Publication ,  Journal Article
Sun, X; Wang, H; Li, J; Pei, J
Published in: Data Mining and Knowledge Discovery
November 1, 2011

We study the challenges of protecting privacy of individuals in the large public survey rating data in this paper. Recent study shows that personal information in supposedly anonymous movie rating records are de-identified. The survey rating data usually contains both ratings of sensitive and non-sensitive issues. The ratings of sensitive issues involve personal privacy. Even though the survey participants do not reveal any of their ratings, their survey records are potentially identifiable by using information from other public sources. None of the existing anonymisation principles (e.g., k-anonymity, l-diversity, etc.) can effectively prevent such breaches in large survey rating data sets. We tackle the problem by defining a principle called (k, ε)-anonymity model to protect privacy. Intuitively, the principle requires that, for each transaction t in the given survey rating data T , at least (k - 1) other transactions in T must have ratings similar to t, where the similarity is controlled by ε. The (k, ε)-anonymity model is formulated by its graphical representation and a specific graph-anonymisation problem is studied by adopting graph modification with graph theory. Various cases are analyzed and methods are developed to make the updated graph meet (k, ε) requirements. The methods are applied to two real-life data sets to demonstrate their efficiency and practical utility. © The Author(s) 2010.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Data Mining and Knowledge Discovery

DOI

ISSN

1384-5810

Publication Date

November 1, 2011

Volume

23

Issue

3

Start / End Page

379 / 406

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 46 Information and computing sciences
  • 0806 Information Systems
  • 0804 Data Format
  • 0801 Artificial Intelligence and Image Processing
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Sun, X., Wang, H., Li, J., & Pei, J. (2011). Publishing anonymous survey rating data. Data Mining and Knowledge Discovery, 23(3), 379–406. https://doi.org/10.1007/s10618-010-0208-4
Sun, X., H. Wang, J. Li, and J. Pei. “Publishing anonymous survey rating data.” Data Mining and Knowledge Discovery 23, no. 3 (November 1, 2011): 379–406. https://doi.org/10.1007/s10618-010-0208-4.
Sun X, Wang H, Li J, Pei J. Publishing anonymous survey rating data. Data Mining and Knowledge Discovery. 2011 Nov 1;23(3):379–406.
Sun, X., et al. “Publishing anonymous survey rating data.” Data Mining and Knowledge Discovery, vol. 23, no. 3, Nov. 2011, pp. 379–406. Scopus, doi:10.1007/s10618-010-0208-4.
Sun X, Wang H, Li J, Pei J. Publishing anonymous survey rating data. Data Mining and Knowledge Discovery. 2011 Nov 1;23(3):379–406.
Journal cover image

Published In

Data Mining and Knowledge Discovery

DOI

ISSN

1384-5810

Publication Date

November 1, 2011

Volume

23

Issue

3

Start / End Page

379 / 406

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 46 Information and computing sciences
  • 0806 Information Systems
  • 0804 Data Format
  • 0801 Artificial Intelligence and Image Processing