Skip to main content

Probabilistic skylines on uncertain data

Publication ,  Conference
Pei, J; Jiang, B; Lin, X; Yuan, Y
Published in: 33rd International Conference on Very Large Data Bases, VLDB 2007 - Conference Proceedings
January 1, 2007

Uncertain data are inherent in some important applications. Although a considerable amount of research has been dedicated to modeling uncertain data and answering some types of queries on uncertain data, how to conduct advanced analysis on uncertain data remains an open problem at large. In this paper, we tackle the problem of skyline analysis on uncertain data. We propose a novel probabilistic skyline model where an uncertain object may take a probability to be in the skyline, and a p-skyline contains all the objects whose skyline probabilities are at least p. Computing probabilistic skylines on large uncertain data sets is challenging. We develop two efficient algorithms. The bottom-up algorithm computes the skyline probabilities of some selected instances of uncertain objects, and uses those instances to prune other instances and uncertain objects effectively. The top-down algorithm recursively partitions the instances of uncertain objects into subsets, and prunes subsets and objects aggressively. Our experimental results on both the real NBA player data set and the benchmark synthetic data sets show that probabilistic skylines are interesting and useful, and our two algorithms are efficient on large data sets, and complementary to each other in performance.

Duke Scholars

Published In

33rd International Conference on Very Large Data Bases, VLDB 2007 - Conference Proceedings

Publication Date

January 1, 2007

Start / End Page

15 / 26
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Pei, J., Jiang, B., Lin, X., & Yuan, Y. (2007). Probabilistic skylines on uncertain data. In 33rd International Conference on Very Large Data Bases, VLDB 2007 - Conference Proceedings (pp. 15–26).
Pei, J., B. Jiang, X. Lin, and Y. Yuan. “Probabilistic skylines on uncertain data.” In 33rd International Conference on Very Large Data Bases, VLDB 2007 - Conference Proceedings, 15–26, 2007.
Pei J, Jiang B, Lin X, Yuan Y. Probabilistic skylines on uncertain data. In: 33rd International Conference on Very Large Data Bases, VLDB 2007 - Conference Proceedings. 2007. p. 15–26.
Pei, J., et al. “Probabilistic skylines on uncertain data.” 33rd International Conference on Very Large Data Bases, VLDB 2007 - Conference Proceedings, 2007, pp. 15–26.
Pei J, Jiang B, Lin X, Yuan Y. Probabilistic skylines on uncertain data. 33rd International Conference on Very Large Data Bases, VLDB 2007 - Conference Proceedings. 2007. p. 15–26.

Published In

33rd International Conference on Very Large Data Bases, VLDB 2007 - Conference Proceedings

Publication Date

January 1, 2007

Start / End Page

15 / 26