Scholars@Duke publication: SUBSKY: Efficient computation of skylines in subspaces

SUBSKY: Efficient computation of skylines in subspaces

Publication , Conference

Tao, Y; Xiao, K; Pei, J

Published in: Proceedings International Conference on Data Engineering

January 1, 2006

Given a set of multi-dimensional points, the skyline contains the best points according to any preference function that is monotone on all axes. In practice, applications that require skyline analysis usually provide numerous candidate attributes, and various users depending on their interests may issue queries regarding different (small) subsets of the dimensions. Formally, given a relation with a large number (e.g., > 10) of attributes, a query aims at finding the skyline in an arbitrary subspace with a low dimensionality (e.g., 2). The existing algorithms do not support subspace skyline retrieval efficiently because they (i) require scanning the entire database at least once, or (ii) are optimized for one particular subspace but incur significant overhead for other subspaces. In this paper, we propose a technique SUBSKY which settles the problem using a single B-tree, and can be implemented in any relational database. The core of SUBSKY is a transformation that converts multi-dimensional data to 1D values, and enables several effective pruning heuristics. Extensive experiments with real data confirm that SUBSKY outperforms alternative approaches significantly in both efficiency and scalability. © 2006 IEEE.

Duke Scholars

Author Jian Pei Computer Science

Published In

Proceedings International Conference on Data Engineering

DOI

10.1109/ICDE.2006.149

ISSN

1084-4627

Publication Date

January 1, 2006

Volume

2006

Start / End Page

66 / 75

Citation

APA

Chicago

ICMJE

MLA

NLM

Tao, Y., Xiao, K., & Pei, J. (2006). SUBSKY: Efficient computation of skylines in subspaces. In Proceedings International Conference on Data Engineering (Vol. 2006, pp. 66–75). https://doi.org/10.1109/ICDE.2006.149

Tao, Y., K. Xiao, and J. Pei. “SUBSKY: Efficient computation of skylines in subspaces.” In Proceedings International Conference on Data Engineering, 2006:66–75, 2006. https://doi.org/10.1109/ICDE.2006.149.

Tao Y, Xiao K, Pei J. SUBSKY: Efficient computation of skylines in subspaces. In: Proceedings International Conference on Data Engineering. 2006. p. 66–75.

Tao, Y., et al. “SUBSKY: Efficient computation of skylines in subspaces.” Proceedings International Conference on Data Engineering, vol. 2006, 2006, pp. 66–75. Scopus, doi:10.1109/ICDE.2006.149.

Tao Y, Xiao K, Pei J. SUBSKY: Efficient computation of skylines in subspaces. Proceedings International Conference on Data Engineering. 2006. p. 66–75.

Published In

Proceedings International Conference on Data Engineering

DOI

10.1109/ICDE.2006.149

ISSN

1084-4627

Publication Date

January 1, 2006

Volume

2006

Start / End Page

66 / 75