Skip to main content

Efficient Indexes for Diverse Top-k Range Queries

Publication ,  Conference
Agarwal, PK; Sintos, S; Steiger, A
Published in: Proceedings of the ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems
June 14, 2020

Let P be a set of n (non-negatively) weighted points in Rd. We consider the problem of computing a subset of (at most) k diverse and high-valued points of P that lie inside a query range, a problem relevant to many areas such as search engines, recommendation systems, and online stores. The diversity and value of a set of points are measured as functions (say average or minimum) of their pairwise distances and weights, respectively. We study both bicriteria and constrained optimization problems. In the former, we wish to return a set of k points that maximize a weighted sum of their value and diversity measures, and in the latter, we wish to return a set of at most k points that maximize their value and satisfy a diversity constraint. We obtain three main types of results in this paper: Near-linear time (0.5-ϵ)-approximation algorithms for the bicriteria optimization problem in the offline setting. Near-linear size indexes for the bicriteria optimization problem that for a query rectangle return a (0.5-ϵ)-approximate solution in time O(k polylog(n)). The indexes can be constructed in O(n polylog(n)) time. Near-linear size indexes for answering constrained optimization range queries. For a query rectangle, a 0.5O(d)-approximate solution can be computed in O(k polylog(n)) time. If we allow some of the returned points to lie at most ϵ outside of the query rectangle then an (1-ϵ)-approximate solution can be computed in O(k polylog(n)) time. The indexes are constructed in O(n polylog(n)) and nO(1/ϵd) time, respectively.

Duke Scholars

Published In

Proceedings of the ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems

DOI

Publication Date

June 14, 2020

Start / End Page

213 / 227
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Agarwal, P. K., Sintos, S., & Steiger, A. (2020). Efficient Indexes for Diverse Top-k Range Queries. In Proceedings of the ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (pp. 213–227). https://doi.org/10.1145/3375395.3387667
Agarwal, P. K., S. Sintos, and A. Steiger. “Efficient Indexes for Diverse Top-k Range Queries.” In Proceedings of the ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, 213–27, 2020. https://doi.org/10.1145/3375395.3387667.
Agarwal PK, Sintos S, Steiger A. Efficient Indexes for Diverse Top-k Range Queries. In: Proceedings of the ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. 2020. p. 213–27.
Agarwal, P. K., et al. “Efficient Indexes for Diverse Top-k Range Queries.” Proceedings of the ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, 2020, pp. 213–27. Scopus, doi:10.1145/3375395.3387667.
Agarwal PK, Sintos S, Steiger A. Efficient Indexes for Diverse Top-k Range Queries. Proceedings of the ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. 2020. p. 213–227.

Published In

Proceedings of the ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems

DOI

Publication Date

June 14, 2020

Start / End Page

213 / 227