Skip to main content

Logging every footstep: Quantile summaries for the entire history

Publication ,  Conference
Tao, Y; Yi, K; Sheng, C; Pei, J; Li, F
Published in: Proceedings of the ACM SIGMOD International Conference on Management of Data
July 23, 2010

Quantiles are a crucial type of order statistics in databases. Extensive research has been focused on maintaining a space-efficient structure for approximate quantile computation as the underlying dataset is updated. The existing solutions, however, are designed to support only the current, most-updated, snapshot of the dataset. Queries on the past versions of the data cannot be answered. This paper studies the problem of historical quantile search. The objective is to enable ε-approximate quantile retrieval on any snapshot of the dataset in history. The problem is very important in analyzing the evolution of a distribution, monitoring the quality of services, query optimization in temporal databases, and so on. We present the first formal results in the literature. First, we prove a novel theoretical lower bound on the space cost of supporting ε-approximate historical quantile queries. The bound reveals the fundamental difference between answering quantile queries about the past and those about the present time. Second, we propose a structure for finding ε-approximate historical quantiles, and show that it consumes more space than the lower bound by only a square-logarithmic factor. Extensive experiments demonstrate that in practice our technique performs much better than predicted by theory. In particular, the quantiles it returns are remarkably more accurate than the theoretical precision guarantee. © 2010 ACM.

Duke Scholars

Published In

Proceedings of the ACM SIGMOD International Conference on Management of Data

DOI

ISSN

0730-8078

Publication Date

July 23, 2010

Start / End Page

639 / 650
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Tao, Y., Yi, K., Sheng, C., Pei, J., & Li, F. (2010). Logging every footstep: Quantile summaries for the entire history. In Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 639–650). https://doi.org/10.1145/1807167.1807237
Tao, Y., K. Yi, C. Sheng, J. Pei, and F. Li. “Logging every footstep: Quantile summaries for the entire history.” In Proceedings of the ACM SIGMOD International Conference on Management of Data, 639–50, 2010. https://doi.org/10.1145/1807167.1807237.
Tao Y, Yi K, Sheng C, Pei J, Li F. Logging every footstep: Quantile summaries for the entire history. In: Proceedings of the ACM SIGMOD International Conference on Management of Data. 2010. p. 639–50.
Tao, Y., et al. “Logging every footstep: Quantile summaries for the entire history.” Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010, pp. 639–50. Scopus, doi:10.1145/1807167.1807237.
Tao Y, Yi K, Sheng C, Pei J, Li F. Logging every footstep: Quantile summaries for the entire history. Proceedings of the ACM SIGMOD International Conference on Management of Data. 2010. p. 639–650.

Published In

Proceedings of the ACM SIGMOD International Conference on Management of Data

DOI

ISSN

0730-8078

Publication Date

July 23, 2010

Start / End Page

639 / 650