Scholars@Duke publication: Execution and optimization of continuous queries with cyclops

Execution and optimization of continuous queries with cyclops

Publication , Journal Article

Lim, H; Babu, S

Published in: Proceedings of the ACM SIGMOD International Conference on Management of Data

July 29, 2013

As the data collected by enterprises grows in scale, there is a growing trend of performing data analytics on large datasets. Batch processing systems that can handle petabyte scale of data, such as Hadoop, have flourished and gained traction in the industry. As the results of batch analytics have been used to continuously improve front-facing user experience, there is a growing interest in pushing the processing latency down. This trend has fueled a resurgence in the development and usage of execution engines that can process continuous queries. An important class of continuous queries is windowed aggregation queries. Such queries arise in a wide range of applications such as generating personalized content and results. Today, considerable manual effort goes into finding the most suitable execution engine for these queries and on tuning query performance on these engines. An ecosystem composed of multiple execution engines may be needed in order to run the overall query workload efficiently given the diverse set of requirements that arise in practice. Cyclops is a continuous query processing platform that manages and orchestrates windowed aggregation queries in an ecosystem composed of multiple continuous query execution engines. Cyclops employs a cost-based approach for picking the most suitable engine and plan for executing a given query. This demonstration first presents an interactive visualization of the rich execution plan space of windowed aggregation queries, which allows users to analyze and understand the differences among plans. The next part of the demonstration will drill down into the design of Cyclops. For a given query, we show the cost spectrum of query execution plans across three different execution engines - Esper, Storm, and Hadoop - as estimated by Cyclops. Copyright © 2013 ACM.

Duke Scholars

Author Shivnath Babu Computer Science

Published In

Proceedings of the ACM SIGMOD International Conference on Management of Data

DOI

10.1145/2463676.2465248

ISSN

0730-8078

Publication Date

July 29, 2013

Start / End Page

1069 / 1072

Citation

APA

Chicago

ICMJE

MLA

NLM

Lim, H., & Babu, S. (2013). Execution and optimization of continuous queries with cyclops. Proceedings of the ACM SIGMOD International Conference on Management of Data, 1069–1072. https://doi.org/10.1145/2463676.2465248

Lim, H., and S. Babu. “Execution and optimization of continuous queries with cyclops.” Proceedings of the ACM SIGMOD International Conference on Management of Data, July 29, 2013, 1069–72. https://doi.org/10.1145/2463676.2465248.

Lim H, Babu S. Execution and optimization of continuous queries with cyclops. Proceedings of the ACM SIGMOD International Conference on Management of Data. 2013 Jul 29;1069–72.

Lim, H., and S. Babu. “Execution and optimization of continuous queries with cyclops.” Proceedings of the ACM SIGMOD International Conference on Management of Data, July 2013, pp. 1069–72. Scopus, doi:10.1145/2463676.2465248.

Lim H, Babu S. Execution and optimization of continuous queries with cyclops. Proceedings of the ACM SIGMOD International Conference on Management of Data. 2013 Jul 29;1069–1072.

Published In

Proceedings of the ACM SIGMOD International Conference on Management of Data

DOI

10.1145/2463676.2465248

ISSN

0730-8078

Publication Date

July 29, 2013

Start / End Page

1069 / 1072