Skip to main content

Chain: Operator Scheduling for Memory Minimization in Data Stream Systems

Publication ,  Journal Article
Babcock, B; Babu, S; Datar, M; Motwani, R
Published in: Proceedings of the ACM SIGMOD International Conference on Management of Data
December 1, 2003

In many applications involving continuous data streams, data arrival is bursty and data rate fluctuates over time. Systems that seek to give rapid or real-time query responses in such an environment must be prepared to deal gracefully with bursts in data arrival without compromising system performance. We discuss one strategy for processing bursty streams - adaptive, load-aware scheduling of query operators to minimize resource consumption during times of peak load. We show that the choice of an operator scheduling strategy can have significant impact on the run-time system memory usage. We then present Chain scheduling, an operator scheduling strategy for data stream systems that is near-optimal in minimizing run-time memory usage for any collection of single-stream queries involving selections, projections, and foreign-key joins with stored relations. Chain scheduling also performs well for queries with sliding-window joins over multiple streams, and multiple queries of the above types. A thorough experimental evaluation is provided where we demonstrate the potential benefits of Chain scheduling, compare it with competing scheduling strategies, and validate our analytical conclusions.

Duke Scholars

Published In

Proceedings of the ACM SIGMOD International Conference on Management of Data

ISSN

0730-8078

Publication Date

December 1, 2003

Start / End Page

253 / 264
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Babcock, B., Babu, S., Datar, M., & Motwani, R. (2003). Chain: Operator Scheduling for Memory Minimization in Data Stream Systems. Proceedings of the ACM SIGMOD International Conference on Management of Data, 253–264.
Babcock, B., S. Babu, M. Datar, and R. Motwani. “Chain: Operator Scheduling for Memory Minimization in Data Stream Systems.” Proceedings of the ACM SIGMOD International Conference on Management of Data, December 1, 2003, 253–64.
Babcock B, Babu S, Datar M, Motwani R. Chain: Operator Scheduling for Memory Minimization in Data Stream Systems. Proceedings of the ACM SIGMOD International Conference on Management of Data. 2003 Dec 1;253–64.
Babcock, B., et al. “Chain: Operator Scheduling for Memory Minimization in Data Stream Systems.” Proceedings of the ACM SIGMOD International Conference on Management of Data, Dec. 2003, pp. 253–64.
Babcock B, Babu S, Datar M, Motwani R. Chain: Operator Scheduling for Memory Minimization in Data Stream Systems. Proceedings of the ACM SIGMOD International Conference on Management of Data. 2003 Dec 1;253–264.

Published In

Proceedings of the ACM SIGMOD International Conference on Management of Data

ISSN

0730-8078

Publication Date

December 1, 2003

Start / End Page

253 / 264