Interaction-aware scheduling of report-generation workloads

Journal Article (Journal Article)

The typical workload in a database system consists of a mix of multiple queries of different types that run concurrently. Interactions among the different queries in a query mix can have a significant impact on database performance. Hence, optimizing database performance requires reasoning about query mixes rather than considering queries individually. Current database systems lack the ability to do such reasoning. We propose a new approach based on planning experiments and statistical modeling to capture the impact of query interactions. Our approach requires no prior assumptions about the internal workings of the database system or the nature and cause of query interactions, making it portable across systems. To demonstrate the potential of modeling and exploiting query interactions, we have developed a novel interaction-aware query scheduler for report-generation workloads. Our scheduler, called QShuffler, uses two query scheduling algorithms that leverage models of query interactions. The first algorithm is optimized for workloads where queries are submitted in large batches. The second algorithm targets workloads where queries arrive continuously, and scheduling decisions have to be made online. We report an experimental evaluation of QShuffler using TPC-H workloads running on IBM DB2. The evaluation shows that QShuffler, by modeling and exploiting query interactions, can consistently outperform (up to 4x) query schedulers in current database systems. © 2011 Springer-Verlag.

Full Text

Duke Authors

Cited Authors

  • Ahmad, M; Aboulnaga, A; Babu, S; Munagala, K

Published Date

  • August 1, 2011

Published In

Volume / Issue

  • 20 / 4

Start / End Page

  • 589 - 615

Electronic International Standard Serial Number (EISSN)

  • 0949-877X

International Standard Serial Number (ISSN)

  • 1066-8888

Digital Object Identifier (DOI)

  • 10.1007/s00778-011-0217-y

Citation Source

  • Scopus