Scholars@Duke publication: Speedup your analytics: Automatic parameter tuning for databases and big data systems

Speedup your analytics: Automatic parameter tuning for databases and big data systems

Publication , Conference

Lu, J; Chen, Y; Herodotou, H; Babu, S

Published in: Proceedings of the VLDB Endowment

January 1, 2018

Database and big data analytics systems such as Hadoop and Spark have a large number of configuration parameters that control memory distribution, I/O optimization, parallelism, and compression. Improper parameter settings can cause significant performance degradation and stability issues. However, regular users and even expert administrators struggle to understand and tune them to achieve good performance. In this tutorial, we review existing approaches on automatic parameter tuning for databases, Hadoop, and Spark, which we classify into six categories: rule-based, cost modeling, simulation-based, experiment-driven, machine learning, and adaptive tuning. We describe the foundations of different automatic parameter tuning algorithms and present pros and cons of each approach. We also highlight real-world applications and systems, and identify research challenges for handling cloud services, resource heterogeneity, and real-time analytics.

Duke Scholars

Author Shivnath Babu Computer Science

Published In

Proceedings of the VLDB Endowment

DOI

10.14778/3352063.3352112

EISSN

2150-8097

Publication Date

January 1, 2018

Volume

Issue

Start / End Page

1970 / 1973

Related Subject Headings

4605 Data management and data science
0807 Library and Information Studies
0806 Information Systems
0802 Computation Theory and Mathematics

Citation

APA

Chicago

ICMJE

MLA

NLM

Lu, J., Chen, Y., Herodotou, H., & Babu, S. (2018). Speedup your analytics: Automatic parameter tuning for databases and big data systems. In Proceedings of the VLDB Endowment (Vol. 12, pp. 1970–1973). https://doi.org/10.14778/3352063.3352112

Lu, J., Y. Chen, H. Herodotou, and S. Babu. “Speedup your analytics: Automatic parameter tuning for databases and big data systems.” In Proceedings of the VLDB Endowment, 12:1970–73, 2018. https://doi.org/10.14778/3352063.3352112.

Lu J, Chen Y, Herodotou H, Babu S. Speedup your analytics: Automatic parameter tuning for databases and big data systems. In: Proceedings of the VLDB Endowment. 2018. p. 1970–3.

Lu, J., et al. “Speedup your analytics: Automatic parameter tuning for databases and big data systems.” Proceedings of the VLDB Endowment, vol. 12, no. 12, 2018, pp. 1970–73. Scopus, doi:10.14778/3352063.3352112.

Lu J, Chen Y, Herodotou H, Babu S. Speedup your analytics: Automatic parameter tuning for databases and big data systems. Proceedings of the VLDB Endowment. 2018. p. 1970–1973.

Published In

Proceedings of the VLDB Endowment

DOI

10.14778/3352063.3352112

EISSN

2150-8097

Publication Date

January 1, 2018

Volume

Issue

Start / End Page

1970 / 1973

Related Subject Headings

4605 Data management and data science
0807 Library and Information Studies
0806 Information Systems
0802 Computation Theory and Mathematics