Skip to main content

Speedup your analytics: Automatic parameter tuning for databases and big data systems

Publication ,  Conference
Lu, J; Chen, Y; Herodotou, H; Babu, S
Published in: Proceedings of the VLDB Endowment
January 1, 2018

Database and big data analytics systems such as Hadoop and Spark have a large number of configuration parameters that control memory distribution, I/O optimization, parallelism, and compression. Improper parameter settings can cause significant performance degradation and stability issues. However, regular users and even expert administrators struggle to understand and tune them to achieve good performance. In this tutorial, we review existing approaches on automatic parameter tuning for databases, Hadoop, and Spark, which we classify into six categories: rule-based, cost modeling, simulation-based, experiment-driven, machine learning, and adaptive tuning. We describe the foundations of different automatic parameter tuning algorithms and present pros and cons of each approach. We also highlight real-world applications and systems, and identify research challenges for handling cloud services, resource heterogeneity, and real-time analytics.

Duke Scholars

Published In

Proceedings of the VLDB Endowment

DOI

EISSN

2150-8097

Publication Date

January 1, 2018

Volume

12

Issue

12

Start / End Page

1970 / 1973

Related Subject Headings

  • 4605 Data management and data science
  • 0807 Library and Information Studies
  • 0806 Information Systems
  • 0802 Computation Theory and Mathematics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Lu, J., Chen, Y., Herodotou, H., & Babu, S. (2018). Speedup your analytics: Automatic parameter tuning for databases and big data systems. In Proceedings of the VLDB Endowment (Vol. 12, pp. 1970–1973). https://doi.org/10.14778/3352063.3352112
Lu, J., Y. Chen, H. Herodotou, and S. Babu. “Speedup your analytics: Automatic parameter tuning for databases and big data systems.” In Proceedings of the VLDB Endowment, 12:1970–73, 2018. https://doi.org/10.14778/3352063.3352112.
Lu J, Chen Y, Herodotou H, Babu S. Speedup your analytics: Automatic parameter tuning for databases and big data systems. In: Proceedings of the VLDB Endowment. 2018. p. 1970–3.
Lu, J., et al. “Speedup your analytics: Automatic parameter tuning for databases and big data systems.” Proceedings of the VLDB Endowment, vol. 12, no. 12, 2018, pp. 1970–73. Scopus, doi:10.14778/3352063.3352112.
Lu J, Chen Y, Herodotou H, Babu S. Speedup your analytics: Automatic parameter tuning for databases and big data systems. Proceedings of the VLDB Endowment. 2018. p. 1970–1973.

Published In

Proceedings of the VLDB Endowment

DOI

EISSN

2150-8097

Publication Date

January 1, 2018

Volume

12

Issue

12

Start / End Page

1970 / 1973

Related Subject Headings

  • 4605 Data management and data science
  • 0807 Library and Information Studies
  • 0806 Information Systems
  • 0802 Computation Theory and Mathematics