Scholars@Duke publication: Automated performance management for the big data stack

Automated performance management for the big data stack

Publication , Conference

Arvanitis, A; Babu, S; Chu, E; Popescu, A; Simitsis, A; Wilkinson, K

Published in: CIDR 2019 - 9th Biennial Conference on Innovative Data Systems Research

January 1, 2019

More than 10,000 enterprises worldwide today use the big data stack that is composed of multiple distributed systems. At Unravel, we have worked with a representative sample of these enterprises that covers most industry verticals. This sample also covers the spectrum of choices for deploying the big data stack across on-premises datacenters, private cloud deployments, public cloud deployments, and hybrid combinations of these. In this paper, we aim to bring attention to the performance management requirements that arise in big data stacks. We provide an overview of the requirements both at the level of individual applications as well as holistic clusters and workloads. We present an architecture that can provide automated solutions for these requirements and then do a deep dive into a few of these solutions.

Duke Scholars

Author Shivnath Babu Computer Science

Published In

CIDR 2019 - 9th Biennial Conference on Innovative Data Systems Research

Publication Date

January 1, 2019

Citation

APA

Chicago

ICMJE

MLA

NLM

Arvanitis, A., Babu, S., Chu, E., Popescu, A., Simitsis, A., & Wilkinson, K. (2019). Automated performance management for the big data stack. In CIDR 2019 - 9th Biennial Conference on Innovative Data Systems Research.

Arvanitis, A., S. Babu, E. Chu, A. Popescu, A. Simitsis, and K. Wilkinson. “Automated performance management for the big data stack.” In CIDR 2019 - 9th Biennial Conference on Innovative Data Systems Research, 2019.

Arvanitis A, Babu S, Chu E, Popescu A, Simitsis A, Wilkinson K. Automated performance management for the big data stack. In: CIDR 2019 - 9th Biennial Conference on Innovative Data Systems Research. 2019.

Arvanitis, A., et al. “Automated performance management for the big data stack.” CIDR 2019 - 9th Biennial Conference on Innovative Data Systems Research, 2019.

Arvanitis A, Babu S, Chu E, Popescu A, Simitsis A, Wilkinson K. Automated performance management for the big data stack. CIDR 2019 - 9th Biennial Conference on Innovative Data Systems Research. 2019.

Published In

CIDR 2019 - 9th Biennial Conference on Innovative Data Systems Research

Publication Date

January 1, 2019