Scholars@Duke publication: Active and accelerated learning of cost models for optimizing scientific applications

Active and accelerated learning of cost models for optimizing scientific applications

Publication , Conference

Shivam, P; Babu, S; Chase, J

Published in: VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases

January 1, 2006

We present the NIMO system that automatically learns cost models for predicting the execution time of computational-science applications running on large-scale networked utilities such as computational grids. Accurate cost models are important for selecting efficient plans for executing these applications on the utility. Computational-science applications are often scripts (written, e.g., in languages like Perl or Matlab) connected using a workflow-description language, and therefore, pose different challenges compared to modeling the execution of plans for declarative queries with well-Understood semantics. NIMO generates appropriate training samples for these applications to learn fairly-accurate cost models quickly using statistical learning techniques. NIMO's approach is active and noninvasive: it actively deploys and monitors the application under varying conditions, and obtains its training data from passive instrumentation streams that require no changes to the operating system or applications. Our experiments with real scientific applications demonstrate that NIMO significantly reduces the number of training samples and the time to learn fairly-accurate cost models. Copyright 2006 VLDB Endowment, ACM.

Duke Scholars

Author Shivnath Babu Computer Science

Author Jeffrey S. Chase Computer Science

Published In

VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases

Publication Date

January 1, 2006

Start / End Page

535 / 546

Citation

APA

Chicago

ICMJE

MLA

NLM

Shivam, P., Babu, S., & Chase, J. (2006). Active and accelerated learning of cost models for optimizing scientific applications. In VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases (pp. 535–546).

Shivam, P., S. Babu, and J. Chase. “Active and accelerated learning of cost models for optimizing scientific applications.” In VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases, 535–46, 2006.

Shivam P, Babu S, Chase J. Active and accelerated learning of cost models for optimizing scientific applications. In: VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases. 2006. p. 535–46.

Shivam, P., et al. “Active and accelerated learning of cost models for optimizing scientific applications.” VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases, 2006, pp. 535–46.

Shivam P, Babu S, Chase J. Active and accelerated learning of cost models for optimizing scientific applications. VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases. 2006. p. 535–546.

Published In

VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases

Publication Date

January 1, 2006

Start / End Page

535 / 546