Skip to main content

Minimizing completion time of a program by checkpointing and rejuvenation

Publication ,  Conference
Garg, S; Huang, Y; Kintala, C; Trivedi, KS
Published in: SIGMETRICS 1996 - Proceedings of the 1996 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems
May 15, 1996

Checkpointing with rollback-recovery is a well known technique to reduce the completion time of a program in the presence of failures. While checkpointing is corrective in nature, rejuvenation refers to preventive maintenance of software aimed to reduce unexpected failures mostly resulting from the "aging"phenomenon. In this paper, we show how both these techniques may be used together to further reduce the expected completion time of a program. The idea of using checkpoints to reduce the amount of rollback upon a failure is taken a step further by combining it with rejuvenation. We derive the equations for expected completion time of a program with finite failure free running time for the following three cases when; (a) neither checkpointing nor rejuvenation is employed, (b) only checkpointing is employed, and finally (c) both checkpointing and rejuvenation are employed.We also present numerical results for Weibull failure time distribution for the above three cases and discuss optimal checkpointing and rejuvenation that minimizes the expected completion time. Using the numerical results, some interesting conclusions are drawn about benefits of these techniques in relation to the nature of failure distribution.

Duke Scholars

Published In

SIGMETRICS 1996 - Proceedings of the 1996 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems

DOI

Publication Date

May 15, 1996

Start / End Page

252 / 261
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Garg, S., Huang, Y., Kintala, C., & Trivedi, K. S. (1996). Minimizing completion time of a program by checkpointing and rejuvenation. In SIGMETRICS 1996 - Proceedings of the 1996 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems (pp. 252–261). https://doi.org/10.1145/233013.233050
Garg, S., Y. Huang, C. Kintala, and K. S. Trivedi. “Minimizing completion time of a program by checkpointing and rejuvenation.” In SIGMETRICS 1996 - Proceedings of the 1996 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 252–61, 1996. https://doi.org/10.1145/233013.233050.
Garg S, Huang Y, Kintala C, Trivedi KS. Minimizing completion time of a program by checkpointing and rejuvenation. In: SIGMETRICS 1996 - Proceedings of the 1996 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems. 1996. p. 252–61.
Garg, S., et al. “Minimizing completion time of a program by checkpointing and rejuvenation.” SIGMETRICS 1996 - Proceedings of the 1996 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 1996, pp. 252–61. Scopus, doi:10.1145/233013.233050.
Garg S, Huang Y, Kintala C, Trivedi KS. Minimizing completion time of a program by checkpointing and rejuvenation. SIGMETRICS 1996 - Proceedings of the 1996 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems. 1996. p. 252–261.

Published In

SIGMETRICS 1996 - Proceedings of the 1996 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems

DOI

Publication Date

May 15, 1996

Start / End Page

252 / 261