Skip to main content

Recovery from Software Failures Caused by Mandelbugs

Publication ,  Journal Article
Grottke, M; Kim, DS; Mansharamani, R; Nambiar, M; Natella, R; Trivedi, KS
Published in: IEEE Transactions on Reliability
March 1, 2016

Software failures are still a major concern in mission- and enterprise-critical contexts, despite significant efforts spent in software testing. In fact, while software testing is effective against easily-reproducible bugs (Bohrbugs), it is considerably less suitable for dealing with bugs that lead to hard-to-reproduce failures (Mandelbugs). On the positive side, the elusive nature of Mandelbugs provides opportunities for failure recovery, which are investigated in this paper. Based on real cases of Mandelbugs in eleven Information Technology (IT) systems running in production, the paper proposes a model that describes the recovery processes in IT systems. It then presents closed-form expressions, and a numerical analysis, of the mean time to recovery, and the software (un)availability. This analysis allows the designer to compare recovery strategies, as well as to determine the parameters having a high influence on the efficacy of recovery from failures caused by Mandelbugs.

Duke Scholars

Published In

IEEE Transactions on Reliability

DOI

ISSN

0018-9529

Publication Date

March 1, 2016

Volume

65

Issue

1

Start / End Page

70 / 87

Related Subject Headings

  • Operations Research
  • 4612 Software engineering
  • 4010 Engineering practice and education
  • 0906 Electrical and Electronic Engineering
  • 0803 Computer Software
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Grottke, M., Kim, D. S., Mansharamani, R., Nambiar, M., Natella, R., & Trivedi, K. S. (2016). Recovery from Software Failures Caused by Mandelbugs. IEEE Transactions on Reliability, 65(1), 70–87. https://doi.org/10.1109/TR.2015.2452933
Grottke, M., D. S. Kim, R. Mansharamani, M. Nambiar, R. Natella, and K. S. Trivedi. “Recovery from Software Failures Caused by Mandelbugs.” IEEE Transactions on Reliability 65, no. 1 (March 1, 2016): 70–87. https://doi.org/10.1109/TR.2015.2452933.
Grottke M, Kim DS, Mansharamani R, Nambiar M, Natella R, Trivedi KS. Recovery from Software Failures Caused by Mandelbugs. IEEE Transactions on Reliability. 2016 Mar 1;65(1):70–87.
Grottke, M., et al. “Recovery from Software Failures Caused by Mandelbugs.” IEEE Transactions on Reliability, vol. 65, no. 1, Mar. 2016, pp. 70–87. Scopus, doi:10.1109/TR.2015.2452933.
Grottke M, Kim DS, Mansharamani R, Nambiar M, Natella R, Trivedi KS. Recovery from Software Failures Caused by Mandelbugs. IEEE Transactions on Reliability. 2016 Mar 1;65(1):70–87.

Published In

IEEE Transactions on Reliability

DOI

ISSN

0018-9529

Publication Date

March 1, 2016

Volume

65

Issue

1

Start / End Page

70 / 87

Related Subject Headings

  • Operations Research
  • 4612 Software engineering
  • 4010 Engineering practice and education
  • 0906 Electrical and Electronic Engineering
  • 0803 Computer Software