Scholars@Duke publication: Autonomic microprocessor execution via self-repairing arrays

Autonomic microprocessor execution via self-repairing arrays

Publication , Journal Article

Bower, FA; Ozev, S; Sorin, DJ

Published in: IEEE Transactions on Dependable and Secure Computing

January 1, 2005

To achieve high reliability despite hard faults that occur during operation and to achieve high yield despite defects introduced at fabrication, a microprocessor must be able to tolerate hard faults. In this paper, we present a framework for autonomic self-repair of the array structures in microprocessors (e.g., reorder buffer, instruction window, etc.). The framework consists of three aspects: 1) detecting/diagnosing the fault, 2) recovering from the resultant error, and 3) mapping out the faulty portion of the array. For each aspect, we present design options. Based on this framework, we develop two particular schemes for self-repairing array structures (SRAS). Simulation results show that one of our SRAS schemes adds some performance overhead in the fault-free case, but that both of them mask hard faults 1) with less hardware overhead cost than higher-level redundancy (e.g., IBM mainframes) and 2) without the per-error performance penalty of existing low-cost techniques that combine error detection with pipeline flushes for backward error recovery (BER). When hard faults are present in arrays, due to operational faults or fabrication defects, SRAS schemes outperform BER due to not having to frequently flush the pipeline. © 2005 IEEE.

Duke Scholars

Author Daniel J. Sorin Electrical and Computer Engineering

Altmetric Attention Stats

Dimensions Citation Stats

Published In

IEEE Transactions on Dependable and Secure Computing

DOI

10.1109/TDSC.2005.44

ISSN

1545-5971

Publication Date

January 1, 2005

Volume

Issue

Start / End Page

297 / 310

Related Subject Headings

Strategic, Defence & Security Studies
4606 Distributed computing and systems software
4604 Cybersecurity and privacy
0805 Distributed Computing
0804 Data Format
0803 Computer Software

Citation

APA

Chicago

ICMJE

MLA

NLM

Bower, F. A., Ozev, S., & Sorin, D. J. (2005). Autonomic microprocessor execution via self-repairing arrays. IEEE Transactions on Dependable and Secure Computing, 2(4), 297–310. https://doi.org/10.1109/TDSC.2005.44

Bower, F. A., S. Ozev, and D. J. Sorin. “Autonomic microprocessor execution via self-repairing arrays.” IEEE Transactions on Dependable and Secure Computing 2, no. 4 (January 1, 2005): 297–310. https://doi.org/10.1109/TDSC.2005.44.

Bower FA, Ozev S, Sorin DJ. Autonomic microprocessor execution via self-repairing arrays. IEEE Transactions on Dependable and Secure Computing. 2005 Jan 1;2(4):297–310.

Bower, F. A., et al. “Autonomic microprocessor execution via self-repairing arrays.” IEEE Transactions on Dependable and Secure Computing, vol. 2, no. 4, Jan. 2005, pp. 297–310. Scopus, doi:10.1109/TDSC.2005.44.

Bower FA, Ozev S, Sorin DJ. Autonomic microprocessor execution via self-repairing arrays. IEEE Transactions on Dependable and Secure Computing. 2005 Jan 1;2(4):297–310.

Published In

IEEE Transactions on Dependable and Secure Computing

DOI

10.1109/TDSC.2005.44

ISSN

1545-5971

Publication Date

January 1, 2005

Volume

Issue

Start / End Page

297 / 310

Related Subject Headings

Strategic, Defence & Security Studies
4606 Distributed computing and systems software
4604 Cybersecurity and privacy
0805 Distributed Computing
0804 Data Format
0803 Computer Software