Scholars@Duke publication: BOLT: Energy-efficient out-of-order latency-tolerant execution

BOLT: Energy-efficient out-of-order latency-tolerant execution

Publication , Conference

Hilton, A; Roth, A

Published in: Proceedings International Symposium on High Performance Computer Architecture

January 1, 2010

Published version (DOI) Open Access Copy (Duke)

LT (latency tolerant) execution is an attractive candidate technique for future out-of-order cores. LT defers the forward slices of LLC (last-level cache) misses to a slice buffer and re-executes them when the misses return. An LT core increases ILP without physically scaling the issue queue and register file and increases MLP without additional software threads that can reduce cache performance. Unfortunately, proposed LT designs are not energy ef.cient. They require too many additional structures and they defer and re-execute too many instructions to justify their performance gains. In this paper, we address these inefficiencies. We introduce a microarchitecture called BOLT (Better Out-of-Order Latency-Tolerance) that implements LT as an alternative use of SMT (Simultaneous Multi-Threading). We also present a new slice buffer organization and traversal scheme that increases performance and reduces overhead by pruning instances of useless and redundant LT. Collectively, these modifications turn out-of-order LT into a technique that improves performance in an energy-efficient way. ©2009 IEEE.

Duke Scholars

Author Andrew Douglas Hilton Electrical and Computer Engineering

Published In

Proceedings International Symposium on High Performance Computer Architecture

DOI

10.1109/hpca.2010.5416634

ISSN

1530-0897

Publication Date

January 1, 2010

Citation

APA

Chicago

ICMJE

MLA

NLM

Hilton, A., & Roth, A. (2010). BOLT: Energy-efficient out-of-order latency-tolerant execution. In Proceedings International Symposium on High Performance Computer Architecture. https://doi.org/10.1109/hpca.2010.5416634

Hilton, A., and A. Roth. “BOLT: Energy-efficient out-of-order latency-tolerant execution.” In Proceedings International Symposium on High Performance Computer Architecture, 2010. https://doi.org/10.1109/hpca.2010.5416634.

Hilton A, Roth A. BOLT: Energy-efficient out-of-order latency-tolerant execution. In: Proceedings International Symposium on High Performance Computer Architecture. 2010.

Hilton, A., and A. Roth. “BOLT: Energy-efficient out-of-order latency-tolerant execution.” Proceedings International Symposium on High Performance Computer Architecture, 2010. Scopus, doi:10.1109/hpca.2010.5416634.

Hilton A, Roth A. BOLT: Energy-efficient out-of-order latency-tolerant execution. Proceedings International Symposium on High Performance Computer Architecture. 2010.

Published In

Proceedings International Symposium on High Performance Computer Architecture

DOI

10.1109/hpca.2010.5416634

ISSN

1530-0897

Publication Date

January 1, 2010