Skip to main content

An efficient STT-RAM-based register file in GPU architectures

Publication ,  Conference
Liu, X; Mao, M; Bi, X; Li, H; Chen, Y
Published in: 20th Asia and South Pacific Design Automation Conference, ASP-DAC 2015
March 11, 2015

Modern GPGPUs employ a large register file (RF) to efficiently process heavily parallel threads in single instruction multiple thread (SIMT) fashion. The up-scaling of RF capacity, however, is greatly constrained by large cell area and high leakage power consumption of SRAM implementation. In this work, we propose a novel GPU RF design based on the emerging multi-level cell (MLC) spin-transfer torque RAM (STT-RAM) technology. Compared to SRAM, MLC STT-RAM (or MLC-STT) has much smaller cell area and almost zero standby power due to its non-volatility. Moreover, by leveraging the asymmetric performance of the soft and the hard bits of a MLC-STT cell, we propose a remapping strategy to perform a flexible tradeoff between the access time and the capacity of the RF based on run-time access patterns. A novel rescheduling scheme is also developed to minimize the waiting time of the issued warps to access register banks. Experimental results over ISPASS2009 and CUDA benchmarks show that on average, our proposed MLC-STT RF can achieve 3.28% performance improvement, 9.48% energy reduction, and 38.9% energy efficiency enhancement compared to conventional SRAM-based design.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

20th Asia and South Pacific Design Automation Conference, ASP-DAC 2015

DOI

Publication Date

March 11, 2015

Start / End Page

490 / 495
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Liu, X., Mao, M., Bi, X., Li, H., & Chen, Y. (2015). An efficient STT-RAM-based register file in GPU architectures. In 20th Asia and South Pacific Design Automation Conference, ASP-DAC 2015 (pp. 490–495). https://doi.org/10.1109/ASPDAC.2015.7059054
Liu, X., M. Mao, X. Bi, H. Li, and Y. Chen. “An efficient STT-RAM-based register file in GPU architectures.” In 20th Asia and South Pacific Design Automation Conference, ASP-DAC 2015, 490–95, 2015. https://doi.org/10.1109/ASPDAC.2015.7059054.
Liu X, Mao M, Bi X, Li H, Chen Y. An efficient STT-RAM-based register file in GPU architectures. In: 20th Asia and South Pacific Design Automation Conference, ASP-DAC 2015. 2015. p. 490–5.
Liu, X., et al. “An efficient STT-RAM-based register file in GPU architectures.” 20th Asia and South Pacific Design Automation Conference, ASP-DAC 2015, 2015, pp. 490–95. Scopus, doi:10.1109/ASPDAC.2015.7059054.
Liu X, Mao M, Bi X, Li H, Chen Y. An efficient STT-RAM-based register file in GPU architectures. 20th Asia and South Pacific Design Automation Conference, ASP-DAC 2015. 2015. p. 490–495.

Published In

20th Asia and South Pacific Design Automation Conference, ASP-DAC 2015

DOI

Publication Date

March 11, 2015

Start / End Page

490 / 495