Scholars@Duke publication: ESSENCE: Exploiting Structured Stochastic Gradient Pruning for Endurance-Aware ReRAM-Based In-Memory Training Systems

ESSENCE: Exploiting Structured Stochastic Gradient Pruning for Endurance-Aware ReRAM-Based In-Memory Training Systems

Publication , Journal Article

Yang, X; Yang, H; Doppa, JR; Pande, PP; Chakrabartys, K; Li, H

Published in: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

July 1, 2023

Processing-in-memory (PIM) enables energy-efficient deployment of convolutional neural networks (CNNs) from edge to cloud. Resistive random-access memory (ReRAM) is one of the most commonly used technologies for PIM architectures. One of the primary limitations of ReRAM-based PIM in neural network training arises from the limited write endurance due to the frequent weight updates. To make ReRAM-based architectures viable for CNN training, the write endurance issue needs to be addressed. This work aims to reduce the number of weight reprogrammings without compromising the final model accuracy. We propose the ESSENCE framework with an endurance-aware structured stochastic gradient pruning method, which dynamically adjusts the probability of gradient update based on the current update counts. Experimental results with multiple CNNs and datasets demonstrate that the proposed method can extend ReRAM's life time for training. For instance, with the ResNet20 network and CIFAR-10 dataset, ESSENCE can save the mean update counts of up to 10.29× compared to the stochastic gradient descent method and effectively reduce the maximum update counts compared with the No Endurance method. Furthermore, an aggressive tuning method based on ESSENCE can boost the mean update count savings by up to 14.41×.

Duke Scholars

Author Hai "Helen" Li Electrical and Computer Engineering

Published In

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

DOI

10.1109/TCAD.2022.3216546

EISSN

1937-4151

ISSN

0278-0070

Publication Date

July 1, 2023

Volume

Issue

Start / End Page

2187 / 2199

Related Subject Headings

Computer Hardware & Architecture
4607 Graphics, augmented reality and games
4009 Electronics, sensors and digital hardware
1006 Computer Hardware
0906 Electrical and Electronic Engineering

Citation

APA

Chicago

ICMJE

MLA

NLM

Yang, X., Yang, H., Doppa, J. R., Pande, P. P., Chakrabartys, K., & Li, H. (2023). ESSENCE: Exploiting Structured Stochastic Gradient Pruning for Endurance-Aware ReRAM-Based In-Memory Training Systems. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 42(7), 2187–2199. https://doi.org/10.1109/TCAD.2022.3216546

Yang, X., H. Yang, J. R. Doppa, P. P. Pande, K. Chakrabartys, and H. Li. “ESSENCE: Exploiting Structured Stochastic Gradient Pruning for Endurance-Aware ReRAM-Based In-Memory Training Systems.” IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 42, no. 7 (July 1, 2023): 2187–99. https://doi.org/10.1109/TCAD.2022.3216546.

Yang X, Yang H, Doppa JR, Pande PP, Chakrabartys K, Li H. ESSENCE: Exploiting Structured Stochastic Gradient Pruning for Endurance-Aware ReRAM-Based In-Memory Training Systems. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems. 2023 Jul 1;42(7):2187–99.

Yang, X., et al. “ESSENCE: Exploiting Structured Stochastic Gradient Pruning for Endurance-Aware ReRAM-Based In-Memory Training Systems.” IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 42, no. 7, July 2023, pp. 2187–99. Scopus, doi:10.1109/TCAD.2022.3216546.

Published In

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

DOI

10.1109/TCAD.2022.3216546

EISSN

1937-4151

ISSN

0278-0070

Publication Date

July 1, 2023

Volume

Issue

Start / End Page

2187 / 2199

Related Subject Headings

Computer Hardware & Architecture
4607 Graphics, augmented reality and games
4009 Electronics, sensors and digital hardware
1006 Computer Hardware
0906 Electrical and Electronic Engineering