Skip to main content

A Novel Architecture Design for Output Significance Aligned Flow with Adaptive Control in ReRAM-based Neural Network Accelerator

Publication ,  Journal Article
Li, T; Jing, N; Jiang, J; Wang, Q; Mao, Z; Chen, Y
Published in: ACM Transactions on Design Automation of Electronic Systems
November 22, 2022

Resistive-RAM-based (ReRAM-based) computing shows great potential on accelerating DNN inference by its highly parallel structure. Regrettably, computing accuracy in practical is much lower than expected due to the non-ideal ReRAM device. Conventional computing flow with fixed wordline activation scheme can effectively protect computing accuracy but at the cost of significant performance and energy savings reduction. For such embarrassment of accuracy, performance and energy, this article proposes a new Adaptive-Wordline-Activation control scheme (AWA-control) and combines it with a theoretical Output-Significance-Aligned computing flow (OSA-flow) to enable fine-grained control on output significance with distinct impact on final result. We demonstrate AWA-control-supported OSA-flow architecture with maximal compatibility to conventional crossbar by input retiming and weight remapping using shifting registers to enable the new flow. However, in contrast to the conventional computing architecture, the OSA-flow architecture shows the better capability to exploit data sparsity commonly seen in DNN models. So we also design a sparsity-aware OSA-flow architecture for further DNN speedup. Evaluation results show that OSA-flow architecture can provide significant performance improvement of 21.6×, and energy savings of 96.2% over conventional computing architecture with similar DNN accuracy.

Duke Scholars

Published In

ACM Transactions on Design Automation of Electronic Systems

DOI

EISSN

1557-7309

ISSN

1084-4309

Publication Date

November 22, 2022

Volume

27

Issue

6

Related Subject Headings

  • Design Practice & Management
  • 4612 Software engineering
  • 4606 Distributed computing and systems software
  • 4009 Electronics, sensors and digital hardware
  • 1006 Computer Hardware
  • 0803 Computer Software
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Li, T., Jing, N., Jiang, J., Wang, Q., Mao, Z., & Chen, Y. (2022). A Novel Architecture Design for Output Significance Aligned Flow with Adaptive Control in ReRAM-based Neural Network Accelerator. ACM Transactions on Design Automation of Electronic Systems, 27(6). https://doi.org/10.1145/3510819
Li, T., N. Jing, J. Jiang, Q. Wang, Z. Mao, and Y. Chen. “A Novel Architecture Design for Output Significance Aligned Flow with Adaptive Control in ReRAM-based Neural Network Accelerator.” ACM Transactions on Design Automation of Electronic Systems 27, no. 6 (November 22, 2022). https://doi.org/10.1145/3510819.
Li T, Jing N, Jiang J, Wang Q, Mao Z, Chen Y. A Novel Architecture Design for Output Significance Aligned Flow with Adaptive Control in ReRAM-based Neural Network Accelerator. ACM Transactions on Design Automation of Electronic Systems. 2022 Nov 22;27(6).
Li, T., et al. “A Novel Architecture Design for Output Significance Aligned Flow with Adaptive Control in ReRAM-based Neural Network Accelerator.” ACM Transactions on Design Automation of Electronic Systems, vol. 27, no. 6, Nov. 2022. Scopus, doi:10.1145/3510819.
Li T, Jing N, Jiang J, Wang Q, Mao Z, Chen Y. A Novel Architecture Design for Output Significance Aligned Flow with Adaptive Control in ReRAM-based Neural Network Accelerator. ACM Transactions on Design Automation of Electronic Systems. 2022 Nov 22;27(6).

Published In

ACM Transactions on Design Automation of Electronic Systems

DOI

EISSN

1557-7309

ISSN

1084-4309

Publication Date

November 22, 2022

Volume

27

Issue

6

Related Subject Headings

  • Design Practice & Management
  • 4612 Software engineering
  • 4606 Distributed computing and systems software
  • 4009 Electronics, sensors and digital hardware
  • 1006 Computer Hardware
  • 0803 Computer Software