Skip to main content

BitSystolic: A 26.7 TOPS/W 2b8b NPU with Configurable Data Flows for Edge Devices

Publication ,  Journal Article
Yang, Q; Li, H
Published in: IEEE Transactions on Circuits and Systems I: Regular Papers
March 1, 2021

Efficient deployment of deep neural networks (DNNs) emerges with the exploding demand for artificial intelligence on edge devices. Mixed-precision inference with both compressed model and reduced computation cost enlightens a way for accurate and efficient DNN deployments. Despite obtaining mixed-precision DNN models at the algorithmic level, there still lacks sufficient hardware support. In this work, we propose BitSystolic, a neural processing unit based on a systolic array structure. In BitSystolic, the numerical precision of both weights and activations can be configured in the range of 2-8 bits, fulfilling different requirements across mixed-precision models and tasks. Moreover, BitSystolic can support various data flows presented in different types of neural layers (e.g., convolution, fully-connected, and recurrent neural layers) and adaptive optimization of data reuse by switching between the matrix-matrix mode and vector-matrix mode. We designed and fabricated the proposed BitSystolic composed of a 16\times 16 systolic array. Our measurement results show that BitSystolic features the unified power efficiency of up to 26.7 TOPS/W with 17.8 mW peak power consumption across various layer types.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

IEEE Transactions on Circuits and Systems I: Regular Papers

DOI

EISSN

1558-0806

ISSN

1549-8328

Publication Date

March 1, 2021

Volume

68

Issue

3

Start / End Page

1134 / 1145

Related Subject Headings

  • Electrical & Electronic Engineering
  • 4009 Electronics, sensors and digital hardware
  • 0906 Electrical and Electronic Engineering
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Yang, Q., & Li, H. (2021). BitSystolic: A 26.7 TOPS/W 2b8b NPU with Configurable Data Flows for Edge Devices. IEEE Transactions on Circuits and Systems I: Regular Papers, 68(3), 1134–1145. https://doi.org/10.1109/TCSI.2020.3043778
Yang, Q., and H. Li. “BitSystolic: A 26.7 TOPS/W 2b8b NPU with Configurable Data Flows for Edge Devices.” IEEE Transactions on Circuits and Systems I: Regular Papers 68, no. 3 (March 1, 2021): 1134–45. https://doi.org/10.1109/TCSI.2020.3043778.
Yang Q, Li H. BitSystolic: A 26.7 TOPS/W 2b8b NPU with Configurable Data Flows for Edge Devices. IEEE Transactions on Circuits and Systems I: Regular Papers. 2021 Mar 1;68(3):1134–45.
Yang, Q., and H. Li. “BitSystolic: A 26.7 TOPS/W 2b8b NPU with Configurable Data Flows for Edge Devices.” IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 68, no. 3, Mar. 2021, pp. 1134–45. Scopus, doi:10.1109/TCSI.2020.3043778.
Yang Q, Li H. BitSystolic: A 26.7 TOPS/W 2b8b NPU with Configurable Data Flows for Edge Devices. IEEE Transactions on Circuits and Systems I: Regular Papers. 2021 Mar 1;68(3):1134–1145.

Published In

IEEE Transactions on Circuits and Systems I: Regular Papers

DOI

EISSN

1558-0806

ISSN

1549-8328

Publication Date

March 1, 2021

Volume

68

Issue

3

Start / End Page

1134 / 1145

Related Subject Headings

  • Electrical & Electronic Engineering
  • 4009 Electronics, sensors and digital hardware
  • 0906 Electrical and Electronic Engineering