Scholars@Duke publication: Designing Efficient Bit-Level Sparsity-Tolerant Memristive Networks.

Designing Efficient Bit-Level Sparsity-Tolerant Memristive Networks.

Publication , Journal Article

Lyu, B; Wen, S; Yang, Y; Chang, X; Sun, J; Chen, Y; Huang, T

Published in: IEEE transactions on neural networks and learning systems

September 2024

With the rapid progress of deep neural network (DNN) applications on memristive platforms, there has been a growing interest in the acceleration and compression of memristive networks. As an emerging model optimization technique for memristive platforms, bit-level sparsity training (with the fixed-point quantization) can significantly reduce the demand for analog-to-digital converters (ADCs) resolution, which is critical for energy and area consumption. However, the bit sparsity and the fixed-point quantization will inevitably lead to a large performance loss. Different from the existing training and optimization techniques, this work attempts to explore more sparsity-tolerant architectures to compensate for performance degradation. We first empirically demonstrate that in a certain search space (e.g., 4-bit quantized DARTS space), network architectures differ in bit-level sparsity tolerance. It is reasonable and necessary to search the architectures for efficient deployment on memristive platforms by the neural architecture search (NAS) technology. We further introduce bit-level sparsity-tolerant NAS (BST-NAS), which encapsulates low-precision quantization and bit-level sparsity training into the differentiable NAS, to explore the optimal bit-level sparsity-tolerant architectures. Experimentally, with the same degree of sparsity and experiment settings, our searched architectures obtain a promising performance, which outperform the normal NAS-based DARTS-series architectures (about 5.8% higher than that of DARTS-V2 and 2.7% higher than that of PC-DARTS) on CIFAR10.

Duke Scholars

Author Yiran Chen Electrical and Computer Engineering

Published In

IEEE transactions on neural networks and learning systems

DOI

10.1109/tnnls.2023.3250437

EISSN

2162-2388

ISSN

2162-237X

Publication Date

September 2024

Volume

Issue

Start / End Page

11979 / 11988

Citation

APA

Chicago

ICMJE

MLA

NLM

Lyu, B., Wen, S., Yang, Y., Chang, X., Sun, J., Chen, Y., & Huang, T. (2024). Designing Efficient Bit-Level Sparsity-Tolerant Memristive Networks. IEEE Transactions on Neural Networks and Learning Systems, 35(9), 11979–11988. https://doi.org/10.1109/tnnls.2023.3250437

Lyu, Bo, Shiping Wen, Yin Yang, Xiaojun Chang, Junwei Sun, Yiran Chen, and Tingwen Huang. “Designing Efficient Bit-Level Sparsity-Tolerant Memristive Networks.” IEEE Transactions on Neural Networks and Learning Systems 35, no. 9 (September 2024): 11979–88. https://doi.org/10.1109/tnnls.2023.3250437.

Lyu B, Wen S, Yang Y, Chang X, Sun J, Chen Y, et al. Designing Efficient Bit-Level Sparsity-Tolerant Memristive Networks. IEEE transactions on neural networks and learning systems. 2024 Sep;35(9):11979–88.

Lyu, Bo, et al. “Designing Efficient Bit-Level Sparsity-Tolerant Memristive Networks.” IEEE Transactions on Neural Networks and Learning Systems, vol. 35, no. 9, Sept. 2024, pp. 11979–88. Epmc, doi:10.1109/tnnls.2023.3250437.

Lyu B, Wen S, Yang Y, Chang X, Sun J, Chen Y, Huang T. Designing Efficient Bit-Level Sparsity-Tolerant Memristive Networks. IEEE transactions on neural networks and learning systems. 2024 Sep;35(9):11979–11988.

Published In

IEEE transactions on neural networks and learning systems

DOI

10.1109/tnnls.2023.3250437

EISSN

2162-2388

ISSN

2162-237X

Publication Date

September 2024

Volume

Issue

Start / End Page

11979 / 11988