Scholars@Duke publication: Exploration of Automatic Mixed-Precision Search for Deep Neural Networks

Exploration of Automatic Mixed-Precision Search for Deep Neural Networks

Publication , Conference

Guo, X; Huang, Y; Cheng, HP; Li, B; Wen, W; Ma, S; Li, H; Chen, Y

Published in: Proceedings 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems Aicas 2019

March 1, 2019

Neural networks have shown great performance in cognitive tasks. When deploying network models on mobile devices with limited computation and storage resources, the weight quantization technique has been widely adopted. In practice, 8-bit or 16-bit quantization is mostly likely to be selected in order to maintain the accuracy at the same level as the models in 32-bit floating-point precision. Binary quantization, on the contrary, aims to obtain the highest compression at the cost of much bigger accuracy drop. Applying different precision in different layers/structures can potentially produce the most efficient model. Seeking for the best precision configuration, however, is difficult. In this work, we proposed an automatic search algorithm to address the challenge. By relaxing the search space of quantization bitwidth from discrete to continuous domain, our algorithm can generate a mixed-precision quantization scheme which achieves the compression rate close to the one from the binary-weighted model while maintaining the testing accuracy similar to the original full-precision model.

Duke Scholars

Author Hai "Helen" Li Electrical and Computer Engineering

Author Yiran Chen Electrical and Computer Engineering

Published In

Proceedings 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems Aicas 2019

DOI

10.1109/AICAS.2019.8771498

Publication Date

March 1, 2019

Start / End Page

276 / 278

Citation

APA

Chicago

ICMJE

MLA

NLM

Guo, X., Huang, Y., Cheng, H. P., Li, B., Wen, W., Ma, S., … Chen, Y. (2019). Exploration of Automatic Mixed-Precision Search for Deep Neural Networks. In Proceedings 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems Aicas 2019 (pp. 276–278). https://doi.org/10.1109/AICAS.2019.8771498

Guo, X., Y. Huang, H. P. Cheng, B. Li, W. Wen, S. Ma, H. Li, and Y. Chen. “Exploration of Automatic Mixed-Precision Search for Deep Neural Networks.” In Proceedings 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems Aicas 2019, 276–78, 2019. https://doi.org/10.1109/AICAS.2019.8771498.

Guo X, Huang Y, Cheng HP, Li B, Wen W, Ma S, et al. Exploration of Automatic Mixed-Precision Search for Deep Neural Networks. In: Proceedings 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems Aicas 2019. 2019. p. 276–8.

Guo, X., et al. “Exploration of Automatic Mixed-Precision Search for Deep Neural Networks.” Proceedings 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems Aicas 2019, 2019, pp. 276–78. Scopus, doi:10.1109/AICAS.2019.8771498.

Guo X, Huang Y, Cheng HP, Li B, Wen W, Ma S, Li H, Chen Y. Exploration of Automatic Mixed-Precision Search for Deep Neural Networks. Proceedings 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems Aicas 2019. 2019. p. 276–278.

Published In

Proceedings 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems Aicas 2019

DOI

10.1109/AICAS.2019.8771498

Publication Date

March 1, 2019

Start / End Page

276 / 278