Skip to main content

Exploration of Automatic Mixed-Precision Search for Deep Neural Networks

Publication ,  Conference
Guo, X; Huang, Y; Cheng, HP; Li, B; Wen, W; Ma, S; Li, H; Chen, Y
Published in: Proceedings 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2019
March 1, 2019

Neural networks have shown great performance in cognitive tasks. When deploying network models on mobile devices with limited computation and storage resources, the weight quantization technique has been widely adopted. In practice, 8-bit or 16-bit quantization is mostly likely to be selected in order to maintain the accuracy at the same level as the models in 32-bit floating-point precision. Binary quantization, on the contrary, aims to obtain the highest compression at the cost of much bigger accuracy drop. Applying different precision in different layers/structures can potentially produce the most efficient model. Seeking for the best precision configuration, however, is difficult. In this work, we proposed an automatic search algorithm to address the challenge. By relaxing the search space of quantization bitwidth from discrete to continuous domain, our algorithm can generate a mixed-precision quantization scheme which achieves the compression rate close to the one from the binary-weighted model while maintaining the testing accuracy similar to the original full-precision model.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Proceedings 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2019

DOI

ISBN

9781538678848

Publication Date

March 1, 2019

Start / End Page

276 / 278
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Guo, X., Huang, Y., Cheng, H. P., Li, B., Wen, W., Ma, S., … Chen, Y. (2019). Exploration of Automatic Mixed-Precision Search for Deep Neural Networks. In Proceedings 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2019 (pp. 276–278). https://doi.org/10.1109/AICAS.2019.8771498
Guo, X., Y. Huang, H. P. Cheng, B. Li, W. Wen, S. Ma, H. Li, and Y. Chen. “Exploration of Automatic Mixed-Precision Search for Deep Neural Networks.” In Proceedings 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2019, 276–78, 2019. https://doi.org/10.1109/AICAS.2019.8771498.
Guo X, Huang Y, Cheng HP, Li B, Wen W, Ma S, et al. Exploration of Automatic Mixed-Precision Search for Deep Neural Networks. In: Proceedings 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2019. 2019. p. 276–8.
Guo, X., et al. “Exploration of Automatic Mixed-Precision Search for Deep Neural Networks.” Proceedings 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2019, 2019, pp. 276–78. Scopus, doi:10.1109/AICAS.2019.8771498.
Guo X, Huang Y, Cheng HP, Li B, Wen W, Ma S, Li H, Chen Y. Exploration of Automatic Mixed-Precision Search for Deep Neural Networks. Proceedings 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2019. 2019. p. 276–278.

Published In

Proceedings 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2019

DOI

ISBN

9781538678848

Publication Date

March 1, 2019

Start / End Page

276 / 278