Scholars@Duke publication: How to obtain and run light and efficient deep learning networks

How to obtain and run light and efficient deep learning networks

Publication , Conference

Chen, F; Wen, W; Song, L; Zhang, J; Li, HH; Chen, Y

Published in: IEEE ACM International Conference on Computer Aided Design Digest of Technical Papers Iccad

November 1, 2019

As the model size of deep neural networks (DNNs) grows for better performance, the increase in computational cost associated with training and testing makes it extremely difficulty to deploy DNNs on end/edge devices with limited resources while also satisfying the response time requirement. To address this challenge, model compression which compresses model size and thus reduces computation cost is widely adopted in deep learning society. However, the practical impacts of hardware design are often ignored in these algorithm-level solutions, such as the increase of the random accesses to memory hierarchy and the constraints of memory capacity. On the other side, limited understanding about the computational needs at algorithm level may lead to unrealistic assumptions during the hardware designs. In this work, we will discuss this mismatch and provide how our approach addresses it through an interactive design practice across both software and hardware levels.

Duke Scholars

Author Hai "Helen" Li Pierre R. Lamond Department of Electrical and Computer Engin ...

Author Yiran Chen Pierre R. Lamond Department of Electrical and Computer Engin ...

Published In

IEEE ACM International Conference on Computer Aided Design Digest of Technical Papers Iccad

DOI

10.1109/ICCAD45719.2019.8942106

ISSN

1092-3152

Publication Date

November 1, 2019

Volume

2019-November

Citation

APA

Chicago

ICMJE

MLA

NLM

Chen, F., Wen, W., Song, L., Zhang, J., Li, H. H., & Chen, Y. (2019). How to obtain and run light and efficient deep learning networks. In IEEE ACM International Conference on Computer Aided Design Digest of Technical Papers Iccad (Vol. 2019-November). https://doi.org/10.1109/ICCAD45719.2019.8942106

Chen, F., W. Wen, L. Song, J. Zhang, H. H. Li, and Y. Chen. “How to obtain and run light and efficient deep learning networks.” In IEEE ACM International Conference on Computer Aided Design Digest of Technical Papers Iccad, Vol. 2019-November, 2019. https://doi.org/10.1109/ICCAD45719.2019.8942106.

Chen F, Wen W, Song L, Zhang J, Li HH, Chen Y. How to obtain and run light and efficient deep learning networks. In: IEEE ACM International Conference on Computer Aided Design Digest of Technical Papers Iccad. 2019.

Chen, F., et al. “How to obtain and run light and efficient deep learning networks.” IEEE ACM International Conference on Computer Aided Design Digest of Technical Papers Iccad, vol. 2019-November, 2019. Scopus, doi:10.1109/ICCAD45719.2019.8942106.

Chen F, Wen W, Song L, Zhang J, Li HH, Chen Y. How to obtain and run light and efficient deep learning networks. IEEE ACM International Conference on Computer Aided Design Digest of Technical Papers Iccad. 2019.

Published In

IEEE ACM International Conference on Computer Aided Design Digest of Technical Papers Iccad

DOI

10.1109/ICCAD45719.2019.8942106

ISSN

1092-3152

Publication Date

November 1, 2019

Volume

2019-November