Scholars@Duke publication: MeDNN: A distributed mobile system with enhanced partition and deployment for large-scale DNNs

MeDNN: A distributed mobile system with enhanced partition and deployment for large-scale DNNs

Publication , Conference

Mao, J; Yang, Z; Wen, W; Wu, C; Song, L; Nixon, KW; Chen, X; Li, H; Chen, Y

Published in: IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD

December 13, 2017

Deep Neural Networks (DNNs) are pervasively used in a significant number of applications and platforms. To enhance the execution efficiency of large-scale DNNs, previous attempts focus mainly on client-server paradigms, relying on powerful external infrastructure, or model compression, with complicated pre-processing phases. Though effective, these methods overlook the optimization of DNNs on distributed mobile devices. In this work, we design and implement MeDNN, a local distributed mobile computing system with enhanced partitioning and deployment tailored for large-scale DNNs. In MeDNN, we first propose Greedy Two Dimensional Partition (GTDP), which can adaptively partition DNN models onto several mobile devices w.r.t. individual resource constraints. We also propose Structured Model Compact Deployment (SMCD), a mobile-friendly compression scheme which utilizes a structured sparsity pruning technique to further accelerate DNN execution. Experimental results show that, GTDP can accelerate the original DNN execution time by 1.86-2.44x with 2-4 worker nodes. By utilizing SMCD, 26.5% of additional computing time and 14.2% of extra communication time are saved, on average, with negligible effect on the model accuracy.

Duke Scholars

Author Hai "Helen" Li Electrical and Computer Engineering

Author Yiran Chen Electrical and Computer Engineering

Published In

IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD

DOI

10.1109/ICCAD.2017.8203852

ISSN

1092-3152

Publication Date

December 13, 2017

Volume

2017-November

Start / End Page

751 / 756

Citation

APA

Chicago

ICMJE

MLA

NLM

Mao, J., Yang, Z., Wen, W., Wu, C., Song, L., Nixon, K. W., … Chen, Y. (2017). MeDNN: A distributed mobile system with enhanced partition and deployment for large-scale DNNs. In IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD (Vol. 2017-November, pp. 751–756). https://doi.org/10.1109/ICCAD.2017.8203852

Mao, J., Z. Yang, W. Wen, C. Wu, L. Song, K. W. Nixon, X. Chen, H. Li, and Y. Chen. “MeDNN: A distributed mobile system with enhanced partition and deployment for large-scale DNNs.” In IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD, 2017-November:751–56, 2017. https://doi.org/10.1109/ICCAD.2017.8203852.

Mao J, Yang Z, Wen W, Wu C, Song L, Nixon KW, et al. MeDNN: A distributed mobile system with enhanced partition and deployment for large-scale DNNs. In: IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD. 2017. p. 751–6.

Mao, J., et al. “MeDNN: A distributed mobile system with enhanced partition and deployment for large-scale DNNs.” IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD, vol. 2017-November, 2017, pp. 751–56. Scopus, doi:10.1109/ICCAD.2017.8203852.

Mao J, Yang Z, Wen W, Wu C, Song L, Nixon KW, Chen X, Li H, Chen Y. MeDNN: A distributed mobile system with enhanced partition and deployment for large-scale DNNs. IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD. 2017. p. 751–756.

Published In

IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD

DOI

10.1109/ICCAD.2017.8203852

ISSN

1092-3152

Publication Date

December 13, 2017

Volume

2017-November

Start / End Page

751 / 756