Scholars@Duke publication: The new large-scale RNNLM system based on distributed neuron

The new large-scale RNNLM system based on distributed neuron

Publication , Conference

Niu, D; Xue, R; Cai, T; Li, H; Effah, K; Zhang, H

Published in: Proceedings 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops Ipdpsw 2017

June 30, 2017

Published version (DOI)

RNNLM (Recurrent Neural Network Language Model) can save the historical information of the training dataset by the last hidden layer and can also as input for training. It has become an interesting topic in the field of Natural Language Processing research. However, the immense training time overhead is a big problem. The large output layer, hidden layer, last hidden layer and the connections among them will generate enormous matrix in training. It is the main facts to influence the efficiency and scalability. At the same time, output layer class and small hidden layer should decrease the accuracy of RNNLM. In general, the lack of parallel for artificial neuron is main reason for these. We change the structure of RNNLM and design the new large-scale RNNLM by the center of distributed artificial neurons in hidden layer to stimulate the parallel characteristic of biological neuron system. Meanwhile, we change training method, and present the coordination strategy for distributed neuron. At last, the prototype of new large-scale RNNLM system is implemented based on Spark. The testing and analysis results show that the training time overhead is far less than the growth rate of the distributed neuron in hidden layer and size of training dataset. These results show our large-scale RNNLM system has efficiency and scalability advantage.

Duke Scholars

Author Hai "Helen" Li Electrical and Computer Engineering

Published In

Proceedings 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops Ipdpsw 2017

DOI

10.1109/IPDPSW.2017.21

Publication Date

June 30, 2017

Start / End Page

433 / 436

Citation

APA

Chicago

ICMJE

MLA

NLM

Niu, D., Xue, R., Cai, T., Li, H., Effah, K., & Zhang, H. (2017). The new large-scale RNNLM system based on distributed neuron. In Proceedings 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops Ipdpsw 2017 (pp. 433–436). https://doi.org/10.1109/IPDPSW.2017.21

Niu, D., R. Xue, T. Cai, H. Li, K. Effah, and H. Zhang. “The new large-scale RNNLM system based on distributed neuron.” In Proceedings 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops Ipdpsw 2017, 433–36, 2017. https://doi.org/10.1109/IPDPSW.2017.21.

Niu D, Xue R, Cai T, Li H, Effah K, Zhang H. The new large-scale RNNLM system based on distributed neuron. In: Proceedings 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops Ipdpsw 2017. 2017. p. 433–6.

Niu, D., et al. “The new large-scale RNNLM system based on distributed neuron.” Proceedings 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops Ipdpsw 2017, 2017, pp. 433–36. Scopus, doi:10.1109/IPDPSW.2017.21.

Niu D, Xue R, Cai T, Li H, Effah K, Zhang H. The new large-scale RNNLM system based on distributed neuron. Proceedings 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops Ipdpsw 2017. 2017. p. 433–436.

Published In

Proceedings 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops Ipdpsw 2017

DOI

10.1109/IPDPSW.2017.21

Publication Date

June 30, 2017

Start / End Page

433 / 436