Skip to main content

Enhanced LSTM with batch normalization

Publication ,  Chapter
Wang, LN; Zhong, G; Yan, S; Dong, J; Huang, K
January 1, 2019

Recurrent neural networks (RNNs) are powerful models for sequence learning. However, the training of RNNs is complicated because the internal covariate shift problem, where the input distribution at each iteration changes during the training as the parameters have been updated. Although some work has applied batch normalization (BN) to alleviate this problem in long short-term memory (LSTM), unfortunately, BN has not been applied to the update of the LSTM cell. In this paper, to tackle the internal covariate shift problem of LSTM, we introduce a method to successfully integrate BN into the update of the LSTM cell. Experimental results on two benchmark data sets, i.e. MNIST and Fashion-MNIST, show that the proposed method, enhanced LSTM with BN (eLSTM-BN), has achieved a faster convergence than LSTM and its variants, while obtained higher classification accuracy on sequence learning tasks.

Duke Scholars

DOI

Publication Date

January 1, 2019

Volume

11953 LNCS

Start / End Page

746 / 755

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 46 Information and computing sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Wang, L. N., Zhong, G., Yan, S., Dong, J., & Huang, K. (2019). Enhanced LSTM with batch normalization (Vol. 11953 LNCS, pp. 746–755). https://doi.org/10.1007/978-3-030-36708-4_61
Wang, L. N., G. Zhong, S. Yan, J. Dong, and K. Huang. “Enhanced LSTM with batch normalization,” 11953 LNCS:746–55, 2019. https://doi.org/10.1007/978-3-030-36708-4_61.
Wang LN, Zhong G, Yan S, Dong J, Huang K. Enhanced LSTM with batch normalization. In 2019. p. 746–55.
Wang, L. N., et al. Enhanced LSTM with batch normalization. Vol. 11953 LNCS, 2019, pp. 746–55. Scopus, doi:10.1007/978-3-030-36708-4_61.
Wang LN, Zhong G, Yan S, Dong J, Huang K. Enhanced LSTM with batch normalization. 2019. p. 746–755.

DOI

Publication Date

January 1, 2019

Volume

11953 LNCS

Start / End Page

746 / 755

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 46 Information and computing sciences