Scholars@Duke publication: Sequence generation with optimal-transport-enhanced reinforcement learning

Sequence generation with optimal-transport-enhanced reinforcement learning

Publication , Conference

Chen, L; Bai, K; Tao, C; Zhang, Y; Wang, G; Wang, W; Henao, R; Carin, L

Published in: Aaai 2020 34th Aaai Conference on Artificial Intelligence

January 1, 2020

Reinforcement learning (RL) has been widely used to aid training in language generation. This is achieved by enhancing standard maximum likelihood objectives with user-specified reward functions that encourage global semantic consistency. We propose a principled approach to address the difficulties associated with RL-based solutions, namely, high-variance gradients, uninformative rewards and brittle training. By leveraging the optimal transport distance, we introduce a regularizer that significantly alleviates the above issues. Our formulation emphasizes the preservation of semantic features, enabling end-to-end training instead of ad-hoc fine-tuning, and when combined with RL, it controls the exploration space for more efficient model updates. To validate the effectiveness of the proposed solution, we perform a comprehensive evaluation covering a wide variety of NLP tasks: machine translation, abstractive text summarization and image caption, with consistent improvements over competing solutions.

Duke Scholars

Author Ricardo Henao Biostatistics & Bioinformatics, Division of Translational Bi ...

Author Lawrence Carin Electrical and Computer Engineering

Published In

Aaai 2020 34th Aaai Conference on Artificial Intelligence

Publication Date

January 1, 2020

Start / End Page

7512 / 7520

Citation

APA

Chicago

ICMJE

MLA

NLM

Chen, L., Bai, K., Tao, C., Zhang, Y., Wang, G., Wang, W., … Carin, L. (2020). Sequence generation with optimal-transport-enhanced reinforcement learning. In Aaai 2020 34th Aaai Conference on Artificial Intelligence (pp. 7512–7520).

Chen, L., K. Bai, C. Tao, Y. Zhang, G. Wang, W. Wang, R. Henao, and L. Carin. “Sequence generation with optimal-transport-enhanced reinforcement learning.” In Aaai 2020 34th Aaai Conference on Artificial Intelligence, 7512–20, 2020.

Chen L, Bai K, Tao C, Zhang Y, Wang G, Wang W, et al. Sequence generation with optimal-transport-enhanced reinforcement learning. In: Aaai 2020 34th Aaai Conference on Artificial Intelligence. 2020. p. 7512–20.

Chen, L., et al. “Sequence generation with optimal-transport-enhanced reinforcement learning.” Aaai 2020 34th Aaai Conference on Artificial Intelligence, 2020, pp. 7512–20.

Chen L, Bai K, Tao C, Zhang Y, Wang G, Wang W, Henao R, Carin L. Sequence generation with optimal-transport-enhanced reinforcement learning. Aaai 2020 34th Aaai Conference on Artificial Intelligence. 2020. p. 7512–7520.

Published In

Aaai 2020 34th Aaai Conference on Artificial Intelligence

Publication Date

January 1, 2020

Start / End Page

7512 / 7520