Scholars@Duke publication: Graph optimal transport for cross-domain alignment

Graph optimal transport for cross-domain alignment

Publication , Conference

Chen, L; Gan, Z; Cheng, Y; Li, L; Carin, L; Liu, J

Published in: 37th International Conference on Machine Learning, ICML 2020

January 1, 2020

Cross-domain alignment between two sets of entities (e.g., objects in an image, words in a sentence) is fundamental to both computer vision and natural language processing. Existing methods mainly focus on designing advanced attention mechanisms to simulate soft alignment, with no training signals to explicitly encourage alignment. The learned attention matrices are also dense and lacks interpretability. We propose Graph Optimal Transport (GOT), a principled framework that germinates from recent advances in Optimal Transport (OT). In GOT, cross-domain alignment is formulated as a graph matching problem, by representing entities into a dynamically-constructed graph. Two types of OT distances are considered: (i) Wasserstein distance (WD) for node (entity) matching; and (ii) Gromov-Wasserstein distance (GWD) for edge (structure) matching. Both WD and GWD can be incorporated into existing neural network models, effectively acting as a dropin regularizer. The inferred transport plan also yields sparse and self-normalized alignment, enhancing the interpretability of the learned model. Experiments show consistent outperformance of GOT over baselines across a wide range of tasks, including image-text retrieval, visual question answering, image captioning, machine translation, and text summarization.

Duke Scholars

Author Lawrence Carin Electrical and Computer Engineering

Published In

37th International Conference on Machine Learning, ICML 2020

Publication Date

January 1, 2020

Volume

PartF168147-2

Start / End Page

1520 / 1531

Citation

APA

Chicago

ICMJE

MLA

NLM

Chen, L., Gan, Z., Cheng, Y., Li, L., Carin, L., & Liu, J. (2020). Graph optimal transport for cross-domain alignment. In 37th International Conference on Machine Learning, ICML 2020 (Vol. PartF168147-2, pp. 1520–1531).

Chen, L., Z. Gan, Y. Cheng, L. Li, L. Carin, and J. Liu. “Graph optimal transport for cross-domain alignment.” In 37th International Conference on Machine Learning, ICML 2020, PartF168147-2:1520–31, 2020.

Chen L, Gan Z, Cheng Y, Li L, Carin L, Liu J. Graph optimal transport for cross-domain alignment. In: 37th International Conference on Machine Learning, ICML 2020. 2020. p. 1520–31.

Chen, L., et al. “Graph optimal transport for cross-domain alignment.” 37th International Conference on Machine Learning, ICML 2020, vol. PartF168147-2, 2020, pp. 1520–31.

Chen L, Gan Z, Cheng Y, Li L, Carin L, Liu J. Graph optimal transport for cross-domain alignment. 37th International Conference on Machine Learning, ICML 2020. 2020. p. 1520–1531.

Published In

37th International Conference on Machine Learning, ICML 2020

Publication Date

January 1, 2020

Volume

PartF168147-2

Start / End Page

1520 / 1531