Timezone: »
Cross-domain alignment between two sets of entities (e.g., objects in an image, words in a sentence) is fundamental to both computer vision and natural language processing. Existing methods mainly focus on designing advanced attention mechanisms to simulate soft alignment, where no training signals are provided to explicitly encourage alignment. Plus, the learned attention matrices are often dense and difficult to interpret. We propose Graph Optimal Transport (GOT), a principled framework that builds upon recent advances in Optimal Transport (OT). In GOT, cross-domain alignment is formulated as a graph matching problem, by representing entities as a dynamically-constructed graph. Two types of OT distances are considered: (i) Wasserstein distance (WD) for node (entity) matching; and (ii) Gromov-Wasserstein distance (GWD) for edge (structure) matching. Both WD and GWD can be incorporated into existing neural network models, effectively acting as a drop-in regularizer.
The inferred transport plan also yields sparse and self-normalized alignment, enhancing the interpretability of the learned model. Experiments show consistent outperformance of GOT over baselines across a wide range of tasks, including image-text retrieval, visual question answering, image captioning, machine translation, and text summarization.
Author Information
Liqun Chen (Duke University)
Zhe Gan (Microsoft)
Yu Cheng (Microsoft)
Linjie Li (Microsoft)
Lawrence Carin (Duke)
Jingjing Liu (Microsoft)
More from the Same Authors
-
2021 : Hölder Bounds for Sensitivity Analysis in Causal Reasoning »
Serge Assaad · Shuxi Zeng · Henry Pfister · Fan Li · Lawrence Carin -
2020 Poster: Learning Autoencoders with Relational Regularization »
Hongteng Xu · Dixin Luo · Ricardo Henao · Svati Shah · Lawrence Carin -
2020 Poster: On Leveraging Pretrained GANs for Generation with Limited Data »
Miaoyun Zhao · Yulai Cong · Lawrence Carin -
2020 Poster: CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information »
Pengyu Cheng · Weituo Hao · Shuyang Dai · Jiachang Liu · Zhe Gan · Lawrence Carin -
2019 Poster: Gromov-Wasserstein Learning for Graph Matching and Node Embedding »
Hongteng Xu · Dixin Luo · Hongyuan Zha · Lawrence Carin -
2019 Oral: Gromov-Wasserstein Learning for Graph Matching and Node Embedding »
Hongteng Xu · Dixin Luo · Hongyuan Zha · Lawrence Carin -
2019 Poster: Stochastic Blockmodels meet Graph Neural Networks »
Nikhil Mehta · Lawrence Carin · Piyush Rai -
2019 Poster: Variational Annealing of GANs: A Langevin Perspective »
Chenyang Tao · Shuyang Dai · Liqun Chen · Ke Bai · Junya Chen · Chang Liu · RUIYI (ROY) ZHANG · Georgiy Bobashev · Lawrence Carin -
2019 Oral: Stochastic Blockmodels meet Graph Neural Networks »
Nikhil Mehta · Lawrence Carin · Piyush Rai -
2019 Oral: Variational Annealing of GANs: A Langevin Perspective »
Chenyang Tao · Shuyang Dai · Liqun Chen · Ke Bai · Junya Chen · Chang Liu · RUIYI (ROY) ZHANG · Georgiy Bobashev · Lawrence Carin -
2018 Poster: Learning Registered Point Processes from Idiosyncratic Observations »
Hongteng Xu · Lawrence Carin · Hongyuan Zha -
2018 Poster: Policy Optimization as Wasserstein Gradient Flows »
RUIYI (ROY) ZHANG · Changyou Chen · Chunyuan Li · Lawrence Carin -
2018 Poster: JointGAN: Multi-Domain Joint Distribution Learning with Generative Adversarial Nets »
Yunchen Pu · Shuyang Dai · Zhe Gan · Weiyao Wang · Guoyin Wang · Yizhe Zhang · Ricardo Henao · Lawrence Carin -
2018 Oral: Policy Optimization as Wasserstein Gradient Flows »
RUIYI (ROY) ZHANG · Changyou Chen · Chunyuan Li · Lawrence Carin -
2018 Oral: JointGAN: Multi-Domain Joint Distribution Learning with Generative Adversarial Nets »
Yunchen Pu · Shuyang Dai · Zhe Gan · Weiyao Wang · Guoyin Wang · Yizhe Zhang · Ricardo Henao · Lawrence Carin -
2018 Oral: Learning Registered Point Processes from Idiosyncratic Observations »
Hongteng Xu · Lawrence Carin · Hongyuan Zha -
2018 Poster: Adversarial Time-to-Event Modeling »
Paidamoyo Chapfuwa · Chenyang Tao · Chunyuan Li · Courtney Page · Benjamin Goldstein · Lawrence Carin · Ricardo Henao -
2018 Oral: Adversarial Time-to-Event Modeling »
Paidamoyo Chapfuwa · Chenyang Tao · Chunyuan Li · Courtney Page · Benjamin Goldstein · Lawrence Carin · Ricardo Henao -
2018 Poster: Continuous-Time Flows for Efficient Inference and Density Estimation »
Changyou Chen · Chunyuan Li · Liquan Chen · Wenlin Wang · Yunchen Pu · Lawrence Carin -
2018 Poster: Chi-square Generative Adversarial Network »
Chenyang Tao · Liqun Chen · Ricardo Henao · Jianfeng Feng · Lawrence Carin -
2018 Poster: Variational Inference and Model Selection with Generalized Evidence Bounds »
Liqun Chen · Chenyang Tao · RUIYI (ROY) ZHANG · Ricardo Henao · Lawrence Carin -
2018 Oral: Chi-square Generative Adversarial Network »
Chenyang Tao · Liqun Chen · Ricardo Henao · Jianfeng Feng · Lawrence Carin -
2018 Oral: Continuous-Time Flows for Efficient Inference and Density Estimation »
Changyou Chen · Chunyuan Li · Liquan Chen · Wenlin Wang · Yunchen Pu · Lawrence Carin -
2018 Oral: Variational Inference and Model Selection with Generalized Evidence Bounds »
Liqun Chen · Chenyang Tao · RUIYI (ROY) ZHANG · Ricardo Henao · Lawrence Carin -
2017 Poster: Stochastic Gradient Monomial Gamma Sampler »
Yizhe Zhang · Changyou Chen · Zhe Gan · Ricardo Henao · Lawrence Carin -
2017 Poster: Adversarial Feature Matching for Text Generation »
Yizhe Zhang · Zhe Gan · Kai Fan · Zhi Chen · Ricardo Henao · Dinghan Shen · Lawrence Carin -
2017 Talk: Adversarial Feature Matching for Text Generation »
Yizhe Zhang · Zhe Gan · Kai Fan · Zhi Chen · Ricardo Henao · Dinghan Shen · Lawrence Carin -
2017 Talk: Stochastic Gradient Monomial Gamma Sampler »
Yizhe Zhang · Changyou Chen · Zhe Gan · Ricardo Henao · Lawrence Carin -
2017 Poster: Deep Generative Models for Relational Data with Side Information »
Changwei Hu · Piyush Rai · Lawrence Carin -
2017 Talk: Deep Generative Models for Relational Data with Side Information »
Changwei Hu · Piyush Rai · Lawrence Carin