Timezone: »

 
Counterfactual Generation with Identifiability Guarantees
Hanqi Yan · Lingjing Kong · Lin Gui · Yuejie Chi · Eric Xing · Yulan He · Kun Zhang

Counterfactual generation requires the identification of the disentangled latent representations, such as content and style, that underlie the observed data. Existing unsupervised methods crucially rely on oversimplified assumptions, such as assuming independent content and style variables, to identify the latent variables, even though such assumptions may not hold for complex data distributions. This problem is exacerbated when data are sampled from multiple domains, as required by prior work, since the dependence between content and style may vary significantly over domains. In this work, we tackle the dependence between the content and the style variables inherent in the counterfactual generation task. We show identification guarantees by leveraging the relative sparsity of the influences from different latent variables. Our theoretical insights enable the development of a doMain AdapTive conTrollable text gEneration model, called MATTE. It achieves state-of-art performance in unsupervised controllable text generation tasks on large-scale datasets.

Author Information

Hanqi Yan (University of Warwick)
Hanqi Yan

Hanqi Yan (颜寒祺) is a Ph.D. student (2020.10-) under the supervision of Prof. Yulan He, doing Natural Language Processing, focusing on interpretable and robust NLP models.

Lingjing Kong (Carnegie Mellon University)
Lin Gui (Department of Computer Science, University of Warwick, Coventry CV4 7AL, U.K.)
Yuejie Chi (CMU)
Eric Xing (Petuum Inc. and CMU)
Yulan He (King's College London)
Kun Zhang (Carnegie Mellon University)

More from the Same Authors