Skip to yearly menu bar Skip to main content


Poster
in
Workshop: DMLR Workshop: Data-centric Machine Learning Research

Self-supervised Autoencoder for Correlation-Preserving in Tabular GANs

Siddarth Ramesh · Surgan Jandial · Gauri Gupta · Piyush Gupta · Balaji Krishnamurthy


Abstract:

Preserving relationships and interactions between columns(or variables) is crucial for any synthetic tabular data generation approach. Despite their performances, the existing generative adversarial network (GAN)-based methods don't shed much importance on this aspect. In this work, we propose VSA+GAN, a framework that augments with the existing GANs to capture and learn inter-variable interactions with a self-supervised autoencoder trained on a novel pretext task. We show that the method is versatile and is applicable to any variation of Tabular Generative Adversarial Network implementations, and empirically show that our framework significantly improves their performance in terms of data similarity, pair-wise correlation and machine-learning utility metrics.

Chat is not available.