Timezone: »
In this paper, we propose the Self-Attention Generative Adversarial Network (SAGAN) which allows attention-driven, long-range dependency modeling for image generation tasks. Traditional convolutional GANs generate high-resolution details as a function of only spatially local points in lower-resolution feature maps. In SAGAN, details can be generated using cues from all feature locations. Moreover, the discriminator can check that highly detailed features in distant portions of the image are consistent with each other. Furthermore, recent work has shown that generator conditioning affects GAN performance. Leveraging this insight, we apply spectral normalization to the GAN generator and find that this improves training dynamics. The proposed SAGAN performs better than prior work, boosting the best published Inception score from 36.8 to 52.52 and reducing Fr\'echet Inception distance from 27.62 to 18.65 on the challenging ImageNet dataset. Visualization of the attention layers shows that the generator leverages neighborhoods that correspond to object shapes rather than local regions of fixed shape.
Author Information
Han Zhang (Google)
Ian Goodfellow (Google Brain)
Dimitris Metaxas (Rutgers)
Augustus Odena (Google Brain)
Related Events (a corresponding poster, oral, or spotlight)
-
2019 Poster: Self-Attention Generative Adversarial Networks »
Wed. Jun 12th 01:30 -- 04:00 AM Room Pacific Ballroom #11
More from the Same Authors
-
2020 Poster: Error-Bounded Correction of Noisy Labels »
Songzhu Zheng · Pengxiang Wu · Aman Goswami · Mayank Goswami · Dimitris Metaxas · Chao Chen -
2020 Poster: Small-GAN: Speeding up GAN Training using Core-Sets »
Samrath Sinha · Han Zhang · Anirudh Goyal · Yoshua Bengio · Hugo Larochelle · Augustus Odena -
2019 Poster: TensorFuzz: Debugging Neural Networks with Coverage-Guided Fuzzing »
Augustus Odena · Catherine Olsson · David Andersen · Ian Goodfellow -
2019 Oral: TensorFuzz: Debugging Neural Networks with Coverage-Guided Fuzzing »
Augustus Odena · Catherine Olsson · David Andersen · Ian Goodfellow -
2019 Poster: Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition »
Yao Qin · Nicholas Carlini · Garrison Cottrell · Ian Goodfellow · Colin Raffel -
2019 Oral: Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition »
Yao Qin · Nicholas Carlini · Garrison Cottrell · Ian Goodfellow · Colin Raffel -
2018 Poster: Is Generator Conditioning Causally Related to GAN Performance? »
Augustus Odena · Jacob Buckman · Catherine Olsson · Tom B Brown · Christopher Olah · Colin Raffel · Ian Goodfellow -
2018 Oral: Is Generator Conditioning Causally Related to GAN Performance? »
Augustus Odena · Jacob Buckman · Catherine Olsson · Tom B Brown · Christopher Olah · Colin Raffel · Ian Goodfellow -
2017 Poster: Dual Iterative Hard Thresholding: From Non-convex Sparse Minimization to Non-smooth Concave Maximization »
Bo Liu · Xiaotong Yuan · Lezi Wang · Qingshan Liu · Dimitris Metaxas -
2017 Talk: Dual Iterative Hard Thresholding: From Non-convex Sparse Minimization to Non-smooth Concave Maximization »
Bo Liu · Xiaotong Yuan · Lezi Wang · Qingshan Liu · Dimitris Metaxas -
2017 Poster: Conditional Image Synthesis with Auxiliary Classifier GANs »
Augustus Odena · Christopher Olah · Jon Shlens -
2017 Talk: Conditional Image Synthesis with Auxiliary Classifier GANs »
Augustus Odena · Christopher Olah · Jon Shlens