Timezone: »
Inspired by progress in unsupervised representation learning for natural language, we examine whether similar models can learn useful representations for images. We train a sequence Transformer to auto-regressively predict pixels, without incorporating knowledge of the 2D input structure. Despite training on low-resolution ImageNet without labels, we find that a GPT-2 scale model learns strong image representations as measured by linear probing, fine-tuning, and low-data classification. On CIFAR-10, we achieve 96.3% accuracy with a linear probe, outperforming a supervised Wide ResNet, and 99.0% accuracy with full fine-tuning, matching the top supervised pre-trained models. We are also competitive with self-supervised benchmarks on ImageNet when substituting pixels for a VQVAE encoding, achieving 69.0% top-1 accuracy on a linear probe of our features.
Author Information
Mark Chen (OpenAI)
Alec Radford (OpenAI)
Rewon Child (OpenAI)
Jeffrey K Wu (OpenAI)
Heewoo Jun (OpenAI)
David Luan (OpenAI)
Ilya Sutskever (OpenAI)
More from the Same Authors
-
2022 Poster: GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models »
Alexander Nichol · Prafulla Dhariwal · Aditya Ramesh · Pranav Shyam · Pamela Mishkin · Bob McGrew · Ilya Sutskever · Mark Chen -
2022 Spotlight: GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models »
Alexander Nichol · Prafulla Dhariwal · Aditya Ramesh · Pranav Shyam · Pamela Mishkin · Bob McGrew · Ilya Sutskever · Mark Chen -
2021 : Some Thoughts on Generalization, Robustness, and their application with CLIP »
Alec Radford -
2021 Poster: Learning Transferable Visual Models From Natural Language Supervision »
Alec Radford · Jong Wook Kim · Chris Hallacy · Aditya Ramesh · Gabriel Goh · Sandhini Agarwal · Girish Sastry · Amanda Askell · Pamela Mishkin · Jack Clark · Gretchen Krueger · Ilya Sutskever -
2021 Oral: Learning Transferable Visual Models From Natural Language Supervision »
Alec Radford · Jong Wook Kim · Chris Hallacy · Aditya Ramesh · Gabriel Goh · Sandhini Agarwal · Girish Sastry · Amanda Askell · Pamela Mishkin · Jack Clark · Gretchen Krueger · Ilya Sutskever -
2021 Poster: Zero-Shot Text-to-Image Generation »
Aditya Ramesh · Mikhail Pavlov · Gabriel Goh · Scott Gray · Chelsea Voss · Alec Radford · Mark Chen · Ilya Sutskever -
2021 Spotlight: Zero-Shot Text-to-Image Generation »
Aditya Ramesh · Mikhail Pavlov · Gabriel Goh · Scott Gray · Chelsea Voss · Alec Radford · Mark Chen · Ilya Sutskever -
2020 Poster: Distribution Augmentation for Generative Modeling »
Heewoo Jun · Rewon Child · Mark Chen · John Schulman · Aditya Ramesh · Alec Radford · Ilya Sutskever -
2019 Workshop: Workshop on Self-Supervised Learning »
Aaron van den Oord · Yusuf Aytar · Carl Doersch · Carl Vondrick · Alec Radford · Pierre Sermanet · Amir Zamir · Pieter Abbeel