Timezone: »
Capsule networks aim to parse images into a hierarchy of objects, parts and relations. While promising, they remain limited by an inability to learn effective low level part descriptions. To address this issue we propose a way to learn primary capsule encoders that detect atomic parts from a single image. During training we exploit motion as a powerful perceptual cue for part definition, with an expressive decoder for part generation within a layered image model with occlusion. Experiments demonstrate robust part discovery in the presence of multiple objects, cluttered backgrounds, and occlusion. The learned part decoder is shown to infer the underlying shape masks, effectively filling in occluded regions of the detected shapes. We evaluate FlowCapsules on unsupervised part segmentation and unsupervised image classification.
Author Information
Sara Sabour Rouh Aghdam (Google)
Andrea Tagliasacchi (Google Inc.)
Soroosh Yazdani (Google Inc.)
Geoffrey Hinton (Google)
David Fleet (University of Toronto)
Related Events (a corresponding poster, oral, or spotlight)
-
2021 Poster: Unsupervised Part Representation by Flow Capsules »
Fri. Jul 23rd 04:00 -- 06:00 AM Room
More from the Same Authors
-
2023 Poster: Scalable Adaptive Computation for Iterative Generation »
Allan Jabri · David Fleet · Ting Chen -
2020 Poster: Imputer: Sequence Modelling via Imputation and Dynamic Programming »
William Chan · Chitwan Saharia · Geoffrey Hinton · Mohammad Norouzi · Navdeep Jaitly -
2020 Poster: A Simple Framework for Contrastive Learning of Visual Representations »
Ting Chen · Simon Kornblith · Mohammad Norouzi · Geoffrey Hinton -
2019 Poster: Similarity of Neural Network Representations Revisited »
Simon Kornblith · Mohammad Norouzi · Honglak Lee · Geoffrey Hinton -
2019 Poster: Analyzing and Improving Representations with the Soft Nearest Neighbor Loss »
Nicholas Frosst · Nicolas Papernot · Geoffrey Hinton -
2019 Oral: Similarity of Neural Network Representations Revisited »
Simon Kornblith · Mohammad Norouzi · Honglak Lee · Geoffrey Hinton -
2019 Oral: Analyzing and Improving Representations with the Soft Nearest Neighbor Loss »
Nicholas Frosst · Nicolas Papernot · Geoffrey Hinton