Timezone: »
The ability to continuously acquire new knowledge and skills is crucial for autonomous agents. However, existing methods are typically based on either fixed-size models that cannot capture many diverse behaviors, or growing-size models that scale poorly with the number of tasks. In this paper, we introduce Continual Subspace of Policies (CSP), a method that iteratively learns a subspace of policies in the continual reinforcement learning setting where tasks are presented sequentially. The subspace's high expressivity allows our method to strike a good balance between stability (i.e. not forgetting prior tasks) and plasticity (i.e. learning new tasks), while the number of parameters grows sublinearly with the number of tasks. In addition, CSP displays good transfer, being able to quickly adapt to new tasks including combinations of previously seen ones without additional training. Finally, CSP outperforms state-of-the-art methods on a wide range of scenarios in two different domains. An interactive visualization of the subspace can be found at https://share.streamlit.io/continual-subspace/policies/main.
Author Information
Jean-Baptiste Gaya (Facebook)
Thang Doan (McGill University)
Lucas Caccia (McGIll)
Laure Soulier (Sorbonne Université)
Ludovic Denoyer (Facebook)
Roberta Raileanu (FAIR / Meta)
More from the Same Authors
-
2021 Workshop: Theory and Foundation of Continual Learning »
Thang Doan · Bogdan Mazoure · Amal Rannen Triki · Rahaf Aljundi · Vincenzo Lomonaco · Xu He · Arslan Chaudhry -
2020 Poster: Online Learned Continual Compression with Adaptive Quantization Modules »
Lucas Caccia · Eugene Belilovsky · Massimo Caccia · Joelle Pineau -
2019 Poster: Multi-objective training of Generative Adversarial Networks with multiple discriminators »
Isabela Albuquerque · Joao Monteiro · Thang Doan · Breandan Considine · Tiago Falk · Ioannis Mitliagkas -
2019 Oral: Multi-objective training of Generative Adversarial Networks with multiple discriminators »
Isabela Albuquerque · Joao Monteiro · Thang Doan · Breandan Considine · Tiago Falk · Ioannis Mitliagkas -
2019 Poster: Context-Aware Zero-Shot Learning for Object Recognition »
Eloi Zablocki · Patrick Bordes · Laure Soulier · Benjamin Piwowarski · Patrick Gallinari -
2019 Oral: Context-Aware Zero-Shot Learning for Object Recognition »
Eloi Zablocki · Patrick Bordes · Laure Soulier · Benjamin Piwowarski · Patrick Gallinari