Timezone: »
We study a variety of notions of equivariance as an inductive bias in Reinforcement Learning (RL). In particular, we propose new mechanisms for learning representations that are equivariant to both the agent’s action, as well as symmetry transformations of the state-action pairs. Whereas prior work on exploiting symmetries in deep RL can only incorporate predefined linear transformations, our approach allows non-linear symmetry transformations of state-action pairs to be learned from the data. This is achieved through 1) equivariant Lie algebraic parameterization of state and action encodings, 2) equivariant latent transition models, and 3) the incorporation of symmetry-based losses. We demonstrate the advantages of our method, which we call Equivariant representations for RL (EqR), for Atari games in a data-efficient setting limited to 100K steps of interactions with the environment.
Author Information
Arnab Kumar Mondal (Mcgill University)

I am a third-year Ph.D. candidate in Computer Science at Mcgill University and Mila - Quebec Artificial Intelligence Institute, jointly supervised by Prof. Siamak Ravanbakhsh and Prof. Kaleem Siddiqi. My primary areas of interest include equivariant representation learning, self-supervised learning and reinforcement learning. Before moving to Montreal, I did my undergraduate studies in Electronics and Electrical Communication Engineering from the Indian Institute of Technology, Kharagpur. I have worked on VLSI engineering, Robotics, Computer Vision, Embedded systems, and Free-form Lens design during my undergrad. When I am not working, I like to spend time in nature, stargaze with my close friends and talk about life. I love mountains and lakes and have been on a few Himalayan treks. Next on my list is Banff National Park.
Vineet Jain (McGill University, Mila)
Kaleem Siddiqi (McGill University)
Siamak Ravanbakhsh (McGill - Mila)
Related Events (a corresponding poster, oral, or spotlight)
-
2022 Poster: EqR: Equivariant Representations for Data-Efficient Reinforcement Learning »
Thu. Jul 21st through Fri the 22nd Room Hall E #1026
More from the Same Authors
-
2022 : Galaxies on graph neural networks: towards robust synthetic galaxy catalogs with deep generative models »
Yesukhei Jagvaral · Rachel Mandelbaum · Francois Lanusse · Siamak Ravanbakhsh · Sukhdeep Singh · Duncan Campbell -
2022 Poster: Utility Theory for Sequential Decision Making »
Mehran Shakerinava · Siamak Ravanbakhsh -
2022 Spotlight: Utility Theory for Sequential Decision Making »
Mehran Shakerinava · Siamak Ravanbakhsh -
2021 Poster: Equivariant Networks for Pixelized Spheres »
Mehran Shakerinava · Siamak Ravanbakhsh -
2021 Spotlight: Equivariant Networks for Pixelized Spheres »
Mehran Shakerinava · Siamak Ravanbakhsh -
2020 Poster: Universal Equivariant Multilayer Perceptrons »
Siamak Ravanbakhsh -
2017 Poster: Equivariance Through Parameter-Sharing »
Siamak Ravanbakhsh · Jeff Schneider · Barnabás Póczos -
2017 Talk: Equivariance Through Parameter-Sharing »
Siamak Ravanbakhsh · Jeff Schneider · Barnabás Póczos