Timezone: »
Scaling up deep Reinforcement Learning (RL) agents beyond traditional benchmarks, without abundant computational resources, presents a significant challenge. Following recent developments in generative modelling, model-based RL positions itself as a strong contender to bring autonomous agents to new heights. In fact, the recently introduced IRIS agent provides evidence that advances in sequence modelling can be leveraged to build powerful world models. In the present work, we propose delta-IRIS, a new agent with a world model architecture that is amenable to scaling up to visually complex environments with longer time horizons. In the Crafter benchmark, delta-IRIS solves 16 out of 21 tasks after 10M frames of training, matching the current best method, DreamerV3. To facilitate research on efficient world models, we release our code at X.
Author Information
Eloi Alonso (University of Geneva)
Vincent Micheli (Microsoft Research UNIGE)
François Fleuret (University of Geneva)
More from the Same Authors
-
2023 : 🎤 Fast Causal Attention with Dynamic Sparsity »
Daniele Paliotta · Matteo Pagliardini · Martin Jaggi · François Fleuret -
2023 : DeepEMD: A Transformer-based Fast Estimation of the Earth Mover's Distance »
Atul Kumar Sinha · François Fleuret -
2023 Poster: Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models »
Nikolaos Dimitriadis · Pascal Frossard · François Fleuret -
2020 Poster: Optimizer Benchmarking Needs to Account for Hyperparameter Tuning »
Prabhu Teja Sivaprasad · Florian Mai · Thijs Vogels · Martin Jaggi · François Fleuret -
2020 Poster: Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention »
Angelos Katharopoulos · Apoorv Vyas · Nikolaos Pappas · François Fleuret