firstbacksecondback
539 Results
Poster
|
Wed 15:30 |
Goal Misgeneralization in Deep Reinforcement Learning Lauro Langosco di Langosco · Jack Koch · Lee Sharkey · Jacob Pfau · David Krueger |
|
Poster
|
Thu 15:00 |
Retrieval-Augmented Reinforcement Learning Anirudh Goyal · Abe Friesen Friesen · Andrea Banino · Theophane Weber · Nan Rosemary Ke · Adrià Puigdomenech Badia · Arthur Guez · Mehdi Mirza · Peter Humphreys · Ksenia Konyushkova · Michal Valko · Simon Osindero · Timothy Lillicrap · Nicolas Heess · Charles Blundell |
|
Spotlight
|
Thu 13:40 |
Retrieval-Augmented Reinforcement Learning Anirudh Goyal · Abe Friesen Friesen · Andrea Banino · Theophane Weber · Nan Rosemary Ke · Adrià Puigdomenech Badia · Arthur Guez · Mehdi Mirza · Peter Humphreys · Ksenia Konyushkova · Michal Valko · Simon Osindero · Timothy Lillicrap · Nicolas Heess · Charles Blundell |
|
Oral
|
Tue 10:30 |
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution Vihang Patil · Markus Hofmarcher · Marius-Constantin Dinu · Matthias Dorfer · Patrick Blies · Johannes Brandstetter · Jose A. Arjona-Medina · Sepp Hochreiter |
|
Poster
|
Tue 15:30 |
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution Vihang Patil · Markus Hofmarcher · Marius-Constantin Dinu · Matthias Dorfer · Patrick Blies · Johannes Brandstetter · Jose A. Arjona-Medina · Sepp Hochreiter |
|
Spotlight
|
Thu 8:20 |
Dataset Condensation via Efficient Synthetic-Data Parameterization Jang-Hyun Kim · Jinuk Kim · Seong Joon Oh · Sangdoo Yun · Hwanjun Song · Joonhyun Jeong · Jung-Woo Ha · Hyun Oh Song |
|
Poster
|
Thu 15:00 |
Dataset Condensation via Efficient Synthetic-Data Parameterization Jang-Hyun Kim · Jinuk Kim · Seong Joon Oh · Sangdoo Yun · Hwanjun Song · Joonhyun Jeong · Jung-Woo Ha · Hyun Oh Song |
|
Poster
|
Wed 15:30 |
History Compression via Language Models in Reinforcement Learning Fabian Paischer · Thomas Adler · Vihang Patil · Angela Bitto-Nemling · Markus Holzleitner · Sebastian Lehner · Hamid Eghbal-zadeh · Sepp Hochreiter |
|
Spotlight
|
Wed 8:00 |
History Compression via Language Models in Reinforcement Learning Fabian Paischer · Thomas Adler · Vihang Patil · Angela Bitto-Nemling · Markus Holzleitner · Sebastian Lehner · Hamid Eghbal-zadeh · Sepp Hochreiter |
|
Poster
|
Thu 15:00 |
General-purpose, long-context autoregressive modeling with Perceiver AR Curtis Hawthorne · Andrew Jaegle · Cătălina Cangea · Sebastian Borgeaud · Charlie Nash · Mateusz Malinowski · Sander Dieleman · Oriol Vinyals · Matthew Botvinick · Ian Simon · Hannah Sheahan · Neil Zeghidour · Jean-Baptiste Alayrac · Joao Carreira · Jesse Engel |
|
Spotlight
|
Thu 8:40 |
General-purpose, long-context autoregressive modeling with Perceiver AR Curtis Hawthorne · Andrew Jaegle · Cătălina Cangea · Sebastian Borgeaud · Charlie Nash · Mateusz Malinowski · Sander Dieleman · Oriol Vinyals · Matthew Botvinick · Ian Simon · Hannah Sheahan · Neil Zeghidour · Jean-Baptiste Alayrac · Joao Carreira · Jesse Engel |