firstbacksecondback
91 Results
Spotlight
|
Wed 13:55 |
Generalizing Gaussian Smoothing for Random Search Katelyn Gao · Ozan Sener |
|
Spotlight
|
Wed 14:40 |
A Natural Actor-Critic Framework for Zero-Sum Markov Games Ahmet Alacaoglu · Luca Viano · Niao He · Volkan Cevher |
|
Spotlight
|
Wed 14:00 |
Constrained Offline Policy Optimization Nicholas Polosky · Bruno C. da Silva · Madalina Fiterau · Jithin Jagannath |
|
Spotlight
|
Tue 11:45 |
Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms Jeongyeol Kwon · Yonathan Efroni · Constantine Caramanis · Shie Mannor |
|
Poster
|
Wed 15:30 |
Generalizing Gaussian Smoothing for Random Search Katelyn Gao · Ozan Sener |
|
Spotlight
|
Wed 7:45 |
Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models and Amortized Policy Search Qi Wang · Herke van Hoof |
|
Poster
|
Wed 15:30 |
A Natural Actor-Critic Framework for Zero-Sum Markov Games Ahmet Alacaoglu · Luca Viano · Niao He · Volkan Cevher |
|
Spotlight
|
Wed 10:15 |
Biased Gradient Estimate with Drastic Variance Reduction for Meta Reinforcement Learning Yunhao Tang |
|
Poster
|
Wed 15:30 |
Constrained Offline Policy Optimization Nicholas Polosky · Bruno C. da Silva · Madalina Fiterau · Jithin Jagannath |
|
Spotlight
|
Thu 11:00 |
Understanding Policy Gradient Algorithms: A Sensitivity-Based Approach Shuang Wu · Ling Shi · Jun Wang · Guangjian Tian |
|
Spotlight
|
Wed 11:05 |
A Temporal-Difference Approach to Policy Gradient Estimation Samuele Tosatto · Andrew Patterson · Martha White · A. Mahmood |
|
Spotlight
|
Wed 8:00 |
History Compression via Language Models in Reinforcement Learning Fabian Paischer · Thomas Adler · Vihang Patil · Angela Bitto-Nemling · Markus Holzleitner · Sebastian Lehner · Hamid Eghbal-zadeh · Sepp Hochreiter |