Poster
|
Tue 7:00 |
Optimizing for the Future in Non-Stationary MDPs Yash Chandak · Georgios Theocharous · Shiv Shankar · Martha White · Sridhar Mahadevan · Philip Thomas |
|
Poster
|
Tue 7:00 |
Private Reinforcement Learning with PAC and Regret Guarantees Giuseppe Vietri · Borja de Balle Pigem · Akshay Krishnamurthy · Steven Wu |
|
Poster
|
Tue 7:00 |
Asynchronous Coagent Networks James Kostas · Chris Nota · Philip Thomas |
|
Poster
|
Tue 7:00 |
Evaluating the Performance of Reinforcement Learning Algorithms Scott Jordan · Yash Chandak · Daniel Cohen · Mengxue Zhang · Philip Thomas |
|
Poster
|
Tue 8:00 |
FormulaZero: Distributionally Robust Online Adaptation via Offline Population Synthesis Aman Sinha · Matthew O'Kelly · Hongrui Zheng · Rahul Mangharam · John Duchi · Russ Tedrake |
|
Poster
|
Tue 8:00 |
Batch Reinforcement Learning with Hyperparameter Gradients Byung-Jun Lee · Jongmin Lee · Peter Vrancx · Dongho Kim · Kee-Eung Kim |
|
Poster
|
Tue 8:00 |
Discount Factor as a Regularizer in Reinforcement Learning Ron Amit · Ron Meir · Kamil Ciosek |
|
Poster
|
Tue 8:00 |
GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values Shangtong Zhang · Bo Liu · Shimon Whiteson |
|
Poster
|
Tue 8:00 |
Representations for Stable Off-Policy Reinforcement Learning Dibya Ghosh · Marc Bellemare |
|
Poster
|
Tue 9:00 |
Tightening Exploration in Upper Confidence Reinforcement Learning Hippolyte Bourel · Odalric-Ambrym Maillard · Mohammad Sadegh Talebi |
|
Poster
|
Tue 9:00 |
Lookahead-Bounded Q-learning Ibrahim El Shar · Daniel Jiang |
|
Poster
|
Tue 9:00 |
Structured Policy Iteration for Linear Quadratic Regulator Youngsuk Park · Ryan A. Rossi · Zheng Wen · Gang Wu · Handong Zhao |