firstbacksecondback
396 Results
Poster
|
Thu 7:00 |
ConQUR: Mitigating Delusional Bias in Deep Q-Learning DiJia Su · Jayden Ooi · Tyler Lu · Dale Schuurmans · Craig Boutilier |
|
Affinity Workshop
|
Mon 11:35 |
Breakout Session 4.3: Coping with Sample Inefficiency of Deep-Reinforcement Learning (DRL) for Embodied AI |
|
Workshop
|
Meta-Reinforcement Learning for Robotic Industrial Insertion Tasks Gerrit Schoettler |
||
Poster
|
Tue 7:00 |
From Importance Sampling to Doubly Robust Policy Gradient Jiawei Huang · Nan Jiang |
|
Poster
|
Thu 6:00 |
Bandits for BMO Functions Tianyu Wang · Cynthia Rudin |
|
Workshop
|
Fri 6:05 |
Invited Talk: Christoph H. Lampert "Learning Theory for Continual and Meta-Learning" Christoph H. Lampert |
|
Poster
|
Tue 7:00 |
Asynchronous Coagent Networks James Kostas · Chris Nota · Philip Thomas |
|
Poster
|
Wed 5:00 |
What can I do here? A Theory of Affordances in Reinforcement Learning Khimya Khetarpal · Zafarali Ahmed · Gheorghe Comanici · David Abel · Doina Precup |
|
Poster
|
Thu 8:00 |
AR-DAE: Towards Unbiased Neural Entropy Gradient Estimation Jae Hyun Lim · Aaron Courville · Christopher Pal · Chin-Wei Huang |
|
Poster
|
Tue 9:00 |
Lookahead-Bounded Q-learning Ibrahim El Shar · Daniel Jiang |
|
Poster
|
Tue 7:00 |
Provably Efficient Exploration in Policy Optimization Qi Cai · Zhuoran Yang · Chi Jin · Zhaoran Wang |
|
Poster
|
Thu 14:00 |
Multi-Agent Determinantal Q-Learning Yaodong Yang · Ying Wen · Jun Wang · Liheng Chen · Kun Shao · David Mguni · Weinan Zhang |