firstbacksecondback
Filter by Keyword:
607 Results
Spotlight
|
Wed 19:20 |
Combinatorial Blocking Bandits with Stochastic Delays Alexia Atsidakou · Orestis Papadigenopoulos · Soumya Basu · Constantine Caramanis · Sanjay Shakkottai |
|
Oral
|
Wed 7:00 |
High-dimensional Experimental Design and Kernel Bandits Romain Camilleri · Kevin Jamieson · Julian Katz-Samuels |
|
Poster
|
Wed 21:00 |
Improved Regret Bounds of Bilinear Bandits using Action Space Analysis Kyoungseok Jang · Kwang-Sung Jun · Se-Young Yun · Wanmo Kang |
|
Spotlight
|
Wed 5:35 |
Characterizing the Gap Between Actor-Critic and Policy Gradient Junfeng Wen · Saurabh Kumar · Ramki Gummadi · Dale Schuurmans |
|
Poster
|
Tue 9:00 |
Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration Seungyul Han · Youngchul Sung |
|
Spotlight
|
Tue 18:45 |
Emphatic Algorithms for Deep Reinforcement Learning Ray Jiang · Tom Zahavy · Zhongwen Xu · Adam White · Matteo Hessel · Charles Blundell · Hado van Hasselt |
|
Poster
|
Tue 9:00 |
Inverse Constrained Reinforcement Learning Shehryar Malik · Usman Anwar · Alireza Aghasi · Ali Ahmed |
|
Poster
|
Thu 9:00 |
Detecting Rewards Deterioration in Episodic Reinforcement Learning Ido Greenberg · Shie Mannor |
|
Poster
|
Tue 21:00 |
Recomposing the Reinforcement Learning Building Blocks with Hypernetworks Elad Sarafian · Shai Keynan · Sarit Kraus |
|
Poster
|
Tue 9:00 |
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL Seyed Kamyar Seyed Ghasemipour · Dale Schuurmans · Shixiang Gu |
|
Spotlight
|
Wed 5:25 |
Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices Evan Liu · Aditi Raghunathan · Percy Liang · Chelsea Finn |
|
Spotlight
|
Tue 19:35 |
Generalizable Episodic Memory for Deep Reinforcement Learning Hao Hu · Jianing Ye · Guangxiang Zhu · Zhizhou Ren · Chongjie Zhang |