firstbacksecondback
91 Results
Spotlight
|
Wed 11:15 |
Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency Qi Cai · Zhuoran Yang · Zhaoran Wang |
|
Spotlight
|
Wed 13:40 |
Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime James-Michael Leahy · Bekzhan Kerimkulov · David Siska · Lukasz Szpruch |
|
Poster
|
Thu 15:00 |
An Analytical Update Rule for General Policy Optimization Hepeng Li · Nicholas Clavette · Haibo He |
|
Poster
|
Tue 15:30 |
Interpretable Off-Policy Learning via Hyperbox Search Daniel Tschernutter · Tobias Hatt · Stefan Feuerriegel |
|
Poster
|
Wed 15:30 |
Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime James-Michael Leahy · Bekzhan Kerimkulov · David Siska · Lukasz Szpruch |
|
Spotlight
|
Tue 14:40 |
Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis Ziyi Chen · Yi Zhou · Rong-Rong Chen · Shaofeng Zou |
|
Spotlight
|
Thu 13:50 |
Policy Diagnosis via Measuring Role Diversity in Cooperative Multi-agent RL Siyi Hu · Chuanlong Xie · Xiaodan Liang · Xiaojun Chang |
|
Spotlight
|
Wed 8:55 |
Improving Policy Optimization with Generalist-Specialist Learning Zhiwei Jia · Xuanlin Li · Zhan Ling · Shuang Liu · Yiran Wu · Hao Su |
|
Spotlight
|
Wed 7:30 |
Modeling Strong and Human-Like Gameplay with KL-Regularized Search Athul Paul Jacob · David Wu · Gabriele Farina · Adam Lerer · Hengyuan Hu · Anton Bakhtin · Jacob Andreas · Noam Brown |
|
Poster
|
Wed 15:30 |
Modeling Strong and Human-Like Gameplay with KL-Regularized Search Athul Paul Jacob · David Wu · Gabriele Farina · Adam Lerer · Hengyuan Hu · Anton Bakhtin · Jacob Andreas · Noam Brown |
|
Spotlight
|
Thu 12:45 |
Safe Exploration for Efficient Policy Evaluation and Comparison Runzhe Wan · Branislav Kveton · Rui Song |
|
Oral
|
Wed 8:05 |
REvolveR: Continuous Evolutionary Models for Robot-to-robot Policy Transfer Xingyu Liu · Deepak Pathak · Kris Kitani |