Spotlight
|
Wed 13:40
|
Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime
James-Michael Leahy · Bekzhan Kerimkulov · David Siska · Lukasz Szpruch
|
|
Poster
|
Wed 15:30
|
Expression might be enough: representing pressure and demand for reinforcement learning based traffic signal control
Liang Zhang · Qiang Wu · Jun Shen · Linyuan Lü · Bo Du · Jianqing Wu
|
|
Poster
|
Thu 15:00
|
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation)
Huang Bojun
|
|
Poster
|
Wed 15:30
|
The Power of Exploiter: Provable Multi-Agent RL in Large State Spaces
Chi Jin · Qinghua Liu · Tiancheng Yu
|
|
Spotlight
|
Thu 11:45
|
Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory
Ruiqi Zhang · Xuezhou Zhang · Chengzhuo Ni · Mengdi Wang
|
|
Poster
|
Thu 15:00
|
Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes
Hongyi Guo · Qi Cai · Yufeng Zhang · Zhuoran Yang · Zhaoran Wang
|
|
Poster
|
Wed 15:30
|
Cascaded Gaps: Towards Logarithmic Regret for Risk-Sensitive Reinforcement Learning
Yingjie Fei · Ruitu Xu
|
|
Poster
|
Wed 15:30
|
Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime
James-Michael Leahy · Bekzhan Kerimkulov · David Siska · Lukasz Szpruch
|
|
Spotlight
|
Wed 13:30
|
Improved Regret for Differentially Private Exploration in Linear MDP
Dung Ngo · Giuseppe Vietri · Steven Wu
|
|
Poster
|
Wed 15:30
|
Saute RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
Aivar Sootla · Alexander I Cowen-Rivers · Taher Jafferjee · Ziyan Wang · David Mguni · Jun Wang · Haitham Bou Ammar
|
|
Poster
|
Thu 15:00
|
Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory
Ruiqi Zhang · Xuezhou Zhang · Chengzhuo Ni · Mengdi Wang
|
|
Spotlight
|
Wed 14:35
|
Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods
Yi Wan · Ali Rahimi-Kalahroudi · Janarthanan Rajendran · Ida Momennejad · Sarath Chandar · Harm van Seijen
|
|