Poster
|
Wed 15:30
|
Nearly Minimax Optimal Reinforcement Learning with Linear Function Approximation
Pihe Hu · Yu Chen · Longbo Huang
|
|
Spotlight
|
Wed 8:40
|
Nearly Minimax Optimal Reinforcement Learning with Linear Function Approximation
Pihe Hu · Yu Chen · Longbo Huang
|
|
Poster
|
Thu 15:00
|
Learning Stochastic Shortest Path with Linear Function Approximation
Yifei Min · Jiafan He · Tianhao Wang · Quanquan Gu
|
|
Spotlight
|
Thu 8:40
|
Learning Stochastic Shortest Path with Linear Function Approximation
Yifei Min · Jiafan He · Tianhao Wang · Quanquan Gu
|
|
Spotlight
|
Thu 8:30
|
A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games
Wei Xiong · Han Zhong · Chengshuai Shi · Cong Shen · Tong Zhang
|
|
Poster
|
Thu 15:00
|
A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games
Wei Xiong · Han Zhong · Chengshuai Shi · Cong Shen · Tong Zhang
|
|
Spotlight
|
Wed 7:50
|
The Power of Exploiter: Provable Multi-Agent RL in Large State Spaces
Chi Jin · Qinghua Liu · Tiancheng Yu
|
|
Poster
|
Wed 15:30
|
The Power of Exploiter: Provable Multi-Agent RL in Large State Spaces
Chi Jin · Qinghua Liu · Tiancheng Yu
|
|
Spotlight
|
Thu 11:50
|
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation)
Huang Bojun
|
|
Poster
|
Thu 15:00
|
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation)
Huang Bojun
|
|
Poster
|
Tue 15:30
|
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation
Xiaoyu Chen · Han Zhong · Zhuoran Yang · Zhaoran Wang · Liwei Wang
|
|
Spotlight
|
Tue 8:45
|
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation
Xiaoyu Chen · Han Zhong · Zhuoran Yang · Zhaoran Wang · Liwei Wang
|
|