firstbacksecondback
44 Results
Spotlight
|
Wed 14:45 |
Distributionally Robust QQ-Learning Zijian Liu · Jerry Bai · Jose Blanchet · Perry Dong · Wei Xu · Zhengqing Zhou · Zhengyuan Zhou |
|
Poster
|
Wed 15:30 |
Distributionally Robust QQ-Learning Zijian Liu · Jerry Bai · Jose Blanchet · Perry Dong · Wei Xu · Zhengqing Zhou · Zhengyuan Zhou |
|
Spotlight
|
Wed 14:40 |
A Natural Actor-Critic Framework for Zero-Sum Markov Games Ahmet Alacaoglu · Luca Viano · Niao He · Volkan Cevher |
|
Spotlight
|
Wed 11:20 |
Actor-Critic based Improper Reinforcement Learning Mohammadi Zaki · Avi Mohan · Aditya Gopalan · Shie Mannor |
|
Poster
|
Wed 15:30 |
A Natural Actor-Critic Framework for Zero-Sum Markov Games Ahmet Alacaoglu · Luca Viano · Niao He · Volkan Cevher |
|
Poster
|
Wed 15:30 |
Actor-Critic based Improper Reinforcement Learning Mohammadi Zaki · Avi Mohan · Aditya Gopalan · Shie Mannor |
|
Spotlight
|
Tue 8:20 |
Learning Infinite-horizon Average-reward Markov Decision Process with Constraints Liyu Chen · Rahul Jain · Haipeng Luo |
|
Spotlight
|
Wed 11:25 |
On the Sample Complexity of Learning Infinite-horizon Discounted Linear Kernel MDPs Yuanzhou Chen · Jiafan He · Quanquan Gu |
|
Spotlight
|
Tue 14:40 |
Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis Ziyi Chen · Yi Zhou · Rong-Rong Chen · Shaofeng Zou |
|
Poster
|
Wed 15:30 |
On the Sample Complexity of Learning Infinite-horizon Discounted Linear Kernel MDPs Yuanzhou Chen · Jiafan He · Quanquan Gu |
|
Poster
|
Tue 15:30 |
Learning Infinite-horizon Average-reward Markov Decision Process with Constraints Liyu Chen · Rahul Jain · Haipeng Luo |
|
Poster
|
Tue 15:30 |
Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis Ziyi Chen · Yi Zhou · Rong-Rong Chen · Shaofeng Zou |