firstbacksecondback
Filter by Keyword:
84 Results
Poster
|
Wed 21:00 |
Randomized Exploration in Reinforcement Learning with General Value Function Approximation Haque Ishfaq · Qiwen Cui · Viet Nguyen · Alex Ayoub · Zhuoran Yang · Zhaoran Wang · Doina Precup · Lin Yang |
|
Spotlight
|
Wed 18:30 |
Randomized Exploration in Reinforcement Learning with General Value Function Approximation Haque Ishfaq · Qiwen Cui · Viet Nguyen · Alex Ayoub · Zhuoran Yang · Zhaoran Wang · Doina Precup · Lin Yang |
|
Poster
|
Wed 21:00 |
Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity Dhruv Malik · Aldo Pacchiano · Vishwak Srinivasan · Yuanzhi Li |
|
Poster
|
Thu 9:00 |
PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models Chaoyang He · Shen Li · Mahdi Soltanolkotabi · Salman Avestimehr |
|
Spotlight
|
Thu 6:35 |
PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models Chaoyang He · Shen Li · Mahdi Soltanolkotabi · Salman Avestimehr |
|
Spotlight
|
Wed 17:30 |
Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity Dhruv Malik · Aldo Pacchiano · Vishwak Srinivasan · Yuanzhi Li |
|
Poster
|
Wed 9:00 |
Bilinear Classes: A Structural Framework for Provable Generalization in RL Simon Du · Sham Kakade · Jason Lee · Shachar Lovett · Gaurav Mahajan · Wen Sun · Ruosong Wang |
|
Oral
|
Wed 6:00 |
Bilinear Classes: A Structural Framework for Provable Generalization in RL Simon Du · Sham Kakade · Jason Lee · Shachar Lovett · Gaurav Mahajan · Wen Sun · Ruosong Wang |
|
Spotlight
|
Wed 17:35 |
Leveraging Non-uniformity in First-order Non-convex Optimization Jincheng Mei · Yue Gao · Bo Dai · Csaba Szepesvari · Dale Schuurmans |
|
Oral
|
Wed 17:00 |
UCB Momentum Q-learning: Correcting the bias without forgetting Pierre Menard · Omar Darwiche Domingues · Xuedong Shang · Michal Valko |
|
Poster
|
Wed 21:00 |
UCB Momentum Q-learning: Correcting the bias without forgetting Pierre Menard · Omar Darwiche Domingues · Xuedong Shang · Michal Valko |
|
Poster
|
Wed 21:00 |
Leveraging Non-uniformity in First-order Non-convex Optimization Jincheng Mei · Yue Gao · Bo Dai · Csaba Szepesvari · Dale Schuurmans |