firstbacksecondback
64 Results
Poster
|
Wed 15:30 |
Actor-Critic based Improper Reinforcement Learning Mohammadi Zaki · Avi Mohan · Aditya Gopalan · Shie Mannor |
|
Spotlight
|
Wed 8:25 |
LeNSE: Learning To Navigate Subgraph Embeddings for Large-Scale Combinatorial Optimisation David Ireland · Giovanni Montana |
|
Poster
|
Thu 15:00 |
Temporal Difference Learning for Model Predictive Control Nicklas Hansen · Hao Su · Xiaolong Wang |
|
Poster
|
Thu 15:00 |
Thresholded Lasso Bandit Kaito Ariu · Kenshi Abe · Alexandre Proutiere |
|
Poster
|
Wed 15:30 |
On the Sample Complexity of Learning Infinite-horizon Discounted Linear Kernel MDPs Yuanzhou Chen · Jiafan He · Quanquan Gu |
|
Spotlight
|
Thu 10:50 |
Offline Meta-Reinforcement Learning with Online Self-Supervision Vitchyr Pong · Ashvin Nair · Laura Smith · Catherine Huang · Sergey Levine |
|
Spotlight
|
Tue 14:20 |
Greedy when Sure and Conservative when Uncertain about the Opponents Haobo Fu · Ye Tian · Hongxiang Yu · Weiming Liu · Shuang Wu · Jiechao Xiong · Ying Wen · Kai Li · Junliang Xing · Qiang Fu · Wei Yang |
|
Spotlight
|
Thu 8:30 |
Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost Dan Qiao · Ming Yin · Ming Min · Yu-Xiang Wang |
|
Poster
|
Thu 15:00 |
Safe Exploration for Efficient Policy Evaluation and Comparison Runzhe Wan · Branislav Kveton · Rui Song |
|
Spotlight
|
Wed 11:05 |
A Temporal-Difference Approach to Policy Gradient Estimation Samuele Tosatto · Andrew Patterson · Martha White · A. Mahmood |
|
Spotlight
|
Tue 14:40 |
Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis Ziyi Chen · Yi Zhou · Rong-Rong Chen · Shaofeng Zou |
|
Poster
|
Thu 15:00 |
Branching Reinforcement Learning Yihan Du · Wei Chen |