Poster
|
Thu 15:00
|
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach
Andrew Wagenmaker · Yifang Chen · Max Simchowitz · Simon Du · Kevin Jamieson
|
|
Oral
|
Thu 7:30
|
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach
Andrew Wagenmaker · Yifang Chen · Max Simchowitz · Simon Du · Kevin Jamieson
|
|
Spotlight
|
Thu 11:45
|
Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory
Ruiqi Zhang · Xuezhou Zhang · Chengzhuo Ni · Mengdi Wang
|
|
Poster
|
Thu 15:00
|
Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory
Ruiqi Zhang · Xuezhou Zhang · Chengzhuo Ni · Mengdi Wang
|
|
Spotlight
|
Tue 11:05
|
Influence-Augmented Local Simulators: a Scalable Solution for Fast Deep RL in Large Networked Systems
Miguel Suau · Jinke He · Matthijs T. J. Spaan · Frans Oliehoek
|
|
Poster
|
Tue 15:30
|
Influence-Augmented Local Simulators: a Scalable Solution for Fast Deep RL in Large Networked Systems
Miguel Suau · Jinke He · Matthijs T. J. Spaan · Frans Oliehoek
|
|
Spotlight
|
Wed 8:25
|
LeNSE: Learning To Navigate Subgraph Embeddings for Large-Scale Combinatorial Optimisation
David Ireland · Giovanni Montana
|
|
Poster
|
Thu 15:00
|
Adversarially Trained Actor Critic for Offline Reinforcement Learning
Ching-An Cheng · Tengyang Xie · Nan Jiang · Alekh Agarwal
|
|
Oral
|
Thu 11:15
|
Adversarially Trained Actor Critic for Offline Reinforcement Learning
Ching-An Cheng · Tengyang Xie · Nan Jiang · Alekh Agarwal
|
|
Poster
|
Wed 15:30
|
LeNSE: Learning To Navigate Subgraph Embeddings for Large-Scale Combinatorial Optimisation
David Ireland · Giovanni Montana
|
|
Poster
|
Wed 15:30
|
Benefits of Overparameterized Convolutional Residual Networks: Function Approximation under Smoothness Constraint
Hao Liu · Minshuo Chen · Siawpeng Er · Wenjing Liao · Tong Zhang · Tuo Zhao
|
|
Spotlight
|
Wed 8:45
|
Benefits of Overparameterized Convolutional Residual Networks: Function Approximation under Smoothness Constraint
Hao Liu · Minshuo Chen · Siawpeng Er · Wenjing Liao · Tong Zhang · Tuo Zhao
|
|