Skip to yearly menu bar Skip to main content


(14 events)   Timezone:  
Show all
Toggle Poster Visibility
Spotlight
Wed Jul 20 10:15 AM -- 10:20 AM (PDT) @ Room 307
Biased Gradient Estimate with Drastic Variance Reduction for Meta Reinforcement Learning
Yunhao Tang
Spotlight
Wed Jul 20 10:20 AM -- 10:25 AM (PDT) @ Room 307
Analysis of Stochastic Processes through Replay Buffers
Shirli Di-Castro Shashua · Shie Mannor · Dotan Di Castro
Spotlight
Wed Jul 20 10:25 AM -- 10:30 AM (PDT) @ Room 307
Cascaded Gaps: Towards Logarithmic Regret for Risk-Sensitive Reinforcement Learning
Yingjie Fei · Ruitu Xu
Spotlight
Wed Jul 20 10:30 AM -- 10:35 AM (PDT) @ Room 307
Communicating via Markov Decision Processes
Samuel Sokota · Christian Schroeder · Maximilian Igl · Luisa Zintgraf · Phil Torr · Martin Strohmeier · Zico Kolter · Shimon Whiteson · Jakob Foerster
Spotlight
Wed Jul 20 10:35 AM -- 10:40 AM (PDT) @ Room 307
PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation
Matilde Gargiani · Andrea Zanelli · Andrea Martinelli · Tyler Summers · John Lygeros
Spotlight
Wed Jul 20 10:40 AM -- 10:45 AM (PDT) @ Room 307
DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning
Hassam Sheikh · Kizza Nandyose Frisbee · mariano phielipp
Oral
Wed Jul 20 10:45 AM -- 11:05 AM (PDT) @ Room 307
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner · Yilun Du · Josh Tenenbaum · Sergey Levine
Spotlight
Wed Jul 20 11:05 AM -- 11:10 AM (PDT) @ Room 307
A Temporal-Difference Approach to Policy Gradient Estimation
Samuele Tosatto · Andrew Patterson · Martha White · A. Mahmood
Spotlight
Wed Jul 20 11:10 AM -- 11:15 AM (PDT) @ Room 307
MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer
JEON JEEWON · WOOJUN KIM · Whiyoung Jung · Youngchul Sung
Spotlight
Wed Jul 20 11:15 AM -- 11:20 AM (PDT) @ Room 307
Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency
Qi Cai · Zhuoran Yang · Zhaoran Wang
Spotlight
Wed Jul 20 11:20 AM -- 11:25 AM (PDT) @ Room 307
Actor-Critic based Improper Reinforcement Learning
Mohammadi Zaki · Avi Mohan · Aditya Gopalan · Shie Mannor
Spotlight
Wed Jul 20 11:25 AM -- 11:30 AM (PDT) @ Room 307
On the Sample Complexity of Learning Infinite-horizon Discounted Linear Kernel MDPs
Yuanzhou Chen · Jiafan He · Quanquan Gu
Spotlight
Wed Jul 20 11:30 AM -- 11:35 AM (PDT) @ Room 307
The Geometry of Robust Value Functions
Kaixin Wang · Navdeep Kumar · Kuangqi Zhou · Bryan Hooi · Jiashi Feng · Shie Mannor
Spotlight
Wed Jul 20 11:35 AM -- 11:40 AM (PDT) @ Room 307
Denoised MDPs: Learning World Models Better Than the World Itself
Tongzhou Wang · Simon Du · Antonio Torralba · Phillip Isola · Amy Zhang · Yuandong Tian