Theoretical Foundations of Reinforcement Learning

Workshop

Theoretical Foundations of Reinforcement Learning

Emma Brunskill · Thodoris Lykouris · Max Simchowitz · Wen Sun · Mengdi Wang

Fri 17 Jul, 6:30 a.m. PDT

Keywords: Bandits Representation Learning Reinforcement Learning sample-efficient exploration policy gradient safety in RL human-in-the-loop RL mutli-agent RL off-policy learning

[ Abstract ] Workshop Website

In many settings such as education, healthcare, drug design, robotics, transportation, and achieving better-than-human performance in strategic games, it is important to make decisions sequentially. This poses two interconnected algorithmic and statistical challenges: effectively exploring to learn information about the underlying dynamics and effectively planning using this information. Reinforcement Learning (RL) is the main paradigm tackling both of these challenges simultaneously which is essential in the aforementioned applications. Over the last years, reinforcement learning has seen enormous progress both in solidifying our understanding on its theoretical underpinnings and in applying these methods in practice.

This workshop aims to highlight recent theoretical contributions, with an emphasis on addressing significant challenges on the road ahead. Such theoretical understanding is important in order to design algorithms that have robust and compelling performance in real-world applications. As part of the ICML 2020 conference, this workshop will be held virtually. It will feature keynote talks from six reinforcement learning experts tackling different significant facets of RL. It will also offer the opportunity for contributed material (see below the call for papers and our outstanding program committee). The authors of each accepted paper will prerecord a 10-minute presentation and will also appear in a poster session. Finally, the workshop will have a panel discussing important challenges in the road ahead.

Chat is not available.

Timezone: America/Los_Angeles

Schedule

Fri 6:30 a.m. - 7:15 a.m.	Exploration, Policy Gradient Methods, and the Deadly Triad - Sham Kakade ( Talk ) >	Sham Kakade 🔗
Fri 7:20 a.m. - 8:05 a.m.	A Unifying View of Optimism in Episodic Reinforcement Learning - Gergely Neu ( Talk ) >	Gergely Neu 🔗
Fri 8:10 a.m. - 9:25 a.m.	Poster Session 1 ( Poster Session ) >	🔗
Fri 9:30 a.m. - 10:25 a.m.	Speaker Panel ( Panel ) >	Csaba Szepesvari · Martha White · Sham Kakade · Gergely Neu · Shipra Agrawal · Akshay Krishnamurthy 🔗
Fri 10:30 a.m. - 11:15 a.m.	An Off-policy Policy Gradient Theorem: A Tale About Weightings - Martha White ( Talk ) > SlidesLive Video	Martha White 🔗
Fri 11:20 a.m. - 11:35 a.m.	Short Talk 1 - Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic Optimality ( Talk ) >	Kwang-Sung Jun 🔗
Fri 11:35 a.m. - 11:50 a.m.	Short Talk 2 - Adaptive Discretization for Model-Based Reinforcement Learning ( Talk ) >	Sean R. Sinclair 🔗
Fri 11:50 a.m. - 12:05 p.m.	Short Talk 3 - A Kernel-Based Approach to Non-Stationary Reinforcement Learning in Metric Spaces ( Talk ) >	Omar Darwiche Domingues 🔗
Fri 12:05 p.m. - 12:20 p.m.	Short Talk 4 - Adaptive Regret for Online Control ( Talk ) >	Edgar Minasyan 🔗
Fri 12:20 p.m. - 12:35 p.m.	Short Talk 5 - Near-Optimal Reinforcement Learning with Self-Play ( Talk ) >	Tiancheng Yu 🔗
Fri 12:35 p.m. - 12:50 p.m.	Short Talk 6 - Preference learning along multiple criteria: A game-theoretic perspective ( Talk ) >	Kush Bhatia 🔗
Fri 1:00 p.m. - 2:15 p.m.	Poster Session 2 ( Poster Session ) >	🔗
Fri 2:20 p.m. - 3:05 p.m.	Representation learning and exploration in reinforcement learning - Akshay Krishnamurthy ( Talk ) > SlidesLive Video	Akshay Krishnamurthy 🔗
Fri 3:10 p.m. - 3:55 p.m.	Learning to price under the Bass model for dynamic demand - Shipra Agrawal ( Talk ) >	Shipra Agrawal 🔗
Fri 4:00 p.m. - 4:45 p.m.	Efficient Planning in Large MDPs with Weak Linear Function Approximation - Csaba Szepesvari ( Talk ) >	Csaba Szepesvari 🔗