Toggle Poster Visibility
Spotlight
Tue Jul 19 07:30 AM -- 07:35 AM (PDT) @ Hall F
Dynamic Regret of Online Markov Decision Processes
Spotlight
Tue Jul 19 07:35 AM -- 07:40 AM (PDT) @ Hall F
On the Impossibility of Learning to Cooperate with Adaptive Partner Strategies in Repeated Games
Spotlight
Tue Jul 19 07:40 AM -- 07:45 AM (PDT) @ Hall F
Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning
Spotlight
Tue Jul 19 07:45 AM -- 07:50 AM (PDT) @ Hall F
Provable Reinforcement Learning with a Short-Term Memory
[
Paper PDF]
Spotlight
Tue Jul 19 07:50 AM -- 07:55 AM (PDT) @ Hall F
Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer
[
Paper PDF]
Spotlight
Tue Jul 19 07:55 AM -- 08:00 AM (PDT) @ Hall F
Mirror Learning: A Unifying Framework of Policy Optimisation
Oral
Tue Jul 19 08:00 AM -- 08:20 AM (PDT) @ Hall F
Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDP
Spotlight
Tue Jul 19 08:20 AM -- 08:25 AM (PDT) @ Hall F
Learning Infinite-horizon Average-reward Markov Decision Process with Constraints
Spotlight
Tue Jul 19 08:25 AM -- 08:30 AM (PDT) @ Hall F
A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning
Spotlight
Tue Jul 19 08:30 AM -- 08:35 AM (PDT) @ Hall F
Langevin Monte Carlo for Contextual Bandits
Spotlight
Tue Jul 19 08:35 AM -- 08:40 AM (PDT) @ Hall F
Prompting Decision Transformer for Few-Shot Policy Generalization
Spotlight
Tue Jul 19 08:40 AM -- 08:45 AM (PDT) @ Hall F
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Spotlight
Tue Jul 19 08:45 AM -- 08:50 AM (PDT) @ Hall F
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation