Filter by Keyword:

78 Results

Oral
Wed 5:00 Near Optimal Reward-Free Reinforcement Learning
Zhang Zihan, Simon Du, Xiangyang Ji
Spotlight
Wed 5:20 Batch Value-function Approximation with Only Realizability
Tengyang Xie, Nan Jiang
Spotlight
Wed 5:35 Bootstrapping Fitted Q-Evaluation for Off-Policy Inference
Botao Hao, Xiang Ji, Yaqi Duan, Hao Lu, Csaba Szepesvari, Mengdi Wang
Oral
Wed 6:00 Dynamic Game Theoretic Neural Optimizer
Guan-Horng Liu, CHEN Chen, Evangelos Theodorou
Oral
Wed 6:00 Bilinear Classes: A Structural Framework for Provable Generalization in RL
Simon Du, Sham Kakade, Jason Lee, Shachar Lovett, Gaurav Mahajan, Wen Sun, Ruosong Wang
Spotlight
Wed 6:20 Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning
Yaqi Duan, Chi Jin, Zhiyuan Li
Spotlight
Wed 6:25 Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with √T Regret
Asaf Cassel, Tomer Koren
Spotlight
Wed 6:30 Reward Identification in Inverse Reinforcement Learning
Kuno Kim, Shivam Garg, Kiran Shiragur, Stefano Ermon
Spotlight
Wed 6:35 Online Optimization in Games via Control Theory: Connecting Regret, Passivity and Poincaré Recurrence
Yun Kuen Cheung, Georgios Piliouras
Spotlight
Wed 6:40 Efficient Performance Bounds for Primal-Dual Reinforcement Learning from Demonstrations
Angeliki Kamoutsi, Goran Banjac, John Lygeros
Spotlight
Wed 6:40 On Reinforcement Learning with Adversarial Corruption and Its Application to Block MDP
Tianhao Wu, Yunchang Yang, Simon Du, Liwei Wang
Spotlight
Wed 6:45 Kernel-Based Reinforcement Learning: A Finite-Time Analysis
Omar Darwiche Domingues, Pierre Menard, Matteo Pirotta, Emilie Kaufmann, Michal Valko
Spotlight
Wed 6:45 Robust Reinforcement Learning using Least Squares Policy Iteration with Provable Performance Guarantees
Kishan Panaganti, Dileep Kalathil
Poster
Wed 9:00 Near Optimal Reward-Free Reinforcement Learning
Zhang Zihan, Simon Du, Xiangyang Ji
Poster
Wed 9:00 Robust Reinforcement Learning using Least Squares Policy Iteration with Provable Performance Guarantees
Kishan Panaganti, Dileep Kalathil
Poster
Wed 9:00 On Reinforcement Learning with Adversarial Corruption and Its Application to Block MDP
Tianhao Wu, Yunchang Yang, Simon Du, Liwei Wang
Poster
Wed 9:00 Online Optimization in Games via Control Theory: Connecting Regret, Passivity and Poincaré Recurrence
Yun Kuen Cheung, Georgios Piliouras
Poster
Wed 9:00 Efficient Performance Bounds for Primal-Dual Reinforcement Learning from Demonstrations
Angeliki Kamoutsi, Goran Banjac, John Lygeros
Poster
Wed 9:00 Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with √T Regret
Asaf Cassel, Tomer Koren
Poster
Wed 9:00 Bootstrapping Fitted Q-Evaluation for Off-Policy Inference
Botao Hao, Xiang Ji, Yaqi Duan, Hao Lu, Csaba Szepesvari, Mengdi Wang
Poster
Wed 9:00 Batch Value-function Approximation with Only Realizability
Tengyang Xie, Nan Jiang
Poster
Wed 9:00 Kernel-Based Reinforcement Learning: A Finite-Time Analysis
Omar Darwiche Domingues, Pierre Menard, Matteo Pirotta, Emilie Kaufmann, Michal Valko
Poster
Wed 9:00 Reward Identification in Inverse Reinforcement Learning
Kuno Kim, Shivam Garg, Kiran Shiragur, Stefano Ermon
Poster
Wed 9:00 Dynamic Game Theoretic Neural Optimizer
Guan-Horng Liu, CHEN Chen, Evangelos Theodorou
Poster
Wed 9:00 Bilinear Classes: A Structural Framework for Provable Generalization in RL
Simon Du, Sham Kakade, Jason Lee, Shachar Lovett, Gaurav Mahajan, Wen Sun, Ruosong Wang
Poster
Wed 9:00 Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning
Yaqi Duan, Chi Jin, Zhiyuan Li
Oral
Wed 17:00 Exponential Lower Bounds for Batch Reinforcement Learning: Batch RL can be Exponentially Harder than Online RL
Andrea Zanette
Oral
Wed 17:00 UCB Momentum Q-learning: Correcting the bias without forgetting
Pierre MENARD, Omar Darwiche Domingues, Xuedong Shang, Michal Valko
Spotlight
Wed 17:25 Confidence-Budget Matching for Sequential Budgeted Learning
Yonathan Efroni, Nadav Merlis, Aadirupa Saha, Shie Mannor
Spotlight
Wed 17:30 Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity
Dhruv Malik, Aldo Pacchiano, Vishwak Srinivasan, Yuanzhi Li
Spotlight
Wed 17:30 Fast active learning for pure exploration in reinforcement learning
Pierre MENARD, Omar Darwiche Domingues, Anders Jonsson, Emilie Kaufmann, Edouard Leurent, Michal Valko
Spotlight
Wed 17:35 Leveraging Non-uniformity in First-order Non-convex Optimization
Jincheng Mei, Yue Gao, Bo Dai, Csaba Szepesvari, Dale Schuurmans
Spotlight
Wed 17:40 Robust Policy Gradient against Strong Data Corruption
Xuezhou Zhang, Yiding Chen, Jerry Zhu, Wen Sun
Spotlight
Wed 17:45 Logarithmic Regret for Reinforcement Learning with Linear Function Approximation
Jiafan He, Dongruo Zhou, Quanquan Gu
Oral
Wed 18:00 Task-Optimal Exploration in Linear Dynamical Systems
Andrew Wagenmaker, Max Simchowitz, Kevin Jamieson
Oral
Wed 18:00 Provably Efficient Algorithms for Multi-Objective Competitive RL
Tiancheng Yu, Yi Tian, Jingzhao Zhang, Suvrit Sra
Spotlight
Wed 18:20 Online Learning in Unknown Markov Games
Yi Tian, Yuanhao Wang, Tiancheng Yu, Suvrit Sra
Spotlight
Wed 18:25 CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee
Tengyu Xu, Yingbin LIANG, Guanghui Lan
Spotlight
Wed 18:25 A Sharp Analysis of Model-based Reinforcement Learning with Self-Play
Qinghua Liu, Tiancheng Yu, Yu Bai, Chi Jin
Spotlight
Wed 18:30 Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Tengyu Xu, Zhuoran Yang, Zhaoran Wang, Yingbin LIANG
Spotlight
Wed 18:30 Randomized Exploration in Reinforcement Learning with General Value Function Approximation
Haque Ishfaq, Qiwen Cui, Viet Nguyen, Alex Ayoub, Zhuoran Yang, Zhaoran Wang, Doina Precup, Lin Yang
Spotlight
Wed 18:35 Towards Tight Bounds on the Sample Complexity of Average-reward MDPs
Yujia Jin, Aaron Sidford
Spotlight
Wed 18:40 Finding the Stochastic Shortest Path with Low Regret: the Adversarial Cost and Unknown Transition Case
Liyu Chen, Haipeng Luo
Oral
Wed 19:00 Improved Regret Bound and Experience Replay in Regularized Policy Iteration
Nevena Lazic, Dong Yin, Yasin Abbasi-Yadkori, Csaba Szepesvari
Spotlight
Wed 19:20 Near-Optimal Model-Free Reinforcement Learning in Non-Stationary Episodic MDPs
Weichao Mao, Kaiqing Zhang, Ruihao Zhu, David Simchi-Levi, Tamer Basar
Spotlight
Wed 19:30 Provably Correct Optimization and Exploration with Non-linear Policies
Fei Feng, Wotao Yin, Alekh Agarwal, Lin Yang
Spotlight
Wed 19:35 Safe Reinforcement Learning Using Advantage-Based Intervention
Nolan Wagener, Byron Boots, Ching-An Cheng
Spotlight
Wed 19:45 Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap
Gokul Swamy, Sanjiban Choudhury, J. Bagnell, Steven Wu
Poster
Wed 21:00 Confidence-Budget Matching for Sequential Budgeted Learning
Yonathan Efroni, Nadav Merlis, Aadirupa Saha, Shie Mannor
Poster
Wed 21:00 Near-Optimal Model-Free Reinforcement Learning in Non-Stationary Episodic MDPs
Weichao Mao, Kaiqing Zhang, Ruihao Zhu, David Simchi-Levi, Tamer Basar
Poster
Wed 21:00 Fast active learning for pure exploration in reinforcement learning
Pierre MENARD, Omar Darwiche Domingues, Anders Jonsson, Emilie Kaufmann, Edouard Leurent, Michal Valko
Poster
Wed 21:00 UCB Momentum Q-learning: Correcting the bias without forgetting
Pierre MENARD, Omar Darwiche Domingues, Xuedong Shang, Michal Valko
Poster
Wed 21:00 Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Tengyu Xu, Zhuoran Yang, Zhaoran Wang, Yingbin LIANG
Poster
Wed 21:00 Task-Optimal Exploration in Linear Dynamical Systems
Andrew Wagenmaker, Max Simchowitz, Kevin Jamieson
Poster
Wed 21:00 A Sharp Analysis of Model-based Reinforcement Learning with Self-Play
Qinghua Liu, Tiancheng Yu, Yu Bai, Chi Jin
Poster
Wed 21:00 CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee
Tengyu Xu, Yingbin LIANG, Guanghui Lan
Poster
Wed 21:00 Exponential Lower Bounds for Batch Reinforcement Learning: Batch RL can be Exponentially Harder than Online RL
Andrea Zanette
Poster
Wed 21:00 Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity
Dhruv Malik, Aldo Pacchiano, Vishwak Srinivasan, Yuanzhi Li
Poster
Wed 21:00 Improved Regret Bound and Experience Replay in Regularized Policy Iteration
Nevena Lazic, Dong Yin, Yasin Abbasi-Yadkori, Csaba Szepesvari
Poster
Wed 21:00 Provably Correct Optimization and Exploration with Non-linear Policies
Fei Feng, Wotao Yin, Alekh Agarwal, Lin Yang
Poster
Wed 21:00 Provably Efficient Algorithms for Multi-Objective Competitive RL
Tiancheng Yu, Yi Tian, Jingzhao Zhang, Suvrit Sra
Poster
Wed 21:00 Online Learning in Unknown Markov Games
Yi Tian, Yuanhao Wang, Tiancheng Yu, Suvrit Sra
Poster
Wed 21:00 Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap
Gokul Swamy, Sanjiban Choudhury, J. Bagnell, Steven Wu
Poster
Wed 21:00 Leveraging Non-uniformity in First-order Non-convex Optimization
Jincheng Mei, Yue Gao, Bo Dai, Csaba Szepesvari, Dale Schuurmans
Poster
Wed 21:00 Towards Tight Bounds on the Sample Complexity of Average-reward MDPs
Yujia Jin, Aaron Sidford
Poster
Wed 21:00 Randomized Exploration in Reinforcement Learning with General Value Function Approximation
Haque Ishfaq, Qiwen Cui, Viet Nguyen, Alex Ayoub, Zhuoran Yang, Zhaoran Wang, Doina Precup, Lin Yang
Poster
Wed 21:00 Safe Reinforcement Learning Using Advantage-Based Intervention
Nolan Wagener, Byron Boots, Ching-An Cheng
Poster
Wed 21:00 Robust Policy Gradient against Strong Data Corruption
Xuezhou Zhang, Yiding Chen, Jerry Zhu, Wen Sun
Poster
Wed 21:00 Logarithmic Regret for Reinforcement Learning with Linear Function Approximation
Jiafan He, Dongruo Zhou, Quanquan Gu
Poster
Wed 21:00 Finding the Stochastic Shortest Path with Low Regret: the Adversarial Cost and Unknown Transition Case
Liyu Chen, Haipeng Luo
Oral
Thu 6:00 Temporal Difference Learning as Gradient Splitting
Rui Liu, Alex Olshevsky
Spotlight
Thu 6:20 First-Order Methods for Wasserstein Distributionally Robust MDP
Julien Grand-Clement, Christian Kroer
Spotlight
Thu 6:30 Adaptive Sampling for Best Policy Identification in Markov Decision Processes
Aymen Al Marjani, Alexandre Proutiere
Spotlight
Thu 6:35 Quantum algorithms for reinforcement learning with a generative model
Daochen Wang, Aarthi Sundaram, Robin Kothari, Ashish Kapoor, Martin Roetteler
Poster
Thu 9:00 Quantum algorithms for reinforcement learning with a generative model
Daochen Wang, Aarthi Sundaram, Robin Kothari, Ashish Kapoor, Martin Roetteler
Poster
Thu 9:00 First-Order Methods for Wasserstein Distributionally Robust MDP
Julien Grand-Clement, Christian Kroer
Poster
Thu 9:00 Adaptive Sampling for Best Policy Identification in Markov Decision Processes
Aymen Al Marjani, Alexandre Proutiere
Poster
Thu 9:00 Temporal Difference Learning as Gradient Splitting
Rui Liu, Alex Olshevsky