firstbacksecondback
Filter by Keyword:
84 Results
Spotlight
|
Thu 6:30 |
Adaptive Sampling for Best Policy Identification in Markov Decision Processes Aymen Al Marjani · Alexandre Proutiere |
|
Spotlight
|
Wed 6:25 |
Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with √T Regret Asaf Cassel · Tomer Koren |
|
Poster
|
Thu 9:00 |
Adaptive Sampling for Best Policy Identification in Markov Decision Processes Aymen Al Marjani · Alexandre Proutiere |
|
Poster
|
Wed 9:00 |
Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with √T Regret Asaf Cassel · Tomer Koren |
|
Spotlight
|
Wed 19:35 |
Dropout: Explicit Forms and Capacity Control Raman Arora · Peter Bartlett · Poorya Mianjy · Nati Srebro |
|
Poster
|
Wed 21:00 |
Dropout: Explicit Forms and Capacity Control Raman Arora · Peter Bartlett · Poorya Mianjy · Nati Srebro |
|
Spotlight
|
Wed 17:25 |
Confidence-Budget Matching for Sequential Budgeted Learning Yonathan Efroni · Nadav Merlis · Aadirupa Saha · Shie Mannor |
|
Poster
|
Wed 21:00 |
Confidence-Budget Matching for Sequential Budgeted Learning Yonathan Efroni · Nadav Merlis · Aadirupa Saha · Shie Mannor |
|
Poster
|
Wed 21:00 |
Provably Efficient Algorithms for Multi-Objective Competitive RL Tiancheng Yu · Yi Tian · Jingzhao Zhang · Suvrit Sra |
|
Oral
|
Wed 18:00 |
Provably Efficient Algorithms for Multi-Objective Competitive RL Tiancheng Yu · Yi Tian · Jingzhao Zhang · Suvrit Sra |
|
Poster
|
Wed 21:00 |
Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality Tengyu Xu · Zhuoran Yang · Zhaoran Wang · Yingbin LIANG |
|
Spotlight
|
Wed 18:30 |
Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality Tengyu Xu · Zhuoran Yang · Zhaoran Wang · Yingbin LIANG |