Timezone: »
We consider the setting of iterative learning control, or model-based policy learning in the presence of uncertain, time-varying dynamics. In this setting, we propose a new performance metric, planning regret, which replaces the standard stochastic uncertainty assumptions with worst case regret. Based on recent advances in non-stochastic control, we design a new iterative algorithm for minimizing planning regret that is more robust to model mismatch and uncertainty. We provide theoretical and empirical evidence that the proposed algorithm outperforms existing methods on several benchmarks.
Author Information
Naman Agarwal (Google Research)
Elad Hazan (Princeton University and Google Brain)
Anirudha Majumdar (Princeton University)
Anirudha Majumdar is an Assistant Professor in the Mechanical and Aerospace Engineering (MAE) department at Princeton University. He also holds a part-time visiting research scientist position at the Google AI Lab in Princeton. Majumdar received a Ph.D. in Electrical Engineering and Computer Science from the Massachusetts Institute of Technology in 2016, and a B.S.E. in Mechanical Engineering and Mathematics from the University of Pennsylvania in 2011. Subsequently, he was a postdoctoral scholar at Stanford University from 2016 to 2017 at the Autonomous Systems Lab in the Aeronautics and Astronautics department. He is a recipient of the Sloan Fellowship, ONR Young Investigator Program (YIP) award, the NSF CAREER award, the Google Faculty Research Award (twice), the Amazon Research Award (twice), the Young Faculty Researcher Award from the Toyota Research Institute, the Paper of the Year Award from the International Journal of Robotics Research (IJRR), the Best Conference Paper Award at the International Conference on Robotics and Automation (ICRA), the Alfred Rheinstein Faculty Award (Princeton), and the Excellence in Teaching Award (Princeton SEAS).
Karan Singh (Microsoft Research)
Related Events (a corresponding poster, oral, or spotlight)
-
2021 Spotlight: A Regret Minimization Approach to Iterative Learning Control »
Wed. Jul 21st 01:35 -- 01:40 PM Room
More from the Same Authors
-
2021 : Robust online control with model misspecification »
Xinyi Chen · Udaya Ghai · Elad Hazan · Alexandre Megretsky -
2021 : A Boosting Approach to Reinforcement Learning »
Nataly Brukhim · Elad Hazan · Karan Singh -
2022 : Non-convex online learning via algorithmic equivalence »
Udaya Ghai · Zhou Lu · Elad Hazan -
2023 Poster: Multi-User Reinforcement Learning with Low Rank Rewards »
Naman Agarwal · Prateek Jain · Dheeraj Nagaraj · Suhas Kowshik · Praneeth Netrapalli -
2023 Poster: Fundamental Tradeoffs in Learning with Prior Information »
Anirudha Majumdar -
2022 Poster: A Regret Minimization Approach to Multi-Agent Control »
Udaya Ghai · Udari Madhuhshani · Naomi Leonard · Elad Hazan -
2022 Spotlight: A Regret Minimization Approach to Multi-Agent Control »
Udaya Ghai · Udari Madhuhshani · Naomi Leonard · Elad Hazan -
2021 Poster: Boosting for Online Convex Optimization »
Elad Hazan · Karan Singh -
2021 Spotlight: Boosting for Online Convex Optimization »
Elad Hazan · Karan Singh -
2021 Poster: Acceleration via Fractal Learning Rate Schedules »
Naman Agarwal · Surbhi Goel · Cyril Zhang -
2021 Spotlight: Acceleration via Fractal Learning Rate Schedules »
Naman Agarwal · Surbhi Goel · Cyril Zhang -
2021 : Online and non-stochastic control »
Karan Singh -
2021 Tutorial: Online and non-stochastic control »
Elad Hazan · Karan Singh -
2021 : Online and non-stochastic control »
Elad Hazan -
2020 Poster: Boosting for Control of Dynamical Systems »
Naman Agarwal · Nataly Brukhim · Elad Hazan · Zhou Lu -
2019 Poster: Efficient Full-Matrix Adaptive Regularization »
Naman Agarwal · Brian Bullins · Xinyi Chen · Elad Hazan · Karan Singh · Cyril Zhang · Yi Zhang -
2019 Poster: Online Control with Adversarial Disturbances »
Naman Agarwal · Brian Bullins · Elad Hazan · Sham Kakade · Karan Singh -
2019 Oral: Efficient Full-Matrix Adaptive Regularization »
Naman Agarwal · Brian Bullins · Xinyi Chen · Elad Hazan · Karan Singh · Cyril Zhang · Yi Zhang -
2019 Oral: Online Control with Adversarial Disturbances »
Naman Agarwal · Brian Bullins · Elad Hazan · Sham Kakade · Karan Singh -
2019 Poster: Provably Efficient Maximum Entropy Exploration »
Elad Hazan · Sham Kakade · Karan Singh · Abby Van Soest -
2019 Oral: Provably Efficient Maximum Entropy Exploration »
Elad Hazan · Sham Kakade · Karan Singh · Abby Van Soest -
2018 Poster: On the Optimization of Deep Networks: Implicit Acceleration by Overparameterization »
Sanjeev Arora · Nadav Cohen · Elad Hazan -
2018 Oral: On the Optimization of Deep Networks: Implicit Acceleration by Overparameterization »
Sanjeev Arora · Nadav Cohen · Elad Hazan -
2017 Poster: Efficient Regret Minimization in Non-Convex Games »
Elad Hazan · Karan Singh · Cyril Zhang -
2017 Talk: Efficient Regret Minimization in Non-Convex Games »
Elad Hazan · Karan Singh · Cyril Zhang