Timezone: »
Recent work has shown that reinforcement learning (RL) is a promising approach to control dynamical systems described by partial differential equations (PDE). This paper shows how to use RL to tackle more general PDE control problems that have continuous high-dimensional action spaces with spatial relationship among action dimensions. In particular, we propose the concept of action descriptors, which encode regularities among spatially-extended action dimensions and enable the agent to control high-dimensional action PDEs. We provide theoretical evidence suggesting that this approach can be more sample efficient compared to a conventional approach that treats each action dimension separately and does not explicitly exploit the spatial regularity of the action space. The action descriptor approach is then used within the deep deterministic policy gradient algorithm. Experiments on two PDE control problems, with up to 256-dimensional continuous actions, show the advantage of the proposed approach over the conventional one.
Author Information
Yangchen Pan (University of Alberta)
Amir-massoud Farahmand (Vector Institute)
Martha White (University of Alberta)
Saleh Nabi
Piyush Grover (Mitsubishi Electric Research Labs)
Piyush Grover is a principal researcher at MERL. He obtained his Ph.D. in Engineering Mechanics in 2010 from Virginia Tech, under the supervision of Shane Ross. His work involves a mix of basic and applied research at the intersection of nonlinear dynamical systems, mechanics and control. He is interested in both geometric/topological and operator-theoretic (or statistical) descriptions of phase space transport in dynamical systems, and deriving low-order descriptions of distributed systems.
Daniel Nikovski (Mitsubishi Electric Research Labs)
Related Events (a corresponding poster, oral, or spotlight)
-
2018 Poster: Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control »
Fri. Jul 13th 04:15 -- 07:00 PM Room Hall B #39
More from the Same Authors
-
2022 : VIPer: Iterative Value-Aware Model Learning on the Value Improvement Path »
Romina Abachi · Claas Voelcker · Animesh Garg · Amir-massoud Farahmand -
2023 Poster: Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning »
Brett Daley · Martha White · Christopher Amato · Marlos C. Machado -
2022 Workshop: Decision Awareness in Reinforcement Learning »
Evgenii Nikishin · Pierluca D'Oro · Doina Precup · Andre Barreto · Amir-massoud Farahmand · Pierre-Luc Bacon -
2022 Poster: A Temporal-Difference Approach to Policy Gradient Estimation »
Samuele Tosatto · Andrew Patterson · Martha White · A. Mahmood -
2022 Spotlight: A Temporal-Difference Approach to Policy Gradient Estimation »
Samuele Tosatto · Andrew Patterson · Martha White · A. Mahmood -
2021 Poster: PID Accelerated Value Iteration Algorithm »
Amir-massoud Farahmand · Mohammad Ghavamzadeh -
2021 Spotlight: PID Accelerated Value Iteration Algorithm »
Amir-massoud Farahmand · Mohammad Ghavamzadeh -
2020 : Panel Discussion »
Eric Eaton · Martha White · Doina Precup · Irina Rish · Harm van Seijen -
2020 : QA for invited talk 5 White »
Martha White -
2020 : Invited talk 5 White »
Martha White -
2020 : An Off-policy Policy Gradient Theorem: A Tale About Weightings - Martha White »
Martha White -
2020 : Speaker Panel »
Csaba Szepesvari · Martha White · Sham Kakade · Gergely Neu · Shipra Agrawal · Akshay Krishnamurthy -
2020 Poster: Gradient Temporal-Difference Learning with Regularized Corrections »
Sina Ghiassian · Andrew Patterson · Shivam Garg · Dhawal Gupta · Adam White · Martha White -
2020 Poster: Can Increasing Input Dimensionality Improve Deep Reinforcement Learning? »
Kei Ota · Tomoaki Oiki · Devesh Jha · Toshisada Mariyama · Daniel Nikovski -
2020 Poster: Selective Dyna-style Planning Under Limited Model Capacity »
Zaheer Abbas · Samuel Sokota · Erin Talvitie · Martha White -
2020 Poster: Optimizing for the Future in Non-Stationary MDPs »
Yash Chandak · Georgios Theocharous · Shiv Shankar · Martha White · Sridhar Mahadevan · Philip Thomas -
2019 Workshop: Exploration in Reinforcement Learning Workshop »
Benjamin Eysenbach · Benjamin Eysenbach · Surya Bhupatiraju · Shixiang Gu · Harrison Edwards · Martha White · Pierre-Yves Oudeyer · Kenneth Stanley · Emma Brunskill -
2018 Poster: Improving Regression Performance with Distributional Losses »
Ehsan Imani · Martha White -
2018 Oral: Improving Regression Performance with Distributional Losses »
Ehsan Imani · Martha White