Timezone: »
Model-based reinforcement learning (MBRL) approaches rely on discrete-time state transition models whereas physical systems and the vast majority of control tasks operate in continuous-time. To avoid time-discretization approximation of the underlying process, we propose a continuous-time MBRL framework based on a novel actor-critic method. Our approach also infers the unknown state evolution differentials with Bayesian neural ordinary differential equations (ODE) to account for epistemic uncertainty. We implement and test our method on a new ODE-RL suite that explicitly solves continuous-time control systems. Our experiments illustrate that the model is robust against irregular and noisy data, and can solve classic control problems in a sample-efficient manner.
Author Information
Cagatay Yildiz (Aalto University)
Markus Heinonen (Aalto University)
Harri Lähdesmäki (Aalto University)
Related Events (a corresponding poster, oral, or spotlight)
-
2021 Poster: Continuous-time Model-based Reinforcement Learning »
Wed. Jul 21st 04:00 -- 06:00 PM Room
More from the Same Authors
-
2023 : AbODE: Ab initio antibody design using conjoined ODEs »
Yogesh Verma · Markus Heinonen · Vikas K Garg -
2023 : Adverse event prediction using a task-specific generative model »
Otto Lönnroth · Siddharth Ramchandran · Pekka Tiikkainen · Mine Öğretir · Jussi Leinonen · Harri Lähdesmäki -
2023 : Longitudinal Variational Autoencoder for Compositional Data Analysis »
Mine Öğretir · Harri Lähdesmäki · Jamie Norton -
2023 : AbODE: Ab initio antibody design using conjoined ODEs »
Yogesh Verma · Markus Heinonen · Vikas K Garg -
2023 Poster: AbODE: Ab initio antibody design using conjoined ODEs »
Yogesh Verma · Markus Heinonen · Vikas K Garg -
2022 Poster: Tackling covariate shift with node-based Bayesian neural networks »
Trung Trinh · Markus Heinonen · Luigi Acerbi · Samuel Kaski -
2022 Oral: Tackling covariate shift with node-based Bayesian neural networks »
Trung Trinh · Markus Heinonen · Luigi Acerbi · Samuel Kaski -
2019 : Poster Session & Lunch break »
Kay Wiese · Brandon Carter · Dan DeBlasio · Mohammad Hashir · Rachel Chan · Matteo Manica · Ali Oskooei · Zhenqin Wu · Karren Yang · François FAGES · Ruishan Liu · Nicasia Beebe-Wang · Bryan He · Jacopo Cirrone · Pekka Marttinen · Elior Rahmani · Harri Lähdesmäki · Nikhil Yadala · Andreea-Ioana Deac · Ava Soleimany · Mansi Ranjit Mane · Jason Ernst · Joseph Paul Cohen · Joel Mathew · Vishal Agarwal · AN ZHENG -
2018 Poster: Learning unknown ODE models with Gaussian processes »
Markus Heinonen · Cagatay Yildiz · Henrik Mannerström · Jukka Intosalmi · Harri Lähdesmäki -
2018 Oral: Learning unknown ODE models with Gaussian processes »
Markus Heinonen · Cagatay Yildiz · Henrik Mannerström · Jukka Intosalmi · Harri Lähdesmäki