Continuous-Time Model-Based Reinforcement Learning

Yıldız, Çağatay, Heinonen, Markus, Lähdesmäki, Harri

Feb-10-2021–arXiv.org Machine Learning

Model-based reinforcement learning (MBRL) approaches rely on discrete-time state transition models whereas physical systems and the vast majority of control tasks operate in continuous-time. To avoid time-discretization approximation of the underlying process, we propose a continuous-time MBRL framework based on a novel actor-critic method. Our approach also infers the unknown state evolution differentials with Bayesian neural ordinary differential equations (ODE) to account for epistemic uncertainty. We implement and test our method on a new ODE-RL suite that explicitly solves continuous-time control systems. Our experiments illustrate that the model is robust against irregular and noisy data, is sample-efficient, and can solve control problems which pose challenges to discrete-time MBRL methods.

neural network, reinforcement, trajectory, (13 more...)

arXiv.org Machine Learning

Feb-10-2021

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Massachusetts (0.04)
- Europe
  - Finland (0.04)
  - Germany > Baden-Württemberg
    - Freiburg (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Neural Networks > Deep Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found