Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs

Du, Jianzhun, Futoma, Joseph, Doshi-Velez, Finale

Oct-25-2020–arXiv.org Machine Learning

We present two elegant solutions for modeling continuous-time dynamics, in a novel model-based reinforcement learning (RL) framework for semi-Markov decision processes (SMDPs), using neural ordinary differential equations (ODEs). Our models accurately characterize continuous-time dynamics and enable us to develop high-performing policies using a small amount of data. We also develop a model-based approach for optimizing time schedules to reduce interaction rates with the environment while maintaining the near-optimal performance, which is not possible for model-free methods. We experimentally demonstrate the efficacy of our methods across various continuous-time domains.

arxiv preprint arxiv, machine learning, reinforcement learning, (15 more...)

arXiv.org Machine Learning

Oct-25-2020

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - North Carolina (0.04)
    - New York (0.04)
    - Massachusetts > Middlesex County
      - Cambridge (0.04)
    - California > Alameda County
      - Berkeley (0.14)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine > Therapeutic Area
  - Infections and Infectious Diseases (1.00)
  - Immunology > HIV (0.48)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Neural Networks > Deep Learning (0.93)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.70)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found