A0C: Alpha Zero in Continuous Action Space

Moerland, Thomas M., Broekens, Joost, Plaat, Aske, Jonker, Catholijn M.

May-24-2018–arXiv.org Artificial Intelligence

A core novelty of Alpha Zero is the interleaving of tree search and deep learning, which has proven very successful in board games like Chess, Shogi and Go. These games have a discrete action space. However, many real-world reinforcement learning domains have continuous action spaces, for example in robotic control, navigation and self-driving cars. This paper presents the necessary theoretical extensions of Alpha Zero to deal with continuous action space. We also provide some preliminary experiments on the Pendulum swing-up task, empirically showing the feasibility of our approach. Thereby, this work provides a first step towards the application of iterated search and learning in domains with a continuous action space.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

May-24-2018

arXiv.org PDF

Add feedback

Country:
- Europe > Netherlands (0.28)

Genre:
- Research Report (0.82)

Industry:
- Leisure & Entertainment > Games > Chess (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Representation & Reasoning (1.00)
  - Machine Learning
    - Reinforcement Learning (0.90)
    - Neural Networks > Deep Learning (0.48)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found