Random Actions vs Random Policies: Bootstrapping Model-Based Direct Policy Search

Hanna, Elias, Coninx, Alex, Doncieux, Stéphane

Oct-21-2022–arXiv.org Artificial Intelligence

This paper studies the impact of the initial data gathering method on the subsequent learning of a dynamics model. Dynamics models approximate the true transition function of a given task, in order to perform policy search directly on the model rather than on the costly real system. This study aims to determine how to bootstrap a model as efficiently as possible, by comparing initialization methods employed in two different policy search frameworks in the literature. The study focuses on the model performance under the episode-based framework of Evolutionary methods using probabilistic ensembles. Experimental results show that various task-dependant factors can be detrimental to each method, suggesting to explore hybrid approaches.

dynamic model, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

Oct-21-2022

arXiv.org PDF

Add feedback

Country:
- Europe > France > Île-de-France > Paris > Paris (0.05)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Representation & Reasoning (1.00)
  - Machine Learning
    - Neural Networks (0.48)
    - Performance Analysis > Accuracy (0.40)
    - Reinforcement Learning (0.30)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found