Reinforcement Learning Based on On-Line EM Algorithm

Sato, Masa-aki, Ishii, Shin

Neural Information Processing Systems 

The actor and the critic are approximated by Normalized Gaussian Networks (NGnet), which are networks of local linear regression units. The NGnet is trained by the online EM algorithm proposed in our previous paper.We apply our RL method to the task of swinging-up and stabilizing a single pendulum and the task of balancing a double pendulumnear the upright position.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found