Nonparametric Model-Based Reinforcement Learning

Dec-31-1998–Neural Information Processing Systems

This paper describes some of the interactions of model learning algorithms and planning algorithms we have found in exploring model-based reinforcement learning. The paper focuses on how local trajectoryoptimizers can be used effectively with learned nonparametric models.We find that trajectory planners that are fully consistent with the learned model often have difficulty finding reasonable plansin the early stages of learning. Trajectory planners that balance obeying the learned model with minimizing cost (or maximizing reward) often do better, even if the plan is not fully consistent with the learned model. 1 INTRODUCTION We are exploring the use of nonparametric models in robot learning (Atkeson et al., 1997b; Atkeson and Schaal, 1997). This paper describes the interaction of model learning algorithms and planning algorithms, focusing on how local trajectory optimization canbe used effectively with nonparametric models in reinforcement learning. We find that trajectory optimizers that are fully consistent with the learned model often have difficulty finding reasonable plans in the early stages of learning. The message of this paper is that a planner should not be entirely consistent with the learned model during model-based reinforcement learning.

machine learning, reinforcement learning, trajectory, (16 more...)

Neural Information Processing Systems

Dec-31-1998

Conferences PDF

Add feedback

Country:
- North America > United States (0.69)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (1.00)
  - Representation & Reasoning > Optimization (0.72)

Duplicate Docs Excel Report

Title
Nonparametric Model-Based Reinforcement Learning
Nonparametric Model-Based Reinforcement Learning

Similar Docs Excel Report more

Title	Similarity	Source
None found