Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification

Neural Information Processing Systems 

Reinforcement learning (RL) algorithms assume that users specify tasks by manually writing down a reward function.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found