Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification

Open in new window