ReplacingRewardswithExamples: Example-Based PolicySearchviaRecursiveClassification
–Neural Information Processing Systems
Forexample,theusercouldsolvethe task using actions unavailable to the agent (e.g., the user may have two arms, but a robotic agent mayhaveonlyone) ortheusercould findsuccess examples bysearching theinternet.
Neural Information Processing Systems
Feb-8-2026, 23:14:52 GMT
- Country:
- North America > United States
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Pennsylvania > Allegheny County
- Asia > Middle East
- Jordan (0.04)
- North America > United States
- Technology: