Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement Benjamin Eysenbach

Aug-15-2025, 16:31:18 GMT–Neural Information Processing Systems

Several prior works have found that relabeling past experience with different reward functions can improve sample efficiency.

inverse rl, reward function, trajectory, (14 more...)

Neural Information Processing Systems

Aug-15-2025, 16:31:18 GMT

Conferences PDF

Country:
- North America
  - Canada (0.04)
  - United States > Pennsylvania
    - Allegheny County > Pittsburgh (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
a97da629b098b75c294dffdc3e463904-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found