Goto

Collaborating Authors

 Reinforcement Learning


OfflineRLWithoutOff-PolicyEvaluation

Neural Information Processing Systems

Inaddition, wehypothesize thatthestrong performance of the one-step algorithm is due to a combination of favorable structure in the environmentandbehaviorpolicy.





2bba9f4124283edd644799e0cecd45ca-Paper.pdf

Neural Information Processing Systems

The problem of inverse reinforcement learning (IRL) is relevant to a variety of tasks including valuealignment androbot learning fromdemonstration.