Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees

Neural Information Processing Systems 

Inverse reinforcement learning (IRL) aims to recover the reward function and the associated optimal policy that best fits observed sequences of states and actions implemented by an expert.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found