Rethinking Inverse Reinforcement Learning: from Data Alignment to Task Alignment

Neural Information Processing Systems 

However, the inferred reward function often fails to capture the underlying task objective.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found