The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning Harm van Seijen

Neural Information Processing Systems 

This is a great development, but the lack of a consistent metric to evaluate such methods makes it difficult to compare various approaches.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found