The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning Harm van Seijen
–Neural Information Processing Systems
This is a great development, but the lack of a consistent metric to evaluate such methods makes it difficult to compare various approaches.
Neural Information Processing Systems
Oct-2-2025, 20:18:20 GMT
- Country:
- North America > Canada > Quebec > Montreal (0.04)
- Genre:
- Research Report > New Finding (0.68)
- Industry:
- Leisure & Entertainment > Games (0.68)
- Technology: