Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning
–Neural Information Processing Systems
The main idea is updating the most accurate prediction head to specialize each head in certain environments with similar dynamics, i.e., clustering environments.
Neural Information Processing Systems
Nov-14-2025, 15:04:46 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- North America
- Canada (0.04)
- United States > California
- Alameda County > Berkeley (0.04)
- Asia > Middle East
- Industry:
- Education (1.00)
- Technology: