Reining Generalization in Offline Reinforcement Learning via Representation Distinction
–Neural Information Processing Systems
Offline Reinforcement Learning (RL) aims to address the challenge of distribution shift between the dataset and the learned policy, where the value of out-of-distribution (OOD) data may be erroneously estimated due to overgeneralization.
Neural Information Processing Systems
Feb-15-2026, 13:02:56 GMT
- Country:
- Asia > China
- Tianjin Province > Tianjin (0.04)
- North America
- Canada > Quebec
- Montreal (0.04)
- United States > Montana (0.04)
- Canada > Quebec
- Asia > China
- Genre:
- Research Report (0.46)
- Technology: