Adaptive Q-Aid for Conditional Supervised Learning in Offline Reinforcement Learning Jeonghye Kim
–Neural Information Processing Systems
Offline reinforcement learning (RL) has progressed with return-conditioned supervised learning (RCSL), but its lack of stitching ability remains a limitation.
Neural Information Processing Systems
Nov-19-2025, 23:26:46 GMT