Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL Qin-Wen Luo
–Neural Information Processing Systems
Offline reinforcement learning (RL) aims to learn a policy from a fixed dataset without additional interactions with the environment.
Neural Information Processing Systems
Oct-10-2025, 15:54:56 GMT
- Country:
- Asia
- China > Jiangsu Province
- Nanjing (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- China > Jiangsu Province
- North America > United States
- Montana (0.04)
- Washington > King County
- Seattle (0.04)
- Asia
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Education > Educational Setting
- Online (0.45)
- Information Technology (0.67)
- Education > Educational Setting
- Technology: