Iteratively Refined Behavior Regularization for Offline Reinforcement Learning Yi Ma
–Neural Information Processing Systems
Unfortunately, behavior regularization, a simple yet effective offline RL algorithm, tends to struggle in this regard.
Neural Information Processing Systems
Feb-15-2026, 12:42:32 GMT
- Country:
- Asia > China
- Guangdong Province > Shenzhen (0.04)
- Tianjin Province > Tianjin (0.04)
- North America > United States
- Montana (0.04)
- Asia > China
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.93)
- Research Report
- Industry:
- Education > Educational Setting > Online (0.46)
- Technology: