Provably (More) Sample-Efficient Offline RL with Options
–Neural Information Processing Systems
Planning in long-horizon tasks is challenging in reinforcement learning (RL) (Co-Reyes et al., 2018;
Neural Information Processing Systems
Oct-9-2025, 08:59:16 GMT
- Country:
- Asia > China
- Hong Kong (0.04)
- North America > United States (0.04)
- Asia > China
- Industry:
- Transportation (0.46)
- Technology: