Provably (More) Sample-Efficient Offline RL with Options

Neural Information Processing Systems 

Planning in long-horizon tasks is challenging in reinforcement learning (RL) (Co-Reyes et al., 2018;

Similar Docs  Excel Report  more

TitleSimilaritySource
None found