Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction

Neural Information Processing Systems 

Tree Search (TS) is crucial to some of the most influential successes in reinforcement learning.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found