Policy Improvement via Imitation of Multiple Oracles

Neural Information Processing Systems 

Despite its promise, reinforcement learning's real-world adoption has been hampered by the need for costly exploration to learn a good policy.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found