Conservative Q-Learning for Offline Reinforcement Learning A viral Kumar

Neural Information Processing Systems 

Effectively leveraging large, previously collected datasets in reinforcement learning (RL) is a key challenge for large-scale real-world applications. Offline RL algorithms promise to learn effective policies from previously-collected, static datasets without further interaction.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found