Goto

Collaborating Authors

 Reinforcement Learning



AConsciousness-InspiredPlanningAgentfor Model-Based ReinforcementLearning

Neural Information Processing Systems

Whether when planning our paths home from the office or from a hotel to an airport in an unfamiliar city, we typically focus on a small subset of relevant variables,e.g. the changeinposition orthepresence oftraffic.


Conservative Q-Learning for Offline Reinforcement Learning A viral Kumar

Neural Information Processing Systems

Effectively leveraging large, previously collected datasets in reinforcement learning (RL) is a key challenge for large-scale real-world applications. Offline RL algorithms promise to learn effective policies from previously-collected, static datasets without further interaction.