Probabilistic Inference in Reinforcement Learning Done Right Jean T arbouriech Google DeepMind

Neural Information Processing Systems 

A popular perspective in Reinforcement learning (RL) casts the problem as probabilistic inference on a graphical model of the Markov decision process (MDP). The core object of study is the probability of each state-action pair being visited under the optimal policy.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found