Goto

Collaborating Authors

 Markov Models






Probabilistic Inference in Reinforcement Learning Done Right Jean T arbouriech Google DeepMind

Neural Information Processing Systems

A popular perspective in Reinforcement learning (RL) casts the problem as probabilistic inference on a graphical model of the Markov decision process (MDP). The core object of study is the probability of each state-action pair being visited under the optimal policy.