Probabilistic Inference in Reinforcement Learning Done Right Jean T arbouriech Google DeepMind
–Neural Information Processing Systems
A popular perspective in Reinforcement learning (RL) casts the problem as probabilistic inference on a graphical model of the Markov decision process (MDP). The core object of study is the probability of each state-action pair being visited under the optimal policy.
Neural Information Processing Systems
Oct-8-2025, 20:37:00 GMT
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Asia > Middle East
- Jordan (0.04)
- Europe > United Kingdom
- Genre:
- Workflow (0.45)
- Technology: