Linearly-solvable Markov decision problems

Dec-31-2007–Neural Information Processing Systems

We introduce a class of MPDs which greatly simplify Reinforcement Learning. They have discrete state spaces and continuous control spaces. The controls have the effect of rescaling the transition probabilities of an underlying Markov chain. A control cost penalizing KL divergence between controlled and uncontrolled transition probabilities makes the minimization problem convex, and allows analytical computation of the optimal controls given the optimal value function. An exponential transformation of the optimal value function makes the minimized Bellman equation linear.

Duplicate Docs Excel Report

Title
Linearly-solvable Markov decision problems
Linearly-solvable Markov decision problems

Similar Docs Excel Report more

Title	Similarity	Source
None found