Plotting

 Munos, Rémi


Barycentric Interpolators for Continuous Space and Time Reinforcement Learning

Neural Information Processing Systems

In order to find the optimal control of continuous state-space and time reinforcement learning (RL) problems, we approximate the value function (VF) with a particular class of functions called the barycentric interpolators. We establish sufficient conditions under which a RL algorithm converges to the optimal VF, even when we use approximate models of the state dynamics and the reinforcement functions.


Reinforcement Learning for Continuous Stochastic Control Problems

Neural Information Processing Systems

Here we sudy the continuous time, continuous state-spacestochastic case, which covers a wide variety of control problems including target, viability, optimization problems (see [FS93], [KP95])}or which a formalism is the following.


Reinforcement Learning for Continuous Stochastic Control Problems

Neural Information Processing Systems

Here we sudy the continuous time, continuous state-space stochastic case, which covers a wide variety of control problems including target, viability, optimization problems (see [FS93], [KP95])}or which a formalism is the following.