Barycentric Interpolators for Continuous Space and Time Reinforcement Learning
–Neural Information Processing Systems
In order to find the optimal control of continuous state-space and time reinforcement learning (RL) problems, we approximate the value function (VF) with a particular class of functions called the barycentric interpolators. We establish sufficient conditions under which a RL algorithm converges to the optimal VF, even when we use approximate models of the state dynamics and the reinforcement functions.
Neural Information Processing Systems
Dec-31-1999
- Country:
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
- Technology: