Barycentric Interpolators for Continuous Space and Time Reinforcement Learning

Apr-6-2023, 17:41:09 GMT–Neural Information Processing Systems

In order to find the optimal control of continuous state-space and time reinforcement learning (RL) problems, we approximate the value function (VF) with a particular class of functions called the barycentric interpolators. We establish sufficient conditions under which a RL algorithm converges to the optimal VF, even when we use approximate models of the state dynamics and the reinforce(cid:173) ment functions .

barycentric interpolator, continuous space, space and time reinforcement learning

Neural Information Processing Systems

Apr-6-2023, 17:41:09 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)