Budgeted Reinforcement Learning in Continuous State Space

Neural Information Processing Systems 

So far, BMDPs could only be solved in the case of finite state spaces with known dynamics. This work extends the state-of-the-art to continuous spaces environments and unknown dynamics. We show that the solution to a BMDP is a fixed point of a novel Budgeted Bellman Optimality operator. This observation allows us to introduce natural extensions of Deep Reinforcement Learning algorithms to address large-scale BMDPs.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found