Budgeted Reinforcement Learning in Continuous State Space
–Neural Information Processing Systems
So far, BMDPs could only be solved in the case of finite state spaces with known dynamics. This work extends the state-of-the-art to continuous spaces environments and unknown dynamics. We show that the solution to a BMDP is a fixed point of a novel Budgeted Bellman Optimality operator. This observation allows us to introduce natural extensions of Deep Reinforcement Learning algorithms to address large-scale BMDPs.
Neural Information Processing Systems
Oct-2-2025, 17:17:53 GMT
- Country:
- Europe
- France > Hauts-de-France
- Nord > Lille (0.04)
- Pas-de-Calais (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- France > Hauts-de-France
- North America > Canada
- Europe
- Industry:
- Automobiles & Trucks (0.68)