Budgeted Reinforcement Learning in Continuous State Space Nicolas Carrara SequeL team, INRIA Lille - Nord Europe

May-31-2025, 17:04:14 GMT–Neural Information Processing Systems

A Budgeted Markov Decision Process (BMDP) is an extension of a Markov Decision Process to critical applications requiring safety constraints. It relies on a notion of risk implemented in the shape of a cost signal constrained to lie below an - adjustable - threshold. So far, BMDPs could only be solved in the case of finite state spaces with known dynamics. This work extends the state-of-the-art to continuous spaces environments and unknown dynamics. We show that the solution to a BMDP is a fixed point of a novel Budgeted Bellman Optimality operator. This observation allows us to introduce natural extensions of Deep Reinforcement Learning algorithms to address large-scale BMDPs.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

May-31-2025, 17:04:14 GMT

Conferences PDF

Add feedback

Country:
- Europe > France
  - Hauts-de-France (0.14)
- North America > Canada (0.28)

Industry:
- Automobiles & Trucks (0.68)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.87)
  - Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
4fe5149039b52765bde64beb9f674940-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found