An Efficient Solution to s-Rectangular Robust Markov Decision Processes

Kumar, Navdeep, Levy, Kfir, Wang, Kaixin, Mannor, Shie

Jan-31-2023–arXiv.org Artificial Intelligence

In Markov Decision Processes (MDPs), an agent interacts with the environment and learns to optimally behave in it [28]. However, the MDP solution may be very sensitive to little changes in the model parameters [23]. Hence we should be cautious applying the solution of the MDP, when the model is changing or when there is uncertainty in the model parameters. Robust MDPs provide a way to address this issue, where an agent can learn to optimally behave even when the model parameters are uncertain [15, 29, 18]. Another motivation to study robust MDPs is that they can lead to better generalization [33, 34, 25] compared to non-robust solutions.

artificial intelligence, iteration, machine learning, (19 more...)

arXiv.org Artificial Intelligence

Jan-31-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.47)

Genre:
- Research Report (0.63)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.61)
    - Statistical Learning (0.67)
  - Representation & Reasoning > Optimization (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found