MDP Geometry, Normalization and Value Free Solvers
Mustafin, Arsenii, Pakharev, Aleksei, Olshevsky, Alex, Paschalidis, Ioannis Ch.
–arXiv.org Artificial Intelligence
Markov Decision Process (MDP) is a common mathematical model for sequential decision-making problems. In this paper, we present a new geometric interpretation of MDP, which is useful for analyzing the dynamics of main MDP algorithms. Based on this interpretation, we demonstrate that MDPs can be split into equivalence classes with indistinguishable algorithm dynamics. The related normalization procedure allows for the design of a new class of MDP-solving algorithms that find optimal policies without computing policy values.
arXiv.org Artificial Intelligence
Jul-9-2024
- Country:
- North America > United States
- New York > New York County
- New York City (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- New York > New York County
- Europe > France
- Nouvelle-Aquitaine > Gironde > Bordeaux (0.04)
- North America > United States
- Genre:
- Research Report (0.50)