Analysis of Value Iteration Through Absolute Probability Sequences
Mustafin, Arsenii, Colla, Sebastien, Olshevsky, Alex, Paschalidis, Ioannis Ch.
–arXiv.org Artificial Intelligence
Value Iteration is a widely used algorithm for solving Markov Decision Processes (MDPs). While previous studies have extensively analyzed its convergence properties, they primarily focus on convergence with respect to the infinity norm. In this work, we use absolute probability sequences to develop a new line of analysis and examine the algorithm's convergence in terms of the $L^2$ norm, offering a new perspective on its behavior and performance.
arXiv.org Artificial Intelligence
Feb-5-2025
- Country:
- North America > United States
- Massachusetts > Suffolk County > Boston (0.05)
- Europe > Belgium
- Wallonia > Walloon Brabant > Louvain-la-Neuve (0.04)
- North America > United States
- Genre:
- Research Report (0.40)
- Technology: