Analysis of Value Iteration Through Absolute Probability Sequences

Mustafin, Arsenii, Colla, Sebastien, Olshevsky, Alex, Paschalidis, Ioannis Ch.

Feb-5-2025–arXiv.org Artificial Intelligence

Value Iteration is a widely used algorithm for solving Markov Decision Processes (MDPs). While previous studies have extensively analyzed its convergence properties, they primarily focus on convergence with respect to the infinity norm. In this work, we use absolute probability sequences to develop a new line of analysis and examine the algorithm's convergence in terms of the $L^2$ norm, offering a new perspective on its behavior and performance.

artificial intelligence, machine learning, sequence, (14 more...)

arXiv.org Artificial Intelligence

Feb-5-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Massachusetts > Suffolk County > Boston (0.05)
- Europe > Belgium
  - Wallonia > Walloon Brabant > Louvain-la-Neuve (0.04)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.49)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found