On Value Iteration Convergence in Connected MDPs

Mustafin, Arsenii, Olshevsky, Alex, Paschalidis, Ioannis Ch.

Jun-13-2024–arXiv.org Artificial Intelligence

This paper establishes that an MDP with a unique optimal policy and ergodic associated transition matrix ensures the convergence of various versions of the Value Iteration algorithm at a geometric rate that exceeds the discount factor {\gamma} for both discounted and average-reward criteria.

algorithm, convergence, iteration, (10 more...)

arXiv.org Artificial Intelligence

Jun-13-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States > Massachusetts > Suffolk County > Boston (0.05)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found