On the convergence of optimistic policy iteration for stochastic shortest path problem

Aug-29-2018–arXiv.org Machine Learning

In this paper, we prove some convergence results of a special case of optimistic policy iteration algorithm for stochastic shortest path problem mentioned in [5] . We consider both Monte Carlo and TD(λ) methods for the policy evaluation step under the condition that termination state will eventually be reached almost surely.

artificial intelligence, machine learning, stochastic shortest path problem, (12 more...)

arXiv.org Machine Learning

Aug-29-2018

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.28)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.94)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found