Improving Training Result of Partially Observable Markov Decision Process by Filtering Beliefs

Jan-4-2021–arXiv.org Artificial Intelligence

In this study I proposed a filtering beliefs method for improving performance of Partially Observable Markov Decision Processes(POMDPs), which is a method wildly used in autonomous robot and many other domains concerning control policy. My method search and compare every similar belief pair. Because a similar belief have insignificant influence on control policy, the belief is filtered out for reducing training time. The empirical results show that the proposed method outperforms the point-based approximate POMDPs in terms of the quality of training results as well as the efficiency of the method.

control policy, sample belief, vector, (15 more...)

arXiv.org Artificial Intelligence

Jan-4-2021

arXiv.org PDF

Add feedback

Genre:
- Research Report > New Finding (0.57)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found