Feng

Feb-8-2022, 11:14:57 GMT–AAAI Conferences

We describe an approximate dynamic programming algorithm for partially observable Markov decision processes represented in factored form. Two complementary forms of approximation are used to simplify a piecewise linear and convex value function, where each linear facet of the function is represented compactly by an algebraic decision diagram.

approximation, linear facet, value function, (1 more...)

AAAI Conferences

Feb-8-2022, 11:14:57 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)