Exponential Family PCA for Belief Compression in POMDPs

Apr-6-2023, 16:27:42 GMT–Neural Information Processing Systems

Standard value function approaches to finding policies for Partially Observable Markov Decision Processes (POMDPs) are intractable for large models. The in- tractability of these algorithms is due to a great extent to their generating an optimal policy over the entire belief space. However, in real POMDP problems most belief states are unlikely, and there is a structured, low-dimensional manifold of plausible beliefs embedded in the high-dimensional belief space. We introduce a new method for solving large-scale POMDPs by taking advantage of belief space sparsity. We reduce the dimensionality of the belief space by exponential family Principal Components Analysis [1], which allows us to turn the sparse, high- dimensional belief space into a compact, low-dimensional representation in terms of learned features of the belief state.

belief space, exponential family pca, pomdp, (3 more...)

Neural Information Processing Systems

Apr-6-2023, 16:27:42 GMT

Conferences Web Page

Add feedback

Country:
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.23)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)