Sampling Networks and Aggregate Simulation for Online POMDP Planning

Hao(Jackson) Cui, Roni Khardon

Neural Information Processing Systems 

The paper introduces a new algorithm for planning in partially observable Markov decision processes (POMDP) based on the idea of aggregate simulation. The algorithm uses product distributions to approximate the belief state and shows how to build a representation graph of an approximate action-value function over belief space.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found