Sampling Networks and Aggregate Simulation for Online POMDP Planning
Hao(Jackson) Cui, Roni Khardon
–Neural Information Processing Systems
The paper introduces a new algorithm for planning in partially observable Markov decision processes (POMDP) based on the idea of aggregate simulation. The algorithm uses product distributions to approximate the belief state and shows how to build a representation graph of an approximate action-value function over belief space.
Neural Information Processing Systems
Mar-26-2025, 04:21:07 GMT