Point Based Value Iteration with Optimal Belief Compression for Dec-POMDPs
–Neural Information Processing Systems
This paper presents four major results towards solving decentralized partially observable Markov decision problems (DecPOMDPs) culminating in an algorithm that outperforms all existing algorithms on all but one standard infinite-horizon benchmark problems. The program is notable because its linear relaxation is very often integral. These actions correspond to strategies of a CBG. We choose one such algorithm, point-based valued iteration, and modify it to produce the first tractable value iteration method for DecPOMDPs which outperforms existing algorithms.
Neural Information Processing Systems
Sep-30-2025, 12:48:29 GMT
- Technology: