Information-guided Planning: An Online Approach for Partially Observable Problems
–Neural Information Processing Systems
This paper presents IB-POMCP, a novel algorithm for online planning under partial observability. Our approach enhances the decision-making process by using estimations of the world belief's entropy to guide a tree search process and surpass the limitations of planning in scenarios with sparse reward configurations. By performing what we denominate as an information-guided planning process, the algorithm, which incorporates a novel I-UCB function, shows significant improvements in reward and reasoning time compared to state-of-the-art baselines in several benchmark scenarios, along with theoretical convergence guarantees.
Neural Information Processing Systems
Apr-29-2026, 23:35:11 GMT
- Country:
- Europe > United Kingdom (0.28)
- Genre:
- Research Report
- Experimental Study (0.67)
- New Finding (0.46)
- Research Report
- Industry:
- Leisure & Entertainment > Games (0.46)
- Technology: