Reviews: QMDP-Net: Deep Learning for Planning under Partial Observability

Oct-8-2024, 12:10:46 GMT–Neural Information Processing Systems

The paper proposes a novel policy network architecture for partially observable environments. The network includes a filtering module and a planning module. The filtering module mimics computation of the current belief of the agent given its previous belief, the last action and the last observation. The model of the environment and the observation function are replaced with trainable neural modules. The planning module runs value iteration for an MDP, whose transition function and reward function are also trainable neural modules.

deep learning, module, partial observability, (9 more...)

Neural Information Processing Systems

Oct-8-2024, 12:10:46 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.60)