Learning Others' Intentional Models in Multi-Agent Settings Using Interactive POMDPs

Yanlin Han, Piotr Gmytrasiewicz

Neural Information Processing Systems 

It extends POMDPs to multi-agent settings by including models of other agents in the state space and forming a hierarchical belief structure.