Emergence of Theory of Mind Collaboration in Multiagent Systems
Attempts to integrate ToM in opponent modeling has profound cognitive science origin de2014theory; baker2017rational. LOLA in foerster2018learning learns the best response to evolving opponents. Yet, opponents/partners' real-time believes are not considered into policy. However, I-POMDP requires extensive sampling to approximate the nested integration over the belief space, action space and observation space, limiting its scalability. The Bayesian action decoder (BAD)-MDP proposed by Foerster et al.
Oct-4-2021, 19:28:54 GMT
- Technology: