A Deep Bayesian Policy Reuse Approach Against Non-Stationary Agents

YAN ZHENG, Zhaopeng Meng, Jianye Hao, Zongzhang Zhang, Tianpei Yang, Changjie Fan

Neural Information Processing Systems 

Inmultiagent domains, coping withnon-stationary agents thatchange behaviors from time to time is a challenging problem, where an agent is usually required to be able to quickly detect the other agent's policy during online interaction, and then adapt its own policy accordingly.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found