A Deep Bayesian Policy Reuse Approach Against Non-Stationary Agents

YAN ZHENG, Zhaopeng Meng, Jianye Hao, Zongzhang Zhang, Tianpei Yang, Changjie Fan

Neural Information Processing Systems 

There also exist many application scenarios involving multiagent interactions, commonly known as multiagent systems (MAS).