A Deep Bayesian Policy Reuse Approach Against Non-Stationary Agents
YAN ZHENG, Zhaopeng Meng, Jianye Hao, Zongzhang Zhang, Tianpei Yang, Changjie Fan
–Neural Information Processing Systems
Inmultiagent domains, coping withnon-stationary agents thatchange behaviors from time to time is a challenging problem, where an agent is usually required to be able to quickly detect the other agent's policy during online interaction, and then adapt its own policy accordingly.
Neural Information Processing Systems
Feb-13-2026, 13:17:08 GMT
- Country:
- Asia
- China
- Tianjin Province > Tianjin (0.05)
- Zhejiang Province > Hangzhou (0.04)
- Middle East > Jordan (0.04)
- China
- North America > Canada
- Asia
- Technology: