A Deep Bayesian Policy Reuse Approach Against Non-Stationary Agents

Mar-16-2026, 23:01:05 GMT–Neural Information Processing Systems

In multiagent domains, coping with non-stationary agents that change behaviors from time to time is a challenging problem, where an agent is usually required to be able to quickly detect the other agent's policy during online interaction, and then adapt its own policy accordingly.

artificial intelligence, machine learning, proceedings, (7 more...)

Neural Information Processing Systems

Mar-16-2026, 23:01:05 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.37)