SPD: Synergy Pattern Diversifying Oriented Unsupervised Multi-agent Reinforcement Learning
–Neural Information Processing Systems
As for the single agent, unsupervised learning has been incorporated into RL to acquire diverse skills for the agent without extrinsic reward from the environment, and this scenario is known as unsupervised reinforcement learning (URL).
Neural Information Processing Systems
Aug-16-2025, 12:53:43 GMT