Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games

Dec-23-2025, 17:36:59 GMT–Neural Information Processing Systems

Measuring and promoting policy diversity is critical for solving games with strong non-transitive dynamics where strategic cycles exist, and there is no consistent winner (e.g., Rock-Paper-Scissors). With that in mind, maintaining a pool of diverse policies via open-ended learning is an attractive solution, which can generate auto-curricula to avoid being exploited. However, in conventional open-ended learning algorithms, there are no widely accepted definitions for diversity, making it hard to construct and evaluate the diverse policies. In this work, we summarize previous concepts of diversity and work towards offering a unified measure of diversity in multi-agent open-ended learning to include all elements in Markov games, based on both Behavioral Diversity (BD) and Response Diversity (RD).

name change, open-ended learning, unifying behavioral and response diversity, (7 more...)

Neural Information Processing Systems

Dec-23-2025, 17:36:59 GMT

Conferences Web Page

Add feedback

Industry:
- Leisure & Entertainment > Games (0.38)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.38)