Social diversity and social preferences in mixed-motive reinforcement learning
McKee, Kevin R., Gemp, Ian, McWilliams, Brian, Duéñez-Guzmán, Edgar A., Hughes, Edward, Leibo, Joel Z.
–arXiv.org Artificial Intelligence
Recent research on reinforcement learning in pure-conflict and pure-common interest games has emphasized the importance of population heterogeneity. In contrast, studies of reinforcement learning in mixed-motive games have primarily leveraged homogeneous approaches. Given the defining characteristic of mixed-motive games--the imperfect correlation of incentives between group members--we study the effect of population heterogeneity on mixed-motive reinforcement learning. We draw on interdependence theory from social psychology and imbue reinforcement learning agents with Social Value Orientation (SVO), a flexible formalization of preferences over group outcome distributions. We subsequently explore the effects of diversity in SVO on populations of reinforcement learning agents in two mixed-motive Markov games. We demonstrate that heterogeneity in SVO generates meaningful and complex behavioral variation among agents similar to that suggested by interdependence theory. Empirical results in these mixed-motive dilemmas suggest agents trained in heterogeneous populations develop particularly generalized, high-performing policies relative to those trained in homogeneous populations.
arXiv.org Artificial Intelligence
Feb-12-2020
- Country:
- Europe
- Norway > Norwegian Sea (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Oceania > New Zealand
- North Island > Auckland Region > Auckland (0.05)
- Europe
- Genre:
- Research Report > New Finding (0.89)
- Industry:
- Leisure & Entertainment > Games (1.00)
- Technology: