Evolutionary Dynamics of Q-Learning over the Sequence Form
Panozzo, Fabio (Politecnico di Milano) | Gatti, Nicola (Politecnico di Milano) | Restelli, Marcello (Politecnico di Milano)
Multi-agent learning is a challenging open task in artificial intelligence. It is known an interesting connection between multi-agent learning algorithms and evolutionary game theory, showing that the learning dynamics of some algorithms can be modeled as replicator dynamics with a mutation term. Inspired by the recent sequence-form replicator dynamics, we develop a new version of the Q-learning algorithm working on the sequence form of an extensive-form game allowing thus an exponential reduction of the dynamics length w.r.t. those of the normal form. The dynamics of the proposed algorithm can be modeled by using the sequence-form replicator dynamics with a mutation term. We show that, although sequence-form and normal-form replicator dynamics are realization equivalent, the Q-learning algorithm applied to the two forms have non-realization equivalent dynamics. Originally from the previous works on evolutionary game theory models form multi-agent learning, we produce an experimental evaluation to show the accuracy of the model.
Jul-14-2014
- Country:
- North America > United States
- Massachusetts > Middlesex County > Cambridge (0.04)
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Italy > Lombardy
- Milan (0.04)
- United Kingdom > England
- North America > United States
- Industry:
- Leisure & Entertainment > Games (1.00)
- Technology: