Calibration of Shared Equilibria in General Sum Partially Observable Markov Games

Neural Information Processing Systems 

This paper aims at i) formally understanding equilibria reached by such agents, and ii) matching emergent phenomena of such equilibria to real-world targets. Parameter sharing with decentralized execution has been introduced as an efficient way to train multiple agents using a single policy network.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found