Multi-Agent Imitation by Learning and Sampling from Factorized Soft Q-Function

Open in new window