Multiagent Soft Q-Learning