Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning