Measuring Policy Distance for Multi-Agent Reinforcement Learning