Explanation of Reinforcement Learning Model in Dynamic Multi-Agent System