Path Planning through Multi-Agent Reinforcement Learning in Dynamic Environments