Multi-Agent Reinforcement Learning Simulation for Environmental Policy Synthesis