Multi-Agent Reinforcement Learning for Power Grid Topology Optimization