Reinforcement Learning for Resilient Power Grids