Ctrl-Z: Recovering from Instability in Reinforcement Learning