Counterexample-Guided Repair of Reinforcement Learning Systems Using Safety Critics