Iterative Reachability Estimation for Safe Reinforcement Learning