Verification-Guided Falsification for Safe RL via Explainable Abstraction and Risk-Aware Exploration