Improving RL Exploration for LLM Reasoning through Retrospective Replay