History Rhymes: Accelerating LLM Reinforcement Learning with RhymeRL