CurveRL: Principled Distribution-Aware Context Reweighting for LLM Reasoning