Hard Examples Are All You Need: Maximizing GRPO Post-Training Under Annotation Budgets