Quagmires in SFT-RL Post-Training: When High SFT Scores Mislead and What to Use Instead