How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning