Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning