Efficient Test-Time Scaling for Small Vision-Language Models