Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering