Robust Question Answering against Distribution Shifts with Test-Time Adaptation: An Empirical Study