Empirical Study on Optimizer Selection for Out-of-Distribution Generalization