Rethinking Sharpness-Aware Minimization as Variational Inference