Information-Theoretic Perspectives on Optimizers