Generalization Bounds: Perspectives from Information Theory and PAC-Bayes