Information-Theoretic Generalization Bounds for Sequential Decision Making