Tail-Aware Information-Theoretic Generalization for RLHF and SGLD

Open in new window