Gradient Ascent Post-training Enhances Language Model Generalization

Open in new window