Investigating Regularization of Self-Play Language Models

Open in new window