Language Model Alignment with Elastic Reset

Neural Information Processing Systems 

We propose Elastic Reset, a new algorithm that achieves higher reward with less drift without explicitly modifying the training objective.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found