Aligner: Efficient Alignment by Learning to Correct

Neural Information Processing Systems 

Aligner can be applied to any powerful, large-scale upstream models. Moreover, it can even iteratively bootstrap the upstream models using corrected responses as synthetic human preference data, breaking through the model's performance ceiling.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found