Goto

Collaborating Authors

 Education





Y our contrastive learning problem is secretly a distribution alignment problem

Neural Information Processing Systems

Intuitively, by using more information from the distribution of latents, our approach allows a more distribution-aware manipulation of the relationships within augmented sample sets.



Aligner: Efficient Alignment by Learning to Correct

Neural Information Processing Systems

Aligner can be applied to any powerful, large-scale upstream models. Moreover, it can even iteratively bootstrap the upstream models using corrected responses as synthetic human preference data, breaking through the model's performance ceiling.