Panacea: Pareto Alignment via Preference Adaptation for LLMs

Neural Information Processing Systems 

Panacea trains a single model capable of adapting online and Pareto-optimally to diverse sets of preferences without the need for further tuning.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found