InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models

Neural Information Processing Systems 

Model fusion combines multiple Large Language Models (LLMs) with different strengths into a more powerful, integrated model through lightweight training methods.