Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment

Open in new window