InfoPO: On Mutual Information Maximization for Large Language Model Alignment

Open in new window