SparsePO: Controlling Preference Alignment of LLMs via Sparse Token Masks

Open in new window