C-3DPO: Constrained Controlled Classification for Direct Preference Optimization