SEE-DPO: Self Entropy Enhanced Direct Preference Optimization