DCRM: A Heuristic to Measure Response Pair Quality in Preference Optimization