Disentangling Length Bias In Preference Learning Via Response-Conditioned Modeling