Compositional preference models for aligning LMs