Beyond the Binary: Capturing Diverse Preferences With Reward Regularization