Beyond the Binary: Capturing Diverse Preferences With Reward Regularization

Open in new window