Generalizing while preserving monotonicity in comparison-based preference learning models
–Neural Information Processing Systems
If you tell a learning model that you prefer an alternative a over another alternative b, then you probably expect the model to be monotone, that is, the valuation of a increases, and that of bdecreases. Yet, perhaps surprisingly, many widely deployed comparison-based preference learning models, including large language models, fail to have this guarantee. Until now, the only comparison-based preference learning algorithms that were proved to be monotone are the Generalized BradleyTerry models [10]. Yet, these models are unable to generalize to uncompared data. In this paper, we advance the understanding of the set of models with generalization ability that are monotone. Namely, we propose a new class of Linear Generalized Bradley-Terry models with Diffusion Priors, and identify sufficient conditions on alternatives' embeddings that guarantee monotonicity. Our experiments show that this monotonicity is far from being a general guarantee, and that our new class of generalizing models improves accuracy, especially when the dataset is limited.
Neural Information Processing Systems
Jun-18-2026, 18:44:34 GMT
- Country:
- Europe (1.00)
- North America > United States (0.67)
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.87)
- Research Report
- Technology: