Direct Preference-Based Evolutionary Multi-Objective Optimization with Dueling Bandits

Neural Information Processing Systems 

SOI by directly leveraging human feedback without being restricted by a predefined reward model nor cumbersome model selection.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found