Direct Preference-Based Evolutionary Multi-Objective Optimization with Dueling Bandits

Neural Information Processing Systems 

The ultimate goal of multi-objective optimization (MO) is to assist human decision-makers (DMs) in identifying solutions of interest (SOI) that optimally reconcile multiple objectives according to their preferences.