A Generalized Acquisition Function for Preference-based Reward Learning

Open in new window