A Generalized Acquisition Function for Preference-based Reward Learning