CRED: Counterfactual Reasoning and Environment Design for Active Preference Learning

Open in new window