Interpretable Preference-based Reinforcement Learning with Tree-Structured Reward Functions

Open in new window