Can Differentiable Decision Trees Learn Interpretable Reward Functions?

Open in new window