Evaluating Interpretable Reinforcement Learning by Distilling Policies into Programs

Open in new window