Upside-Down Reinforcement Learning for More Interpretable Optimal Control