Reward is enough for convex MDPs

Open in new window