Choquet regularization for reinforcement learning