On the Sample Complexity of Discounted Reinforcement Learning with Optimized Certainty Equivalents

Open in new window