Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement Learning