Variance Reduction Methods for Sublinear Reinforcement Learning