Projection by Convolution: Optimal Sample Complexity for Reinforcement Learning in Continuous-Space MDPs