Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs with a Generative Model