Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs with a Generative Model

Open in new window