Exploring and Learning in Sparse Linear MDPs without Computationally Intractable Oracles

Open in new window