Learning with Good Feature Representations in Bandits and in RL with a Generative Model

Open in new window