Is Q-Learning Provably Efficient?

Open in new window