Sample-Optimal Parametric Q-Learning with Linear Transition Models

Open in new window