Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration

Open in new window