QXplore: Q-learning Exploration by Maximizing Temporal Difference Error

Open in new window