Demystifying the Recency Heuristic in Temporal-Difference Learning

Open in new window