Successor Uncertainties: exploration and uncertainty in temporal difference learning

Open in new window