Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning

David Janz, Jiri Hron, Przemysław Mazur, Katja Hofmann, José Miguel Hernández-Lobato, Sebastian Tschiatschek

Oct-2-2025, 06:36:33 GMT–Neural Information Processing Systems

Randomised value functions (RVF) can be viewed as a promising approach to scaling PSRL.

artificial intelligence, machine learning, reinforcement learning, (12 more...)

Neural Information Processing Systems

Oct-2-2025, 06:36:33 GMT

Conferences PDF

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.46)

Duplicate Docs Excel Report

Title
Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning

Similar Docs Excel Report more

Title	Similarity	Source
None found