The Successor Representation, $\gamma$-Models, br / and Infinite-Horizon Prediction

Dec-22-2021, 19:35:43 GMT–#artificialintelligence

Standard single-step models have a horizon of one. This post describes a method for training predictive dynamics models in continuous state spaces with an infinite, probabilistic horizon. Reinforcement learning algorithms are frequently categorized by whether they predict future states at any point in their decision-making process. Those that do are called model-based, and those that do not are dubbed model-free. This classification is so common that we mostly take it for granted these days; I am guilty of using it myself.

gamma, mathbf, value function, (14 more...)

#artificialintelligence

Dec-22-2021, 19:35:43 GMT

News Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)