Reviews: Temporal Regularization for Markov Decision Process

Oct-7-2024, 09:41:33 GMT–Neural Information Processing Systems

This paper is very interesting. One previous assumption in TD learning is that reward are close with states in proximity of the state space, which has been pointed out by many papers is not realistic and have problems for spatial value function regularization. Instead, this paper make the assumption that rewards are close for states. Overall this paper has a very good motivation, and the literature review shows that the author is knowledgable of this field. This paper could open a novel area of temporal regularization that received inadequate attention before.

function approximation, markov decision process, temporal regularization, (8 more...)

Neural Information Processing Systems

Oct-7-2024, 09:41:33 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (0.56)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.52)