Reviews: Large Scale Markov Decision Processes with Changing Rewards

Jan-24-2025, 00:53:36 GMT–Neural Information Processing Systems

The paper contributes new algorithmic ideas and theoretical results for regret minimization in Markov Decision Processes with known transition kernels but arbitrary cost functions. The reviewers broadly agree that the theoretical and algorithmic techniques introduced by the paper -- using the FTRL online learning idea and the extension to large MDPs via linear function approximation -- are novel, and thus the paper deserves to be published; however, the known-MDP-unknown-cost setting may be somewhat narrow in its applicability in practice.

markov decision process, scale markov decision process

Neural Information Processing Systems

Jan-24-2025, 00:53:36 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology
  - Decision Support Systems (0.72)
  - Artificial Intelligence > Machine Learning
    - Learning Graphical Models > Undirected Networks > Markov Models (0.72)