Hardness in Markov Decision Processes: Theory and Practice

Oct-11-2024, 06:48:46 GMT–Neural Information Processing Systems

Meticulously analysing the empirical strengths and weaknesses of reinforcement learning methods in hard (challenging) environments is essential to inspire innovations and assess progress in the field. In tabular reinforcement learning, there is no well-established standard selection of environments to conduct such analysis, which is partially due to the lack of a widespread understanding of the rich theory of hardness of environments. The goal of this paper is to unlock the practical usefulness of this theory through four main contributions. First, we present a systematic survey of the theory of hardness, which also identifies promising research directions. Second, we introduce \texttt{Colosseum}, a pioneering package that enables empirical hardness analysis and implements a principled benchmark composed of environments that are diverse with respect to different measures of hardness.

hardness, markov decision process, theory and practice, (6 more...)

Neural Information Processing Systems

Oct-11-2024, 06:48:46 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (0.91)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.40)