Review for NeurIPS paper: Sample-Efficient Reinforcement Learning of Undercomplete POMDPs

Feb-6-2025, 20:14:55 GMT–Neural Information Processing Systems

Weaknesses: A few comments that are needed to be addressed: 1) The first comment is about the presentation of the derivations. There are steps in the appendix, and also in the main text that are skipped. Some of them took me a while to rederive, some I couldn't spend more time to rederive. Some steps are also taken as granted in the main text. It is useful to elaborate on them more.

derivation, sample-efficient reinforcement learning, undercomplete pomdp, (8 more...)

Neural Information Processing Systems

Feb-6-2025, 20:14:55 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (0.40)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.54)