Reviews: Budgeted Reinforcement Learning in Continuous State Space
–Neural Information Processing Systems
The paper formulates a budgeted Markov decision process (BMDP) able to deal with large search spaces. All reviewers feel the proposed method is novel, interesting and could be an important step in trying to address some existing problems with "modern" RL approaches.
Neural Information Processing Systems
Jan-23-2025, 15:19:53 GMT
- Technology: