Reviews: A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning
–Neural Information Processing Systems
Even after the discussion and the author response there was still some disagreement between the reviewers. The paper proposes a simple yet novel and very interesting idea. There still are a few concerns about clarity, but those can be fixed in the final version (see updated reviews). Overall this is a solid paper, that (as always) would benefit from more thorough empirical evaluation. One reviewer proposed to add an additional baseline of a domain-randomized robust policy that is trained on various tasks.
Neural Information Processing Systems
Jan-26-2025, 21:32:54 GMT
- Technology: