Reviews: Using a Logarithmic Mapping to Enable Lower Discount Factors in Reinforcement Learning

Feb-4-2025, 21:08:03 GMT–Neural Information Processing Systems

The authors propose remapping value functions into a logarithmic space, leading to "logarithmic Q-learning" which is demonstrated to perform quite well in practice. This paper has by far the strongest overall scores (9, 9, 8) in my paper batch. All three reviewers are enthusiastic about the paper and its contributions and results. I am recommending that NeurIPS accept the paper for Oral presentation.

enable lower discount factor, logarithmic mapping, reinforcement learning, (1 more...)

Neural Information Processing Systems

Feb-4-2025, 21:08:03 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)