Review for NeurIPS paper: Non-Crossing Quantile Regression for Distributional Reinforcement Learning
–Neural Information Processing Systems
The strong rebuttal with additional results on NC-IQN swayed multiple initially hesitant reviewers to argue for acceptance, and I concur. The one unresolved concern is about reproducing the baseline results more accurately: I assume this is a matter of codebase/implementation details that does not detract from fair head-to-head comparisons.
Neural Information Processing Systems
Jan-27-2025, 20:55:43 GMT
- Technology: