Review for NeurIPS paper: Adaptive Discretization for Model-Based Reinforcement Learning
–Neural Information Processing Systems
Additional Feedback: medium points: table 1: the "Lower Bounds" method doesn't have "Time complexity" or "Space complexity" entries? Also why is it separated from the other prior work? Is this assuming something the others baselines in Table 1 aren't? If so, is this a fair comparison then? Everything except red (epsilonQL) seems to perform the same.
Neural Information Processing Systems
Jan-22-2025, 20:26:00 GMT
- Technology: