Reviews: Near-Optimal Reinforcement Learning in Dynamic Treatment Regimes

Jan-25-2025, 04:37:35 GMT–Neural Information Processing Systems

Update after rebuttal: Due to author comments and, in particular, discussions with the other reviewers, I have updated my score from 4 to a weak accept 6. For the future draft, aside from the revisions and clarifications the authors have promised in the rebuttal, I recommend the following (slight) modifications to improve the manuscript: The motivation in the introduction would be strengthened by drawing clearer connections to the real world. The authors should consider picking a specific real world example and illustrating the method through that example (even if it's not possible to provide simulation results on such an example). In line with this, the authors should be careful about discussion of safe-RL. Typically such methods involve use of constraints to ensure safety, but it does not appear the authors explicitly use or discuss such methods here.

author comment, dynamic treatment regime, near-optimal reinforcement learning, (9 more...)

Neural Information Processing Systems

Jan-25-2025, 04:37:35 GMT

Conferences Web Page

Add feedback

Genre:
- Summary/Review (0.35)

Industry:
- Health & Medicine (0.55)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)