EpiCare: A Reinforcement Learning Benchmark for Dynamic Treatment Regimes
–Neural Information Processing Systems
Healthcare applications pose significant challenges to existing reinforcement learning (RL) methods due to implementation risks, limited data availability, short treatment episodes, sparse rewards, partial observations, and heterogeneous treatment effects. Despite significant interest in using RL to generate dynamic treatment regimes for longitudinal patient care scenarios, no standardized benchmark has yet been developed. To fill this need we introduce Episodes of Care (EpiCare), a benchmark designed to mimic the challenges associated with applying RL to longitudinal healthcare settings. We leverage this benchmark to test five stateof-the-art offline RL models as well as five common off-policy evaluation (OPE) techniques. Our results suggest that while offline RL may be capable of improving upon existing standards of care given sufficient data, its applicability does not appear to extend to the moderate to low data regimes typical of current healthcare settings. Additionally, we demonstrate that several OPE techniques standard in the the medical RL literature fail to perform adequately on our benchmark. These results suggest that the performance of RL models in dynamic treatment regimes may be difficult to meaningfully evaluate using current OPE methods, indicating that RL for this application domain may still be in its early stages. We hope that these results along with the benchmark will facilitate better comparison of existing methods and inspire further research into techniques that increase the practical applicability of medical RL.
Neural Information Processing Systems
Mar-27-2025, 13:43:44 GMT
- Country:
- North America > United States > California > Santa Cruz County > Santa Cruz (0.14)
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Industry:
- Health & Medicine
- Diagnostic Medicine (0.92)
- Epidemiology (0.68)
- Health Care Technology (0.92)
- Pharmaceuticals & Biotechnology (1.00)
- Therapeutic Area
- Endocrinology > Diabetes (1.00)
- Immunology (0.93)
- Musculoskeletal (0.67)
- Neurology (1.00)
- Health & Medicine