Medical Dead-ends and Learning to Identify High-Risk States and Treatments

May-26-2025, 17:03:50 GMT–Neural Information Processing Systems

Machine learning has successfully framed many sequential decision making problems as either supervised prediction, or optimal decision-making policy identification via reinforcement learning. In data-constrained offline settings, both approaches may fail as they assume fully optimal behavior or rely on exploring alternatives that may not exist. We introduce an inherently different approach that identifies "dead-ends" of a state space. We focus on patient condition in the intensive care unit, where a "medical dead-end" indicates that a patient will expire, regardless of all potential future treatment sequences. We postulate "treatment security" as avoiding treatments with probability proportional to their chance of leading to dead-ends, present a formal proof, and frame discovery as an RL problem.

artificial intelligence, machine learning, reinforcement learning, (2 more...)

Neural Information Processing Systems

May-26-2025, 17:03:50 GMT

Conferences Web Page

Add feedback

Industry:
- Health & Medicine (0.84)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.65)