Reinforcement Learning with a Terminator

Dec-25-2025, 14:28:00 GMT–Neural Information Processing Systems

We present the problem of reinforcement learning with exogenous termination. We define the Termination Markov Decision Process (TerMDP), an extension of the MDP framework, in which episodes may be interrupted by an external non-Markovian observer.

name change, reinforcement learning, termination, (2 more...)

Neural Information Processing Systems

Dec-25-2025, 14:28:00 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.31)