Reinforcement Learning with a Terminator Guy T ennenholtz

Aug-19-2025, 15:24:05 GMT–Neural Information Processing Systems

We present the problem of reinforcement learning with exogenous termination. We define the Termination Markov Decision Process (TerMDP), an extension of the MDP framework, in which episodes may be interrupted by an external non-Markovian observer.

machine learning, reinforcement learning, termination, (13 more...)

Neural Information Processing Systems

Aug-19-2025, 15:24:05 GMT

Conferences PDF

Add feedback

Country:
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)
- Asia > Middle East
  - Israel (0.04)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Information Technology (0.68)
- Transportation (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.34)

Duplicate Docs Excel Report

Title
e83b86156555ab9692743f9f8f67adf1-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found