Reward Machines for Deep RL in Noisy and Uncertain Environments

Feb-18-2026, 02:22:08 GMT–Neural Information Processing Systems

Reward Machines provide an automaton-inspired structure for specifying instructions, safety constraints, and other temporally extended reward-worthy behaviour. By exposing the underlying structure of a reward function, they enable the decomposition of an RL task, leading to impressive gains in sample efficiency.

abstraction model, logic & formal reasoning, machine learning, (21 more...)

Neural Information Processing Systems

Feb-18-2026, 02:22:08 GMT

Conferences PDF

Add feedback

Country:
- South America > Chile (0.04)
- Europe > Italy (0.04)
- North America > Canada
  - Ontario > Toronto (0.14)
- Asia > Middle East
  - Jordan (0.04)
  - Republic of Türkiye > Aksaray Province
    - Aksaray (0.04)

Genre:
- Research Report > Experimental Study (0.93)

Industry:
- Information Technology (0.46)
- Government (0.46)
- Education (0.46)
- Transportation > Ground
  - Road (0.47)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Natural Language (1.00)
  - Representation & Reasoning
    - Logic & Formal Reasoning (0.68)
    - Agents (0.67)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Neural Networks > Deep Learning (0.46)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.69)

Duplicate Docs Excel Report

Title
c71769e2715835d37c3e25cc1173bd62-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found