Reward Machines for Deep RL in Noisy and Uncertain Environments
–Neural Information Processing Systems
Reward Machines provide an automaton-inspired structure for specifying instructions, safety constraints, and other temporally extended reward-worthy behaviour. By exposing the underlying structure of a reward function, they enable the decomposition of an RL task, leading to impressive gains in sample efficiency.
Neural Information Processing Systems
Feb-18-2026, 02:22:08 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Republic of Türkiye > Aksaray Province
- Aksaray (0.04)
- Europe > Italy (0.04)
- North America > Canada
- South America > Chile (0.04)
- Asia > Middle East
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Education (0.46)
- Government (0.46)
- Information Technology (0.46)
- Transportation > Ground
- Road (0.47)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning
- Learning Graphical Models > Undirected Networks
- Markov Models (0.69)
- Neural Networks > Deep Learning (0.46)
- Reinforcement Learning (1.00)
- Learning Graphical Models > Undirected Networks
- Natural Language (1.00)
- Representation & Reasoning
- Agents (0.67)
- Logic & Formal Reasoning (0.68)
- Robots (1.00)
- Machine Learning
- Information Technology > Artificial Intelligence