On the Expressivity of Markov Reward
–Neural Information Processing Systems
Reward is the driving force for reinforcement-learning agents. This paper is dedicated to understanding the expressivity of reward as a way to capture tasks that we would want an agent to perform.
Neural Information Processing Systems
Nov-20-2025, 08:52:30 GMT
- Country:
- North America > United States
- Massachusetts (0.04)
- Michigan (0.04)
- North America > United States
- Genre:
- Research Report > New Finding (0.93)
- Industry:
- Education (0.46)
- Technology: