A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment

Felix Leibfried, Sergio Pascual-Díaz, Jordi Grau-Moya

May-31-2025, 06:21:37 GMT–Neural Information Processing Systems

Empowerment is an information-theoretic method that can be used to intrinsically motivate learning agents. It attempts to maximize an agent's control over the environment by encouraging visiting states with a large number of reachable next states. Empowered learning has been shown to lead to complex behaviors, without requiring an explicit reward signal. In this paper, we investigate the use of empowerment in the presence of an extrinsic reward signal. We hypothesize that empowerment can guide reinforcement learning (RL) agents to find good early behavioral solutions by encouraging highly empowered states.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

May-31-2025, 06:21:37 GMT

Conferences PDF

Add feedback

Country:
- North America > Canada (0.28)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Neural Networks > Deep Learning (0.46)
    - Reinforcement Learning (1.00)
  - Representation & Reasoning (1.00)