On Learning Intrinsic Rewards for Policy Gradient Methods

Zeyu Zheng, Junhyuk Oh, Satinder Singh

Nov-20-2025, 16:27:07 GMT–Neural Information Processing Systems

In this paper we build on the Optimal Rewards Framework of Singh et al. [2010] that defines the optimal intrinsic reward function as one that when used by an RL agent achieves behavior that optimizes the

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Nov-20-2025, 16:27:07 GMT

Conferences PDF

Country:
- North America
  - United States > Michigan (0.04)
  - Canada > Quebec
    - Montreal (0.04)

Genre:
- Research Report (0.68)

Industry:
- Leisure & Entertainment > Games > Computer Games (0.31)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning
    - Search (0.68)
    - Agents (0.68)
  - Machine Learning
    - Neural Networks (1.00)
    - Reinforcement Learning (0.92)
    - Statistical Learning > Gradient Descent (0.47)

Duplicate Docs Excel Report

Title
On Learning Intrinsic Rewards for Policy Gradient Methods
On Learning Intrinsic Rewards for Policy Gradient Methods

Similar Docs Excel Report more

Title	Similarity	Source
None found