Intrinsic Reward Functions

Apr-25-2026, 01:35:00 GMT–Neural Information Processing Systems

In our approach, the intrinsic reward can be separated into two parts. One is related to action-aware diversity, while the other is related to observation-aware diversity. We revisit the formulation of our information-theoretic objective (Eq. A.1 Intrinsic Rewards for Action-Aware Diversity First we analyze term 2, which is related to action-aware diversity. T 1 T 1 X p(at| t,id) Xp(at| t,id) 2 = Eid, log q(at| t) DKL (p(at| t)kq(at| t)) Eid, log q(at| t) .

artificial intelligence, diversity, machine learning, (15 more...)

Neural Information Processing Systems

Apr-25-2026, 01:35:00 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (0.72)
  - Representation & Reasoning > Agents (0.48)

Duplicate Docs Excel Report

Title
Inourapproach, diversity information-theoretic A.1 Intrinsic Firstwe2, which 2 = T1X t=0 Eid, log p(at\| t,id) p(at\| t) Thecomputationalp(at\| t), which p(at\| t) = X

Similar Docs Excel Report more

Title	Similarity	Source
None found