On Learning Intrinsic Rewards for Policy Gradient Methods
Zeyu Zheng, Junhyuk Oh, Satinder Singh
–Neural Information Processing Systems
Whether itispossible tolearn intrinsic reward functions for learning agents remains an open problem.
Neural Information Processing Systems
Feb-12-2026, 20:13:39 GMT