Regularity as Intrinsic Reward for Free Play

Jan-19-2025, 21:36:02 GMT–Neural Information Processing Systems

We propose regularity as a novel reward signal for intrinsically-motivated reinforcement learning. Taking inspiration from child development, we postulate that striving for structure and order helps guide exploration towards a subspace of tasks that are not favored by naive uncertainty-based intrinsic rewards. Our generalized formulation of Regularity as Intrinsic Reward (RaIR) allows us to operationalize it within model-based reinforcement learning. In a synthetic environment, we showcase the plethora of structured patterns that can emerge from pursuing this regularity objective. We also demonstrate the strength of our method in a multi-object robotic manipulation environment.

free play, intrinsic reward, regularity, (2 more...)

Neural Information Processing Systems

Jan-19-2025, 21:36:02 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Robots (0.65)
  - Machine Learning > Reinforcement Learning (0.56)