Open-Ended Reinforcement Learning with Neural Reward Functions Robert Meier
–Neural Information Processing Systems
We propose a different approach that uses reward functions encoded by neural networks. These are trained iteratively to reward more complex behavior.
Neural Information Processing Systems
Oct-2-2025, 09:02:06 GMT