Video Prediction Models as Rewards for Reinforcement Learning

Dec-26-2025, 22:19:24 GMT–Neural Information Processing Systems

Specifying reward signals that allow agents to learn complex behaviors is a long-standing challenge in reinforcement learning.A promising approach is to extract preferences for behaviors from unlabeled videos, which are widely available on the internet.

name change, reinforcement learning, video prediction model, (5 more...)

Neural Information Processing Systems

Dec-26-2025, 22:19:24 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.50)