Fostering Video Reasoning via Next-Event Prediction

Open in new window