Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities

Open in new window