Temporal DINO: A Self-supervised Video Strategy to Enhance Action Prediction