Self-supervisedCo-TrainingforVideoRepresentationLearning