Reviews: Joint-task Self-supervised Learning for Temporal Correspondence

Neural Information Processing Systems 

The work does not include original ideas. It is exclusively a collection of previous ideas combined together in a rather classical way. Major remarks: Equation (6) makes loss non-smooth and non-differentiable. The authors do not discuss how they handle this. I assume they use the typical approach by getting the right'case' in the forward step and then doing back-prop on the fixed smooth function.