Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning