Masked Momentum Contrastive Learning for Zero-shot Semantic Understanding