Masked Vision-language Transformer in Fashion - Machine Intelligence Research