TVLT: Textless Vision-Language Transformer

Open in new window