Vision Language Transformers: A Survey

Open in new window