Natural Language Understanding and Inference with MLLM in Visual Question Answering: A Survey

Open in new window