InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning

Open in new window