VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

Open in new window