OMG-LLaV A: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding

Open in new window