Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models Yang Jiao 1,2,3

Open in new window