VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks

Open in new window