Slot-MLLM: Object-Centric Visual Tokenization for Multimodal LLM

Open in new window