Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

Open in new window