VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Open in new window