Strefer: Empowering Video LLMs with Space-Time Referring and Reasoning via Synthetic Instruction Data

Open in new window