STER-VLM: Spatio-Temporal With Enhanced Reference Vision-Language Models

Open in new window