Tarsier: Recipes for Training and Evaluating Large Video Description Models

Open in new window