PG-Video-LLaVA: Pixel Grounding Large Video-Language Models

Open in new window