HawkEye: Training Video-Text LLMs for Grounding Text in Videos

Open in new window