TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

Open in new window