OneTokentoSegThemAll: LanguageInstructed ReasoningSegmentationinVideos
–Neural Information Processing Systems
We introduce VideoLISA, a video-based multimodal large language model designed to tackle the problem of language-instructed reasoning segmentation in videos.
Neural Information Processing Systems
Feb-7-2026, 19:54:46 GMT
- Country:
- Oceania > Australia
- Western Australia > Perth (0.04)
- Europe > Netherlands
- North Holland > Amsterdam (0.04)
- Oceania > Australia
- Genre:
- Research Report (0.93)
- Technology:
- Information Technology > Artificial Intelligence
- Vision (1.00)
- Machine Learning (1.00)
- Representation & Reasoning (0.94)
- Natural Language > Large Language Model (0.90)
- Information Technology > Artificial Intelligence