SlowFocus: Enhancing Fine-grained Temporal Understanding in Video LLM
–Neural Information Processing Systems
However, current Vid-LLMs struggle to simultaneously retain high-quality frame-level semantic information ( i.e., a sufficient
Neural Information Processing Systems
Oct-10-2025, 10:09:42 GMT
- Genre:
- Research Report
- Experimental Study (0.93)
- New Finding (0.67)
- Research Report
- Industry:
- Education (0.46)
- Technology: