6c7c9811d06b41b320b69abf37234f84-Paper-Datasets_and_Benchmarks_Track.pdf
–Neural Information Processing Systems
To quantify this stagnation, we introduce LIVEVQA, the first-of-its-kind dataset featuring 107,143 samples and 12 categories data specifically designed to support research in both seeking and updating with live visual knowledge. Drawing from recent news articles, video platforms, and academic publications in April 2024-May 2025, LIVEVQA enables evaluation of how models handle latest visual information beyond their knowledge boundaries and how current methods help to update them. Our comprehensive benchmarking of 17 state-of-the-art MLLMs reveals significant performance gaps on content beyond knowledge cutoff, and tool-use or agentic visual seeking framework drastically gain an average of 327% improvement. Furthermore, we explore parameter-efficient fine-tuning (PEFT) methods to update MLLMs with new visual knowledge. We dive deeply to the critical balance between adapter capacity and model capability when updating MLLMs with new visual knowledge. All the experimental dataset and source code are publicly available at: https://livevqa.github.io.
Neural Information Processing Systems
Jun-18-2026, 05:02:06 GMT
- Country:
- Asia (0.92)
- Europe (0.67)
- North America > United States
- California > Los Angeles County (0.27)
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.92)
- Research Report
- Industry:
- Law (1.00)
- Information Technology (1.00)
- Health & Medicine (0.93)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.92)
- Leisure & Entertainment > Sports (0.92)
- Education (0.67)
- Media
- News (1.00)
- Film (1.00)
- Television (0.92)
- Government > Regional Government
- Technology:
- Information Technology
- Sensing and Signal Processing > Image Processing (1.00)
- Information Management (1.00)
- Data Science (1.00)
- Communications (1.00)
- Artificial Intelligence
- Vision (1.00)
- Representation & Reasoning (1.00)
- Cognitive Science > Problem Solving (0.92)
- Natural Language
- Text Processing (1.00)
- Large Language Model (1.00)
- Chatbot (1.00)
- Question Answering (0.93)
- Machine Learning > Neural Networks
- Deep Learning (1.00)
- Information Technology