6c7c9811d06b41b320b69abf37234f84-Paper-Datasets_and_Benchmarks_Track.pdf

Jun-18-2026, 05:02:06 GMT–Neural Information Processing Systems

To quantify this stagnation, we introduce LIVEVQA, the first-of-its-kind dataset featuring 107,143 samples and 12 categories data specifically designed to support research in both seeking and updating with live visual knowledge. Drawing from recent news articles, video platforms, and academic publications in April 2024-May 2025, LIVEVQA enables evaluation of how models handle latest visual information beyond their knowledge boundaries and how current methods help to update them. Our comprehensive benchmarking of 17 state-of-the-art MLLMs reveals significant performance gaps on content beyond knowledge cutoff, and tool-use or agentic visual seeking framework drastically gain an average of 327% improvement. Furthermore, we explore parameter-efficient fine-tuning (PEFT) methods to update MLLMs with new visual knowledge. We dive deeply to the critical balance between adapter capacity and model capability when updating MLLMs with new visual knowledge. All the experimental dataset and source code are publicly available at: https://livevqa.github.io.

large language model, machine learning, question answering, (20 more...)

Neural Information Processing Systems

Jun-18-2026, 05:02:06 GMT

Conferences PDF

Add feedback

Country:
- Asia (0.92)
- Europe (0.67)
- North America > United States
  - California > Los Angeles County (0.27)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (0.92)

Industry:
- Law (1.00)
- Information Technology (1.00)
- Health & Medicine (0.93)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.92)
- Leisure & Entertainment > Sports (0.92)
- Education (0.67)
- Media
  - News (1.00)
  - Film (1.00)
  - Television (0.92)
- Government > Regional Government
  - North America Government > United States Government (1.00)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (1.00)
  - Information Management (1.00)
  - Data Science (1.00)
  - Communications (1.00)
  - Artificial Intelligence
    - Vision (1.00)
    - Representation & Reasoning (1.00)
    - Cognitive Science > Problem Solving (0.92)
    - Natural Language
      - Text Processing (1.00)
      - Large Language Model (1.00)
      - Chatbot (1.00)
      - Question Answering (0.93)
    - Machine Learning > Neural Networks
      - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found