Towards Evaluating Proactive Risk Awareness of Multimodal Language Models

Jun-14-2026, 14:12:03 GMT–Neural Information Processing Systems

Human safety awareness gaps often prevent the timely recognition of everyday risks. In solving this problem, a proactive safety artificial intelligence (AI) system would work better than a reactive one. Instead of just reacting to users' questions, it would actively watch people's behavior and their environment to detect potential dangers in advance. Our Proactive Safety Bench (PaSBench2) evaluates this capability through 416 multimodal scenarios (128 image sequences, 288 text logs) spanning 5 safety-critical domains. Evaluation of 36 advanced models reveals fundamental limitations: Top performers like Gemini-2.5-pro

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Jun-14-2026, 14:12:03 GMT

Conferences PDF

Add feedback

Country:
- Asia > China (0.28)
- North America > United States (0.28)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Industry:
- Transportation (1.00)
- Information Technology > Security & Privacy (1.00)
- Government (1.00)
- Food & Agriculture (0.92)
- Education (0.67)
- Health & Medicine
  - Consumer Health (1.00)
  - Therapeutic Area (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found