MLLM-ISU: The First-Ever Comprehensive Benchmark for Multimodal Large Language Models based Intrusion Scene Understanding

Jun-13-2026, 13:12:03 GMT–Neural Information Processing Systems

Vision-based intrusion detection has multiple applications in practical scenarios, e.g., autonomous driving, intelligent monitoring, and security. Previous works mainly focus on improving the intrusion detection performance, without a comprehensive and in-depth understanding of the intrusion scene. To fill this gap, we explore a novel task called Multimodal Large Language Models based Intrusion Scene Understanding (MLLM-ISU) and report a comprehensive benchmark for the task.

artificial intelligence, large language model, natural language, (5 more...)

Neural Information Processing Systems

Jun-13-2026, 13:12:03 GMT

Conferences Web Page

Add feedback

Industry:
- Information Technology (0.83)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.60)