HAWK: Learning to Understand Open-World Video Anomalies

May-27-2025, 21:57:04 GMT–Neural Information Processing Systems

Video Anomaly Detection (VAD) systems can autonomously monitor and identify disturbances, reducing the need for manual labor and associated costs. However, current VAD systems are often limited by their superficial semantic understanding of scenes and minimal user interaction. Additionally, the prevalent data scarcity in existing datasets restricts their applicability in open-world scenarios.In this paper, we introduce HAWK, a novel framework that leverages interactive large Visual Language Models (VLM) to interpret video anomalies precisely. Recognizing the difference in motion information between abnormal and normal videos, HAWK explicitly integrates motion modality to enhance anomaly identification. To reinforce motion attention, we construct an auxiliary consistency loss within the motion and video space, guiding the video branch to focus on the motion modality.

learning, open-world scenario, open-world video anomaly, (2 more...)

Neural Information Processing Systems

May-27-2025, 21:57:04 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology
  - Artificial Intelligence (0.64)
  - Data Science > Data Mining
    - Anomaly Detection (1.00)