f-score
- Asia > Taiwan (0.04)
- Asia > South Korea > Gyeonggi-do > Suwon (0.04)
- Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- North America > Canada > Ontario > Toronto (0.04)
- (2 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Information Technology > Communications (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Modality-Independent Teachers Meet Weakly-Supervised Audio-Visual Event Parser
Audio-visual learning has been a major pillar of multi-modal machine learning, where the community mostly focused on its $\textit{modality-aligned}$ setting, $\textit{i.e.}$, the audio and visual modality are $\textit{both}$ assumed to signal the prediction target.With the Look, Listen, and Parse dataset (LLP), we investigate the under-explored $\textit{unaligned}$ setting, where the goal is to recognize audio and visual events in a video with only weak labels observed.Such weak video-level labels only tell what events happen without knowing the modality they are perceived (audio, visual, or both).To enhance learning in this challenging setting, we incorporate large-scale contrastively pre-trained models as the modality teachers. A simple, effective, and generic method, termed $\textbf{V}$isual-$\textbf{A}$udio $\textbf{L}$abel Elab$\textbf{or}$ation (VALOR), is innovated to harvest modality labels for the training events.Empirical studies show that the harvested labels significantly improve an attentional baseline by $\textbf{8.0}$ in average F-score (Type@AV).Surprisingly, we found that modality-independent teachers outperform their modality-fused counterparts since they are noise-proof from the other potentially unaligned modality.Moreover, our best model achieves the new state-of-the-art on all metrics of LLP by a substantial margin ($\textbf{+5.4}$
A Problem-Oriented Taxonomy of Evaluation Metrics for Time Series Anomaly Detection
Yang, Kaixiang, Liu, Jiarong, Song, Yupeng, Yang, Shuanghua, Zhou, Yujue
Abstract--Time series anomaly detection is widely used in IoT and cyber-physical systems, yet its evaluation remains challenging due to diverse application objectives and heterogeneous metric assumptions. This study introduces a problem-oriented framework that reinterprets existing metrics based on the specific evaluation challenges they are designed to address, rather than their mathematical forms or output structures. We categorize over twenty commonly used metrics into six dimensions: (1) basic accuracy-driven evaluation, (2) timeliness-aware reward mechanisms, (3) tolerance to labeling imprecision, (4) penalties reflecting human-audit cost, (5) robustness against random or inflated scores, and (6) parameter-free comparability for cross-dataset benchmark-ing. Comprehensive experiments are conducted to examine metric behavior under genuine, random, and oracle detection scenarios. By comparing their resulting score distributions, we quantify each metric's discriminative ability--its capability to distinguish meaningful detections from random noise. The results show that while most event-level metrics exhibit strong separability, several widely used metrics (e.g., NAB, Point-Adjust) demonstrate limited resistance to random-score inflation. These findings reveal that metric suitability must be inherently task-dependent and aligned with the operational objectives of IoT applications. The proposed framework offers a unified analytical perspective for understanding existing metrics and provides practical guidance for selecting or developing more context-aware, robust, and fair evaluation methodologies for time series anomaly detection. He emergence of the Internet of Things (IoT) has accelerated digital transformation across numerous domains. Its defining characteristic lies in the large-scale deployment of intelligent and heterogeneous devices--such as sensors, actuators, and RFID systems--that are interconnected via the Internet to enable autonomous communication without human intervention [1]. Currently, more than 12 billion IoT devices are in operation, and this number is projected to reach 125 billion by 2030 [2]. Consequently, the volume of data generated by these devices continues to soar, with an expected total of 79.4 ZB by 2025 [3]. In industrial contexts, the integration of IoT technologies has driven the ongoing Industry 4.0 revolution, emphasizing connectivity, automation, and intelligence. Kaixiang Y ang, Jiarong Liu, Y upeng Song, and Y ujue Zhou are with the School of Artificial Intelligence, Y unnan University, Kunming 650091, China. Shuanghua Y ang is with Beijing Normal University - Hong Kong Baptist University, Zhuhai 519087, China. This work was supported in part by the Y unnan Fundamental Research Projects under Grant 202401AU070151, and in part by the Y unnan Provincial Science and Technology Talent and Platform Plan under Grant 202505AF350053.
- Health & Medicine (0.46)
- Energy (0.46)
- Information Technology > Smart Houses & Appliances (0.35)
- North America > United States > Illinois > Cook County > Chicago (0.04)
- North America > United States > California > Los Angeles County > Long Beach (0.04)
- North America > Canada > Ontario > Waterloo Region > Waterloo (0.04)
- Europe > Hungary > Budapest > Budapest (0.04)