TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection
Cao, Yang, Yang, Sikun, Li, Chen, Xiang, Haolong, Qi, Lianyong, Liu, Bo, Li, Rongsheng, Liu, Ming
–arXiv.org Artificial Intelligence
Existing studies often lack Anomaly detection is a critical task in machine systematic evaluations of how different embeddings learning, with applications ranging from fraud detection perform across diverse anomaly types, raising and content moderation to user behavior questions about their generalization capabilities analysis (Pang et al., 2021). Within natural language in complex, real-world scenarios such as multilingual processing (NLP), anomaly detection has become settings or domain-specific anomalies. Recent increasingly relevant for identifying outliers efforts, such as AD-NLP (Bejan et al., 2023) such as harmful content, phishing attempts, and and NLP-ADBench (Li et al., 2024), have significantly spam reviews. However, while AD tasks in structured advanced anomaly detection in NLP. ADdata (e.g., tabular, time series, graphs) (Steinbuss NLP provides valuable insights into different types and Böhm, 2021; Blázquez-García et al., 2021; of anomalies, while NLP-ADBench expands evaluations Qiao et al., 2024) have achieved significant maturity, to a wide range of algorithms and datasets.
arXiv.org Artificial Intelligence
Jan-21-2025
- Country:
- Oceania > Australia (0.04)
- North America > United States
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California > Santa Clara County
- Mountain View (0.04)
- Minnesota > Hennepin County
- Asia
- Japan (0.04)
- China
- Jiangsu Province > Nanjing (0.04)
- Henan Province > Zhengzhou (0.04)
- Heilongjiang Province > Harbin (0.04)
- Genre:
- Research Report (0.50)
- Industry:
- Information Technology > Security & Privacy (0.54)
- Technology: