Interpretable Transformation and Analysis of Timelines through Learning via Surprisability
Mokryn, Osnat, Lazebnik, Teddy, Shoshan, Hagit Ben
–arXiv.org Artificial Intelligence
The analysis of high-dimensional timeline data and the identification of outliers and anomalies is critical across diverse domains, including sensor readings, biological and medical data, historical records, and global statistics. However, conventional analysis techniques often struggle with challenges such as high dimensionality, complex distributions, and sparsity. These limitations hinder the ability to extract meaningful insights from complex temporal datasets, making it difficult to identify trending features, outliers, and anomalies effectively. Inspired by surprisability -- a cognitive science concept describing how humans instinctively focus on unexpected deviations - we propose Learning via Surprisability (LvS), a novel approach for transforming high-dimensional timeline data. LvS quantifies and prioritizes anomalies in time-series data by formalizing deviations from expected behavior. LvS bridges cognitive theories of attention with computational methods, enabling the detection of anomalies and shifts in a way that preserves critical context, offering a new lens for interpreting complex datasets. We demonstrate the usefulness of LvS on three high-dimensional timeline use cases: a time series of sensor data, a global dataset of mortality causes over multiple years, and a textual corpus containing over two centuries of State of the Union Addresses by U.S. presidents. Our results show that the LvS transformation enables efficient and interpretable identification of outliers, anomalies, and the most variable features along the timeline.
arXiv.org Artificial Intelligence
Mar-6-2025
- Country:
- Indian Ocean (0.04)
- Africa > Rwanda (0.04)
- North America
- United States
- Illinois (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Haiti > Ouest
- Port-au-Prince (0.04)
- United States
- Europe
- United Kingdom > England
- Greater London > London (0.04)
- Italy > Tuscany
- Pisa Province > Pisa (0.04)
- France > Île-de-France
- Croatia > Primorje-Gorski Kotar County
- Rijeka (0.04)
- United Kingdom > England
- Asia
- Singapore (0.04)
- Myanmar (0.04)
- Middle East > Israel
- Haifa District > Haifa (0.04)
- China > Jiangsu Province
- Nanjing (0.04)
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Water & Waste Management > Water Management (1.00)
- Information Technology > Security & Privacy (1.00)
- Health & Medicine > Therapeutic Area (1.00)
- Energy (0.93)
- Government
- Technology:
- Information Technology
- Security & Privacy (1.00)
- Information Management (0.93)
- Data Science > Data Mining
- Anomaly Detection (1.00)
- Artificial Intelligence
- Representation & Reasoning (1.00)
- Natural Language (1.00)
- Cognitive Science (1.00)
- Machine Learning
- Statistical Learning (1.00)
- Neural Networks > Deep Learning (0.68)
- Performance Analysis > Accuracy (0.67)
- Information Technology