Data Understanding Survey: Pursuing Improved Dataset Characterization Via Tensor-based Methods
Merris, Matthew D., Andersen, Tim
–arXiv.org Artificial Intelligence
In the evolving domains of Machine Learning and Data Analytics, existing dataset characterization methods such as statistical, structural, and model-based analyses often fail to deliver the deep understanding and insights essential for innovation and explainability. This work surveys the current state-of-the-art conventional data analytic techniques and examines their limitations, and discusses a variety of tensor-based methods and how these may provide a more robust alternative to traditional statistical, structural, and model-based dataset characterization techniques. Through examples, we illustrate how tensor methods unveil nuanced data characteristics, offering enhanced interpretability and actionable intelligence. We advocate for the adoption of tensor-based characterization, promising a leap forward in understanding complex datasets and paving the way for intelligent, explainable data-driven discoveries.
arXiv.org Artificial Intelligence
Oct-17-2025
- Country:
- Africa > Senegal
- Kolda Region > Kolda (0.04)
- Asia
- Afghanistan > Parwan Province
- Charikar (0.04)
- China (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Afghanistan > Parwan Province
- Europe
- Belgium > Flanders
- Flemish Brabant > Leuven (0.04)
- Czechia > Liberec Region
- Liberec (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Belgium > Flanders
- North America
- Canada > British Columbia
- Vancouver (0.04)
- United States
- California
- Alameda County > Livermore (0.04)
- San Diego County > San Diego (0.04)
- Colorado > Denver County
- Denver (0.04)
- Idaho > Ada County
- Boise (0.04)
- New Jersey > Hudson County
- Hoboken (0.04)
- New Mexico > Bernalillo County
- Albuquerque (0.04)
- Texas > Travis County
- Austin (0.04)
- California
- Canada > British Columbia
- South America > Chile
- Africa > Senegal
- Genre:
- Overview (1.00)
- Research Report (1.00)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Neural Networks (1.00)
- Statistical Learning > Clustering (1.00)
- Natural Language (1.00)
- Representation & Reasoning > Uncertainty (0.67)
- Machine Learning
- Data Science
- Data Mining > Big Data (1.00)
- Data Quality (0.92)
- Information Management (0.93)
- Artificial Intelligence
- Information Technology