A Comprehensive Survey on Imbalanced Data Learning
Gao, Xinyi, Xie, Dongting, Zhang, Yihang, Wang, Zhengren, He, Conghui, Yin, Hongzhi, Zhang, Wentao
–arXiv.org Artificial Intelligence
With the expansion of data availability, machine learning (ML) has achieved remarkable breakthroughs in both academia and industry. However, imbalanced data distributions are prevalent in various types of raw data and severely hinder the performance of ML by biasing the decision-making processes. To deepen the understanding of imbalanced data and facilitate the related research and applications, this survey systematically analyzing various real-world data formats and concludes existing researches for different data formats into four distinct categories: data re-balancing, feature representation, training strategy, and ensemble learning. This structured analysis help researchers comprehensively understand the pervasive nature of imbalance across diverse data format, thereby paving a clearer path toward achieving specific research goals. we provide an overview of relevant open-source libraries, spotlight current challenges, and offer novel insights aimed at fostering future advancements in this critical area of study.
arXiv.org Artificial Intelligence
Feb-12-2025
- Country:
- Asia > China (0.46)
- Europe (1.00)
- North America > Canada
- Quebec (0.14)
- Oceania > Australia
- Queensland (0.14)
- Genre:
- Overview (1.00)
- Research Report (1.00)
- Industry:
- Education (1.00)
- Health & Medicine > Diagnostic Medicine (0.93)
- Information Technology (0.67)
- Media > News (0.67)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Ensemble Learning (0.92)
- Evolutionary Systems (1.00)
- Inductive Learning (1.00)
- Neural Networks > Deep Learning (1.00)
- Performance Analysis > Accuracy (1.00)
- Statistical Learning (1.00)
- Natural Language (1.00)
- Representation & Reasoning (1.00)
- Vision (1.00)
- Machine Learning
- Communications (0.93)
- Data Science > Data Mining (1.00)
- Information Management (1.00)
- Sensing and Signal Processing > Image Processing (1.00)
- Artificial Intelligence
- Information Technology