CLIMB: Class-imbalanced Learning Benchmark on Tabular Data

Liu, Zhining, Li, Zihao, Yang, Ze, Wei, Tianxin, Kang, Jian, Zhu, Yada, Hamann, Hendrik, He, Jingrui, Tong, Hanghang

Oct-21-2025–arXiv.org Artificial Intelligence

Class-imbalanced learning (CIL) on tabular data is important in many real-world applications where the minority class holds the critical but rare outcomes. In this paper, we present CLIMB, a comprehensive benchmark for class-imbalanced learning on tabular data. CLIMB includes 73 real-world datasets across diverse domains and imbalance levels, along with unified implementations of 29 representative CIL algorithms. Built on a high-quality open-source Python package with unified API designs, detailed documentation, and rigorous code quality controls, CLIMB supports easy implementation and comparison between different CIL algorithms. Through extensive experiments, we provide practical insights on method accuracy and efficiency, highlighting the limitations of naive rebalancing, the effectiveness of ensembles, and the importance of data quality. Our code, documentation, and examples are available at https://github.com/ZhiningLiu1998/imbalanced-ensemble.

data mining, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

Oct-21-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States (1.00)

Genre:
- Research Report (1.00)

Industry:
- Banking & Finance (1.00)
- Education (0.68)
- Government (0.67)
- Health & Medicine
  - Therapeutic Area > Oncology (1.00)
  - Diagnostic Medicine (0.92)

Technology:
- Information Technology
  - Information Management (0.92)
  - Data Science
    - Data Quality (1.00)
    - Data Mining (0.92)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Natural Language (0.92)
    - Machine Learning
      - Statistical Learning (0.92)
      - Ensemble Learning (0.67)
      - Neural Networks > Deep Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found