LabelBench: A Comprehensive Framework for Benchmarking Adaptive Label-Efficient Learning
Zhang, Jifan, Chen, Yifang, Canal, Gregory, Mussmann, Stephen, Das, Arnav M., Bhatt, Gantavya, Zhu, Yinglun, Bilmes, Jeffrey, Du, Simon Shaolei, Jamieson, Kevin, Nowak, Robert D
–arXiv.org Artificial Intelligence
Labeled data are critical to modern machine learning applications, but obtaining labels can be expensive. To mitigate this cost, machine learning methods, such as transfer learning, semi-supervised learning and active learning, aim to be label-efficient: achieving high predictive performance from relatively few labeled examples. While obtaining the best label-efficiency in practice often requires combinations of these techniques, existing benchmark and evaluation frameworks do not capture a concerted combination of all such techniques. This paper addresses this deficiency by introducing LabelBench, a new computationally-efficient framework for joint evaluation of multiple label-efficient learning techniques. As an application of LabelBench, we introduce a novel benchmark of state-of-the-art active learning methods in combination with semi-supervised learning for fine-tuning pretrained vision transformers. Our benchmark demonstrates better label-efficiencies than previously reported in active learning. LabelBench's modular codebase is open-sourced for the broader community to contribute label-efficient learning methods and benchmarks. The repository can be found at: https://github.com/EfficientTraining/LabelBench.
arXiv.org Artificial Intelligence
Jan-12-2024
- Country:
- Europe (0.67)
- North America > United States
- California > Riverside County
- Riverside (0.14)
- Washington > King County
- Seattle (0.14)
- Wisconsin > Dane County
- Madison (0.14)
- California > Riverside County
- Genre:
- Research Report (1.00)
- Industry:
- Information Technology (0.46)
- Technology: