Quickly Tuning Foundation Models for Image Segmentation
Das, Breenda, Purucker, Lennart, Carstensen, Timur, Hutter, Frank
–arXiv.org Artificial Intelligence
Foundation models like SAM (Segment Anything Model) exhibit strong zero-shot image segmentation performance, but often fall short on domain-specific tasks. Fine-tuning these models typically requires significant manual effort and domain expertise. In this work, we introduce QTT-SEG, a meta-learning-driven approach for automating and accelerating the fine-tuning of SAM for image segmentation. Built on the Quick-Tune hyperparameter optimization framework, QTT-SEG predicts high-performing configurations using meta-learned cost and performance models, efficiently navigating a search space of over 200 million possibilities. We evaluate QTT-SEG on eight binary and five multiclass segmentation datasets under tight time constraints. Our results show that QTT-SEG consistently improves upon SAM's zero-shot performance and surpasses AutoGluon Multimodal, a strong AutoML baseline, on most binary tasks within three minutes. On multiclass datasets, QTT-SEG delivers consistent gains as well. These findings highlight the promise of meta-learning in automating model adaptation for specialized segmentation tasks. Code available at: https://github.com/ds-brx/QTT-SEG/
arXiv.org Artificial Intelligence
Aug-26-2025
- Country:
- Asia > Middle East
- Qatar (0.04)
- Europe
- Croatia > Split-Dalmatia County
- Split (0.04)
- Finland > Pirkanmaa
- Tampere (0.04)
- Germany > Baden-Württemberg
- Freiburg (0.04)
- Tübingen Region > Tübingen (0.04)
- Croatia > Split-Dalmatia County
- North America > Canada
- Quebec > Capitale-Nationale Region
- Quebec City (0.04)
- Québec (0.04)
- Quebec > Capitale-Nationale Region
- Asia > Middle East
- Genre:
- Research Report > New Finding (0.86)
- Industry:
- Health & Medicine
- Diagnostic Medicine > Imaging (1.00)
- Therapeutic Area (1.00)
- Health & Medicine
- Technology: