A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys
Luo, Yufeng, Myers, Adam D., Drlica-Wagner, Alex, Dematties, Dario, Borchani, Salma, Valdes, Frank, Dey, Arjun, Schlegel, David, Zhou, Rongpu, Team, DESI Legacy Imaging Surveys
–arXiv.org Artificial Intelligence
As the data volume of astronomical imaging surveys rapidly increases, traditional methods for image anomaly detection, such as visual inspection by human experts, are becoming impractical. We introduce a machine-learning-based approach to detect poor-quality exposures in large imaging surveys, with a focus on the DECam Legacy Survey (DECaLS) in regions of low extinction (i.e., $E(B-V)<0.04$). Our semi-supervised pipeline integrates a vision transformer (ViT), trained via self-supervised learning (SSL), with a k-Nearest Neighbor (kNN) classifier. We train and validate our pipeline using a small set of labeled exposures observed by surveys with the Dark Energy Camera (DECam). A clustering-space analysis of where our pipeline places images labeled in ``good'' and ``bad'' categories suggests that our approach can efficiently and accurately determine the quality of exposures. Applied to new imaging being reduced for DECaLS Data Release 11, our pipeline identifies 780 problematic exposures, which we subsequently verify through visual inspection. Being highly efficient and adaptable, our method offers a scalable solution for quality control in other large imaging surveys.
arXiv.org Artificial Intelligence
Jul-18-2025
- Country:
- Asia > China
- Europe
- Germany > North Rhine-Westphalia
- Upper Bavaria > Munich (0.04)
- Spain
- Andalusia > Cádiz Province
- Cadiz (0.04)
- Galicia > Madrid (0.04)
- Andalusia > Cádiz Province
- Switzerland > Zürich
- Zürich (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Nottinghamshire > Nottingham (0.04)
- Germany > North Rhine-Westphalia
- North America
- Canada > Alberta
- Census Division No. 13 > Athabasca County (0.04)
- United States
- District of Columbia > Washington (0.04)
- Pennsylvania (0.04)
- Illinois
- Cook County
- Kane County > Batavia (0.04)
- Ohio (0.04)
- Michigan (0.04)
- Wyoming > Albany County
- Laramie (0.04)
- California > Alameda County
- Berkeley (0.04)
- Arizona > Pima County
- Tucson (0.04)
- Texas (0.04)
- Canada > Alberta
- South America
- Brazil > Rio de Janeiro
- Rio de Janeiro (0.04)
- Chile (0.04)
- Brazil > Rio de Janeiro
- Genre:
- Research Report (0.64)
- Industry:
- Technology: