CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion

Meng, Boyuan, Zhang, Xiaohan, Li, Peilin, Wu, Zhe, Li, Yiming, Zhao, Wenkai, Yu, Beinan, Shen, Hui-Liang

May-5-2025–arXiv.org Artificial Intelligence

The object-background confusion refers to the confusion between expected objects and background. As illustrated in Figure 1(a), in underwater scenes, the boundaries between the target object and the background are often ambiguous, leading to missed detections. The object-object confusion refers to the confusion between different classes of objects. As illustrated in Figure 1(b), the similarity between different classes results in false detections. In the field of CD-FSOD, CD-ViTO [8] represents the state-of-the-art work, which devises various fine-tuning modules and achieves significant performance improvements. To address object-background confusion, CD-ViTO re-weights manually selected background features and combines them with object features in a weighted sum. However, manually designed features lack adaptability when the target domain distribution differs [4], [24]. To address object-object confusion, CD-ViTO [8] enhances class distinction by directly adjusting the support class features.

artificial intelligence, confusion, machine learning, (17 more...)

arXiv.org Artificial Intelligence

May-5-2025

arXiv.org PDF

Add feedback

Country:
- Asia > China (0.14)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Vision (0.69)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found