High-Quality Pseudo-Label Generation Based on Visual Prompt Assisted Cloud Model Update

Xu, Xinrun, Zhang, Qiuhong, Yang, Jianwen, Lian, Zhanbiao, Yan, Jin, Ding, Zhiming, Jiang, Shan

arXiv.org Artificial Intelligence 

--Generating high-quality pseudo-labels on the cloud side is crucial for cloud-edge collaborative object detection, especially in dynamic traffic monitoring scenarios where the target data distribution continuously evolves. Existing methods often assume a perfectly reliable cloud model, neglecting the potential for errors in the cloud's predictions, or employ simple adaptation techniques that struggle to handle complex distribution shifts. This paper proposes a novel Cloud-Adaptive High-Quality Pseudo-label generation algorithm (CA-HQP) that addresses these limitations by incorporating a learnable Visual Prompt Generator (VPG) and a dual feature alignment strategy into the cloud model updating process. The VPG enables parameter-efficient adaptation of the large pre-trained cloud model by injecting task-specific visual prompts into the model's input, enhancing its flexibility without extensive fine-tuning. T o mitigate domain discrepancies, CA-HQP introduces two complementary feature alignment techniques: a global Domain Query Feature Alignment (DQF A) that captures scene-level distribution shifts and a fine-grained T emporal Instance-A ware Feature Embedding Alignment (TIAF A) that addresses instance-level variations. Extensive experiments on the Bellevue traffic dataset, a challenging real-world traffic monitoring dataset, demonstrate that CA-HQP significantly improves the quality of pseudo-labels compared to existing state-of-the-art cloud-edge collaborative object detection methods. Further ablation studies validate the contribution of each individual component (DQF A, TIAF A, VPG) and confirm the synergistic effect of combining global and instance-level feature alignment strategies.