Feature-Proxy Transformer for Few-Shot Segmentation

Oct-10-2024, 11:22:52 GMT–Neural Information Processing Systems

These two keypoints are easily integrated into the vision transformer backbone with the prompting mechanism in the transformer. Given the learned features and proxies, FPTrans directly compares their cosine similarity for segmentation. Although the framework is straightforward, we show that FPTrans achieves competitive FSS accuracy on par with state-of-the-art decoder-based methods.

feature-proxy transformer, few-shot segmentation, linear classification head, (5 more...)

Neural Information Processing Systems

Oct-10-2024, 11:22:52 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.41)