Feature-Proxy Transformer for Few-Shot Segmentation
–Neural Information Processing Systems
These two keypoints are easily integrated into the vision transformer backbone with the prompting mechanism in the transformer. Given the learned features and proxies, FPTrans directly compares their cosine similarity for segmentation. Although the framework is straightforward, we show that FPTrans achieves competitive FSS accuracy on par with state-of-the-art decoder-based methods.
Neural Information Processing Systems
Oct-10-2024, 11:22:52 GMT
- Technology: