TransMatcher: Deep Image Matching Through Transformers for Generalizable Person Re-identification

Oct-9-2024, 13:09:02 GMT–Neural Information Processing Systems

Transformers have recently gained increasing attention in computer vision. However, existing studies mostly use Transformers for feature representation learning, e.g. for image classification and dense predictions, and the generalizability of Transformers is unknown. In this work, we further investigate the possibility of applying Transformers for image matching and metric learning given pairs of images. We find that the Vision Transformer (ViT) and the vanilla Transformer with decoders are not adequate for image matching due to their lack of image-to-image attention. The latter improves the performance, but it is still limited.

generalizable person re-identification, transformer, transmatcher, (4 more...)

Neural Information Processing Systems

Oct-9-2024, 13:09:02 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Machine Learning (1.00)