AITopics | xcit

a655fbe4b8d7439994aa37ddad80de56-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 12:02:06 GMT

arxiv preprint arxiv, computer vision, transformer, (12 more...)

Neural Information Processing Systems

Country: South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.68)

Add feedback

XCiT: Cross-Covariance Image Transformers

Neural Information Processing SystemsDec-24-2025, 16:22:09 GMT

Following their success in natural language processing, transformers have recently shown much promise for computer vision. The self-attention operation underlying transformers yields global interactions between all tokens,i.e.

cross-covariance image transformer, name change, xcit, (6 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.97)
Information Technology > Sensing and Signal Processing > Image Processing (0.61)

Add feedback

XCiT: Cross-Covariance Image Transformers

Neural Information Processing SystemsAug-16-2025, 14:44:38 GMT

We propose a "transposed" version of self-attention that operates

arxiv preprint arxiv, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country: South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.68)

Add feedback

XCiT: Cross-Covariance Image Transformers

Neural Information Processing SystemsMay-27-2025, 01:42:23 GMT

Following their success in natural language processing, transformers have recently shown much promise for computer vision. The self-attention operation underlying transformers yields global interactions between all tokens,i.e. This flexibility, however, comes with a quadratic complexity in time and memory, hindering application to long sequences and high-resolution images. We propose a "transposed" version of self-attention that operates across feature channels rather than tokens, where the interactions are based on the cross-covariance matrix between keys and queries. The resulting cross-covariance attention (XCA) has linear complexity in the number of tokens, and allows efficient processing of high-resolution images.Our cross-covariance image transformer (XCiT) is built upon XCA.

artificial intelligence, cross-covariance image transformer, natural language, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

XCiT: Cross-Covariance Image Transformers

Neural Information Processing SystemsJan-18-2025, 12:29:22 GMT

Following their success in natural language processing, transformers have recently shown much promise for computer vision. The self-attention operation underlying transformers yields global interactions between all tokens,i.e. This flexibility, however, comes with a quadratic complexity in time and memory, hindering application to long sequences and high-resolution images. We propose a "transposed" version of self-attention that operates across feature channels rather than tokens, where the interactions are based on the cross-covariance matrix between keys and queries. The resulting cross-covariance attention (XCA) has linear complexity in the number of tokens, and allows efficient processing of high-resolution images.Our cross-covariance image transformer (XCiT) is built upon XCA.

cross-covariance image transformer, interaction, xcit, (3 more...)

Neural Information Processing Systems

Technology: