DCT-Former: Efficient Self-Attention with Discrete Cosine Transform

Scribano, Carmelo, Franchini, Giorgia, Prato, Marco, Bertogna, Marko

Mar-15-2023–arXiv.org Artificial Intelligence

Transformers are a family of recently introduced Deep Learning (DL) models which leverage the mechanism of dot-product attention to map a sequence of tokens of arbitrary length into a new set of tokens. Thanks to their outstanding performance in a variety of tasks, transformers are nowadays ubiquitous in state-of-the-art techniques that gain any benefit from modeling long-term interactions between elements of a sequence. Another important advantage of transformers is the ability to process sequences of arbitrary length in a single forward pass without incurring the limitations of recurrent approaches: no other standard Machine Learning (ML) or DL methods in the literature have shown this great adaptability so far. In the domain of Natural Language Processing (NLP) transformers are pervasive in any sort of task, such as Machine Translation [1-4], text classification, document retrieval, document summarization and several others more. More recently, researchers started to focus on exploiting the benefits of the self-attention mechanism for computer vision tasks [5-7], either standalone or applied downstream to a convolutional backbone and even to multimodal problems where the language and visual input needs to be correlated.

data quality, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Mar-15-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Oregon > Multnomah County
    - Portland (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
- Europe > Italy
  - Emilia-Romagna > Modeno Province
    - Modena (0.04)
  - Calabria > Catanzaro Province
    - Catanzaro (0.04)

Genre:
- Research Report (0.70)

Technology:
- Information Technology
  - Data Science > Data Quality
    - Data Transformation (0.85)
  - Artificial Intelligence
    - Vision (1.00)
    - Natural Language > Machine Translation (0.88)
    - Machine Learning > Neural Networks
      - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found