An Empirical Evaluation of Encoder Architectures for Fast Real-Time Long Conversational Understanding

Senthilnathan, Annamalai, Arumae, Kristjan, Khalilia, Mohammed, Xing, Zhengzheng, Colak, Aaron R.

Feb-17-2025–arXiv.org Artificial Intelligence

Analyzing long text data such as customer call transcripts is a cost-intensive and tedious task. Machine learning methods, namely Transformers, are leveraged to model agent-customer interactions. Unfortunately, Transformers adhere to fixed-length architectures and their self-attention mechanism scales quadratically with input length. Such limitations make it challenging to leverage traditional Transformers for long sequence tasks, such as conversational understanding, especially in real-time use cases. In this paper we explore and evaluate recently proposed efficient Transformer variants (e.g. Performer, Reformer) and a CNN-based architecture for real-time and near real-time long conversational understanding tasks. We show that CNN-based models are dynamic, ~2.6x faster to train, ~80% faster inference and ~72% more memory efficient compared to Transformers on average. Additionally, we evaluate the CNN model using the Long Range Arena benchmark to demonstrate competitiveness in general long document analysis.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Feb-17-2025

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East
  - Qatar > Ad-Dawhah > Doha (0.04)
- Europe > Italy
  - Tuscany > Florence (0.04)
- North America > United States
  - Louisiana > Orleans Parish
    - New Orleans (0.04)
  - Minnesota > Hennepin County
    - Minneapolis (0.14)
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)

Genre:
- Research Report (0.82)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)
  - Natural Language > Chatbot (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found