Reducing Sequence Length Learning Impacts on Transformer Models

Baillargeon, Jean-Thomas, Lamontagne, Luc

Dec-16-2022–arXiv.org Artificial Intelligence

Classification algorithms using Transformer architectures can be affected by the sequence length learning problem whenever observations from different classes have a different length distribution. This problem brings models to use sequence length as a predictive feature instead of relying on important textual information. Even if most public datasets are not affected by this problem, privately corpora for fields such as medicine and insurance may carry this data bias. This poses challenges throughout the value chain given their usage in a machine learning application. In this paper, we empirically expose this problem and present approaches to minimize its impacts.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

Dec-16-2022

arXiv.org PDF

Add feedback

Country:
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- North America
  - United States > Colorado (0.04)
  - Canada > Quebec (0.04)

Genre:
- Research Report (1.00)

Industry:
- Education (0.49)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.89)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found