Anatomy of Neural Language Models
Saleh, Majd, Paquelet, Stéphane
–arXiv.org Artificial Intelligence
Generative AI and transfer learning fields have experienced remarkable advancements in recent years especially in the domain of Natural Language Processing (NLP). Transformers were at the heart of these advancements where the cutting-edge transformer-based Language Models (LMs) enabled new state-of-the-art results in a wide spectrum of applications. While the number of research works involving neural LMs is exponentially increasing, their vast majority are high-level and far from self-contained. Consequently, a deep understanding of the literature in this area is a tough task especially at the absence of a unified mathematical framework explaining the main types of neural LMs. We address the aforementioned problem in this tutorial where the objective is to explain neural LMs in a detailed, simplified and unambiguous mathematical framework accompanied with clear graphical illustrations. Concrete examples on widely used models like BERT and GPT2 are explored. Finally, since transformers pretrained on language-modeling-like tasks have been widely adopted in computer vision and time series applications, we briefly explore some examples of such solutions in order to enable readers understand how transformers work in the aforementioned domains and compare this use with the original one in NLP.
arXiv.org Artificial Intelligence
Jan-8-2024
- Country:
- North America
- United States
- New Jersey > Middlesex County
- New Brunswick (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Arizona > Maricopa County
- Scottsdale (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Michigan > Wayne County
- Detroit (0.04)
- Alaska > Anchorage Municipality
- Anchorage (0.04)
- California
- Santa Clara County > Palo Alto (0.04)
- San Diego County > San Diego (0.04)
- Los Angeles County > Long Beach (0.04)
- New York > New York County
- New York City (0.04)
- New Jersey > Middlesex County
- Canada
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.14)
- United States
- Europe
- France (0.04)
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- Asia
- Africa > Rwanda
- North America
- Genre:
- Research Report (0.63)
- Instructional Material > Course Syllabus & Notes (0.46)
- Overview (0.46)
- Technology: