Finding Structure in Language Models
–arXiv.org Artificial Intelligence
When we speak, write or listen, we continuously make predictions based on our knowledge of a language's grammar. Remarkably, children acquire this grammatical knowledge within just a few years, enabling them to understand and generalise to novel constructions that have never been uttered before. Language models are powerful tools that create representations of language by incrementally predicting the next word in a sentence, and they have had a tremendous societal impact in recent years. The central research question of this thesis is whether these models possess a deep understanding of grammatical structure similar to that of humans. This question lies at the intersection of natural language processing, linguistics, and interpretability. To address it, we will develop novel interpretability techniques that enhance our understanding of the complex nature of large-scale language models. We approach our research question from three directions. First, we explore the presence of abstract linguistic information through structural priming, a key paradigm in psycholinguistics for uncovering grammatical structure in human language processing. Next, we examine various linguistic phenomena, such as adjective order and negative polarity items, and connect a model's comprehension of these phenomena to the data distribution on which it was trained. Finally, we introduce a controlled testbed for studying hierarchical structure in language models using various synthetic languages of increasing complexity and examine the role of feature interactions in modelling this structure. Our findings offer a detailed account of the grammatical knowledge embedded in language model representations and provide several directions for investigating fundamental linguistic questions using computational methods.
arXiv.org Artificial Intelligence
Nov-25-2024
- Country:
- Africa
- Ethiopia > Addis Ababa
- Addis Ababa (0.04)
- Rwanda > Kigali
- Kigali (0.04)
- Ethiopia > Addis Ababa
- Asia
- Europe
- Germany > Saarland
- Saarbrücken (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Norway > Western Norway
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- Finland > Uusimaa
- Helsinki (0.04)
- Italy
- Portugal > Lisbon
- Lisbon (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Oxfordshire > Oxford (0.04)
- Denmark (0.04)
- Netherlands
- North Holland > Amsterdam (0.04)
- South Holland
- Zeeland (0.04)
- Spain
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Valencian Community > Valencia Province
- Valencia (0.04)
- Catalonia > Barcelona Province
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Middle East > Malta
- Eastern Region > Northern Harbour District > St. Julian's (0.04)
- Austria > Vienna (0.13)
- Switzerland > Zürich
- Zürich (0.13)
- Germany > Saarland
- North America
- Canada
- Alberta > Census Division No. 15
- Improvement District No. 9 > Banff (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Ontario > Toronto (0.04)
- Quebec > Montreal (0.04)
- Alberta > Census Division No. 15
- Cuba (0.04)
- Dominican Republic (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- United States
- California
- Los Angeles County > Long Beach (0.04)
- San Diego County > San Diego (0.04)
- San Francisco County > San Francisco (0.13)
- Santa Clara County > San Jose (0.04)
- Massachusetts
- Hampden County > Springfield (0.04)
- Hampshire County > Amherst (0.04)
- Middlesex County > Cambridge (0.04)
- District of Columbia > Washington (0.04)
- Washington > King County
- Seattle (0.04)
- Indiana > Monroe County
- Bloomington (0.04)
- Illinois > Cook County
- Chicago (0.04)
- New York > Monroe County
- Rochester (0.04)
- Utah > Salt Lake County
- Salt Lake City (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Maryland > Baltimore (0.04)
- Florida (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- Texas > Travis County
- Austin (0.04)
- California
- Canada
- Oceania > Australia
- New South Wales > Sydney (0.13)
- Victoria > Melbourne (0.04)
- Africa
- Genre:
- Research Report
- Experimental Study > Negative Result (0.54)
- New Finding (1.00)
- Research Report
- Industry:
- Consumer Products & Services (0.67)
- Education (1.00)
- Government (0.67)
- Health & Medicine > Therapeutic Area
- Neurology (0.92)
- Information Technology (0.92)
- Leisure & Entertainment (1.00)
- Media (0.67)
- Transportation (0.68)
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science > Simulation of Human Behavior (0.67)
- Machine Learning
- Neural Networks > Deep Learning (1.00)
- Statistical Learning (0.92)
- Natural Language
- Chatbot (1.00)
- Grammars & Parsing (1.00)
- Large Language Model (1.00)
- Text Processing (1.00)
- Representation & Reasoning
- Logic & Formal Reasoning (0.67)
- Rule-Based Reasoning (0.67)
- Information Technology > Artificial Intelligence