Towards Knowledge-Grounded Natural Language Understanding and Generation
–arXiv.org Artificial Intelligence
This thesis investigates how natural language understanding and generation with transformer models can benefit from grounding the models with knowledge representations. Currently, the most prevailing paradigm for training language models is through pre-training on abundant raw text data and fine-tuning on downstream tasks. Although language models continue to advance, especially the recent trend of Large Language Models (LLMs) such as ChatGPT, there seem to be limits to what can be achieved with text data alone and it is desirable to study the impact of applying and integrating rich forms of knowledge representation to improve model performance. The most widely used form of knowledge for language modelling is structured knowledge in the form of triples consisting of entities and their relationships, often in English. This thesis explores beyond this conventional approach and aims to address several key questions: Can knowledge of entities extend its benefits beyond entity-centric tasks such as entity linking? How can we faithfully and effectively extract such structured knowledge from raw text, especially noisy web text? How do other types of knowledge, beyond structured knowledge, contribute to improving NLP tasks?
arXiv.org Artificial Intelligence
Mar-22-2024
- Country:
- South America > Colombia
- Meta Department > Villavicencio (0.04)
- Oceania > Australia
- North America
- Dominican Republic (0.04)
- United States
- West Virginia (0.04)
- New Jersey (0.04)
- Texas > Travis County
- Austin (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Washington > King County
- Seattle (0.14)
- California
- San Francisco County > San Francisco (0.04)
- San Diego County > San Diego (0.04)
- Canada
- Ontario > Toronto (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Europe
- Germany > Berlin (0.04)
- Western Europe (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Middle East > Malta
- Eastern Region > Northern Harbour District > St. Julian's (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Romania > Sud - Muntenia Development Region
- Giurgiu County > Giurgiu (0.04)
- United Kingdom > England
- Greater London > London (0.27)
- Cambridgeshire > Cambridge (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Italy
- Tuscany > Florence (0.04)
- Calabria > Catanzaro Province
- Catanzaro (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Indonesia > Bali (0.04)
- Singapore (0.04)
- India (0.04)
- East Asia (0.04)
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Republic of Türkiye > Batman Province
- Batman (0.04)
- Japan > Kyūshū & Okinawa
- Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)
- China
- Hong Kong (0.04)
- Tianjin Province > Tianjin (0.04)
- South America > Colombia
- Genre:
- Research Report > New Finding (1.00)
- Overview (1.00)
- Industry:
- Media (1.00)
- Leisure & Entertainment (1.00)
- Transportation (0.92)
- Health & Medicine > Therapeutic Area (0.68)
- Education (0.67)
- Information Technology (0.67)
- Government > Regional Government
- Technology:
- Information Technology > Artificial Intelligence
- Representation & Reasoning > Expert Systems (1.00)
- Cognitive Science > Problem Solving (1.00)
- Natural Language
- Text Processing (1.00)
- Machine Translation (1.00)
- Large Language Model (1.00)
- Chatbot (1.00)
- Machine Learning > Neural Networks
- Deep Learning (1.00)
- Information Technology > Artificial Intelligence