Towards Knowledge-Grounded Natural Language Understanding and Generation
–arXiv.org Artificial Intelligence
This thesis investigates how natural language understanding and generation with transformer models can benefit from grounding the models with knowledge representations. Currently, the most prevailing paradigm for training language models is through pre-training on abundant raw text data and fine-tuning on downstream tasks. Although language models continue to advance, especially the recent trend of Large Language Models (LLMs) such as ChatGPT, there seem to be limits to what can be achieved with text data alone and it is desirable to study the impact of applying and integrating rich forms of knowledge representation to improve model performance. The most widely used form of knowledge for language modelling is structured knowledge in the form of triples consisting of entities and their relationships, often in English. This thesis explores beyond this conventional approach and aims to address several key questions: Can knowledge of entities extend its benefits beyond entity-centric tasks such as entity linking? How can we faithfully and effectively extract such structured knowledge from raw text, especially noisy web text? How do other types of knowledge, beyond structured knowledge, contribute to improving NLP tasks?
arXiv.org Artificial Intelligence
Mar-22-2024
- Country:
- Asia (1.00)
- Europe > United Kingdom
- England > Greater London > London (0.13)
- North America > United States
- California (0.27)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Washington > King County
- Seattle (0.14)
- Oceania > Australia
- Genre:
- Overview (1.00)
- Research Report > New Finding (1.00)
- Industry:
- Education (0.67)
- Government > Regional Government
- Health & Medicine > Therapeutic Area (0.68)
- Information Technology (0.67)
- Leisure & Entertainment (1.00)
- Media (1.00)
- Transportation (0.92)
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science > Problem Solving (1.00)
- Machine Learning > Neural Networks
- Deep Learning (1.00)
- Natural Language
- Chatbot (1.00)
- Large Language Model (1.00)
- Machine Translation (1.00)
- Text Processing (1.00)
- Representation & Reasoning > Expert Systems (1.00)
- Information Technology > Artificial Intelligence