Towards Knowledge-Grounded Natural Language Understanding and Generation

Mar-22-2024–arXiv.org Artificial Intelligence

This thesis investigates how natural language understanding and generation with transformer models can benefit from grounding the models with knowledge representations. Currently, the most prevailing paradigm for training language models is through pre-training on abundant raw text data and fine-tuning on downstream tasks. Although language models continue to advance, especially the recent trend of Large Language Models (LLMs) such as ChatGPT, there seem to be limits to what can be achieved with text data alone and it is desirable to study the impact of applying and integrating rich forms of knowledge representation to improve model performance. The most widely used form of knowledge for language modelling is structured knowledge in the form of triples consisting of entities and their relationships, often in English. This thesis explores beyond this conventional approach and aims to address several key questions: Can knowledge of entities extend its benefits beyond entity-centric tasks such as entity linking? How can we faithfully and effectively extract such structured knowledge from raw text, especially noisy web text? How do other types of knowledge, beyond structured knowledge, contribute to improving NLP tasks?

generative information extraction, large language model, machine learning, (26 more...)

arXiv.org Artificial Intelligence

Mar-22-2024

arXiv.org PDF

Add feedback

Country:
- Asia (1.00)
- Europe > United Kingdom
  - England > Greater London > London (0.13)
- North America > United States
  - California (0.27)
  - Minnesota > Hennepin County
    - Minneapolis (0.14)
  - Washington > King County
    - Seattle (0.14)
- Oceania > Australia
  - Victoria > Melbourne (0.13)

Genre:
- Overview (1.00)
- Research Report > New Finding (1.00)

Industry:
- Education (0.67)
- Government > Regional Government
  - North America Government > United States Government (1.00)
- Health & Medicine > Therapeutic Area (0.68)
- Information Technology (0.67)
- Leisure & Entertainment (1.00)
- Media (1.00)
- Transportation (0.92)

Technology:
- Information Technology > Artificial Intelligence
  - Cognitive Science > Problem Solving (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)
  - Natural Language
    - Chatbot (1.00)
    - Large Language Model (1.00)
    - Machine Translation (1.00)
    - Text Processing (1.00)
  - Representation & Reasoning > Expert Systems (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found