TaxoKnow: Taxonomy as Prior Knowledge in the Loss Function of Multi-class Classification

Pourvali, Mohsen, Meng, Yao, Sheng, Chen, Du, Yangzhou

May-24-2023–arXiv.org Artificial Intelligence

Chiriatti 2020), have made significant advances in Natural Language Processing (NLP). In general, pre-training, where a model first trains on massive amounts of data before being fine-tuned for a specific task, has proven to be assumption in the real world. Moreover, compared to human an efficient technique for improving the performance of a capabilities, DNNs still lack in various aspects, such wide range of language tasks (Min et al. 2021). For example, as Adaptability, Generalizability, Robustness, Explainability, BERT (Devlin et al. 2018) is a pre-trained transformerbased Abstraction, Common sense, and Causal reasoning. In encoder model that can be fine-tuned for various NLP general, Multi-Layer Perceptrons (MLPs) are good at generalizing tasks, such as sentence classification, question answering, within the space of training examples, but they perform and named entity recognition. In fact, large language models poorly at generalizing outside the space of training examples, have shown a so-called few-shot learning capability to be and this limitation is not improved even by adding efficiently adapted to downstream tasks.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

May-24-2023

arXiv.org PDF

Add feedback

Country:
- Oceania > New Zealand
  - North Island > Auckland Region > Auckland (0.04)
- Asia > China
  - Beijing > Beijing (0.04)

Genre:
- Research Report > New Finding (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (0.89)
  - Machine Learning
    - Inductive Learning (1.00)
    - Neural Networks
      - Deep Learning (0.68)
      - Perceptrons (0.54)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found