How to Encode Domain Information in Relation Classification

Bassignana, Elisa, Gascou, Viggo Unmack, Laustsen, Frida Nøhr, Kristensen, Gustav, Petersen, Marie Haahr, van der Goot, Rob, Plank, Barbara

Apr-21-2024–arXiv.org Artificial Intelligence

Current language models require a lot of training data to obtain high performance. For Relation Classification (RC), many datasets are domain-specific, so combining datasets to obtain better performance is non-trivial. We explore a multi-domain training setup for RC, and attempt to improve performance by encoding domain information. Our proposed models improve > 2 Macro-F1 against the baseline setup, and our analysis reveals that not all the labels benefit the same: The classes which occupy a similar space across domains (i.e., their interpretation is close across them, for example "physical") benefit the least, while domain-dependent relations (e.g., "part-of'') improve the most when encoding domain information.

computational linguistic, dataset, information, (14 more...)

arXiv.org Artificial Intelligence

Apr-21-2024

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - Victoria > Melbourne (0.04)
- Europe
  - Ukraine > Kyiv Oblast
    - Kyiv (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - Germany
    - North Rhine-Westphalia > Düsseldorf Region
      - Düsseldorf (0.04)
    - Bavaria > Upper Bavaria
      - Munich (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
  - Bulgaria > Sofia City Province
    - Sofia (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia > Middle East
  - UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found