Hitachi at SemEval-2023 Task 3: Exploring Cross-lingual Multi-task Strategies for Genre and Framing Detection in Online News

Koreeda, Yuta, Yokote, Ken-ichi, Ozaki, Hiroaki, Yamaguchi, Atsuki, Tsunokake, Masaya, Sogawa, Yasuhiro

Apr-25-2023–arXiv.org Artificial Intelligence

This paper explains the participation of team Hitachi to SemEval-2023 Task 3 "Detecting the genre, the framing, and the persuasion techniques in online news in a multi-lingual setup.'' Based on the multilingual, multi-task nature of the task and the low-resource setting, we investigated different cross-lingual and multi-task strategies for training the pretrained language models. Through extensive experiments, we found that (a) cross-lingual/multi-task training, and (b) collecting an external balanced dataset, can benefit the genre and framing detection. We constructed ensemble models from the results and achieved the highest macro-averaged F1 scores in Italian and Russian genre categorization subtasks.

information retrieval, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

Apr-25-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.14)
- Europe
  - Germany (0.04)
  - Latvia > Riga Municipality
    - Riga (0.04)
- Asia > Japan
  - Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)

Genre:
- Research Report (0.70)

Industry:
- Media > News (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Statistical Learning (0.46)
  - Natural Language > Information Retrieval (0.35)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found