Friend-training: Learning from Models of Different but Related Tasks

Zhang, Mian, Jin, Lifeng, Song, Linfeng, Mi, Haitao, Zhou, Xiabing, Yu, Dong

Jan-31-2023–arXiv.org Artificial Intelligence

Current self-training methods such as standard self-training, co-training, tri-training, and others often focus on improving model performance on a single task, utilizing differences in input features, model architectures, and training processes. However, many tasks in natural language processing are about different but related aspects of language, and models trained for one task can be great teachers for other related tasks. In this work, we propose friend-training, a cross-task self-training framework, where models trained to do different tasks are used in an iterative training, pseudo-labeling, and retraining process to help each other for better selection of pseudo-labels. With two dialogue understanding tasks, conversational semantic role labeling and dialogue rewriting, chosen for a case study, we show that the models trained with the friend-training framework achieve the best performance compared to strong baselines.

computational linguistic, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Jan-31-2023

arXiv.org PDF

Add feedback

Country:
- Africa > Ethiopia (0.04)
- North America
  - Dominican Republic (0.04)
  - Canada > British Columbia (0.04)
  - United States
    - New York (0.04)
    - Maryland > Baltimore (0.04)
    - Colorado (0.04)
    - Washington > King County
      - Seattle (0.04)
      - Bellevue (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
    - Massachusetts
      - Suffolk County > Boston (0.04)
      - Middlesex County > Cambridge (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - California > Los Angeles County
      - Long Beach (0.04)
- Europe
  - United Kingdom > Scotland
    - City of Edinburgh > Edinburgh (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Portugal > Lisbon
    - Lisbon (0.04)
  - Italy > Tuscany
    - Florence (0.04)
- Asia
  - China > Hong Kong (0.04)
  - Thailand (0.04)
  - Singapore (0.04)
  - Middle East > Jordan (0.04)
  - Macao (0.04)

Genre:
- Research Report (0.50)

Industry:
- Education (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Natural Language > Grammars & Parsing (0.89)
  - Machine Learning > Performance Analysis
    - Accuracy (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found