Friend-training: Learning from Models of Different but Related Tasks
Zhang, Mian, Jin, Lifeng, Song, Linfeng, Mi, Haitao, Zhou, Xiabing, Yu, Dong
–arXiv.org Artificial Intelligence
Current self-training methods such as standard self-training, co-training, tri-training, and others often focus on improving model performance on a single task, utilizing differences in input features, model architectures, and training processes. However, many tasks in natural language processing are about different but related aspects of language, and models trained for one task can be great teachers for other related tasks. In this work, we propose friend-training, a cross-task self-training framework, where models trained to do different tasks are used in an iterative training, pseudo-labeling, and retraining process to help each other for better selection of pseudo-labels. With two dialogue understanding tasks, conversational semantic role labeling and dialogue rewriting, chosen for a case study, we show that the models trained with the friend-training framework achieve the best performance compared to strong baselines.
arXiv.org Artificial Intelligence
Jan-31-2023
- Country:
- Africa > Ethiopia (0.04)
- North America
- Dominican Republic (0.04)
- Canada > British Columbia (0.04)
- United States
- New York (0.04)
- Maryland > Baltimore (0.04)
- Colorado (0.04)
- Washington > King County
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Massachusetts
- Suffolk County > Boston (0.04)
- Middlesex County > Cambridge (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California > Los Angeles County
- Long Beach (0.04)
- Europe
- United Kingdom > Scotland
- City of Edinburgh > Edinburgh (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Italy > Tuscany
- Florence (0.04)
- United Kingdom > Scotland
- Asia
- Genre:
- Research Report (0.50)
- Industry:
- Education (0.46)
- Technology: