Towards Better Multi-task Learning: A Framework for Optimizing Dataset Combinations in Large Language Models
–arXiv.org Artificial Intelligence
To efficiently select optimal dataset combinations for enhancing multi-task learning (MTL) performance in large language models, we proposed a novel framework that leverages a neural network to predict the best dataset combinations. The framework iteratively refines the selection, greatly improving efficiency, while being model-, dataset-, and domain-independent. Through experiments on 12 biomedical datasets across four tasks - named entity recognition, relation extraction, event extraction, and text classification-we demonstrate that our approach effectively identifies better combinations, even for tasks that may seem unpromising from a human perspective. This verifies that our framework provides a promising solution for maximizing MTL potential.
arXiv.org Artificial Intelligence
Dec-16-2024
- Country:
- North America > United States
- Minnesota > Hennepin County > Minneapolis (0.28)
- Asia > China
- Hong Kong (0.04)
- North America > United States
- Genre:
- Research Report (1.00)
- Industry:
- Health & Medicine (1.00)
- Technology: