Challenging Common Assumptions in Multi-task Learning
Elich, Cathrin, Kirchdorfer, Lukas, Köhler, Jan M., Schott, Lukas
–arXiv.org Artificial Intelligence
While multi-task learning (MTL) has gained significant attention in recent years, its underlying mechanisms remain poorly understood. Recent methods did not yield consistent performance improvements over single task learning (STL) baselines, underscoring the importance of gaining more profound insights about challenges specific to MTL. In our study, we challenge common assumptions in MTL in the context of STL: First, the choice of optimizer has only been mildly investigated in MTL. We show the pivotal role of common STL tools such as the Adam optimizer in MTL. We deduce the effectiveness of Adam to its partial loss-scale invariance. Second, the notion of gradient conflicts has often been phrased as a specific problem in MTL. We delve into the role of gradient conflicts in MTL and compare it to STL. For angular gradient alignment we find no evidence that this is a unique problem in MTL. We emphasize differences in gradient magnitude as the main distinguishing factor. Lastly, we compare the transferability of features learned through MTL and STL on common image corruptions, and find no conclusive evidence that MTL leads to superior transferability. Overall, we find surprising similarities between STL and MTL suggesting to consider methods from both fields in a broader context.
arXiv.org Artificial Intelligence
Nov-10-2023
- Country:
- Africa > Rwanda
- Europe > Germany
- Baden-Württemberg > Tübingen Region > Tübingen (0.14)
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- United States
- California > Los Angeles County
- Long Beach (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Maryland > Baltimore (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Utah > Salt Lake County
- Salt Lake City (0.04)
- California > Los Angeles County
- Canada
- South America > Chile
- Genre:
- Research Report > New Finding (0.66)
- Technology: