In Defense of the Unitary Scalarization for Deep Multi-Task Learning

Oct-11-2024, 00:30:05 GMT–Neural Information Processing Systems

Recent multi-task learning research argues against unitary scalarization, where training simply minimizes the sum of the task losses. Several ad-hoc multi-task optimization algorithms have instead been proposed, inspired by various hypotheses about what makes multi-task settings difficult. The majority of these optimizers require per-task gradients, and introduce significant memory, runtime, and implementation overhead. We show that unitary scalarization, coupled with standard regularization and stabilization techniques from single-task learning, matches or improves upon the performance of complex multi-task optimizers in popular supervised and reinforcement learning settings. We then present an analysis suggesting that many specialized multi-task optimizers can be partly interpreted as forms of regularization, potentially explaining our surprising results.

deep multi-task learning, multi-task optimizer, unitary scalarization

Neural Information Processing Systems

Oct-11-2024, 00:30:05 GMT

Conferences Web Page

Add feedback

Genre:
- Play > Prospect (0.66)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.30)