Excess risk bounds for multitask learning with trace norm regularization
Maurer, Andreas, Pontil, Massimiliano
A fundamental limitation of supervised learning is the cost incurred by the preparation of the large training samples required for good generalization. A potential remedy is offered by multi-task learning: in many cases, while individual sample sizes are rather small, there are samples to represent a large number of learning tasks, which share some constraining or generative property. This common property can be estimated using the entire collection of training samples, and if this property is sufficiently simple it should allow better estimation of the individual tasks from small individual samples. The machine learning community has tried multi-task learning for many years (see [3, 4, 12, 13, 14, 20, 21, 26], contributions and references therein), but there are few theoretical investigations which clearly expose the conditions under which multi-task learning is preferable to independent learning. Following the seminal work of Baxter ([7, 8]) several authors have given generalization and 1 performance bounds under different assumptions of task-relatedness. In this paper we consider multi-task learning with trace-norm regularization (TNML), a technique for which efficient algorithms exist and which has been successfully applied many times (see e.g.
Jan-14-2013
- Country:
- Europe > Germany > North Rhine-Westphalia > Upper Bavaria > Munich (0.04)
- Genre:
- Research Report (0.50)
- Technology: