Online Multi-task Learning with Hard Constraints
Lugosi, Gabor, Papaspiliopoulos, Omiros, Stoltz, Gilles
We discuss multi-task online learning when a decision maker has to deal simultaneously with M tasks. The tasks are related, which is modeled by imposing that the M-tuple of actions taken by the decision maker needs to satisfy certain constraints. We give natural examples of such restrictions and then discuss a general class of tractable constraints, for which we introduce computationally efficient ways of selecting actions, essentially by reducing to an on-line shortest path problem. We briefly discuss "tracking" and "bandit" versions of the problem and extend the model in various ways, including non-additive global losses and uncountably infinite sets of tasks.
Mar-27-2009
- Country:
- North America > United States
- New York (0.05)
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- France > Île-de-France
- United Kingdom > England
- North America > United States
- Genre:
- Research Report (0.40)
- Industry:
- Education > Educational Setting > Online (0.48)
- Technology: