Online optimization and regret guarantees for non-additive long-term constraints

Jenatton, Rodolphe, Huang, Jim, Csiba, Dominik, Archambeau, Cedric

Jun-8-2016–arXiv.org Machine Learning

We consider online optimization in the 1-lookahead setting, where the objective does not decompose additively over the rounds of the online game. The resulting formulation enables us to deal with non-stationary and/or long-term constraints , which arise, for example, in online display advertising problems. We propose an on-line primal-dual algorithm for which we obtain dynamic cumulative regret guarantees. They depend on the convexity and the smoothness of the non-additive penalty, as well as terms capturing the smoothness with which the residuals of the non-stationary and long-term constraints vary over the rounds. We conduct experiments on synthetic data to illustrate the benefits of the non-additive penalty and show vanishing regret convergence on live traffic data collected by a display advertising platform in production.

artificial intelligence, constraint, information technology services, (19 more...)

arXiv.org Machine Learning

Jun-8-2016

arXiv.org PDF

Add feedback

Country:
- Europe (0.28)
- North America > United States (0.46)

Genre:
- Research Report (0.40)

Industry:
- Information Technology > Services (0.54)
- Marketing (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Representation & Reasoning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found