Online Learning under Delayed Feedback

Joulani, Pooria, György, András, Szepesvári, Csaba

Jun-4-2013–arXiv.org Artificial Intelligence

Online learning with delayed feedback has received increasing attention recently due to its several applications in distributed, web-based learning problems. In this paper we provide a systematic study of the topic, and analyze the effect of delay on the regret of online learning algorithms. Somewhat surprisingly, it turns out that delay increases the regret in a multiplicative way in adversarial problems, and in an additive way in stochastic problems. We give meta-algorithms that transform, in a black-box fashion, algorithms developed for the non-delayed case into ones that can handle the presence of delays in the feedback loop. Modifications of the well-known UCB algorithm are also developed for the bandit problem with delayed feedback, with the advantage over the meta-algorithms that they can be implemented with lower complexity.

algorithm, computer based training, educational technology, (19 more...)

arXiv.org Artificial Intelligence

Jun-4-2013

arXiv.org PDF

Add feedback

Country:
- Europe (0.93)
- North America
  - Canada > Alberta (0.28)
  - United States (1.00)

Industry:
- Education > Educational Setting > Online (1.00)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Representation & Reasoning (0.93)
  - Enterprise Applications > Human Resources
    - Learning Management (0.83)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found