A note on continuous-time online learning

May-16-2024–arXiv.org Machine Learning

In online learning, the data is provided in a sequential order, and the goal of the learner is to make online decisions to minimize overall regrets. This note is concerned with continuous-time models and algorithms for several online learning problems: online linear optimization, adversarial bandit, and adversarial linear bandit. For each problem, we extend the discrete-time algorithm to the continuous-time setting and provide a concise proof of the optimal regret bound.

algorithm, bandit, legendre transform, (13 more...)

arXiv.org Machine Learning

May-16-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California > Santa Clara County
    - Palo Alto (0.05)
    - Stanford (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.05)

Genre:
- Research Report (0.50)

Industry:
- Education > Educational Setting > Online (0.97)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning (1.00)
  - Enterprise Applications > Human Resources
    - Learning Management (0.87)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found