Derivative-Free & Order-Robust Optimisation

Gabillon, Victor, Tutunov, Rasul, Valko, Michal, Ammar, Haitham Bou

Oct-22-2019–arXiv.org Machine Learning

In this paper, we formalise order-robust optimisation as an instance of online learning minimising simple regret, and propose VROOM, a zero'th order optimisation algorithm capable of achieving vanishing regret in non-stationary environments, while recovering favorable rates under stochastic reward-generating processes. Our results are the first to target simple regret definitions in adversarial scenarios unveiling a challenge that has been rarely considered in prior work.

algorithm, log 2, simple regret, (16 more...)

arXiv.org Machine Learning

Oct-22-2019

arXiv.org PDF

Add feedback

Country:
- Europe (0.04)

Genre:
- Research Report > New Finding (0.34)

Industry:
- Education > Educational Setting (0.34)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.46)
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Representation & Reasoning > Optimization (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found