Optimal Stochastic Nonconvex Optimization with Bandit Feedback

Mar-30-2021–arXiv.org Machine Learning

In this paper, we analyze the continuous armed bandit problems for nonconvex cost functions under certain smoothness and sublevel set assumptions. We first derive an upper bound on the expected cumulative regret of a simple bin splitting method. We then propose an adaptive bin splitting method, which can significantly improve the performance. Furthermore, a minimax lower bound is derived, which shows that our new adaptive method achieves locally minimax optimal expected cumulative regret.

bin, query, splitting method, (14 more...)

arXiv.org Machine Learning

Mar-30-2021

arXiv.org PDF

Add feedback

Country:
- North America > United States > California > Yolo County > Davis (0.14)

Genre:
- Research Report (1.00)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.50)
  - Artificial Intelligence
    - Representation & Reasoning > Search (0.55)
    - Machine Learning > Statistical Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found