Exploiting Higher Order Smoothness in Derivative-free Optimization and Continuous Bandits

Oct-10-2024, 10:05:36 GMT–Neural Information Processing Systems

We address the problem of zero-order optimization of a strongly convex function. The goal is to find the minimizer of the function by a sequential exploration of its function values, under measurement noise. We study the impact of higher order smoothness properties of the function on the optimization error and on the online regret. To solve this problem we consider a randomized approximation of the projected gradient descent algorithm. The gradient is estimated by a randomized procedure involving two function evaluations and a smoothing kernel.

algorithm, derivative-free optimization and continuous bandit, exploiting higher order smoothness

Neural Information Processing Systems

Oct-10-2024, 10:05:36 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.44)