Efficient Hyper-parameter Optimization with Cubic Regularization

May-27-2025, 09:07:55 GMT–Neural Information Processing Systems

As hyper-parameters are ubiquitous and can significantly affect the model performance, hyper-parameter optimization is extremely important in machine learning. In this paper, we consider a sub-class of hyper-parameter optimization problems, where the hyper-gradients are not available. Such problems frequently appear when the performance metric is non-differentiable or the hyper-parameter is not continuous. However, existing algorithms, like Bayesian optimization and reinforcement learning, often get trapped in local optimals with poor performance. To address the above limitations, we propose to use cubic regularization to accelerate convergence and avoid saddle points.

artificial intelligence, efficient hyper-parameter optimization, machine learning, (2 more...)

Neural Information Processing Systems

May-27-2025, 09:07:55 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.83)