ReLIZO: Sample Reusable Linear Interpolation-based Zeroth-order Optimization Xiaoxing Wang

May-28-2025, 15:52:15 GMT–Neural Information Processing Systems

Gradient estimation is critical in zeroth-order optimization methods, which aims to obtain the descent direction by sampling update directions and querying function evaluations. Extensive research has been conducted including smoothing and linear interpolation. The former methods smooth the objective function, causing a biased gradient estimation, while the latter often enjoys more accurate estimates, at the cost of large amounts of samples and queries at each iteration to update variables. This paper resorts to the linear interpolation strategy and proposes to reduce the complexity of gradient estimation by reusing queries in the prior iterations while maintaining the sample size unchanged. Specifically, we model the gradient estimation as a quadratically constrained linear program problem and manage to derive the analytical solution. It innovatively decouples the required sample size from the variable dimension without extra conditions required, making it able to leverage the queries in the prior iterations. Moreover, part of the intermediate variables that contribute to the gradient estimation can be directly indexed, significantly reducing the computation complexity.

artificial intelligence, machine learning, natural language, (20 more...)

Neural Information Processing Systems

May-28-2025, 15:52:15 GMT

Conferences PDF

Add feedback

Country:
- Asia > Middle East
  - Israel (0.14)
- North America > United States (0.28)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (0.67)

Industry:
- Information Technology (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks (1.00)
  - Natural Language (0.93)
  - Representation & Reasoning > Optimization (1.00)
  - Vision (0.93)