Stochastic Chebyshev Gradient Descent for Spectral Optimization

Neural Information Processing Systems 

A large class of machine learning techniques requires the solution of optimization problems involving spectral functions of parametric matrices, e.g.