A Gradient Sampling Method With Complexity Guarantees for Lipschitz Functions in High and Low Dimensions

Neural Information Processing Systems 

Their work, however, makes use of a nonstandard subgradient oracle model and requires the function to be directionally differentiable.