Improving SGD convergence by tracing multiple promising directions and estimating distances to their extrema

Open in new window