Surfing: Iterative Optimization Over Incrementally Trained Deep Networks

Ganlin Song, Zhou Fan, John Lafferty

Neural Information Processing Systems 

The approach is to optimize a sequence of objective functions that use network parameters obtained during different stages of the training process.