Backpropagation Convergence Via Deterministic Nonmonotone Perturbed Minimization

Neural Information Processing Systems 

Under certain natural assumptions, such as the series of learning rates diverging while the series of their squares converging, it is established that every accumulation point of the online BP iterates is a stationary point of the BP error func(cid:173) tion. The results presented cover serial and parallel online BP, modified BP with a momentum term, and BP with weight decay.