Understanding Long Range Memory Effects in Deep Neural Networks