Adaptive Low-Rank Regularization with Damping Sequences to Restrict Lazy Weights in Deep Networks