Layerwise Sparsifying Training and Sequential Learning Strategy for Neural Architecture Adaptation