Training Dynamics of Deep Networks using Stochastic Gradient Descent via Neural Tangent Kernel