Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations