Towards Efficient Real-Time Video Motion Transfer via Generative Time Series Modeling